Qwen
/

Qwen2.5-Coder-7B

Text Generation

text-generation-inference

Model card Files Files and versions

Update README.md

#4

by cyente - opened Sep 20, 2024

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (
 **This repo contains the 7B Qwen2.5-Coder model**, which has the following features:
 - Type: Causal Language Models
-- Training Stage: Pretraining & Post-training
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
 - Number of Parameters: 7.61B
 - Number of Paramaters (Non-Embedding): 6.53B

 **This repo contains the 7B Qwen2.5-Coder model**, which has the following features:
 - Type: Causal Language Models
+- Training Stage: Pretraining
 - Architecture: transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
 - Number of Parameters: 7.61B
 - Number of Paramaters (Non-Embedding): 6.53B