alibaba-pai
/

DistilQwen2.5-0.5B-Instruct

Model card Files Files and versions Community

chywang commited on May 24

Commit

a32c0c5

·

verified ·

1 Parent(s): a7c1fe3

Update README.md

Files changed (1) hide show

README.md +22 -0

README.md CHANGED Viewed

@@ -45,4 +45,26 @@ generated_ids = [
 ]
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```

 ]
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
+```
+## Reference
+For more detailed information about the model, we encourage you to refer to our paper:
+- **DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models**
+  Chengyu Wang, Junbing Yan, Yuanhao Yue, Jun Huang
+  [arXiv:2504.15027](https://arxiv.org/abs/2504.15027)
+You can cite the paper using the following citation format:
+```bibtex
+@misc{wang2025distilqwen25industrialpracticestraining,
+      title={DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models},
+      author={Chengyu Wang and Junbing Yan and Yuanhao Yue and Jun Huang},
+      year={2025},
+      eprint={2504.15027},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2504.15027}
+}
 ```