legolasyiu commited on
Commit
5b8b033
·
verified ·
1 Parent(s): bdb5463

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -18,6 +18,10 @@ SA stands for Safety and alignment. We fine tuned DeepCoder-1.5B-Preview with ST
18
 
19
  This model is fine-tuned with policy-grounded data to be safe and aligned with human values while coding. Specifically, it utilizes the STAR-1 dataset, which integrates diverse, deliberative reasoning examples evaluated rigorously by GPT-4o. This ensures the model maintains robust safety standards and minimizes biases, promoting responsible, secure, and effective coding practices without compromising its core reasoning capabilities.
20
 
 
 
 
 
21
  @misc{deepcoder2025,
22
  title={DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level},
23
  author={Michael Luo, Sijun Tan, Roy Huang, Ameen Patel, Alpay Ariyak, Qingyang Wu, Xiaoxiang Shi, Rachel Xin, Colin Cai, Maurice Weber, Ce Zhang, Li Erran Li, Raluca Ada Popa, Ion Stoica},
@@ -25,6 +29,7 @@ This model is fine-tuned with policy-grounded data to be safe and aligned with h
25
  note={Notion Blog},
26
  year={2025}
27
  }
 
28
 
29
 
30
  # Uploaded model
 
18
 
19
  This model is fine-tuned with policy-grounded data to be safe and aligned with human values while coding. Specifically, it utilizes the STAR-1 dataset, which integrates diverse, deliberative reasoning examples evaluated rigorously by GPT-4o. This ensures the model maintains robust safety standards and minimizes biases, promoting responsible, secure, and effective coding practices without compromising its core reasoning capabilities.
20
 
21
+
22
+ ## Citation
23
+
24
+ ```bibtex
25
  @misc{deepcoder2025,
26
  title={DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level},
27
  author={Michael Luo, Sijun Tan, Roy Huang, Ameen Patel, Alpay Ariyak, Qingyang Wu, Xiaoxiang Shi, Rachel Xin, Colin Cai, Maurice Weber, Ce Zhang, Li Erran Li, Raluca Ada Popa, Ion Stoica},
 
29
  note={Notion Blog},
30
  year={2025}
31
  }
32
+ ```
33
 
34
 
35
  # Uploaded model