Triangle104 commited on
Commit
069eead
·
verified ·
1 Parent(s): 4127f25

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md CHANGED
@@ -13,6 +13,19 @@ tags:
13
  This model was converted to GGUF format from [`UCSC-VLAA/STAR1-R1-Distill-1.5B`](https://huggingface.co/UCSC-VLAA/STAR1-R1-Distill-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
14
  Refer to the [original model card](https://huggingface.co/UCSC-VLAA/STAR1-R1-Distill-1.5B) for more details on the model.
15
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
  ## Use with llama.cpp
17
  Install llama.cpp through brew (works on Mac and Linux)
18
 
 
13
  This model was converted to GGUF format from [`UCSC-VLAA/STAR1-R1-Distill-1.5B`](https://huggingface.co/UCSC-VLAA/STAR1-R1-Distill-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
14
  Refer to the [original model card](https://huggingface.co/UCSC-VLAA/STAR1-R1-Distill-1.5B) for more details on the model.
15
 
16
+ ---
17
+ STAR-1 is a high-quality safety dataset designed to enhance safety alignment in large reasoning models (LRMs) like DeepSeek-R1.
18
+
19
+
20
+ Built on the principles of diversity, deliberative reasoning, and
21
+ rigorous filtering, STAR-1 integrates and refines data from multiple
22
+ sources to provide policy-grounded reasoning samples.
23
+ The dataset contains 1,000 carefully selected examples, each aligned with best safety practices through GPT-4o-based evaluation.
24
+ Fine-tuning with STAR-1 leads to significant safety improvements
25
+ across multiple benchmarks, with minimal impact on reasoning
26
+ capabilities.
27
+
28
+ ---
29
  ## Use with llama.cpp
30
  Install llama.cpp through brew (works on Mac and Linux)
31