CALISTA-INDUSTRY
/

DeepSeek-R1-Distill-Qwen-1.5B-FineTune

Model card Files Files and versions Community

rizkysulaeman commited on Jan 28

Commit

5f01434

·

verified ·

1 Parent(s): eb1e8fe

Update README.md

Files changed (1) hide show

README.md +52 -5

README.md CHANGED Viewed

@@ -1,5 +1,52 @@
----
-license: mit
-tags:
-- unsloth
----

+---
+license: mit
+tags:
+- unsloth
+- deepseek_v3
+---
+DeepSeek-R1 Release
+__________________________________________________________________________________________
+⚡ Performance on par with OpenAI-o1
+📖 Fully open-source model & technical report
+🏆 MIT licensed: Distill & commercialize freely!
+🌐 Website & API are live now! Try DeepThink at chat.deepseek.com today!
+__________________________________________________________________________________________
+🔥 Bonus: Open-Source Distilled Models!
+🔬 Distilled from DeepSeek-R1, 6 small models fully open-sourced
+📏 32B & 70B models on par with OpenAI-o1-mini
+🤝 Empowering the open-source community
+🌍 Pushing the boundaries of open AI!
+_____________________________________________________________________
+🛠️ DeepSeek-R1: Technical Highlights
+📈 Large-scale RL in post-training
+🏆 Significant performance boost with minimal labeled data
+🔢 Math, code, and reasoning tasks on par with OpenAI-o1
+📄 More details: https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf
+_____________________________________________________________________
+🌐 API Access & Pricing
+⚙️ Use DeepSeek-R1 by setting model=deepseek-reasoner
+💰 $0.14 / million input tokens (cache hit)
+💰 $0.55 / million input tokens (cache miss)
+💰 $2.19 / million output tokens
+📖 API guide: https://api-docs.deepseek.com/guides/reasoning_model