Minh141120 commited on
Commit
123c9a6
·
verified ·
1 Parent(s): 7ff1959

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ ## Overview
5
+
6
+ **NovaSky Team** developed and released the [Sky-T1](https://huggingface.co/novasky-ai/Sky-T1-32B-Preview), a 32-billion parameter reasoning model adapted from Qwen2.5-32B-Instruct. This model is designed for advanced reasoning, coding, and mathematical tasks, achieving performance comparable to state-of-the-art models like o1-preview while being cost-efficient. Sky-T1 was trained on 17K verified responses from Qwen/QwQ-32B-Preview, with additional science data from the Still-2 dataset, ensuring high-quality and diverse learning sources.
7
+
8
+ The model supports complex reasoning via long chain-of-thought processes and excels in both coding and mathematical challenges. Utilizing Llama-Factory with DeepSpeed Zero-3 Offload, Sky-T1 training was completed in just 19 hours on 8 H100 GPUs, demonstrating efficient resource utilization. These capabilities make Sky-T1 an exceptional tool for applications in programming, academic research, and reasoning-intensive tasks.
9
+
10
+ ## Variants
11
+
12
+ | No | Variant | Cortex CLI command |
13
+ | --- | --- | --- |
14
+ | 1 | [gguf](https://huggingface.co/cortexso/sky-t1/tree/main) | `cortex run sky-t1` |
15
+
16
+ ## Use it with Jan (UI)
17
+
18
+ 1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
19
+ 2. Use in Jan model Hub:
20
+ ```text
21
+ cortexso/sky-t1
22
+ ```
23
+
24
+ ## Use it with Cortex (CLI)
25
+
26
+ 1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
27
+ 2. Run the model with command:
28
+ ```bash
29
+ cortex run sky-t1
30
+ ```
31
+
32
+ ## Credits
33
+
34
+ - **Author:** NovaSky Team
35
+ - **Converter:** [Homebrew](https://www.homebrew.ltd/)
36
+ - **Original License:** [License](https://huggingface.co/novasky-ai/Sky-T1-32B-Preview#license)
37
+ - **Papers:** [Sky-T1: Fully Open-Source Reasoning Model](https://novasky-ai.github.io/posts/sky-t1/)