Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,37 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
---
|
4 |
+
## Overview
|
5 |
+
|
6 |
+
**NovaSky Team** developed and released the [Sky-T1](https://huggingface.co/novasky-ai/Sky-T1-32B-Preview), a 32-billion parameter reasoning model adapted from Qwen2.5-32B-Instruct. This model is designed for advanced reasoning, coding, and mathematical tasks, achieving performance comparable to state-of-the-art models like o1-preview while being cost-efficient. Sky-T1 was trained on 17K verified responses from Qwen/QwQ-32B-Preview, with additional science data from the Still-2 dataset, ensuring high-quality and diverse learning sources.
|
7 |
+
|
8 |
+
The model supports complex reasoning via long chain-of-thought processes and excels in both coding and mathematical challenges. Utilizing Llama-Factory with DeepSpeed Zero-3 Offload, Sky-T1 training was completed in just 19 hours on 8 H100 GPUs, demonstrating efficient resource utilization. These capabilities make Sky-T1 an exceptional tool for applications in programming, academic research, and reasoning-intensive tasks.
|
9 |
+
|
10 |
+
## Variants
|
11 |
+
|
12 |
+
| No | Variant | Cortex CLI command |
|
13 |
+
| --- | --- | --- |
|
14 |
+
| 1 | [gguf](https://huggingface.co/cortexso/sky-t1/tree/main) | `cortex run sky-t1` |
|
15 |
+
|
16 |
+
## Use it with Jan (UI)
|
17 |
+
|
18 |
+
1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
|
19 |
+
2. Use in Jan model Hub:
|
20 |
+
```text
|
21 |
+
cortexso/sky-t1
|
22 |
+
```
|
23 |
+
|
24 |
+
## Use it with Cortex (CLI)
|
25 |
+
|
26 |
+
1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
|
27 |
+
2. Run the model with command:
|
28 |
+
```bash
|
29 |
+
cortex run sky-t1
|
30 |
+
```
|
31 |
+
|
32 |
+
## Credits
|
33 |
+
|
34 |
+
- **Author:** NovaSky Team
|
35 |
+
- **Converter:** [Homebrew](https://www.homebrew.ltd/)
|
36 |
+
- **Original License:** [License](https://huggingface.co/novasky-ai/Sky-T1-32B-Preview#license)
|
37 |
+
- **Papers:** [Sky-T1: Fully Open-Source Reasoning Model](https://novasky-ai.github.io/posts/sky-t1/)
|