Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,14 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- ash001/arxiv-abstract
|
5 |
+
base_model:
|
6 |
+
- Qwen/Qwen2-0.5B-Instruct
|
7 |
+
---
|
8 |
+
# Model Card for hivemind-torchtune-Qwen2-0.5B
|
9 |
+
|
10 |
+
This model is a fine-tuned version of [Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct), optimized using Hivemind and TorchTune for collaborative training over the internet.
|
11 |
+
|
12 |
+
This model was fine-tuned using the [arxiv-abstract-dataset](https://huggingface.co/datasets/ash001/arxiv-abstract).
|
13 |
+
|
14 |
+
For explaining more on how we did this, please check out this [article](https://medium.com/@kannansarat9/finetuning-qwen-0-5b-using-hivemind-data-parallelism-over-the-internet-e20af1b15c05)!
|