ash001 commited on
Commit
341c6fb
·
verified ·
1 Parent(s): ce93108

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -3
README.md CHANGED
@@ -1,3 +1,14 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - ash001/arxiv-abstract
5
+ base_model:
6
+ - Qwen/Qwen2-0.5B-Instruct
7
+ ---
8
+ # Model Card for hivemind-torchtune-Qwen2-0.5B
9
+
10
+ This model is a fine-tuned version of [Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct), optimized using Hivemind and TorchTune for collaborative training over the internet.
11
+
12
+ This model was fine-tuned using the [arxiv-abstract-dataset](https://huggingface.co/datasets/ash001/arxiv-abstract).
13
+
14
+ For explaining more on how we did this, please check out this [article](https://medium.com/@kannansarat9/finetuning-qwen-0-5b-using-hivemind-data-parallelism-over-the-internet-e20af1b15c05)!