ash001
/

hivemind-torchtune-Qwen2-0.5B

Model card Files Files and versions Community

ash001 commited on Feb 26

Commit

341c6fb

·

verified ·

1 Parent(s): ce93108

Update README.md

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -1,3 +1,14 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+datasets:
+- ash001/arxiv-abstract
+base_model:
+- Qwen/Qwen2-0.5B-Instruct
+---
+# Model Card for hivemind-torchtune-Qwen2-0.5B
+This model is a fine-tuned version of [Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct), optimized using Hivemind and TorchTune for collaborative training over the internet.
+This model was fine-tuned using the [arxiv-abstract-dataset](https://huggingface.co/datasets/ash001/arxiv-abstract).
+For explaining more on how we did this, please check out this [article](https://medium.com/@kannansarat9/finetuning-qwen-0-5b-using-hivemind-data-parallelism-over-the-internet-e20af1b15c05)!