Add model card
Browse filesThis PR adds a model card, linking the model to the [paper page](https://huggingface.co/papers/2412.04139).
Feel free to update the other model cards too!
README.md
ADDED
@@ -0,0 +1,7 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
pipeline_tag: text-generation
|
3 |
+
---
|
4 |
+
|
5 |
+
This repository contains the monet-vd-4.1B-100BT model introduced in [Monet: Mixture of Monosemantic Experts for Transformers](https://huggingface.co/papers/2412.04139).
|
6 |
+
|
7 |
+
Code: https://github.com/dmis-lab/Monet
|