Lambent
/

CosMoEAlpacaLisa-4x1b

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Lambent commited on Apr 5, 2024

Commit

db61d16

·

verified ·

1 Parent(s): 403b4e5

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -8,8 +8,11 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 <details><summary>See axolotl config</summary>

   results: []
 ---
+Intuitively it seemed like LISA training should suit a MoE pretty well; though I don't know how well calibrated my intuitions are.
+Interesting thing about this one is it looks like it wasn't converging at the end of one epoch. Still more to learn.
+Generalization eval pending.
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
 <details><summary>See axolotl config</summary>