zerofata
/

L3.3-GeneticLemonade-Unleashed-v3-70B

Text Generation

text-generation-inference

Model card Files Files and versions

zerofata commited on 27 days ago

Commit

4507177

·

verified ·

1 Parent(s): b647fdc

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -214,11 +214,11 @@ a:hover {text-decoration: underline;}
     <div class="section-content">
       <p>The model first went through SFT with a small synthetic dataset of 2.9 million tokens, approximately 750 conversations. Primarily RP data with small amounts of random instruct / assistant data and creative writing.</p>
       <p>The model then went through DPO training using approx 1100 chosen examples from the SFT dataset that were of exceptional quality or showed verifiable instruction following. Rejected samples were generated using another Llama 3.3 finetune that is known for poor instruction following.</p>
-      <h3 class="subheading">Axolotl configs</h3>
-      <p>Neither are optimized for cost / performance efficiency, YMMV.</p>
     </div>
   </div>
 </div>
 <h3>SFT 1*H200</h3>
 ```yml
@@ -329,8 +329,8 @@ save_safetensors: true
 # WANDB TRACKING
 # ====================
 wandb_project: project_name
-wandb_entity: your_entity
-wandb_name: your_run_name
 ```
 <h3>DPO 2*H200</h3>

     <div class="section-content">
       <p>The model first went through SFT with a small synthetic dataset of 2.9 million tokens, approximately 750 conversations. Primarily RP data with small amounts of random instruct / assistant data and creative writing.</p>
       <p>The model then went through DPO training using approx 1100 chosen examples from the SFT dataset that were of exceptional quality or showed verifiable instruction following. Rejected samples were generated using another Llama 3.3 finetune that is known for poor instruction following.</p>
     </div>
   </div>
 </div>
+<h3 class="subheading">Axolotl configs</h3>
+<p>Neither are optimized for cost / performance efficiency, YMMV.</p>
 <h3>SFT 1*H200</h3>
 ```yml
 # WANDB TRACKING
 # ====================
 wandb_project: project_name
+# wandb_entity: your_entity
+# wandb_name: your_run_name
 ```
 <h3>DPO 2*H200</h3>