zerofata
/

L3.3-GeneticLemonade-Unleashed-v3-70B

@@ -150,6 +150,7 @@ a:hover {text-decoration: underline;}
       <p>This is a creative model intended to excel at character driven RP / ERP. It has not been tested or trained on adventure stories or any large amounts of creative writing.</p>
     </div>
   </div>
   <div class="section-container">
     <div class="section-header">
       <div class="section-indicator"></div>
@@ -181,6 +182,7 @@ a:hover {text-decoration: underline;}
       </div>
     </div>
   </div>
   <div class="section-container">
     <div class="section-header">
       <div class="section-indicator"></div>
@@ -203,6 +205,7 @@ a:hover {text-decoration: underline;}
       </div>
     </div>
   </div>
   <div class="section-container">
     <div class="section-header">
       <div class="section-indicator"></div>
@@ -211,8 +214,10 @@ a:hover {text-decoration: underline;}
     <div class="section-content">
       <p>The model first went through SFT with a small synthetic dataset of 2.9 million tokens, approximately 750 conversations. Primarily RP data with small amounts of random instruct / assistant data and creative writing.</p>
       <p>The model then went through DPO training using approx 1100 chosen examples from the SFT dataset that were of exceptional quality or showed verifiable instruction following. Rejected samples were generated using another Llama 3.3 finetune that is known for poor instruction following.</p>
       <h3 class="subheading">SFT 1*H200</h3>
-```yml
 # ====================
 # MODEL CONFIGURATION
 # ====================
@@ -322,10 +327,9 @@ save_safetensors: true
 wandb_project: project_name
 # wandb_entity: your_entity  # Uncomment and set if needed
 # wandb_name: your_run_name  # Uncomment and set if needed
-```
-      <h3 class="subheading">DPO 2*H200</h3>
-```yml
 # ====================
 # MODEL CONFIGURATION
 # ====================
@@ -421,7 +425,7 @@ save_safetensors: true
 wandb_project: project_name
 # wandb_entity: your_entity  # Uncomment and set if needed
 # wandb_name: your_run_name  # Uncomment and set if needed
-```
     </div>
   </div>
 </div>

       <p>This is a creative model intended to excel at character driven RP / ERP. It has not been tested or trained on adventure stories or any large amounts of creative writing.</p>
     </div>
   </div>
   <div class="section-container">
     <div class="section-header">
       <div class="section-indicator"></div>
       </div>
     </div>
   </div>
   <div class="section-container">
     <div class="section-header">
       <div class="section-indicator"></div>
       </div>
     </div>
   </div>
   <div class="section-container">
     <div class="section-header">
       <div class="section-indicator"></div>
     <div class="section-content">
       <p>The model first went through SFT with a small synthetic dataset of 2.9 million tokens, approximately 750 conversations. Primarily RP data with small amounts of random instruct / assistant data and creative writing.</p>
       <p>The model then went through DPO training using approx 1100 chosen examples from the SFT dataset that were of exceptional quality or showed verifiable instruction following. Rejected samples were generated using another Llama 3.3 finetune that is known for poor instruction following.</p>
+      <h3 class="subheading">Axolotl configs</h3>
+      <p>Neither are optimized for cost / performance efficiency, YMMV.</p>
       <h3 class="subheading">SFT 1*H200</h3>
+<pre><code>
 # ====================
 # MODEL CONFIGURATION
 # ====================
 wandb_project: project_name
 # wandb_entity: your_entity  # Uncomment and set if needed
 # wandb_name: your_run_name  # Uncomment and set if needed
+</code></pre>
+<h3 class="subheading">DPO 2*H200</h3>
+<pre><code>
 # ====================
 # MODEL CONFIGURATION
 # ====================
 wandb_project: project_name
 # wandb_entity: your_entity  # Uncomment and set if needed
 # wandb_name: your_run_name  # Uncomment and set if needed
+</code></pre>
     </div>
   </div>
 </div>