zerofata commited on
Commit
f3a8ca8
·
verified ·
1 Parent(s): 7333d57

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -216,8 +216,11 @@ a:hover {text-decoration: underline;}
216
  <p>The model then went through DPO training using approx 1100 chosen examples from the SFT dataset that were of exceptional quality or showed verifiable instruction following. Rejected samples were generated using another Llama 3.3 finetune that is known for poor instruction following.</p>
217
  <h3 class="subheading">Axolotl configs</h3>
218
  <p>Neither are optimized for cost / performance efficiency, YMMV.</p>
219
- <h3 class="subheading">SFT 1*H200</h3>
220
- <pre><code>
 
 
 
221
  # ====================
222
  # MODEL CONFIGURATION
223
  # ====================
@@ -327,9 +330,9 @@ save_safetensors: true
327
  wandb_project: project_name
328
  # wandb_entity: your_entity # Uncomment and set if needed
329
  # wandb_name: your_run_name # Uncomment and set if needed
330
- </code></pre>
331
- <h3 class="subheading">DPO 2*H200</h3>
332
- <pre><code>
333
  # ====================
334
  # MODEL CONFIGURATION
335
  # ====================
@@ -425,9 +428,6 @@ save_safetensors: true
425
  wandb_project: project_name
426
  # wandb_entity: your_entity # Uncomment and set if needed
427
  # wandb_name: your_run_name # Uncomment and set if needed
428
- </code></pre>
429
- </div>
430
- </div>
431
- </div>
432
  </body>
433
  </html>
 
216
  <p>The model then went through DPO training using approx 1100 chosen examples from the SFT dataset that were of exceptional quality or showed verifiable instruction following. Rejected samples were generated using another Llama 3.3 finetune that is known for poor instruction following.</p>
217
  <h3 class="subheading">Axolotl configs</h3>
218
  <p>Neither are optimized for cost / performance efficiency, YMMV.</p>
219
+ </div>
220
+ </div>
221
+ </div>
222
+ <h3>SFT 1*H200</h3>
223
+ ```yml
224
  # ====================
225
  # MODEL CONFIGURATION
226
  # ====================
 
330
  wandb_project: project_name
331
  # wandb_entity: your_entity # Uncomment and set if needed
332
  # wandb_name: your_run_name # Uncomment and set if needed
333
+ ```
334
+ <h3>DPO 2*H200</h3>
335
+ ```yml
336
  # ====================
337
  # MODEL CONFIGURATION
338
  # ====================
 
428
  wandb_project: project_name
429
  # wandb_entity: your_entity # Uncomment and set if needed
430
  # wandb_name: your_run_name # Uncomment and set if needed
431
+ ```
 
 
 
432
  </body>
433
  </html>