What is this

This the KTO checkpoint of my MS3.2 Austral winton train. Use the MS3.2 Winton train for the best experience.

wandb: https://wandb.ai/new-eden/austral/runs/2iaj6moy?nw=nwuserdeltavector

Datasets:

datasets:
  - path: Delta-Vector/Tauri-IFeval-Dans-Tulu-KTO
    split: train
    type: chatml.argilla
  - path: Delta-Vector/Tauri-Opus-accepted-hermes-rejected-shuffled 
    split: train
    type: chatml.argilla
  - path: Delta-Vector/Tauri-Opus-Accepted-GPT-Rejected-Opus-Writing-Prompts
    split: train
    type: chatml.argilla
  - path: Delta-Vector/Tauri-Helpsteer3-Edit
    split: train
    type: chatml.argilla
  - path: Delta-Vector/Tauri-Helpsteer-3-Preference-KTO
    split: train
    type: chatml.argilla
  - path: NewEden/Purpura-Arkhaios-CC-KTO
    split: train
    type: chatml.argilla
  - path: Delta-Vector/Tauri-KTO-Instruct-Mix
    split: train
    type: chatml.argilla
  - path: Delta-Vector/Tauri-LIT-RL-KTO
    split: train
    type: chatml.argilla
  - path: Delta-Vector/Tauri-Synth-1-KTO-R1-No-Think
    split: train
    type: chatml.argilla

Trained on 8xA100s using Axolotl. Ty to my work & Auri <3

Downloads last month
16
Safetensors
Model size
23.6B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Delta-Vector/MS3.2-Austral-24B-KTO