What is this
This the KTO checkpoint of my MS3.2 Austral winton train. Use the MS3.2 Winton train for the best experience.
wandb: https://wandb.ai/new-eden/austral/runs/2iaj6moy?nw=nwuserdeltavector
Datasets:
datasets:
- path: Delta-Vector/Tauri-IFeval-Dans-Tulu-KTO
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Opus-accepted-hermes-rejected-shuffled
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Opus-Accepted-GPT-Rejected-Opus-Writing-Prompts
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Helpsteer3-Edit
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Helpsteer-3-Preference-KTO
split: train
type: chatml.argilla
- path: NewEden/Purpura-Arkhaios-CC-KTO
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-KTO-Instruct-Mix
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-LIT-RL-KTO
split: train
type: chatml.argilla
- path: Delta-Vector/Tauri-Synth-1-KTO-R1-No-Think
split: train
type: chatml.argilla
Trained on 8xA100s using Axolotl. Ty to my work & Auri <3
- Downloads last month
- 16
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for Delta-Vector/MS3.2-Austral-24B-KTO
Base model
mistralai/Mistral-Small-3.1-24B-Base-2503
Finetuned
Gryphe/Codex-24B-Small-3.2
Finetuned
Delta-Vector/MS3.2-Austral-24B-SFT