mlfoundations-dev/hp_ablations_mistral_adambeta2_0.9995_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_adambeta2_0.98_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_bsz512_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_lr1e-6_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_lr5e-6_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_lr1e-5_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 3
mlfoundations-dev/hp_ablations_mistral_lr2e-6_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_lr8e-6_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_constant_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-6_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr5e-7_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.05_minlr1e-7_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_bsz1024_dcftv1.2 Text Generation • 7B • Updated Dec 4, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.05_dcftv1.2 Text Generation • 7B • Updated Dec 4, 2024 • 3
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.15_dcftv1.2 Text Generation • 7B • Updated Dec 4, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_inverse_sqrt_dcftv1.2 Text Generation • 7B • Updated Dec 4, 2024 • 3
mlfoundations-dev/hp_ablations_mistral_scheduler_linear_warmup0.10_dcftv1.2 Text Generation • 7B • Updated Dec 4, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-6_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_bsz2048_dcftv1.2 Text Generation • 7B • Updated Dec 4, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr5e-7_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_scheduler_cosine_warmup0.10_minlr1e-7_dcftv1.2 Text Generation • 7B • Updated Dec 5, 2024 • 2
plaguss/Mistral-7B-v0.1-Math-Shepherd-PRM-token-0.1 Token Classification • 7B • Updated Dec 5, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_epoch2_dcftv1.2 Text Generation • 7B • Updated Dec 9, 2024 • 3
mlfoundations-dev/hp_ablations_mistral_epoch3_dcftv1.2 Text Generation • 7B • Updated Dec 9, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_epoch4_dcftv1.2 Text Generation • 7B • Updated Dec 9, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_epoch5_dcftv1.2 Text Generation • 7B • Updated Dec 9, 2024 • 2
mlfoundations-dev/hp_ablations_mistral_epoch1_dcftv1.2 Text Generation • 7B • Updated Dec 9, 2024 • 2