mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7-mistralv0.3 Text Generation • 7B • Updated Dec 19, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7-mistralv0.3 Text Generation • 7B • Updated Dec 19, 2024 • 2
skymizer/mistral-7B-v0.1-relufication-stage-1-on-slim-orca Text Generation • 7B • Updated Dec 27, 2024 • 2
skymizer/Mistral-7B-v0.1-q-sparse-fineweb-edu-10000steps-4M-bs Text Generation • 7B • Updated Jan 16 • 2
Xinging/mistral-7b_sft_0.1_ratio_alpaca_gpt4_proj_by_comprehensive_ntrain_1890 Text Generation • 7B • Updated Jan 30 • 2