mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 17, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 17, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 17, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 17, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 17, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz512_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 17, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr2e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz1024_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.15-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr2e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr2e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 16, 2024 • 6
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz2048_lr5e-6_scheduler-cosine-warmup0.05-minlr5e-7 Text Generation • 7B • Updated Dec 16, 2024 • 3
mlfoundations-dev/hp_ablations_grid_mistral_bsz4096_lr5e-6_scheduler-cosine-warmup0.15 Text Generation • 7B • Updated Dec 16, 2024 • 3