Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.001_ep5_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 1
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.001_ep10_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 2
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.0001_ep10_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 1
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.0001_ep5_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 30, 2025 β’ 1
Tristan/llama3.2_piqa_custom_splits_sft_broad_1e-5_lr1e-5_wd0.0001_ep1_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 1
Tristan/llama3.2_piqa_custom_splits_sft_broad_lr1e-6_wd0.0001_ep10_piqa_custom_splits Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 1
Tristan/sft_test_llama3.2_5e-6_5e-5_lr5e-5_wd0.001_ep5_arc_easy Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 2
Tristan/sft_test_llama3.2_5e-6_5e-5_lr5e-6_wd0.001_ep5_arc_easy Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 1
Tristan/sft_test_llama3.2_5e-6_5e-5_lr5e-6_wd0.0001_ep1_arc_easy Text Generation β’ 1B β’ Updated Sep 28, 2025 β’ 1
Tristan/sft_test_llama3.2_broad_coverage_lr1e-5_wd0.0001_ep10_arc_easy Text Generation β’ 1B β’ Updated Sep 25, 2025 β’ 2
Tristan/sft_test_llama3.2_broad_coverage_lr1e-5_wd0.0001_ep5_arc_easy Text Generation β’ 1B β’ Updated Sep 25, 2025 β’ 2
Tristan/sft_test_llama3.2_broad_coverage_lr1e-5_wd0.0001_ep1_arc_easy Text Generation β’ 1B β’ Updated Sep 25, 2025 β’ 1
Tristan/sft_test_llama3.2_broad_coverage_lr1e-6_wd0.0001_ep10_arc_easy Text Generation β’ 1B β’ Updated Sep 25, 2025 β’ 2
Tristan/sft_test_arc_easy_broader_lr1e-5_wd0.0001_ep5_arc_easy Text Generation β’ 0.6B β’ Updated Sep 25, 2025 β’ 2
Tristan/sft_test_arc_easy_broader_lr1e-5_wd0.001_ep10_arc_easy Text Generation β’ 0.6B β’ Updated Sep 25, 2025 β’ 2
Tristan/sft_test_arc_easy_broader_lr1e-5_wd0.001_ep5_arc_easy Text Generation β’ 0.6B β’ Updated Sep 25, 2025 β’ 2
Tristan/sft_test_arc_easy_broader_lr1e-5_wd0.01_ep1_arc_easy Text Generation β’ 0.6B β’ Updated Sep 24, 2025 β’ 2
Tristan/sft_test_arc_easy_broader_lr1e-6_wd0.0001_ep10_arc_easy Text Generation β’ 0.6B β’ Updated Sep 24, 2025 β’ 2
Tristan/llama3.2_arc_easy_sft_lr1e-6_wd0.0001_ep10_arc_easy Text Generation β’ 1B β’ Updated Sep 22, 2025 β’ 1
Tristan/qwen3_sft_meta_sft_arc_easy_lr1e-6_wd0.0001_ep10_arc_easy Text Generation β’ 0.6B β’ Updated Sep 8, 2025
Tristan/sft_qwen3_lambada_it_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_it_custom_splits Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 2
Tristan/sft_qwen3_lambada_es_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_es_custom_splits Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 1
Tristan/sft_qwen3_lambada_fr_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_fr_custom_splits Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 1
Tristan/sft_qwen3_lambada_de_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_de_custom_splits Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 1
Tristan/sft_qwen3_lambada_en_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_en_custom_splits Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 1
Tristan/sft_qwen3_piqa_custom_splits_lr1e-6_wd0.001_ep10_piqa_custom_splits Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 1
Tristan/sft_qwen3_sciq_lr1e-6_wd0.001_ep5_sciq Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 1
Tristan/sft_qwen3_arc_easy_lr1e-6_wd0.0001_ep10_arc_easy Text Generation β’ 0.6B β’ Updated Sep 6, 2025 β’ 2