llavallava/smolvlm-instruct-trl-dpo-0_0.5_qa_epochs1_ref Image-Text-to-Text • Updated 15 days ago • 10
llavallava/qwen2vl2b-instruct-trl-dpo-0_0.5_qa_epochs1_ref Image-Text-to-Text • Updated 15 days ago • 16
llavallava/qwen2vl2b-instruct-trl-dpo-0_0.1_epochs1_nonref Image-Text-to-Text • Updated 15 days ago • 4
prosecalign/phi3m0128-cds-0.75-kendall-onof-ofif-corr-max-2-simpo-max1500-default Updated 15 days ago
prosecalign/phi3m0128-cds-0.85-kendall-onof-ofif-corr-max-2-simpo-max1500-default Updated 15 days ago
prosecalign/phi3m0128-cds-0.65-kendall-onof-ofif-corr-max-2-simpo-max1500-default Updated 15 days ago
prosecalign/phi3m0128-wds-0.85-kendall-onof-ofif-corr-max-2-simpo-max1500-default Updated 15 days ago
prosecalign/phi3m0128-wds-0.75-kendall-onof-ofif-corr-max-2-simpo-max1500-default Updated 15 days ago
RyanYr/reflect_mini8B_Om2SftT2_om2-20to40kIpsdpIter2T02_b0.5 Text Generation • Updated 15 days ago • 160
RyanYr/reflect_mini8B_Om2SftT1-om2-20to40kIpsdpIter2T1_b0.5 Text Generation • Updated 15 days ago • 32
RyanYr/reflect_mini8B_Om2SftT2_om2-20to40kIpsdpIter2T02_b1.0 Text Generation • Updated 15 days ago • 3
prosecalign/phi3m0128-cds-0.8-kendall-onof-decrease-corr-max-2-simpo-max1500-default Updated 15 days ago
prosecalign/phi3m0128-cds-0.8-kendall-onof-neg_if-corr-max-2-simpo-max1500-default Updated 15 days ago