Predict human preference to LLM responses.
Binfeng Xu
billxbf
AI & ML interests
None yet
Organizations
models
9

billxbf/nemo-sft-orpo
12B
•
Updated
•
3

billxbf/chai-nemo13b-sft-orpo-merge_v2
Text Generation
•
12B
•
Updated
•
3

billxbf/chai-nemo-sft-orpo-merge
Text Generation
•
12B
•
Updated
•
3

billxbf/wsdm-qwen14b_dare_dslerp-gptq-q4
Text Classification
•
3B
•
Updated
•
5

billxbf/phi4_4k_dare
Text Classification
•
14B
•
Updated
•
4

billxbf/wsdm-qwen14b_dare_dslerp
Text Classification
•
14B
•
Updated
•
6

billxbf/bulla_7b
7B
•
Updated
•
5

billxbf/mmos-deepseek-math-7b
Text Generation
•
Updated
•
2

billxbf/specialized-rewoo-planner-7b
Updated
datasets
10
billxbf/aimo_hard_bilingual
Viewer
•
Updated
•
3.56k
•
13
billxbf/aimo-hard-bilingual
Updated
•
2
billxbf/aimo-dataset
Viewer
•
Updated
•
3.79k
•
19
billxbf/aimo-math-problems
Viewer
•
Updated
•
19.2k
•
7
billxbf/lmsys61k
Viewer
•
Updated
•
110k
•
11
billxbf/ppt127k
Viewer
•
Updated
•
127k
•
6
billxbf/arxiv_dump
Viewer
•
Updated
•
11.1k
•
19
•
1
billxbf/yfdump_5m
Viewer
•
Updated
•
5.18M
•
4
billxbf/rewoo-instruction-finetuning
Viewer
•
Updated
•
2.04k
•
13
•
2
billxbf/sotu2023-qa
Viewer
•
Updated
•
876
•
9