arxiv:2501.04682
Violet Xiang
violetxi
AI & ML interests
None yet
Organizations
Papers
2
models
386
violetxi/sft_tir_3e-5_b32_warmup0.1_checkpoint-epoch2
Text Generation
•
Updated
•
29.6k
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_wd0.01_epoch0
Text Generation
•
Updated
•
8
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_wd0.01_checkpoint12000
Text Generation
•
Updated
•
7
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_wd0.01_checkpoint6000
Text Generation
•
Updated
•
5
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_epoch0
Text Generation
•
Updated
•
10
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint12000
Text Generation
•
Updated
•
5
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint11400
Text Generation
•
Updated
•
7
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint10800
Text Generation
•
Updated
•
6
violetxi/ak-prm-full_sft-steptok_lr1e-5_wa0.03_checkpoint10200
Text Generation
•
Updated
•
5
violetxi/ak-prm-subfull_base_lr1e-5_wa0.03_wd0.01_checkpoint1260
Updated
datasets
262
violetxi/MATH-500_L3_best_first_N128_B16_D15_T0.0001_0-105
Viewer
•
Updated
•
21
•
25
violetxi/MATH-500_L5_best_first_N128_B16_D15_T0.0001_0-134
Viewer
•
Updated
•
18
•
24
violetxi/MATH-500_L4_best_first_N128_B8_D15_T0.0001_86-100
Viewer
•
Updated
•
14
•
21
violetxi/MATH-500_L2_best_first_N128_B16_D15_T0.0001_0-90
Viewer
•
Updated
•
90
•
20
violetxi/MATH-500_L4_best_first_N128_B16_D15_T0.0001_0-128
Viewer
•
Updated
•
21
•
21
violetxi/MATH-500_L1_best_first_N128_B16_D15_T0.0001_0-43
Viewer
•
Updated
•
43
•
20
violetxi/MATH-500_L3_best_first_N128_B8_D15_T0.0001_0-75
Viewer
•
Updated
•
75
•
21
violetxi/MATH-500_L4_best_first_N128_B8_D15_T0.0001_62-80
Viewer
•
Updated
•
18
•
27
violetxi/MATH-500_L4_best_first_N128_B8_D15_T0.0001_80-86
Viewer
•
Updated
•
6
•
18
violetxi/MATH-500_L5_best_first_N128_B8_D15_T0.0001_116-134
Viewer
•
Updated
•
18
•
26