-
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning
Paper • 2505.14625 • Published • 13 -
1
TinyV
💬Verify model answers against ground truth
-
zhangchenxu/TinyV-Qwen3-1.7B
Text Generation • 2B • Updated • 27 -
zhangchenxu/TinyV-Qwen3-1.7B-Think
Text Generation • 2B • Updated • 28 • 2
Zhangchen Xu PRO
zhangchenxu
AI & ML interests
LLM Data, Alignment, Post-Training, Safety
Recent Activity
updated
a model
about 9 hours ago
zhangchenxu/Qwen2.5-14B-Instruct-SFT-LR1.0e-5-EPOCHS2-KimiK2-20250820_055153
updated
a model
about 9 hours ago
zhangchenxu/GLM-4-9B-0414-SFT-LR1.0e-5-EPOCHS2-OSS-20250820_060458_G4
updated
a model
about 10 hours ago
zhangchenxu/Qwen2.5-7B-Instruct-SFT-LR1.0e-5-EPOCHS2-OSS-20250820_055005_TB