Reasoning_eval

university

https://chtholly17.github.io/

AI & ML interests

None defined yet.

Recent Activity

dwenlong submitted a paper 17 days ago

For-Value: Efficient Forward-Only Data Valuation for finetuning LLMs and VLMs

dwenlong authored a paper about 2 months ago

Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents

xk-huang authored a paper 3 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

View all activity

models 18

ReasoningEval/huatuo_sft_m23k_grpo_qwen3-14b

15B • Updated Nov 3, 2025 • 1

ReasoningEval/huatuo_sft_m23k_grpo_qwen3-8b

8B • Updated Nov 3, 2025 • 1

ReasoningEval/huatuo_sft_m23k_grpo_llama31-8b

8B • Updated Nov 3, 2025 • 1

ReasoningEval/openr1_sft_PRIME_grpo_qwen3-14b

15B • Updated Nov 3, 2025 • 3

ReasoningEval/openr1_sft_PRIME_grpo_qwen3-8b

8B • Updated Nov 3, 2025 • 1

ReasoningEval/openr1_sft_PRIME_grpo_llama31-8b

8B • Updated Nov 3, 2025 • 1

ReasoningEval/openr1_sft_qwen3-8b

8B • Updated Oct 29, 2025 • 1

ReasoningEval/openr1_sft_qwen3-14b

425k • Updated Oct 28, 2025 • 2

ReasoningEval/openr1_sft_llama31-8b

8B • Updated Oct 28, 2025 • 1

ReasoningEval/huatuo_sft_qwen3-8b

8B • Updated Oct 28, 2025 • 1

datasets 0

None public yet