yubo's picture

yubo

ubowang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

updated a dataset 3 days ago

TIGER-Lab/mmlu_pro_leaderboard_submission

upvoted a paper 11 days ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

View all activity

Organizations

upvoted a paper 3 days ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published 8 days ago • 57

updated a dataset 3 days ago

TIGER-Lab/mmlu_pro_leaderboard_submission

Viewer • Updated 3 days ago • 226 • 166

upvoted a paper 11 days ago

BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent

Paper • 2508.06600 • Published 16 days ago • 36

updated 2 datasets 20 days ago

ubowang/agent_cpt_0802

Updated 20 days ago • 51

ubowang/test_data_0804

Viewer • Updated 20 days ago • 4.42k • 25

published a dataset 20 days ago

ubowang/test_data_0804

Viewer • Updated 20 days ago • 4.42k • 25

published a dataset 22 days ago

ubowang/agent_cpt_0802

Updated 20 days ago • 51

updated a dataset about 1 month ago

ubowang/critique_rl

Preview • Updated Jul 11 • 56

upvoted a paper about 2 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 73

published a dataset about 2 months ago

ubowang/critique_rl

Preview • Updated Jul 11 • 56

updated a model 2 months ago

ubowang/qwen3_4b_cft_ckpt40

4B • Updated Jun 21 • 6

published a model 2 months ago

ubowang/qwen3_4b_cft_ckpt40

4B • Updated Jun 21 • 6

New activity in TIGER-Lab/One-Shot-CFT-Logic-Qwen-7B-TimeArithmetic 3 months ago

Add pipeline_tag and library_name

#1 opened 3 months ago by

New activity in TIGER-Lab/One-Shot-CFT-Logic-Qwen-7B-DisambiguationQA 3 months ago

Improve model card: Add pipeline tag, library name and license

#1 opened 3 months ago by

New activity in TIGER-Lab/One-Shot-CFT-Logic-Qwen-7B-CausalUnderstanding 3 months ago

Add library_name, pipeline_tag and license

#1 opened 3 months ago by

New activity in TIGER-Lab/One-Shot-CFT-Math-Qwen-14B 3 months ago

Add pipeline tag and library name

#1 opened 3 months ago by

New activity in TIGER-Lab/One-Shot-CFT-Math-Llama-3B 3 months ago

Improve model card: add pipeline tag, library name and license

#1 opened 3 months ago by

New activity in TIGER-Lab/One-Shot-CFT-Data 3 months ago

Add question-answering task category

#1 opened 3 months ago by

New activity in TIGER-Lab/One-Shot-CFT-Math-Qwen-7B 3 months ago

Add pipeline tag, library name and license

#1 opened 3 months ago by

New activity in TIGER-Lab/One-Shot-CFT-Math-Qwen-1.5B 3 months ago

Improve model card metadata: add pipeline tag and library name

#1 opened 3 months ago by