1 29 16

kang zhu

kangz

AI & ML interests

None yet

Recent Activity

authored a paper about 22 hours ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

upvoted a paper 1 day ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

upvoted a paper 9 days ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

View all activity

Organizations

upvoted a paper 1 day ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published 15 days ago • 78

upvoted a paper 9 days ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published 10 days ago • 103

upvoted 2 papers 14 days ago

VeriGUI: Verifiable Long-Chain GUI Dataset

Paper • 2508.04026 • Published 15 days ago • 137

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published 28 days ago • 81

upvoted a paper 20 days ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published 21 days ago • 108

upvoted 4 papers about 1 month ago

upvoted 3 papers about 2 months ago

SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval

Paper • 2401.13478 • Published Jan 24, 2024 • 3

PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents

Paper • 2406.13923 • Published Jun 20, 2024 • 24

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published Jun 17 • 35

upvoted 3 papers 2 months ago

PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization

Paper • 2506.12915 • Published Jun 15 • 21

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 61

upvoted 2 papers 4 months ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21 • 22

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 45

upvoted a paper 6 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 106

upvoted a paper 8 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 48

upvoted a paper 10 months ago

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

Paper • 2401.11944 • Published Jan 22, 2024 • 28

kang zhu

AI & ML interests

Recent Activity

Organizations

kangz's activity