Dan Zhang's picture

2 4

Dan Zhang

zd21

·

https://zhangdan0602.github.io/

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference

Paper • 2509.15110 • Published Sep 18, 2025 • 1

upvoted 2 collections 6 months ago

TDRM

Learning Smooth Reward Models with Temporal Difference for LLM RL and Inference • 15 items • Updated Nov 12, 2025 • 2

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11, 2025 • 252

upvoted a paper about 1 year ago

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published Oct 31, 2024 • 49