Shizhe Diao

shizhediao2

39 43 33

https://shizhediao.github.io/

AI & ML interests

LLM pre-training and reasoning

Recent Activity

upvoted a paper about 2 months ago

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

upvoted a paper 2 months ago

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

liked a model 3 months ago

nvidia/nemotron-climb-fasttext-classifiers

View all activity

Organizations

upvoted a paper about 2 months ago

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

Paper • 2606.05080 • Published Jun 3 • 31

upvoted a paper 2 months ago

ScientistOne: Towards Human-Level Autonomous Research via Chain-of-Evidence

Paper • 2605.26340 • Published May 25 • 37

liked a model 3 months ago

nvidia/nemotron-climb-fasttext-classifiers

Updated May 11 • 14

liked a dataset 5 months ago

karpathy/climbmix-400b-shuffle

Preview • Updated Mar 3 • 38.8k • 57

upvoted 2 papers 5 months ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 141

VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection

Paper • 2603.00912 • Published Mar 1 • 40

liked a dataset 5 months ago

nvidia/ProfBench

Viewer • Updated Mar 4 • 40 • 381 • 29

upvoted 2 papers 5 months ago

Query-focused and Memory-aware Reranker for Long Context Processing

Paper • 2602.12192 • Published Feb 12 • 58

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published Feb 24 • 103

upvoted an article 5 months ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

nvidia

•

Oct 28, 2025

• 21

upvoted 2 papers 6 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

upvoted a paper 7 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 235

liked a model 7 months ago

nvidia/Nemotron-Flash-1B

Text Generation • 1.0B • Updated Jan 9 • 553 • 30

updated a dataset 8 months ago

nvidia/ToolScale

Viewer • Updated Dec 17, 2025 • 4.06k • 1.73k • 202

New activity in nvidia/ToolScale 8 months ago

Add metadata and refactor to ToolScale Dataset Card

#3 opened 8 months ago by

nielsr

posted an update 8 months ago

Post

189

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689)

reacted to di-zhang-fdu's post with 🔥 8 months ago

Post

1951

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689)

upvoted 2 papers 8 months ago

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 37

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 129