2 9 1

Zhuokai Zhao

zhuokai

https://zhuokai-zhao.com/

AI & ML interests

Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System

Recent Activity

authored a paper 27 days ago

Scaling Agent Learning via Experience Synthesis

upvoted a paper 29 days ago

Scaling Agent Learning via Experience Synthesis

authored a paper about 1 month ago

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding

View all activity

Organizations

authored a paper 27 days ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 80

authored 10 papers about 1 month ago

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding

Paper • 2412.06474 • Published Dec 9, 2024

CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

Paper • 2503.19900 • Published Mar 25

Boosting LLM Reasoning via Spontaneous Self-Correction

Paper • 2506.06923 • Published Jun 7

RecoWorld: Building Simulated Environments for Agentic Recommender Systems

Paper • 2509.10397 • Published Sep 12 • 7

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21 • 1

Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning

Paper • 2510.05251 • Published Oct 6 • 7

Thought Communication in Multiagent Collaboration

Paper • 2510.20733 • Published Oct 23 • 14

authored a paper 9 months ago

HumanMM: Global Human Motion Recovery from Multi-shot Videos

Paper • 2503.07597 • Published Mar 10 • 2

authored a paper about 1 year ago

Quantifying Generalization Complexity for Large Language Models

Paper • 2410.01769 • Published Oct 2, 2024 • 13

authored 7 papers over 1 year ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 56

HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding

Paper • 2403.00425 • Published Mar 1, 2024 • 1

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

Paper • 2402.11452 • Published Feb 18, 2024 • 1

Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint Safeguards

Paper • 2310.03379 • Published Oct 5, 2023

Multi-Modality Guidance Network For Missing Modality Inference

Paper • 2309.03452 • Published Sep 7, 2023

Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits

Paper • 2305.19889 • Published May 31, 2023

Breaking the Curse of Quality Saturation with User-Centric Ranking

Paper • 2305.15333 • Published May 24, 2023

Zhuokai Zhao

AI & ML interests

Recent Activity

Organizations

zhuokai's activity