Zhenting Wang's picture

6 8 2

Zhenting Wang

ztwang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

authored a paper 4 days ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

commented on a paper 4 days ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

View all activity

Organizations

commented a paper 4 days ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published 7 days ago • 123 •

commented 2 papers about 1 month ago

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Paper • 2508.20453 • Published Aug 28 • 62 •

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Paper • 2508.20453 • Published Aug 28 • 62 •

commented a paper 6 months ago

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

Paper • 2504.09710 • Published Apr 13 • 19 •

commented 2 papers 8 months ago

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Paper • 2502.03628 • Published Feb 5 • 12 •

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Paper • 2502.03628 • Published Feb 5 • 12 •

commented 2 papers 9 months ago

MLLM-as-a-Judge for Image Safety without Human Labeling

Paper • 2501.00192 • Published Dec 31, 2024 • 31 •

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46 •