Tianbao Xie's picture

Tianbao Xie PRO

tianbaoxiexxx

·

https://tianbaoxie.com

AI & ML interests

NLP, AI, RL, Robotics

Recent Activity

upvoted a paper 5 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 6 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

upvoted a paper 8 days ago

One-shot Entropy Minimization

View all activity

Organizations

tianbaoxiexxx's activity

upvoted a paper 5 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 6 days ago • 132

upvoted a paper 6 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 9 days ago • 115

upvoted a paper 8 days ago

One-shot Entropy Minimization

Paper • 2505.20282 • Published 13 days ago • 7

upvoted a paper 10 days ago

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published 10 days ago • 45

upvoted a paper 11 days ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published 13 days ago • 101

upvoted 2 papers 19 days ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published 20 days ago • 21

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published 20 days ago • 45

upvoted a paper about 1 month ago

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8 • 24

upvoted 2 papers 3 months ago

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 36

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted 3 papers 4 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 191

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 58

Teaching Language Models to Critique via Reinforcement Learning

Paper • 2502.03492 • Published Feb 5 • 24

upvoted an article 5 months ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

By

and 1 other •

Jan 3

• 18

upvoted a paper 5 months ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 88

upvoted 3 papers 6 months ago

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Paper • 2412.09605 • Published Dec 12, 2024 • 30

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 158

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 66

upvoted 2 papers 7 months ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 80

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published Nov 15, 2024 • 35