11 62 54

Pengxiang Li

pengxiang

pixeli99

AI & ML interests

Video generation, Image editing, AD

Recent Activity

upvoted a paper 4 days ago

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

upvoted a paper 4 days ago

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

commented on a paper 15 days ago

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

View all activity

Organizations

None yet

pengxiang's activity

upvoted 2 papers 4 days ago

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

Paper • 2506.08012 • Published 4 days ago • 7

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Paper • 2506.07530 • Published 5 days ago • 18

upvoted 2 papers 22 days ago

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Paper • 2505.16933 • Published 23 days ago • 30

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Paper • 2505.16839 • Published 23 days ago • 12

upvoted a paper 23 days ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published 23 days ago • 88

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94

upvoted 2 papers about 2 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 128

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

Paper • 2504.14239 • Published Apr 19 • 13

upvoted 4 papers 2 months ago

upvoted 4 papers 3 months ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 71

Frac-Connections: Fractional Extension of Hyper-Connections

Paper • 2503.14125 • Published Mar 18 • 21

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 20

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published Mar 3 • 39

upvoted 2 papers 4 months ago

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 120

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published Feb 9 • 40

upvoted 2 papers 5 months ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 62

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72