11 62 54

Pengxiang Li

pengxiang

pixeli99

AI & ML interests

Video generation, Image editing, AD

Recent Activity

upvoted a paper 4 days ago

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

upvoted a paper 4 days ago

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

commented on a paper 15 days ago

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

View all activity

Organizations

None yet

pengxiang's activity

upvoted 2 papers 4 days ago

GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior

Paper • 2506.08012 • Published 4 days ago • 7

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Paper • 2506.07530 • Published 5 days ago • 18

commented a paper 15 days ago

Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

Paper • 2505.20199 • Published 19 days ago • 2 •

upvoted 2 papers 22 days ago

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

Paper • 2505.16933 • Published 23 days ago • 30

LaViDa: A Large Diffusion Language Model for Multimodal Understanding

Paper • 2505.16839 • Published 23 days ago • 12

upvoted a paper 23 days ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published 23 days ago • 88

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94

upvoted a paper about 2 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 128

updated 2 models about 2 months ago

pengxiang/Qwen2.5-1.5B-Open-R1-Distill-loop

Updated Apr 25 • 9

pengxiang/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated Apr 25 • 9

liked a dataset about 2 months ago

Anthropic/values-in-the-wild

Viewer • Updated Apr 28 • 6.91k • 353 • 131

updated 2 models about 2 months ago

pengxiang/Qwen2.5-1.5B-Open-R1-Distill-loop

Updated Apr 25 • 9

pengxiang/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated Apr 25 • 9

published a model about 2 months ago

pengxiang/Qwen2.5-1.5B-Open-R1-Distill-loop

Updated Apr 25 • 9

updated a model about 2 months ago

pengxiang/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated Apr 25 • 9

published 2 models about 2 months ago

pengxiang/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated Apr 25 • 9

pengxiang/Qwen2.5-1.5B-Open-R1-GRPO

Updated Apr 23

updated a dataset about 2 months ago

pengxiang/coins_new

Viewer • Updated Apr 23 • 4.91k • 2.23k

authored a paper about 2 months ago

InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners

Paper • 2504.14239 • Published Apr 19 • 13

updated a dataset about 2 months ago

pengxiang/COIN

Viewer • Updated Apr 22 • 528 • 10