Pengxiang Li
pengxiang
AI & ML interests
Video generation, Image editing, AD
Recent Activity
upvoted
a
paper
3 days ago
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection
Behavior
upvoted
a
paper
3 days ago
BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
commented on
a paper
14 days ago
Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking
Organizations
None yet
Collections
2
-
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Paper • 2402.03162 • Published • 19 -
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 65 -
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
Paper • 2407.02371 • Published • 55 -
Large Language Diffusion Models
Paper • 2502.09992 • Published • 120
models
10

pengxiang/Qwen2.5-1.5B-Open-R1-Distill-loop
Updated
•
9

pengxiang/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
•
8

pengxiang/Qwen2.5-1.5B-Open-R1-GRPO
Updated

pengxiang/LNS_1B
Updated
•
12
•
1

pengxiang/TrackDiffusion_SVD_Stage2
Text-to-Video
•
Updated

pengxiang/TrackDiffusion_SVD_Stage1
Text-to-Video
•
Updated

pengxiang/TrackDiffusion_Pretrain
Updated
•
8
•
1

pengxiang/GLIGEN_1_4
Updated
•
9

pengxiang/TrackDiffusion_ModelScope
Text-to-Video
•
Updated

pengxiang/trackdiffusion_ytvis
Text-to-Video
•
Updated
•
2
datasets
16
pengxiang/coins_new
Viewer
•
Updated
•
4.91k
•
2.23k
pengxiang/COIN
Viewer
•
Updated
•
528
•
10
pengxiang/tvqa
Preview
•
Updated
•
14
pengxiang/COINs
Viewer
•
Updated
•
1.59k
•
1.06k
pengxiang/sthv2
Updated
•
10
pengxiang/youcook2
Updated
•
69
pengxiang/UVO
Viewer
•
Updated
•
799
•
9
pengxiang/youcook
Viewer
•
Updated
•
407
•
872
pengxiang/clevrer
Viewer
•
Updated
•
10k
•
8
pengxiang/oops
Updated
•
9