-
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
Paper • 2503.12797 • Published • 30 -
MaxyLee/DeepPerception
Image-Text-to-Text • Updated • 24 • 2 -
MaxyLee/KVG-Bench
Viewer • Updated • 1.34k • 64 -
MaxyLee/DeepPerception-FGVR
Image-Text-to-Text • Updated • 15
Xinyu Ma
MaxyLee
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
MiniCPM4: Ultra-Efficient LLMs on End Devices
upvoted
a
paper
about 2 months ago
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
Organizations
None yet