DX Y
HeartofSheep
AI & ML interests
None yet
Recent Activity
updated
a collection
6 days ago
VLMs
updated
a collection
6 days ago
VLMs
liked
a model
12 days ago
THUDM/CogView4-6B
Organizations
None yet
Collections
4
-
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
Paper • 2503.12797 • Published • 28 -
CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era
Paper • 2503.12329 • Published • 23 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 45
models
None public yet
datasets
None public yet