A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.
Ray Yang
rayruiyang
AI & ML interests
None yet
Recent Activity
liked a dataset 12 days ago
jdopensource/JoyAI-Image-OpenSpatial upvoted a paper 27 days ago
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression upvoted a paper about 1 month ago
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?Organizations
None yet