Yash Thube

thubZ9

AI & ML interests

Multimodal learning • CV • RL

Recent Activity

updated a collection 16 days ago

My reading list!

upvoted a paper 16 days ago

Reinforcement Pre-Training

upvoted a paper 20 days ago

Diffusion Classifiers Understand Compositionality, but Conditions Apply

View all activity

Organizations

upvoted a paper 16 days ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published 18 days ago • 232

upvoted a paper 20 days ago

Diffusion Classifiers Understand Compositionality, but Conditions Apply

Paper • 2505.17955 • Published May 23 • 21

upvoted a paper 22 days ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published 23 days ago • 73

upvoted a paper 28 days ago

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Paper • 2505.23747 • Published 29 days ago • 67

upvoted an article about 1 month ago

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 391

upvoted 2 papers about 2 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 175

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

upvoted 2 papers 2 months ago

Efficient Process Reward Model Training via Active Learning

Paper • 2504.10559 • Published Apr 14 • 13

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 274

upvoted 3 papers 3 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 160

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 291

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17 • 96

upvoted a collection 3 months ago

Cohere Labs Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Apr 15 • 68

upvoted a collection 4 months ago

Gemma 3 Release

Collection

24 items • Updated 28 days ago • 393

upvoted an article 4 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

and 3 others •

Mar 4

• 75

upvoted 5 papers 4 months ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 74

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published Feb 24 • 53