cagatay odabasi's picture

18 84

cagatay odabasi

cagatayodabasi

·

cagbal

AI & ML interests

None yet

Recent Activity

liked a Space about 5 hours ago

VAST-AI/MV-Adapter-Img2Texture

View all activity

Organizations

cagatayodabasi's activity

upvoted a paper 12 days ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 76

upvoted a collection 12 days ago

Physical AI

Collection of commercial-grade datasets for physical AI developers • 10 items • Updated 4 days ago • 34

upvoted a paper about 2 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 118

upvoted a collection 4 months ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 17 days ago • 68

upvoted a paper 4 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 123

upvoted a collection 7 months ago

Theia

Distilling Diverse Vision Foundation Models for Robot Learning • 6 items • Updated Sep 30, 2024 • 9

upvoted 2 papers 7 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 91

3D-VLA: A 3D Vision-Language-Action Generative World Model

Paper • 2403.09631 • Published Mar 14, 2024 • 10

upvoted a collection 8 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 4 days ago • 60

upvoted 6 papers 8 months ago

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 32

DC3DO: Diffusion Classifier for 3D Objects

Paper • 2408.06693 • Published Aug 13, 2024 • 11

Imagen 3

Paper • 2408.07009 • Published Aug 13, 2024 • 61

Task-oriented Sequential Grounding in 3D Scenes

Paper • 2408.04034 • Published Aug 7, 2024 • 8

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7, 2024 • 31

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 27

upvoted an article 8 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 294

upvoted a paper over 1 year ago

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Paper • 2312.13252 • Published Dec 20, 2023 • 28