Yoni Gozlan's picture

Yoni Gozlan

yonigozlan

·

yonigozlan

AI & ML interests

Vision, Multimodal, Pose Estimation

Recent Activity

new activity 15 days ago

facebook/sam3:'Sam3Processor' 'transformers Error

new activity 20 days ago

facebook/sam3:can run Streaming Video Inference

upvoted an article 22 days ago

Transformers v5: Simple model definitions powering the AI ecosystem

View all activity

Organizations

upvoted an article 22 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

24 days ago

•

253

upvoted an article 5 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5

•

508

upvoted a collection 7 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 174

upvoted an article 8 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12

•

572

upvoted a paper 8 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 306

upvoted an article 10 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

+2

Mar 4

•

78

upvoted 3 papers about 1 year ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 122

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147