199 24 38

Raushan Turganbay

RaushanTurganbay

zucchini-nlp

AI & ML interests

Generation and Multimodality

Recent Activity

updated a model 6 days ago

hf-internal-testing/tiny-random-BlipForConditionalGeneration

updated a model 11 days ago

llava-hf/llava-1.5-7b-hf

new activity 12 days ago

Hcompany/Holo1-3B:Wrong co ordinates return

View all activity

Organizations

RaushanTurganbay's activity

upvoted a collection 21 days ago

Releases 23 May

Collection

34 items • Updated 21 days ago • 8

upvoted a changelog 25 days ago

Changelog

AI-generated Abstract summaries on Hugging Face Papers

25 days ago

• 70

upvoted an article 28 days ago

Article

NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning

and 1 other •

28 days ago

• 24

upvoted an article about 1 month ago

Article

Page-to-Video: Generate videos from webpages 🪄🎬

•

May 6

• 27

upvoted an article 3 months ago

Article

Tensor Parallelism

•

Aug 20, 2024

• 18

upvoted a paper 4 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 193

upvoted 2 articles 4 months ago

Article

What's Automatic Differentiation?

•

Mar 19, 2024

• 15

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

and 3 others •

Feb 4

• 163

upvoted an article 5 months ago

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

Jan 23

• 68

upvoted a paper 8 months ago

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 30

upvoted an article 9 months ago

Article

Saving Memory Using Padding-Free Transformer Layers during Finetuning

•

Jun 11, 2024

• 18

upvoted a collection 9 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 305

upvoted an article 9 months ago

Article

Key Insights into the Law of Vision Representations in MLLMs

•

Sep 2, 2024

• 18

upvoted a paper 9 months ago

Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published Sep 6, 2024 • 27

upvoted a collection 10 months ago

Vision Language Models Papers 🖼️💬📝

Collection

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 37

upvoted an article 10 months ago

Article

Introduction to ggml

and 2 others •

Aug 13, 2024

• 206

upvoted 3 papers 10 months ago

mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language Models

Paper • 2408.04840 • Published Aug 9, 2024 • 35

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9, 2024 • 50

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

upvoted a paper 11 months ago

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22, 2024 • 41