Omar Sanseviero's picture

Omar Sanseviero

osanseviero

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

liked a model 9 days ago

google/videoprism-lvt-large-f8r288

new activity 15 days ago

google/magenta-realtime:Update library tag

liked a model 20 days ago

google/t5gemma-2b-2b-ul2

View all activity

Organizations

upvoted a collection 20 days ago

T5Gemma

32 items • Updated 20 days ago • 59

upvoted an article about 1 month ago

Article

Gemma 3n fully available in the open-source ecosystem!

By

and 7 others •

Jun 26

• 113

upvoted a paper about 1 month ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 35

upvoted a changelog about 2 months ago

Changelog

New Inference Providers Dashboard

Jun 5

• 61

upvoted a collection about 2 months ago

GRMR V3 Models

An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). • 6 items • Updated Jun 4 • 10

upvoted a paper about 2 months ago

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published May 23 • 60

upvoted an article about 2 months ago

Article

The Transformers Library: standardizing model definitions

By

and 3 others •

May 15

• 116

upvoted 2 collections 2 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated 18 days ago • 263

Gemma 3n Preview

4 items • Updated 20 days ago • 162

upvoted an article 3 months ago

Article

17 Reasons Why Gradio Isn't Just Another UI Library

By

and 1 other •

Apr 16

• 41

upvoted a collection 3 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 19 items • Updated Apr 18 • 28

upvoted a paper 3 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 279

upvoted an article 4 months ago

Article

The Large Language Model Course

By

•

Jan 16

• 197

upvoted a collection 4 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 207

upvoted an article 4 months ago

Article

Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning

By

•

Apr 1

• 25

upvoted a paper 4 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 53

upvoted a collection 5 months ago

Gemma 3 Release

24 items • Updated 20 days ago • 425

upvoted an article 5 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 447

upvoted a collection 5 months ago

Cohere Labs Aya Vision

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Apr 15 • 69

upvoted an article 5 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

By

and 3 others •

Mar 4

• 75