Omar Sanseviero's picture

Omar Sanseviero

osanseviero

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

liked a model 3 days ago

google/gemma-3-270m-it

liked a model 3 days ago

google/gemma-3-270m

published a model 4 days ago

google/gemma-3-270m-qat-q4_0-unquantized

View all activity

Organizations

Posts 19

Post

14607

Diaries of Open Source. Part 15 🤗

🕵️‍♀️Idefics 2 is out, a multimodal open-source model with very nice capabilities
Models, demo, and datasets: HuggingFaceM4/idefics2-661d1971b7c50831dd3ce0fe
Blog: https://hf.co/blog/idefics2

💾Snowflake released snowflake-arctic-embed, a family of powerful small embedding models
Model: Snowflake/snowflake-arctic-embed-m
Blog: https://www.snowflake.com/blog/introducing-snowflake-arctic-embed-snowflakes-state-of-the-art-text-embedding-family-of-models/

✨Pile-T5, EleutherAI's T5 model trained on 2T tokens
Blog: https://blog.eleuther.ai/pile-t5/
Models: EleutherAI/pile-t5-65a76a0d0022dd270b385a66
GitHub: https://github.com/EleutherAI/improved-t5

🤖CodeQwen1.5-7B base and chat models. Models trained on 3T tokens strong benchmark results for code generation, editing and SQL
Blog post: https://qwenlm.github.io/blog/codeqwen1.5/
Demo: https://hf.co/spaces/Qwen/CodeQwen1.5-7b-Chat-demo
Models: Qwen/CodeQwen1.5-7B and Qwen/CodeQwen1.5-7B-Chat

Misc
🦉 DocOwl1.5: Unified Stucture Learning for OCR-free Document Understanding mPLUG/DocOwl
👀Cerule - a tiny Vision LM model Tensoic/Cerule-v0.1
ChemLLM - a LLM for chemistry and molecule science ⚗️https://hf.co/AI4Chem/ChemLLM-7B-Chat-1.5-DPO
Distil Whisper Large
📝New pdf/OCR datasets with 19 samples pixparse/pdf-document-ocr-datasets-660701430b0346f97c4bc628
🔥Gretel AI high quality text-to-sql synthetic dataset gretelai/synthetic_text_to_sql

Articles 26

Article

191

Llama can now see and run on your device - welcome Llama 3.2

View all Articles

Collections 13

View 13 collections

Papers 5

arxiv:2503.19786

arxiv:2310.16944

arxiv:2303.12582

arxiv:2211.05100

spaces 180

InstantCoder

Generate app code from ideas

Co2 Estimator

Estimate CO2 activities from an image

How Much Do I Cost

Distilabel Dataset Generator

Create datasets with FAQs and SFT prompts

Mistral Super Fast

Non Streaming Example

View 180 Spaces

models 300

osanseviero/qwen2.5-0.5b-instruct-q2_K

0.5B • Updated Oct 10, 2024 • 10 • 1

osanseviero/o-blob-3.2

1B • Updated Oct 10, 2024 • 7

osanseviero/test-in-go7

Updated Oct 8, 2024

osanseviero/test-in-go6

Updated Oct 8, 2024

osanseviero/test-in-go5

Updated Oct 8, 2024

osanseviero/Reflection-Llama-3.1-70B-GGUF

Text Generation • 71B • Updated Sep 16, 2024 • 43

osanseviero/test-in-go4

Updated Sep 13, 2024

osanseviero/test-in-go3

Updated Sep 13, 2024

osanseviero/test-in-go

Updated Sep 12, 2024

osanseviero/test-repo-go2

Updated Sep 12, 2024

View 300 models

datasets 38

osanseviero/super-fun-llamas

Viewer • Updated Sep 13, 2024 • 10 • 16 • 1

osanseviero/fun_llamas

Viewer • Updated Sep 12, 2024 • 50 • 14

osanseviero/my-llamas

Viewer • Updated Sep 11, 2024 • 100 • 10

osanseviero/bill_summary_us_chunks-similarity

Viewer • Updated Jul 12, 2024 • 2k • 13

osanseviero/bill_summary_us_chunks

Viewer • Updated Jul 12, 2024 • 3.45M • 17

osanseviero/testing_geospatial

Updated Jul 8, 2024 • 4

osanseviero/ag_misclassifications

Viewer • Updated Oct 8, 2023 • 200 • 12

osanseviero/test_hacks

Updated Apr 28, 2023 • 6

osanseviero/example_ola

Viewer • Updated Mar 24, 2023 • 2 • 2

osanseviero/langchain_hub_test

Viewer • Updated Jan 30, 2023 • 1 • 7

View 38 datasets