Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
26760000000177.6
TFLOPS
1528
1744
4716
Omar Sanseviero
osanseviero
Follow
commonface's profile picture
Embedwith's profile picture
wath5's profile picture
3078 followers
·
453 following
https://osanseviero.github.io/hackerllama/
osanseviero
osanseviero
omarsanseviero
osanseviero.bsky.social
AI & ML interests
Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙
Recent Activity
liked
a model
about 15 hours ago
MiniMaxAI/MiniMax-Text-01
liked
a model
about 15 hours ago
openbmb/MiniCPM-o-2_6
upvoted
a
paper
about 15 hours ago
The Lessons of Developing Process Reward Models in Mathematical Reasoning
View all activity
Articles
Llama can now see and run on your device - welcome Llama 3.2
Sep 25, 2024
•
181
Fine-tuning LLMs to 1.58bit: extreme quantization made easy
Sep 18, 2024
•
216
Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
Jul 23, 2024
•
226
WWDC 24: Running Mistral 7B with Core ML
Jul 22, 2024
•
56
How we leveraged distilabel to create an Argilla 2.0 Chatbot
Jul 16, 2024
•
32
Welcome Gemma 2 - Google's new open LLM
Jun 27, 2024
•
124
Welcome Llama 3 - Meta's new open LLM
Apr 18, 2024
•
282
CodeGemma - an official Google release for code LLMs
Apr 9, 2024
•
99
🪆 Introduction to Matryoshka Embedding Models
Feb 23, 2024
•
68
Welcome Gemma - Google's new open LLM
Feb 21, 2024
•
21
Constitutional AI with Open LLMs
Feb 1, 2024
•
13
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18, 2024
•
42
Mixture of Experts Explained
Dec 11, 2023
•
255
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
Dec 11, 2023
•
11
Inference for PROs
Sep 22, 2023
•
52
Spread Your Wings: Falcon 180B is here
Sep 6, 2023
•
4
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
9
Results of the Open Source AI Game Jam
Jul 21, 2023
Llama 2 is here - get it on Hugging Face
Jul 18, 2023
•
23
The Falcon has landed in the Hugging Face ecosystem
Jun 5, 2023
•
11
Hugging Face Machine Learning Demos on arXiv
Nov 17, 2022
What's new in Diffusers? 🎨
Sep 12, 2022
Announcing Evaluation on the Hub
Jun 28, 2022
An Introduction to Deep Reinforcement Learning
May 4, 2022
•
3
Welcome spaCy to the 🤗 Hub
Jul 13, 2021
•
1
Sentence Transformers in the 🤗 Hub
Jun 28, 2021
Organizations
osanseviero
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
3 models
about 15 hours ago
MiniMaxAI/MiniMax-Text-01
Text Generation
•
Updated
1 day ago
•
1.53k
•
372
openbmb/MiniCPM-o-2_6
Any-to-Any
•
Updated
about 21 hours ago
•
7.05k
•
545
Qwen/Qwen2.5-Math-PRM-72B
Text Classification
•
Updated
about 22 hours ago
•
358
•
51
liked
2 models
about 16 hours ago
jxm/cde-small-v2
Feature Extraction
•
Updated
1 day ago
•
1.03k
•
55
NovaSky-AI/Sky-T1-32B-Preview
Text Generation
•
Updated
5 days ago
•
4.89k
•
447
liked
a model
2 days ago
Alfaxad/gemma2-2b-swahili-preview
Text Generation
•
Updated
5 days ago
•
32
•
4
liked
a model
6 days ago
google/path-foundation
Updated
Dec 5, 2024
•
140
•
29
liked
a Space
6 days ago
Running
44
🥇
GIFT Eval
GIFT-Eval: A Benchmark for General Time Series Forecasting
liked
2 models
6 days ago
google/timesfm-2.0-500m-pytorch
Time Series Forecasting
•
Updated
19 days ago
•
89
microsoft/phi-4
Text Generation
•
Updated
9 days ago
•
100k
•
1.41k
liked
a Space
13 days ago
Running
414
📈
2024 AI Timeline
liked
2 Spaces
16 days ago
Running
on
Zero
168
📈
Lumina Brush Uniform Lit
Sleeping
43
📊
LMM
liked
2 models
16 days ago
nomic-ai/modernbert-embed-base
Sentence Similarity
•
Updated
7 days ago
•
76.5k
•
165
deepseek-ai/DeepSeek-V3
Updated
19 days ago
•
142k
•
2.01k
liked
a Space
16 days ago
Running
3
⚡
Co2 Estimator
Estimate CO2 activities from an image
liked
a model
21 days ago
yulan-team/YuLan-Mini
Text Generation
•
Updated
15 days ago
•
856
•
33
liked
a model
23 days ago
THUDM/cogagent-9b-20241220
Image-Text-to-Text
•
Updated
24 days ago
•
2.5k
•
39
liked
a model
24 days ago
deepseek-ai/DeepSeek-V3-Base
Updated
19 days ago
•
15.1k
•
1.27k
liked
a Space
25 days ago
Running
512
🌍
QVQ 72B Preview
Load more