Maani (‌ ‍ ‎ ‏ ‌ ‍ )

upvoted 3 collections 5 months ago

upvoted a changelog 5 months ago

Hugging Face Changelog

Hugging Face Docs for Humans and AI Agents

Nov 6, 2025

• 89

upvoted a paper 5 months ago

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published Nov 17, 2025 • 121

upvoted a changelog 7 months ago

Hugging Face Changelog

Repositories total file size is now displayed

Sep 18, 2025

• 175

upvoted an article 7 months ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

May 17, 2025

•

12

upvoted a paper 8 months ago

ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

Paper • 2508.18773 • Published Aug 26, 2025 • 16

upvoted a collection 10 months ago

💧 LFM2

Collection

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 28 items • Updated 23 days ago • 153

upvoted a paper 10 months ago

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24, 2025 • 44

upvoted an article 11 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

845

upvoted an article about 1 year ago

Article

Tiny Agents: an MCP-powered agent in 50 lines of code

Apr 25, 2025

•

308

upvoted a paper about 1 year ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published Apr 15, 2025 • 31

upvoted an article about 1 year ago

Article

DualPipe could be better without the Dual

Feb 28, 2025

•

17

upvoted a paper about 1 year ago

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published Jan 30, 2025 • 23

upvoted an article over 1 year ago

Article

So, what is general intelligence?

Jan 20, 2025

•

2

upvoted 2 collections over 1 year ago

Tools 4 Agents

Collection

This is a collection of spaces on the hub that are useful for building agents. https://huggingface.co/docs/smolagents/en/tutorials/tools • 5 items • Updated Jun 26, 2025 • 7

DataGemma Release

Collection

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Mar 12 • 89

upvoted 2 papers over 2 years ago

Levels of AGI for Operationalizing Progress on the Path to AGI

Paper • 2311.02462 • Published Nov 4, 2023 • 36

Efficient LLM Inference on CPUs

Paper • 2311.00502 • Published Nov 1, 2023 • 7

‌ ‍ ‎ ‏ ‌ ‍

AI & ML interests

Organizations

Nemotron-Post-Training-v3

Olmo 3 Pre-training

Olmo 3 Post-training

Hugging Face Docs for Humans and AI Agents

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Repositories total file size is now displayed

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

💧 LFM2

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Uncensor any LLM with abliteration

Tiny Agents: an MCP-powered agent in 50 lines of code

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

DualPipe could be better without the Dual

o3-mini vs DeepSeek-R1: Which One is Safer?

So, what is general intelligence?

Tools 4 Agents

DataGemma Release

Levels of AGI for Operationalizing Progress on the Path to AGI

Efficient LLM Inference on CPUs

‌ ​ ‍ ​ ‎ ‏ ‌ ​ ‍ ​

AI & ML interests

Organizations

Maani's activity

Hugging Face Docs for Humans and AI Agents

Repositories total file size is now displayed

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

Uncensor any LLM with abliteration

Tiny Agents: an MCP-powered agent in 50 lines of code

DualPipe could be better without the Dual

So, what is general intelligence?

‌ ‍ ‎ ‏ ‌ ‍