Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 11 days ago • 54
Olmo 3 Pre-training Collection All artifacts related to Olmo 3 pre-training • 10 items • Updated 11 days ago • 32
Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 11 days ago • 46
GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms Paper • 2511.17592 • Published Nov 17, 2025 • 118
view article Article Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training May 17, 2025 • 11
ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models Paper • 2508.18773 • Published Aug 26, 2025 • 16
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models Paper • 2506.19697 • Published Jun 24, 2025 • 44
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published Apr 15, 2025 • 31
Tools 4 Agents Collection This is a collection of spaces on the hub that are useful for building agents. https://huggingface.co/docs/smolagents/en/tutorials/tools • 5 items • Updated Jun 26, 2025 • 7
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Jul 10, 2025 • 87
Levels of AGI for Operationalizing Progress on the Path to AGI Paper • 2311.02462 • Published Nov 4, 2023 • 37