Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization Paper • 2508.07629 • Published 29 days ago • 41
AcceLLM: Accelerating LLM Inference using Redundancy for Load Balancing and Data Locality Paper • 2411.05555 • Published Nov 8, 2024 • 2
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated about 2 hours ago • 215
InternVL3.5 Collection This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated 10 days ago • 87
view article Article Luth: Efficient French Specialization for Small Language Models By MaxLSB and 1 other • 29 days ago • 15
Trained on AWS Trainium Collection Collection of models on Hugging Face that have been trained on AWS Trainium. Learn more here: https://huggingface.co/docs/optimum-neuron/index • 7 items • Updated May 7, 2024 • 10
view article Article Transformers backend integration in SGLang By marcsun13 and 4 others • Jun 23 • 53
view article Article NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset By nvidia and 4 others • 19 days ago • 15
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 22 days ago • 54
view article Article MCP for Research: How to Connect AI to Research Tools By dylanebert • 22 days ago • 47
NVIDIA Nemotron Collection Open, Production-ready Enterprise Models. Nvidia Open Model license. • 4 items • Updated 5 days ago • 56