NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks By nvidia and 4 others • 10 days ago • 62
How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio By fdaudens • 7 days ago • 19
Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B By nvidia and 9 others • 2 days ago • 14
Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era By frimelle and 1 other • about 21 hours ago • 10
What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware By RakshitAralimatti • 13 days ago • 14
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation By Alibaba-DAMO-Academy and 9 others • 10 days ago • 25
AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org By PeterKruger • about 23 hours ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 209
NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset By nvidia and 4 others • about 10 hours ago • 5
Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset By Aratako • 7 days ago • 5
Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training By codelion • May 17 • 9
NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks By nvidia and 4 others • 10 days ago • 62
How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio By fdaudens • 7 days ago • 19
Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B By nvidia and 9 others • 2 days ago • 14
Old Maps, New Terrain: Updating Labour Taxonomies for the AI Era By frimelle and 1 other • about 21 hours ago • 10
What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware By RakshitAralimatti • 13 days ago • 14
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation By Alibaba-DAMO-Academy and 9 others • 10 days ago • 25
AutoBench Third Run: Revolutionizing LLM Evaluation with Record-Breaking Scale, Accuracy, and a New Home at autobench.org By PeterKruger • about 23 hours ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 209
NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset By nvidia and 4 others • about 10 hours ago • 5
Magpie Speech — Applying an LLM Data Synthesis Method to an LLM-Based TTS Model to Synthesize a Speech Dataset By Aratako • 7 days ago • 5
Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training By codelion • May 17 • 9