LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 7 days ago • 53
Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI By nvidia • 3 days ago • 15
Australian-made LLM beats OpenAI and Google at legal retrieval By isaacus and 2 others • 8 days ago • 25
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks By nvidia and 6 others • 2 days ago • 13
Can Your LLM Think Like a Professional? Introducing ProfBench By nvidia and 7 others • 2 days ago • 12
Aligning to What? Rethinking Agent Generalization in MiniMax M2 By MiniMax-AI • about 14 hours ago • 12
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare By nvidia • 2 days ago • 11
Advancing Predictive ADMET Modeling Through Community-Driven Science: The ExpansionRx-OpenADMET Blind Challenge By hugging-science and 1 other • 3 days ago • 9
LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • 7 days ago • 53
Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI By nvidia • 3 days ago • 15
Australian-made LLM beats OpenAI and Google at legal retrieval By isaacus and 2 others • 8 days ago • 25
NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks By nvidia and 6 others • 2 days ago • 13
Can Your LLM Think Like a Professional? Introducing ProfBench By nvidia and 7 others • 2 days ago • 12
Aligning to What? Rethinking Agent Generalization in MiniMax M2 By MiniMax-AI • about 14 hours ago • 12
How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare By nvidia • 2 days ago • 11
Advancing Predictive ADMET Modeling Through Community-Driven Science: The ExpansionRx-OpenADMET Blind Challenge By hugging-science and 1 other • 3 days ago • 9