view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 502
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others β’ 15 days ago β’ 145
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper β’ 2507.14683 β’ Published 24 days ago β’ 124
FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents Paper β’ 2504.13128 β’ Published Apr 17 β’ 7
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards Paper β’ 2505.24760 β’ Published May 30 β’ 69
Instruction-Following Evaluation for Large Language Models Paper β’ 2311.07911 β’ Published Nov 14, 2023 β’ 21
π Speech Enhancement Collection Unlocking a new era in Speech Enhancement, powered by the latest AI technologies, for superior audio quality improvements! π β’ 8 items β’ Updated May 1, 2024 β’ 13
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models Paper β’ 2411.18152 β’ Published Nov 27, 2024 β’ 1
PresentAgent: Multimodal Agent for Presentation Video Generation Paper β’ 2507.04036 β’ Published Jul 5 β’ 10
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language By davidberenstein1957 and 5 others β’ Dec 16, 2024 β’ 134
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper β’ 2507.01006 β’ Published Jul 1 β’ 215
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper β’ 2504.10479 β’ Published Apr 14 β’ 280