Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 7 days ago • 49
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community Dec 9, 2024 • 54
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published 3 days ago • 73
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 4 days ago • 86
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 3 days ago • 49
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 11 days ago • 132
One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs Paper • 2502.10454 • Published 10 days ago • 6
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation Paper • 2502.12148 • Published 4 days ago • 16
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published 8 days ago • 32
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 10 days ago • 6
Tools for learning AI Collection This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated 4 days ago • 61
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper • 2502.09604 • Published 8 days ago • 30
DPO-Shift: Shifting the Distribution of Direct Preference Optimization Paper • 2502.07599 • Published 10 days ago • 14
CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging Paper • 2502.05664 • Published 13 days ago • 22
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence Paper • 2401.14196 • Published Jan 25, 2024 • 59
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models Paper • 2401.06066 • Published Jan 11, 2024 • 48