Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap Jun 19, 2024 • 11
Jan 17 Releases ❄️ Collection Models and datasets of the second week of Jan 2025. • 23 items • Updated about 17 hours ago • 6
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 2 days ago • 42
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw • 11 days ago • 22
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 225
view article Article Memory-efficient Diffusion Transformers with Quanto and Diffusers Jul 30, 2024 • 62
Writing in the Margins: Better Inference Pattern for Long Context Retrieval Paper • 2408.14906 • Published Aug 27, 2024 • 139
NIM Serverless Inference API Collection Models in this collection are available for inference via a serverless API powered by NVIDIA NIM. • 8 items • Updated 1 day ago • 22
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 By dvilasuero • Jul 30, 2024 • 37
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 264
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper • 2407.14933 • Published Jul 20, 2024 • 12
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate Jun 13, 2024 • 45