SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks Paper • 2503.15478 • Published 4 days ago • 4
Cluster and Predict Latents Patches for Improved Masked Image Modeling Paper • 2502.08769 • Published Feb 12 • 4
Audiobox: Unified Audio Generation with Natural Language Prompts Paper • 2312.15821 • Published Dec 25, 2023 • 16
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale Paper • 2111.09296 • Published Nov 17, 2021 • 3
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise Distillation Paper • 2203.15643 • Published Mar 29, 2022 • 1
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published Dec 11, 2024 • 14
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees Paper • 2311.08384 • Published Nov 14, 2023
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Paper • 2402.19446 • Published Feb 29, 2024