ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published 4 days ago • 53
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 2 days ago • 139
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published about 1 month ago • 123
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated Apr 10 • 77
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated Apr 3 • 54
Beyond Release: Access Considerations for Generative AI Systems Paper • 2502.16701 • Published Feb 23 • 16
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Paper • 2501.16764 • Published Jan 28 • 22
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28 • 121