view article Article Human in the Loop for computer use agents (instant handoff from AI to you) By dhruv3006 • 10 days ago • 1
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 22 days ago • 54
view article Article Computer-Use Agents SOTA Challenge @ Hack the North (YC interview for top team) + Global Online ($2000 prize) By dhruv3006 • 13 days ago • 1
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 668
view article Article xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy By BobWue • Jun 4 • 12
view article Article Gotchas in Tokenizer Behavior Every Developer Should Know By qgallouedec • Apr 18 • 41
view article Article State of open video generation models in Diffusers By sayakpaul and 2 others • Jan 27 • 63
Flux tools in NF4 Collection Contains Flux Fill, Canny, and Dev checkpoints in NF4. • 3 items • Updated Nov 24, 2024 • 10
Image / Video Gen Collection Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 37 items • Updated May 4 • 9
Optimizing diffusion models Collection Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 21 items • Updated Aug 22, 2024 • 21
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • Jan 19 • 29
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 242
view article Article Timm ❤️ Transformers: Use any timm model with transformers By ariG23498 and 4 others • Jan 16 • 51