Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 7 days ago • 43
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 2 days ago • 413
4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture Paper • 2507.05163 • Published 3 days ago • 37
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 67
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • 14 days ago • 105
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 474
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 7 days ago • 147