Running on CPU Upgrade 91 91 Ostris' AI Toolkit 💻 Train FLUX, Qwen and Wan LoRAs with Ostris Ai Toolkit
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT Paper • 2509.19284 • Published 7 days ago • 22
view article Article mmBERT: ModernBERT goes Multilingual By orionweller and 5 others • 22 days ago • 101
view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation By cbensimon and 3 others • 29 days ago • 64
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation? Paper • 2508.19827 • Published Aug 27 • 32
Splade Models Collection The collection includes Splade models from different authors that can be load thanks to the Sparse Encoder modules of Sentence Transformers • 16 items • Updated Jul 30 • 8
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • Jul 1 • 123
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published Jul 14 • 88
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 688
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning Paper • 2506.24119 • Published Jun 30 • 50