Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 4 days ago • 20
WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published 3 days ago • 74
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation Paper • 2506.21546 • Published 10 days ago • 2
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • 5 days ago • 75
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective Paper • 2507.01925 • Published 4 days ago • 29
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published 5 days ago • 53
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper • 2506.19852 • Published 12 days ago • 34
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published 5 days ago • 166
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • 4 days ago • 57
view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others • 17 days ago • 71
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 23 items • Updated 3 days ago • 140
view article Article Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others • 9 days ago • 23
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models Paper • 2506.19851 • Published 12 days ago • 55
view article Article 🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation By moonshotai and 1 other • 15 days ago • 56