view article Article The 4 Things Qwen-3's Chat Template Teaches Us By cfahlgren1 β’ 29 days ago β’ 46
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs Paper β’ 2505.19457 β’ Published 3 days ago β’ 59
My MCP-ready spaces [WIP] Collection Progressive list of MCP server ready trending spaces maintained by fffiloni β’ 11 items β’ Updated about 9 hours ago β’ 4
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper β’ 2505.19641 β’ Published 3 days ago β’ 41
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation Paper β’ 2505.20292 β’ Published 2 days ago β’ 49
view changelog Changelog Xet is now the default storage option for new users and organizations 5 days ago β’ 45
HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation Paper β’ 2503.18860 β’ Published Mar 24 β’ 5
StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Paper β’ 2505.20139 β’ Published 2 days ago β’ 17
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models Paper β’ 2505.19223 β’ Published 3 days ago β’ 7
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper β’ 2505.16933 β’ Published 6 days ago β’ 29
One-RL-to-See-Them-All Collection https://github.com/MiniMax-AI/One-RL-to-See-Them-All β’ 5 items β’ Updated 2 days ago β’ 12
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning Paper β’ 2505.17667 β’ Published 6 days ago β’ 75
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper β’ 2505.09568 β’ Published 14 days ago β’ 85
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others β’ 6 days ago β’ 83
view article Article Empowering Public Organizations: Preparing Your Data for the AI Era By evijit and 1 other β’ Apr 10 β’ 14
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others β’ 8 days ago β’ 106
Emerging Properties in Unified Multimodal Pretraining Paper β’ 2505.14683 β’ Published 8 days ago β’ 124