view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 491
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents Paper • 2506.11763 • Published Jun 13 • 69
Sora Reference Papers Collection A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Oct 3, 2024 • 52
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 11 days ago • 520
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models By andito and 2 others • Jun 24, 2024 • 199