view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 284
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Paper β’ 2412.04424 β’ Published Dec 5, 2024 β’ 64
Video Collection Stability AI's suite of image-to-video models β’ 6 items β’ Updated 21 days ago β’ 83
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper β’ 2409.01704 β’ Published Sep 3, 2024 β’ 85
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model Paper β’ 2408.16767 β’ Published Aug 29, 2024 β’ 33
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models By andito and 2 others β’ Jun 24, 2024 β’ 194