Detect objects in images from URL or upload
nanonets / qwen2vl / rolmocr / aya vision
Extract text from images
Interact with an agent to perform web-based tasks
[Keep updating]Collect everything about o1 and r1!