OmniParser, turn your LLM into GUI agent
ocr images and video understanding
Analysis of data on an invoice