Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 61
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published Nov 26, 2024 • 80
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 196
view article Article Hugging Face x LangChain : A new partner package in LangChain May 14, 2024 • 125
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods Jan 18, 2024 • 44
🐐 GEITje 7B ultra 🤖 Collection SFT and DPO models for GEITje 7B Ultra, including the datasets used to train them. • 10 items • Updated Dec 6, 2024 • 9