Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Paper • 2405.10292 • Published May 16, 2024 • 2
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics • 360 items • Updated 3 days ago • 52
cjfcsjt/train_mcts_webshopv-sft300_niter5_b3b3_trajdpo_0.4 Viewer • Updated Nov 10, 2024 • 11.6k • 11 • 1