GameFactory: Creating New Games with Generative Interactive Videos Paper ā¢ 2501.08325 ā¢ Published 16 days ago ā¢ 61
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper ā¢ 2412.09645 ā¢ Published Dec 10, 2024 ā¢ 35
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper ā¢ 2412.12606 ā¢ Published Dec 17, 2024 ā¢ 41
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection Paper ā¢ 2412.04455 ā¢ Published Dec 5, 2024 ā¢ 37
ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting Paper ā¢ 2411.17176 ā¢ Published Nov 26, 2024 ā¢ 23
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper ā¢ 2411.17465 ā¢ Published Nov 26, 2024 ā¢ 79
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Paper ā¢ 2411.10442 ā¢ Published Nov 15, 2024 ā¢ 73
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation Paper ā¢ 2411.04999 ā¢ Published Nov 7, 2024 ā¢ 17
Personalization of Large Language Models: A Survey Paper ā¢ 2411.00027 ā¢ Published Oct 29, 2024 ā¢ 31
Survey of Cultural Awareness in Language Models: Text and Beyond Paper ā¢ 2411.00860 ā¢ Published Oct 30, 2024 ā¢ 23
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper ā¢ 2411.02959 ā¢ Published Nov 5, 2024 ā¢ 66
CLEAR: Character Unlearning in Textual and Visual Modalities Paper ā¢ 2410.18057 ā¢ Published Oct 23, 2024 ā¢ 200
Ko-BioMistral-7B Collection A Korean Language Model for Biomedical Text ā¢ 3 items ā¢ Updated Jun 2, 2024 ā¢ 1