Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content Paper ā¢ 2503.16031 ā¢ Published 5 days ago ā¢ 3
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait Paper ā¢ 2503.12963 ā¢ Published 8 days ago ā¢ 7
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM Paper ā¢ 2503.14478 ā¢ Published 6 days ago ā¢ 41
API Agents vs. GUI Agents: Divergence and Convergence Paper ā¢ 2503.11069 ā¢ Published 11 days ago ā¢ 31
Charting and Navigating Hugging Face's Model Atlas Paper ā¢ 2503.10633 ā¢ Published 11 days ago ā¢ 71
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper ā¢ 2502.15007 ā¢ Published Feb 20 ā¢ 164
SurveyX: Academic Survey Automation via Large Language Models Paper ā¢ 2502.14776 ā¢ Published Feb 20 ā¢ 95
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper ā¢ 2502.14499 ā¢ Published Feb 20 ā¢ 183
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper ā¢ 2502.14739 ā¢ Published Feb 20 ā¢ 97
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Paper ā¢ 2502.09082 ā¢ Published Feb 13 ā¢ 28
Improving Transformer World Models for Data-Efficient RL Paper ā¢ 2502.01591 ā¢ Published Feb 3 ā¢ 9
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper ā¢ 2501.17433 ā¢ Published Jan 29 ā¢ 9
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper ā¢ 2501.16975 ā¢ Published Jan 28 ā¢ 27
Evolution and The Knightian Blindspot of Machine Learning Paper ā¢ 2501.13075 ā¢ Published Jan 22 ā¢ 6
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper ā¢ 2501.12909 ā¢ Published Jan 22 ā¢ 68