VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues Paper • 2502.12084 • Published Feb 17 • 31
ConText: Driving In-context Learning for Text Removal and Segmentation Paper • 2506.03799 • Published Jun 4 • 1
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published Jan 14 • 68