arxiv:2403.15382

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Published on Mar 22, 2024

· Submitted by

akhaliq on Mar 25, 2024

Upvote

Authors:

Ruining Li ,

Chuanxia Zheng ,

Abstract

DragAPart generates new images based on part-level interactions using fine-tuned image generators and a new synthetic dataset, showing improved part-level motion understanding compared to previous methods.

AI-generated summary

We introduce DragAPart, a method that, given an image and a set of drags as input, can generate a new image of the same object in a new state, compatible with the action of the drags. Differently from prior works that focused on repositioning objects, DragAPart predicts part-level interactions, such as opening and closing a drawer. We study this problem as a proxy for learning a generalist motion model, not restricted to a specific kinematic structure or object category. To this end, we start from a pre-trained image generator and fine-tune it on a new synthetic dataset, Drag-a-Move, which we introduce. Combined with a new encoding for the drags and dataset randomization, the new model generalizes well to real images and different categories. Compared to prior motion-controlled generators, we demonstrate much better part-level motion understanding.

View arXiv page View PDF Add to collection