license: mit | |
pipeline_tag: image-to-video | |
This repository contains the model of the paper [Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait](https://huggingface.co/papers/2503.12963). | |
Code: https://github.com/chaolongy/KDTalker | |