File size: 318 Bytes
c0d0dd1 9652977 c0d0dd1 9652977 |
1 2 3 4 5 6 7 8 9 |
---
license: mit
pipeline_tag: image-to-video
---
This repository contains the model of the paper [Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait](https://huggingface.co/papers/2503.12963).
Code: https://github.com/chaolongy/KDTalker
|