---
license: mit
pipeline_tag: image-to-video
---

This repository contains the model of the paper [Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait](https://huggingface.co/papers/2503.12963).

Code: https://github.com/chaolongy/KDTalker