Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FrancisRing
/
StableAvatar
like
75
Image-to-Video
Diffusers
ONNX
Safetensors
WanPipeline
video-generation
video diffusion transformer
audio-driven avatar animation
arxiv:
2508.08248
License:
mit
Model card
Files
Files and versions
xet
Community
6
Use this model
1511e92
StableAvatar
8.74 GB
3 contributors
History:
10 commits
FrancisRing
Update README.md
1511e92
verified
about 1 month ago
StableAvatar-1.3B
Upload 2 files
about 1 month ago
Wan2.1-Fun-V1.1-1.3B-InP
Upload Wan2.1_VAE.pth
about 1 month ago
wav2vec2-base-960h
Upload 9 files
about 1 month ago
.gitattributes
1.71 kB
Upload 8 files
about 1 month ago
Kim_Vocal_2.onnx
Safe
66.8 MB
xet
Upload Kim_Vocal_2.onnx
about 1 month ago
README.md
25.2 kB
Update README.md
about 1 month ago
config.json
Safe
1.03 kB
Upload config.json
about 1 month ago