Wan-Alpha_ComfyUI / README.md

htdong

Update README.md

b08559c verified about 2 months ago

preview code

raw

history blame

2.68 kB

metadata

license: apache-2.0
base_model:
  - Wan-AI/Wan2.1-T2V-14B
pipeline_tag: text-to-video

Wan-Alpha

Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel

Qualitative results of video generation using Wan-Alpha. Our model successfully generates various scenes with accurate and clearly rendered transparency. Notably, it can synthesize diverse semi-transparent objects, glowing effects, and fine-grained details such as hair.

🔥 News

[2025.09.30] Released Wan-Alpha v1.0, the Wan2.1-14B-T2V–adapted weights and inference code are now open-sourced.

🌟 Showcase

Text-to-Video Generation with Alpha Channel

Prompt	Preview Video	Alpha Video
"Medium shot. A little girl holds a bubble wand and blows out colorful bubbles that float and pop in the air. The background of this video is transparent. Realistic style."

For more results, please visit Our Website

🚀 Quick Start

Please see Github for code running details


## 🤝 Acknowledgements

This project is built upon the following excellent open-source projects:
* [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio) (training/inference framework)
* [Wan2.1](https://github.com/Wan-Video/Wan2.1) (base video generation model)
* [LightX2V](https://github.com/ModelTC/LightX2V) (inference acceleration)
* [WanVideo_comfy](https://huggingface.co/Kijai/WanVideo_comfy) (inference acceleration)

We sincerely thank the authors and contributors of these projects.

---

## ✏ Citation

If you find our work helpful for your research, please consider citing our paper:

```bibtex
@article{
}

📬 Contact Us

If you have any questions or suggestions, feel free to reach out via GitHub Issues . We look forward to your feedback!