question about using image guidance in this ti2v model ?

#3
by Lixiaoming - opened

hi, it is a great work, and thanks for sharing the weights of fast wan model.
I am wondering is this a ti2v model or just i2v model ? Since i do not see the usage of image in the instruction below:

num_gpus=1
export FASTVIDEO_ATTENTION_BACKEND=FLASH_ATTN
export MODEL_BASE=FastVideo/FastWan2.2-TI2V-5B-Full-Diffusers

export MODEL_BASE=hunyuanvideo-community/HunyuanVideo

You can either use --prompt or --prompt-txt, but not both.

fastvideo generate
--model-path $MODEL_BASE
--sp-size $num_gpus
--tp-size 1
--num-gpus $num_gpus
--height 704
--width 1280
--num-frames 121
--num-inference-steps 3
--fps 24
--prompt-txt assets/prompt.txt
--negative-prompt "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"
--seed 1024
--output-path outputs_video_dmd/
--dmd-denoising-steps "1000,757,522"

Sign up or log in to comment