TempoFunk
/

makeavid-sd-jax

StableDiffusionPseudo3DPipeline

jax-diffusers-event

Model card Files Files and versions Community

lopho commited on May 7, 2023

Commit

56e4906

·

1 Parent(s): 1dbab65

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -91,12 +91,12 @@ This model used weights pretrained by [lxj616](https://huggingface.co/lxj616/mak
 * **Image size:** 512 x 512
 * **Frame count:** 24
 * **Schedule:**
-  * 2 x 10 epochs: LR warmup for 2 epochs then held constant at 5e-5 (10,000 samples per ep)
-  * 2 x 20 epochs: LR warmup for 2 epochs then held constant at 5e-5 (10,000 samples per ep)
   * 1 x 9 epochs: LR warmup for 1 epoch to 5e-5 then cosine annealing to 1e-8
   * Additional data mixed in, see [Trainig Data](#training-data)
-  * 1 x 5 epochs: LR warmup for 1 epochs to 2.5e-5 then constant (17,000 samples per ep)
-  * 1 x 5 epochs: LR warmup for 0.25 epochs to 5e-6 then cosine annealing to 2.5e-6 (17,000 samples per ep)
   * some restarts were required due to NaNs appearing in the gradient (see training logs)
 * **Total update steps:** ~200,000
 * **Hardware:** 4 x TPUv4 (provided by Google Cloud for the [HuggingFace JAX/Diffusers Sprint Event](https://github.com/huggingface/community-events/tree/main/jax-controlnet-sprint))

 * **Image size:** 512 x 512
 * **Frame count:** 24
 * **Schedule:**
+  * 2 x 10 epochs: LR warmup for 1 epochs then held constant at 5e-5 (10,000 samples per ep)
+  * 2 x 20 epochs: LR warmup for 1 epochs then held constant at 5e-5 (10,000 samples per ep)
   * 1 x 9 epochs: LR warmup for 1 epoch to 5e-5 then cosine annealing to 1e-8
   * Additional data mixed in, see [Trainig Data](#training-data)
+  * 1 x 5 epochs: LR warmup for 0.5 epochs to 2.5e-5 then constant (17,000 samples per ep)
+  * 1 x 5 epochs: LR warmup for 0.5 epochs to 5e-6 then cosine annealing to 2.5e-6 (17,000 samples per ep)
   * some restarts were required due to NaNs appearing in the gradient (see training logs)
 * **Total update steps:** ~200,000
 * **Hardware:** 4 x TPUv4 (provided by Google Cloud for the [HuggingFace JAX/Diffusers Sprint Event](https://github.com/huggingface/community-events/tree/main/jax-controlnet-sprint))