Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,132 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
base_model:
|
6 |
+
- Wan-AI/Wan2.1-I2V-14B-480P
|
7 |
+
- Wan-AI/Wan2.1-I2V-14B-480P-Diffusers
|
8 |
+
pipeline_tag: image-to-video
|
9 |
+
tags:
|
10 |
+
- text-to-image
|
11 |
+
- lora
|
12 |
+
- diffusers
|
13 |
+
- template:diffusion-lora
|
14 |
+
- image-to-video
|
15 |
+
widget:
|
16 |
+
- text: >-
|
17 |
+
The video opens on a puppy. A knife, held by a hand, is coming into frame
|
18 |
+
and hovering over the puppy. The knife then begins cutting into the puppy to
|
19 |
+
c4k3 cakeify it. As the knife slices the puppy open, the inside of the puppy
|
20 |
+
is revealed to be cake with chocolate layers. The knife cuts through and the
|
21 |
+
contents of the puppy are revealed.
|
22 |
+
output:
|
23 |
+
url: example_videos/man_deflate.mp4
|
24 |
+
- text: >-
|
25 |
+
The video opens on a woman. A knife, held by a hand, is coming into frame and hovering over the woman. The knife then begins cutting into the woman to c4k3 cakeify it. As the knife slices the woman open, the inside of the woman is revealed to be cake with chocolate layers. The knife cuts through and the contents of the woman are revealed.
|
26 |
+
output:
|
27 |
+
url: example_videos/lamp_deflate.mp4
|
28 |
+
- text: >-
|
29 |
+
The video opens on a timberland boot. A knife, held by a hand, is coming into frame and hovering over the timberland boot. The knife then begins cutting into the timberland boot to c4k3 cakeify it. As the knife slices the timberland boot open, the inside of the timberland boot is revealed to be cake with chocolate layers. The knife cuts through and the contents of the timberland boot are revealed.
|
30 |
+
output:
|
31 |
+
url: example_videos/balloon_deflate.mp4
|
32 |
+
- text: >-
|
33 |
+
The video opens on a cat. A knife, held by a hand, is coming into frame and hovering over the cat. The knife then begins cutting into the cat to c4k3 cakeify it. As the knife slices the cat open, the inside of the cat is revealed to be cake with chocolate layers. The knife cuts through and the contents of the cat are revealed.
|
34 |
+
output:
|
35 |
+
url: example_videos/cat_deflate.mp4
|
36 |
+
---
|
37 |
+
|
38 |
+
<div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
39 |
+
<h1 style="color: #24292e; margin-top: 0;">Cakeify Effect LoRA for Wan2.1 14B I2V 480p</h1>
|
40 |
+
|
41 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
42 |
+
<h2 style="color: #24292e; margin-top: 0;">Overview</h2>
|
43 |
+
<p>This LoRA is trained on the Wan2.1 14B I2V 480p model and allows you to cakeify any object in an image. The effect works on a wide variety of objects, from animals to vehicles to people!</p>
|
44 |
+
</div>
|
45 |
+
|
46 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
47 |
+
<h2 style="color: #24292e; margin-top: 0;">Features</h2>
|
48 |
+
<ul style="margin-bottom: 0;">
|
49 |
+
<li>Transform any image into a video of it being cakeified</li>
|
50 |
+
<li>Trained on the Wan2.1 14B 480p I2V base model</li>
|
51 |
+
<li>Consistent results across different object types</li>
|
52 |
+
<li>Simple prompt structure that's easy to adapt</li>
|
53 |
+
</ul>
|
54 |
+
</div>
|
55 |
+
|
56 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
57 |
+
<h2 style="color: #24292e; margin-top: 0;">Community</h2>
|
58 |
+
<ul style="margin-bottom: 0;">
|
59 |
+
<li><b>Discord:</b> <a href="https://discord.com/invite/7tsKMCbNFC" style="color: #0366d6; text-decoration: none;">Join our community</a> to generate videos with this LoRA for free</li>
|
60 |
+
<li><b>Request LoRAs:</b> We're training and open-sourcing Wan2.1 LoRAs for free - join our Discord to make requests!</li>
|
61 |
+
</ul>
|
62 |
+
</div>
|
63 |
+
</div>
|
64 |
+
|
65 |
+
<Gallery />
|
66 |
+
|
67 |
+
|
68 |
+
# Model File and Inference Workflow
|
69 |
+
|
70 |
+
## 📥 Download Links:
|
71 |
+
|
72 |
+
- [cakeify_16_epochs.safetensors](./cakeify_16_epochs.safetensors) - LoRA Model File
|
73 |
+
- [wan_img2vid_lora_workflow.json](./workflow/wan_img2vid_lora_workflow.json) - Wan I2V with LoRA Workflow for ComfyUI
|
74 |
+
|
75 |
+
---
|
76 |
+
<div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
77 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
78 |
+
<h2 style="color: #24292e; margin-top: 0;">Recommended Settings</h2>
|
79 |
+
<ul style="margin-bottom: 0;">
|
80 |
+
<li><b>LoRA Strength:</b> 1.0</li>
|
81 |
+
<li><b>Embedded Guidance Scale:</b> 6.0</li>
|
82 |
+
<li><b>Flow Shift:</b> 5.0</li>
|
83 |
+
</ul>
|
84 |
+
</div>
|
85 |
+
|
86 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
87 |
+
<h2 style="color: #24292e; margin-top: 0;">Trigger Words</h2>
|
88 |
+
<p>The key trigger phrase is: <code style="background-color: #f0f0f0; padding: 3px 6px; border-radius: 4px;"> c4k3 cakeify it</code></p>
|
89 |
+
</div>
|
90 |
+
|
91 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
92 |
+
<h2 style="color: #24292e; margin-top: 0;">Prompt Template</h2>
|
93 |
+
<p>For best results, use this prompt structure:</p>
|
94 |
+
<div style="background-color: #f0f0f0; padding: 12px; border-radius: 6px; margin: 10px 0;">
|
95 |
+
<i>The video opens on a [object]. A knife, held by a hand, is coming into frame and hovering over the [object]. The knife then begins cutting into the [object] to c4k3 cakeify it. As the knife slices the [object] open, the inside of the [object] is revealed to be cake with chocolate layers. The knife cuts through and the contents of the [object] are revealed.</i>
|
96 |
+
</div>
|
97 |
+
<p>Simply replace <code style="background-color: #f0f0f0; padding: 3px 6px; border-radius: 4px;">[object]</code> with whatever you want to see cakeified!</p>
|
98 |
+
</div>
|
99 |
+
|
100 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
101 |
+
<h2 style="color: #24292e; margin-top: 0;">ComfyUI Workflow</h2>
|
102 |
+
<p>This LoRA works with a modified version of <a href="https://github.com/kijai/ComfyUI-WanVideoWrapper/blob/main/example_workflows/wanvideo_480p_I2V_example_02.json" style="color: #0366d6; text-decoration: none;">Kijai's Wan Video Wrapper workflow</a>. The main modification is adding a Wan LoRA node connected to the base model.</p>
|
103 |
+
<img src="./workflow/cakeify_workflow_screenshot.png" style="width: 100%; border-radius: 8px; margin: 15px 0; box-shadow: 0 4px 8px rgba(0,0,0,0.1);">
|
104 |
+
<p>See the Downloads section above for the modified workflow.</p>
|
105 |
+
</div>
|
106 |
+
</div>
|
107 |
+
|
108 |
+
<div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
109 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
110 |
+
<h2 style="color: #24292e; margin-top: 0;">Model Information</h2>
|
111 |
+
<p>The model weights are available in Safetensors format. See the Downloads section above.</p>
|
112 |
+
</div>
|
113 |
+
|
114 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
115 |
+
<h2 style="color: #24292e; margin-top: 0;">Training Details</h2>
|
116 |
+
<ul style="margin-bottom: 0;">
|
117 |
+
<li><b>Base Model:</b> Wan2.1 14B I2V 480p</li>
|
118 |
+
<li><b>Training Data:</b> 1 minute of video (13 short clips of things being cakeified, each clip captioned separately)</li>
|
119 |
+
<li><b>Epochs:</b> 16</li>
|
120 |
+
</ul>
|
121 |
+
</div>
|
122 |
+
|
123 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
124 |
+
<h2 style="color: #24292e; margin-top: 0;">Additional Information</h2>
|
125 |
+
<p>Training was done using <a href="https://github.com/tdrussell/diffusion-pipe" style="color: #0366d6; text-decoration: none;">Diffusion Pipe for Training</a></p>
|
126 |
+
</div>
|
127 |
+
|
128 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
129 |
+
<h2 style="color: #24292e; margin-top: 0;">Acknowledgments</h2>
|
130 |
+
<p style="margin-bottom: 0;">Special thanks to Kijai for the ComfyUI Wan Video Wrapper and tdrussell for the training scripts!</p>
|
131 |
+
</div>
|
132 |
+
</div>
|