Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -1,3 +1,122 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
base_model:
|
| 6 |
+
- Wan-AI/Wan2.1-I2V-14B-480P
|
| 7 |
+
- Wan-AI/Wan2.1-I2V-14B-480P-Diffusers
|
| 8 |
+
pipeline_tag: image-to-video
|
| 9 |
+
tags:
|
| 10 |
+
- text-to-image
|
| 11 |
+
- lora
|
| 12 |
+
- diffusers
|
| 13 |
+
- template:diffusion-lora
|
| 14 |
+
- image-to-video
|
| 15 |
+
widget:
|
| 16 |
+
- text: >-
|
| 17 |
+
A man with short brown hair wearing a white shirt and a dark coat stands in the red neon light of a motel room doorway. He looks back towards the motel room. The camera performs a cr34sh crash zoom in effect, rapidly zooming closer to the man's face. He turns with a shocked expression, as if he heard a noise, and reaches for his pocket.
|
| 18 |
+
output:
|
| 19 |
+
url: example_videos/1.mp4
|
| 20 |
+
- text: >-
|
| 21 |
+
A young woman with red hair in a ponytail, wearing a t-shirt and jeans, sits in a wooden chair, facing away from the camera, in a room filled with dozens of old CRT televisions, each displaying different images. The camera performs a cr34sh crash zoom in effect, rapidly zooming closer to the woman's face as she turns her head, looking directly at the viewer with a mixture of curiosity and confusion. The image on the central TV begins to change, reflecting the scene.
|
| 22 |
+
output:
|
| 23 |
+
url: example_videos/2.mp4
|
| 24 |
+
---
|
| 25 |
+
|
| 26 |
+
<div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
| 27 |
+
<h1 style="color: #24292e; margin-top: 0;">Crash zoom in LoRA for Wan2.1 14B I2V 480p</h1>
|
| 28 |
+
|
| 29 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 30 |
+
<h2 style="color: #24292e; margin-top: 0;">Overview</h2>
|
| 31 |
+
<p>Abruptly zooms in on the subject, typically the face, to heighten drama, surprise, or comedic timing. Ideal for stylized edits, reaction shots, or sudden emotional emphasis.This LoRA is trained on the Wan2.1 14B I2V 480p model.
|
| 32 |
+
</p>
|
| 33 |
+
</div>
|
| 34 |
+
|
| 35 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 36 |
+
<h2 style="color: #24292e; margin-top: 0;">Features</h2>
|
| 37 |
+
<ul style="margin-bottom: 0;">
|
| 38 |
+
<li>Trained on the Wan2.1 14B 480p I2V base model</li>
|
| 39 |
+
<li>Consistent results across different object types</li>
|
| 40 |
+
<li>Simple prompt structure that's easy to adapt</li>
|
| 41 |
+
</ul>
|
| 42 |
+
</div>
|
| 43 |
+
|
| 44 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 45 |
+
<h2 style="color: #24292e; margin-top: 0;">Community</h2>
|
| 46 |
+
<ul style="margin-bottom: 0;">
|
| 47 |
+
<li>
|
| 48 |
+
Generate videos with 100+ Camera Control and VFX LoRAs on the
|
| 49 |
+
<a href="https://app.remade.ai/canvas/create" style="color: #0366d6; text-decoration: none;">Remade Canvas</a>.
|
| 50 |
+
</li>
|
| 51 |
+
<li>
|
| 52 |
+
<b>Discord:</b>
|
| 53 |
+
<a href="https://remade.ai/join-discord?utm_source=Huggingface&utm_medium=Social&utm_campaign=model_release&utm_content=crane_up" style="color: #0366d6; text-decoration: none;">
|
| 54 |
+
Join our community
|
| 55 |
+
</a> to generate videos with this LoRA for free
|
| 56 |
+
</li>
|
| 57 |
+
</ul>
|
| 58 |
+
</div>
|
| 59 |
+
|
| 60 |
+
<Gallery />
|
| 61 |
+
|
| 62 |
+
# Model File and Inference Workflow
|
| 63 |
+
|
| 64 |
+
## 📥 Download Links:
|
| 65 |
+
|
| 66 |
+
- [crash_zoom_in.safetensors](./crash_zoom_in.safetensors) - LoRA Model File
|
| 67 |
+
- [wan_img2vid_lora_workflow.json](./workflow_I2V/wan_img2vid_lora_workflow.json) - Wan I2V with LoRA Workflow for ComfyUI
|
| 68 |
+
|
| 69 |
+
---
|
| 70 |
+
<div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
| 71 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 72 |
+
<h2 style="color: #24292e; margin-top: 0;">Recommended Settings</h2>
|
| 73 |
+
<ul style="margin-bottom: 0;">
|
| 74 |
+
<li><b>LoRA Strength:</b> 1.0</li>
|
| 75 |
+
<li><b>Embedded Guidance Scale:</b> 6.0</li>
|
| 76 |
+
<li><b>Flow Shift:</b> 5.0</li>
|
| 77 |
+
</ul>
|
| 78 |
+
</div>
|
| 79 |
+
|
| 80 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 81 |
+
<h2 style="color: #24292e; margin-top: 0;">Trigger Words</h2>
|
| 82 |
+
<p>The key trigger phrase is: <code style="background-color: #f0f0f0; padding: 3px 6px; border-radius: 4px;">cr34sh crash zoom in effect</code></p>
|
| 83 |
+
</div>
|
| 84 |
+
|
| 85 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 86 |
+
<h2 style="color: #24292e; margin-top: 0;">Prompt Template</h2>
|
| 87 |
+
<p>For prompting, check out the example prompts; this way of prompting seems to work very well.</p>
|
| 88 |
+
|
| 89 |
+
|
| 90 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 91 |
+
<h2 style="color: #24292e; margin-top: 0;">ComfyUI Workflow</h2>
|
| 92 |
+
<p>This LoRA works with a modified version of <a href="https://github.com/kijai/ComfyUI-WanVideoWrapper/blob/main/example_workflows/wanvideo_480p_I2V_example_02.json" style="color: #0366d6; text-decoration: none;">Kijai's Wan Video Wrapper workflow</a>. The main modification is adding a Wan LoRA node connected to the base model.</p>
|
| 93 |
+
<img src="./workflow_I2V/workflow_screenshot.png" style="width: 100%; border-radius: 8px; margin: 15px 0; box-shadow: 0 4px 8px rgba(0,0,0,0.1);">
|
| 94 |
+
<p>See the Downloads section above for the modified workflow.</p>
|
| 95 |
+
</div>
|
| 96 |
+
</div>
|
| 97 |
+
|
| 98 |
+
<div style="background-color: #f8f9fa; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
| 99 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 100 |
+
<h2 style="color: #24292e; margin-top: 0;">Model Information</h2>
|
| 101 |
+
<p>The model weights are available in Safetensors format. See the Downloads section above.</p>
|
| 102 |
+
</div>
|
| 103 |
+
|
| 104 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 105 |
+
<h2 style="color: #24292e; margin-top: 0;">Training Details</h2>
|
| 106 |
+
<ul style="margin-bottom: 0;">
|
| 107 |
+
<li><b>Base Model:</b> Wan2.1 14B I2V 480p</li>
|
| 108 |
+
<li><b>Training Data:</b> Trained on 50 seconds of video comprised of 10 short clips (each clip captioned separately) of scenes that used the crash zoom in camera motion.</li>
|
| 109 |
+
<li><b> Epochs:</b> 30</li>
|
| 110 |
+
</ul>
|
| 111 |
+
</div>
|
| 112 |
+
|
| 113 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 114 |
+
<h2 style="color: #24292e; margin-top: 0;">Additional Information</h2>
|
| 115 |
+
<p>Training was done using <a href="https://github.com/tdrussell/diffusion-pipe" style="color: #0366d6; text-decoration: none;">Diffusion Pipe for Training</a></p>
|
| 116 |
+
</div>
|
| 117 |
+
|
| 118 |
+
<div style="background-color: white; padding: 15px; border-radius: 8px; margin: 15px 0; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
|
| 119 |
+
<h2 style="color: #24292e; margin-top: 0;">Acknowledgments</h2>
|
| 120 |
+
<p style="margin-bottom: 0;">Special thanks to Kijai for the ComfyUI Wan Video Wrapper and tdrussell for the training scripts!</p>
|
| 121 |
+
</div>
|
| 122 |
+
</div>
|