valiantcat Qwen-Image-Edit LoRA

Prompt
turn the image into a giant figurine and take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style.
Prompt
turn the image into a giant figurine and take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style.
Prompt
turn the image into a giant figurine and take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style.

Model Card for Model ID

This is a model for generation of character statues, trained on Qwen/Qwen-Image-Edit, and it is mainly used to generate a photo of the current giant figurine with oneself.For use in ComfyUI.

ComfyUI Workflow

This LoRA works with a modified version of Comfy's Qwen-Image-Edit workflow. The main modification is adding a Qwen-Image-Edit LoRA node connected to the base model.

See the Downloads section above for the modified workflow.

Direct Use

from diffusers import QwenImageEditPipeline
import torch
from PIL import Image

# Load the pipeline
pipeline = QwenImageEditPipeline.from_pretrained("Qwen/Qwen-Image-Edit")
pipeline.to(torch.bfloat16)
pipeline.to("cuda")

# Load trained LoRA weights for in-scene editing
pipeline.load_lora_weights("valiantcat/Qwen-Image-Edit-Giant-Figurine",weight_name="qwen-edit-giant-figurine.safetensors")

# Load input image
image = Image.open("./result/test.jpg").convert("RGB")

# Define in-scene editing prompt
prompt = "turn the image into a giant figurine and  take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style. "

# Generate edited image with enhanced scene understanding
inputs = {
    "image": image,
    "prompt": prompt,
    "generator": torch.manual_seed(12345),
    "true_cfg_scale": 4.0,
    "negative_prompt": " ",
    "num_inference_steps": 50,
}

with torch.inference_mode():
    output = pipeline(**inputs)
    output_image = output.images[0]
    output_image.save("restlt.png")

Trigger phrase

turn the image into a giant figurine and take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style.

There is no fixed trigger word. The specific removal prompt needs to be tested more

Download model

Weights for this model are available in Safetensors format.

Download

Training at Chongqing Valiant Cat

This model was trained by the AI Laboratory of Chongqing Valiant Cat Technology Co., LTD(https://vvicat.com/).Business cooperation is welcome

Downloads last month
16
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for valiantcat/Qwen-Image-Edit-Giant-Figurine

Adapter
(35)
this model

Collection including valiantcat/Qwen-Image-Edit-Giant-Figurine