valiantcat Qwen-Image-Edit LoRA

Prompt
turn the image into a giant figurine and take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style.

Model Card for Model ID

This is a model for generation of character statues, trained on Qwen/Qwen-Image-Edit, and it is mainly used to generate a photo of the current giant figurine with oneself.For use in ComfyUI.

ComfyUI Workflow

This LoRA works with a modified version of Comfy's Qwen-Image-Edit workflow. The main modification is adding a Qwen-Image-Edit LoRA node connected to the base model.

See the Downloads section above for the modified workflow.

Direct Use

from diffusers import QwenImageEditPipeline
import torch
from PIL import Image

# Load the pipeline
pipeline = QwenImageEditPipeline.from_pretrained("Qwen/Qwen-Image-Edit")
pipeline.to(torch.bfloat16)
pipeline.to("cuda")

# Load trained LoRA weights for in-scene editing
pipeline.load_lora_weights("valiantcat/Qwen-Image-Edit-Giant-Figurine",weight_name="qwen-edit-giant-figurine.safetensors")

# Load input image
image = Image.open("./result/test.jpg").convert("RGB")

# Define in-scene editing prompt
prompt = "turn the image into a giant figurine and  take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style. "

# Generate edited image with enhanced scene understanding
inputs = {
    "image": image,
    "prompt": prompt,
    "generator": torch.manual_seed(12345),
    "true_cfg_scale": 4.0,
    "negative_prompt": " ",
    "num_inference_steps": 50,
}

with torch.inference_mode():
    output = pipeline(**inputs)
    output_image = output.images[0]
    output_image.save("restlt.png")

Trigger phrase

turn the image into a giant figurine and take a photo with the figurine. The figurine has a large head and a rounded cartoon style. The scene is replaced with a gallery style.

There is no fixed trigger word. The specific removal prompt needs to be tested more