---
license: openrail++
language:
- en
base_model:
- PixArt-alpha/PixArt-XL-2-1024-MS
pipeline_tag: text-to-image
tags:
- pixart
- gguf-node
widget:
- text: a close-up shot of a beautiful girl in a serene world. She has white hair
    and is blindfolded, with a calm expression. Her hands are pressed together in
    a prayer pose, with fingers interlaced and palms touching. The background is softly
    blurred, enhancing her ethereal presence.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: samples\ComfyUI_00007_.png
- text: a wizard with a glowing staff and a glowing hat, colorful magic, dramatic
    atmosphere, sharp focus, highly detailed, cinematic, original composition, fine
    detail, intricate, elegant, creative, color spread, shiny, amazing, symmetry,
    illuminated, inspired, pretty, attractive, artistic, dynamic background, relaxed,
    professional, extremely inspirational, beautiful, determined, cute, adorable,
    best
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: samples\ComfyUI_00008_.png
- text: a girl stands amidst scattered glass shards, surrounded by a beautifully crafted
    and expansive world. The scene is depicted from a dynamic angle, emphasizing her
    determined expression. The background features vast landscapes with floating crystals
    and soft, glowing lights that create a mystical and grand atmosphere.
  parameters:
    negative_prompt: blurry, cropped, ugly
  output:
    url: samples\ComfyUI_00009_.png
- text: close-up portrait of girl
  output:
    url: samples\ComfyUI_00001_.png
- text: close-up portrait of cat
  output:
    url: samples\ComfyUI_00002_.png
- text: close-up portrait of young lady
  output:
    url: samples\ComfyUI_00003_.png
---

# **gguf quantized version of pixart**

<Gallery />

## **setup (once)**
- drag pixart-xl-2-1024-ms-q4_k_m.gguf [[1GB](https://huggingface.co/calcuis/pixart/blob/main/pixart-xl-2-1024-ms-q4_k_m.gguf)] to > ./ComfyUI/models/diffusion_models
- drag t5xxl_fp16-q4_0.gguf [[2.9GB](https://huggingface.co/calcuis/pixart/blob/main/t5xxl_fp16-q4_0.gguf)] to > ./ComfyUI/models/text_encoders
- drag pixart_vae_fp8_e4m3fn.safetensors [[83.7MB](https://huggingface.co/calcuis/pixart/blob/main/pixart_vae_fp8_e4m3fn.safetensors)] to > ./ComfyUI/models/vae

## **run it straight (no installation needed way)**
- run the .bat file in the main directory (assuming you are using the gguf-node [pack](https://github.com/calcuis/gguf/releases) below)
- drag the workflow json file (below) or the demo picture above to > your browser

### **workflow**
- example workflow for [gguf](https://huggingface.co/calcuis/pixart/blob/main/workflow-pixart-gguf.json)
- example workflow for [safetensors](https://huggingface.co/calcuis/pixart/blob/main/workflow-pixart-safetensors.json)

### review
- should set the output image size according to the model stated, i.e., 1024x1024 or 512x512
- pixart-xl-2-1024-ms and pixart-sigma-xl-2-1024-ms are recommended (with 1024x1024 size)
- small size model but good quality pictures; and t5 encoder allows you inputting short description or sentence instead of tag(s)
- more quantized versions of t5xxl encoder can be found [here](https://huggingface.co/chatpig/t5xxl/tree/main)
- upgrade your gguf-node (see the last item in reference list below) to the latest version for pixart model support

### **paper**
- [pixart-α](https://arxiv.org/pdf/2310.00426)
- [pixart-Σ](https://arxiv.org/pdf/2403.04692)
- [high-resolution image synthesis](https://arxiv.org/pdf/2112.10752)

### **reference**
- base model from [pixart-alpha](https://huggingface.co/PixArt-alpha)
- comfyui [comfyanonymous](https://github.com/comfyanonymous/ComfyUI)
- gguf-node ([pypi](https://pypi.org/project/gguf-node)|[repo](https://github.com/calcuis/gguf)|[pack](https://github.com/calcuis/gguf/releases))