--- license: openrail++ language: - en base_model: - PixArt-alpha/PixArt-XL-2-1024-MS pipeline_tag: text-to-image tags: - pixart - gguf-node widget: - text: a close-up shot of a beautiful girl in a serene world. She has white hair and is blindfolded, with a calm expression. Her hands are pressed together in a prayer pose, with fingers interlaced and palms touching. The background is softly blurred, enhancing her ethereal presence. parameters: negative_prompt: blurry, cropped, ugly output: url: samples\ComfyUI_00007_.png - text: a wizard with a glowing staff and a glowing hat, colorful magic, dramatic atmosphere, sharp focus, highly detailed, cinematic, original composition, fine detail, intricate, elegant, creative, color spread, shiny, amazing, symmetry, illuminated, inspired, pretty, attractive, artistic, dynamic background, relaxed, professional, extremely inspirational, beautiful, determined, cute, adorable, best parameters: negative_prompt: blurry, cropped, ugly output: url: samples\ComfyUI_00008_.png - text: a girl stands amidst scattered glass shards, surrounded by a beautifully crafted and expansive world. The scene is depicted from a dynamic angle, emphasizing her determined expression. The background features vast landscapes with floating crystals and soft, glowing lights that create a mystical and grand atmosphere. parameters: negative_prompt: blurry, cropped, ugly output: url: samples\ComfyUI_00009_.png - text: close-up portrait of girl output: url: samples\ComfyUI_00001_.png - text: close-up portrait of cat output: url: samples\ComfyUI_00002_.png - text: close-up portrait of young lady output: url: samples\ComfyUI_00003_.png --- # **gguf quantized version of pixart** ## **setup (once)** - drag pixart-xl-2-1024-ms-q4_k_m.gguf [[1GB](https://huggingface.co/calcuis/pixart/blob/main/pixart-xl-2-1024-ms-q4_k_m.gguf)] to > ./ComfyUI/models/diffusion_models - drag t5xxl_fp16-q4_0.gguf [[2.9GB](https://huggingface.co/calcuis/pixart/blob/main/t5xxl_fp16-q4_0.gguf)] to > ./ComfyUI/models/text_encoders - drag pixart_vae_fp8_e4m3fn.safetensors [[83.7MB](https://huggingface.co/calcuis/pixart/blob/main/pixart_vae_fp8_e4m3fn.safetensors)] to > ./ComfyUI/models/vae ## **run it straight (no installation needed way)** - run the .bat file in the main directory (assuming you are using the gguf-node [pack](https://github.com/calcuis/gguf/releases) below) - drag the workflow json file (below) or the demo picture above to > your browser ### **workflow** - example workflow for [gguf](https://huggingface.co/calcuis/pixart/blob/main/workflow-pixart-gguf.json) - example workflow for [safetensors](https://huggingface.co/calcuis/pixart/blob/main/workflow-pixart-safetensors.json) ### review - should set the output image size according to the model stated, i.e., 1024x1024 or 512x512 - pixart-xl-2-1024-ms and pixart-sigma-xl-2-1024-ms are recommended (with 1024x1024 size) - small size model but good quality pictures; and t5 encoder allows you inputting short description or sentence instead of tag(s) - more quantized versions of t5xxl encoder can be found [here](https://huggingface.co/chatpig/t5xxl/tree/main) - upgrade your gguf-node (see the last item in reference list below) to the latest version for pixart model support ### **paper** - [pixart-α](https://arxiv.org/pdf/2310.00426) - [pixart-Σ](https://arxiv.org/pdf/2403.04692) - [high-resolution image synthesis](https://arxiv.org/pdf/2112.10752) ### **reference** - base model from [pixart-alpha](https://huggingface.co/PixArt-alpha) - comfyui [comfyanonymous](https://github.com/comfyanonymous/ComfyUI) - gguf-node ([pypi](https://pypi.org/project/gguf-node)|[repo](https://github.com/calcuis/gguf)|[pack](https://github.com/calcuis/gguf/releases))