zhang
AI & ML interests
Recent Activity
Organizations
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 168k β’ 687 -
Running on Zero7777
Nanonets OCR
πDemo for Nanonets-OCR
-
Running on ZeroMCP319319
OCR
πolmocr / nanonets ocr / rolmocr / qwen2vl ocr / aya vision
-
Running on ZeroMCP115115
OCR2
π»monkey ocr / nanonets ocr / smoldocling / typhoon ocr
-
Running on Zero1.47k1.47k
Flux.1-dev Upscaler
πUpscale an image to higher resolution
-
Running on Zero410410
InvSR
πImage Super-resolution via Diffusion Inversion
-
Running on Zero237237
FLUX Upsacle Image
π₯Upscale and enhance images
-
Runtime error277277
Thera Arbitrary-Scale Super-Resolution
π₯Enhance image quality by scaling it up
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 505 -
Running on Zero908908
OminiControl
πGenerate images using text prompts and condition images
-
Running on Zero379379
FLUXllama
π¦mcp_server & FLUX 4-bit Quantization(just 8GB VRAM)
-
Running on L42.03k2.03k
MagicQuill
πͺΆEdit and enhance images using scribbles and prompts
-
Running77
Browser only - Screen Capture & OCR
πOne-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running550550
First Agent Template
β‘Fetch local time in specified timezone
-
Running on T4127127
OctoTools
πAn Agentic Framework with Tools for Complex Reasoning
-
Running136136
smolagents LLM leaderboard
πA leaderboard for LLMs powering smolagents
-
LiuZichen/MagicQuill-models
Image-to-Image β’ Updated β’ 56 -
Running on L40S573573
Leffa
πGenerate images by trying on clothes or transferring poses
-
Running6161
Dokdo
β‘Image to Video Generation
-
Running6666
Llama-4-Maverick-17B-search
πGenerate answers using chat with optional web search
-
Running on Zero1.4k1.4k
Joy Caption Alpha Two
πGenerate captions for images in various styles
-
Running on Zero4040
Florence Llama
π¬Generate responses using images and text
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 518 β’ 2 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 377 β’ 1
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.98k β’ 11 -
Running on Zero2.54k2.54k
F5-TTS
π£F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Running on L40S2.11k2.11k
FacePoke
πImport a portrait, click to move the head!
-
Running on L4615615
OpenAudio S1
πGenerate audio from text
-
Running77
Browser only - Screen Capture & OCR
πOne-minute creation by AI Coding Autonomous Agent MOUSE-I
-
Running550550
First Agent Template
β‘Fetch local time in specified timezone
-
Running on T4127127
OctoTools
πAn Agentic Framework with Tools for Complex Reasoning
-
Running136136
smolagents LLM leaderboard
πA leaderboard for LLMs powering smolagents
-
allenai/olmOCR-7B-0225-preview
Image-to-Text β’ 8B β’ Updated β’ 168k β’ 687 -
Running on Zero7777
Nanonets OCR
πDemo for Nanonets-OCR
-
Running on ZeroMCP319319
OCR
πolmocr / nanonets ocr / rolmocr / qwen2vl ocr / aya vision
-
Running on ZeroMCP115115
OCR2
π»monkey ocr / nanonets ocr / smoldocling / typhoon ocr
-
LiuZichen/MagicQuill-models
Image-to-Image β’ Updated β’ 56 -
Running on L40S573573
Leffa
πGenerate images by trying on clothes or transferring poses
-
Running6161
Dokdo
β‘Image to Video Generation
-
Running6666
Llama-4-Maverick-17B-search
πGenerate answers using chat with optional web search
-
Running on Zero1.4k1.4k
Joy Caption Alpha Two
πGenerate captions for images in various styles
-
Running on Zero4040
Florence Llama
π¬Generate responses using images and text
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 518 β’ 2 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 377 β’ 1
-
Running on Zero1.47k1.47k
Flux.1-dev Upscaler
πUpscale an image to higher resolution
-
Running on Zero410410
InvSR
πImage Super-resolution via Diffusion Inversion
-
Running on Zero237237
FLUX Upsacle Image
π₯Upscale and enhance images
-
Runtime error277277
Thera Arbitrary-Scale Super-Resolution
π₯Enhance image quality by scaling it up
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 505 -
Running on Zero908908
OminiControl
πGenerate images using text prompts and condition images
-
Running on Zero379379
FLUXllama
π¦mcp_server & FLUX 4-bit Quantization(just 8GB VRAM)
-
Running on L42.03k2.03k
MagicQuill
πͺΆEdit and enhance images using scribbles and prompts
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 1.98k β’ 11 -
Running on Zero2.54k2.54k
F5-TTS
π£F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
Running on L40S2.11k2.11k
FacePoke
πImport a portrait, click to move the head!
-
Running on L4615615
OpenAudio S1
πGenerate audio from text