AI & ML interests

None defined yet.

Recent Activity

Nymbo 
posted an update 1 day ago
view post
Post
225
I have a few updates to my MCP server I wanna share: New Memory tool, improvements to web search & speech generation.

# Memory_Manager Tool

We now have a Memory_Manager tool. Ask ChatGPT to write all its memories verbatim, then tell gpt-oss-20b to save each one using the tool, then take them anywhere! It stores memories in a memories.json file in the repo, no external database required.

The Memory_Manager tool is currently hidden from the HF space because it's intended for local use. It's enabled by providing a HF_READ_TOKEN in the env secrets, although it doesn't actually use the key for anything. There's probably a cleaner way of ensuring memory is only used locally, I'll come back to this.

# Fetch & Websearch

The Fetch_Webpage tool has been simplified a lot. It now converts the page to Markdown and returns the page with three length settings (Brief, Standard, Full). This is a lot more reliable than the old custom extraction method.

The Search_DuckDuckGo tool has a few small improvements. The input is easier for small models to get right, and the output is more readable.

# Speech Generation

I've added the remaining voices for Kokoro-82M, it now supports all 54 voices with all accents/languages.

I also removed the 30 second cap by making sure it computes all chunks in sequence, not just the first. I've tested it on outputs that are ~10 minutes long. Do note that when used as an MCP server, the tool will timeout after 1 minute, nothing I can do about that for right now.

# Other Thoughts

Lots of MCP use cases involve manipulating media (image editing, ASR, etc.). I've avoided adding tools like this so far for two reasons:

1. Most of these solutions would require assigning it a ZeroGPU slot.
2. The current process of uploading files like images to a Gradio space is still a bit rough. It's doable but requires additional tools.

Both of these points make it a bit painful for local usage. I'm open to suggestions for other tools that rely on text.
ginipick 
posted an update 11 days ago
view post
Post
3909
🍌 Nano Banana + Video: AI Image Style Transfer & Video Generation Tool

🎨 Key Features
1️⃣ Image Style Transfer

ginigen/Nano-Banana-Video

📸 Upload up to 2 images for style fusion
✨ High-quality image generation with Google Nano Banana model
🎭 Apply desired styles with text prompts

2️⃣ Video Generation

🎬 Convert generated images to videos
📐 Maintain original aspect ratio option
⏱️ Adjustable duration (1-4 seconds)

🚀 How to Use
Step-by-Step Guide
Step 1: Image Generation 🖼️

Enter style description
Upload 1-2 images (optional)
Click "Generate Magic ✨"

Step 2: Video Creation 📹

Send generated image to video tab
Set animation style
Generate video!

💡 Use Cases

🏞️ Transform landscape photos into artistic masterpieces
🤖 Bring static images to life
🎨 Mix styles from two different images
📱 Create short videos for social media

⚡ Tech Stack
Google Nano Banana Stable Video Diffusion Gradio Replicate API

#AIVideoGenerator #ImageToVideoConverter #StyleTransferAI #GoogleNanoBanana #StableVideoDiffusion #AIAnimationTool #TextToVideo #ImageAnimationSoftware #AIArtGenerator #VideoCreationTool #MachineLearningVideo #DeepLearningAnimation #HuggingFaceSpaces #ReplicateAPI #GradioApplication #ZeroGPUComputing #AIStyleMixing #AutomatedVideoProduction #NeuralStyleTransfer #AIPoweredCreativity
ginipick 
posted an update 13 days ago
view post
Post
3377
🎉 Fashion Fit 360: The New Standard in AI Virtual Try-On!

🚀 Now Live and Free to Use!Say goodbye to online shopping uncertainty - "Will this look good on me?" - with our revolutionary solution!Fashion Fit 360 is a cutting-edge AI-powered virtual fitting service that transforms your fashion shopping experience.

LINK: ginigen/Fashion-Fit360

✨ Core Features
🔄 360-Degree Multi-Pose Generation
Transform a single front-facing photo into 6 different viewing angles!
Front, side, and back views for complete visualization
Experience a real fitting room mirror effect
Check fit and style from every perspective

👗 15 Fashion Item Categories
Apparel: Tops, bottoms, dresses
Jewelry: Necklaces, earrings, rings, bracelets
Accessories: Sunglasses, eyewear, hats, ties, bow ties, belts
Essentials: Bags, shoes

🎯 Perfect For:
🛍️ Online Shopping Enthusiasts: Preview before purchase - zero return hassles!
💍 Jewelry Lovers: Virtually try expensive pieces before investing
🎁 Thoughtful Gift-Givers: Test items on recipient photos beforehand
👔 Business Professionals: Preview suit and tie combinations
👗 Fashion Designers: Rapidly visualize design samples

💡 Why Fashion Fit 360?Fashion Fit 360 delivers innovation beyond conventional services.While most virtual fitting platforms only support clothing, we offer complete support for 15 accessory types. Unlike competitors providing only front views, Fashion Fit 360 generates 6 poses for true 360-degree visualization, ensuring you can verify actual fit perfectly.Performance is unmatched - get results in under 20 seconds with one-click simplicity and no complex configurations. Plus, download all generated images as a convenient ZIP file, eliminating tedious individual saves.

🔥 Key Differentiators
🎨 360-Degree Multi-Pose Image Generation
🤖 FLUX.1-Fill based OmniTry integrated model with Flux.1 KONTEXT LoRA technology
Nymbo 
posted an update 14 days ago
view post
Post
741
I built a general use MCP space ~ Fetch webpages, DuckDuckGo search, Python code execution, Kokoro TTS, Image Gen, Video Gen.

# Tools

1. Fetch webpage
2. Web search via DuckDuckGo (very concise, low excess context)
3. Python code executor
4. Kokoro-82M speech generation
5. Image Generation (use any model from HF Inference Providers)
6. Video Generation (use any model from HF Inference Providers)

The first four tools can be used without any API keys whatsoever. DDG search is free and the code execution and speech gen is done on CPU. Having a HF_READ_TOKEN in the env variables will show all tools. If there isn't a key present, The Image/Video Gen tools are hidden.

Nymbo/Tools
ginipick 
posted an update 22 days ago
view post
Post
3327
✨ HairPick | Preview Your Perfect Hair Transformation in 360° ✨

🎊 Free Trial for Hugging Face Launch! Hurry! ⏰
Hello! Introducing an innovative AI service that helps you choose the perfect hairstyle without any regrets before visiting the salon!

🎯 Try It Now
ginigen/Hair-Pick

🔄 What Makes HairPick Special? 360° Complete Preview!
Other hair simulators only show the front view? 😑

HairPick is different!
✅ Front + 4 random angles = Total 5 multi-angle images generated
✅ Perfect check from side profile 👤 diagonal 📐 back view 👥!
✅ 100+ trendy hairstyle library 💇‍♀️

💡 Highly Recommended For:
🎯 "I really don't want to fail this time!"
→ Check side volume and back lines thoroughly
🎯 "It's hard to explain exactly to my stylist"
→ Perfect communication with 360° result images!
🎯 "I have a profile photo/photoshoot coming up"
→ Preview your best look from every angle
🚀 Super Simple Usage (Just 1 Minute!)

1️⃣ One Selfie 📸
Take a front-facing photo in bright light (show your forehead and face outline clearly!)
2️⃣ Choose Your Style 💫
Select from 100+ options: short cuts, medium, long hair, layered, bangs, and more
3️⃣ Check 360° Results 🔄
Compare front + side + back + diagonal angles all at once!
4️⃣ Go to the Salon! ✂️
Save your favorite result → Show it to your stylist

📸 Pro Tips for Perfect Results!
💡 Lighting: Natural light or bright, even indoor lighting
💡 Angle: Camera at eye level, facing straight ahead
💡 Preparation: No hats❌ No sunglasses❌ Hair tucked behind ears⭕

🎁 Now's Your Chance!
"The era of deciding based on front view only is over!"
HairPick isn't just simple hair synthesis, it's a next-level AI hair simulator that predicts your actual appearance in 360°.

🔥 Limited free access for Hugging Face launch!
🔥 100+ latest trend styles!
🔥 ZERO failures with 360° perfect prediction!

✂️ Click before you cut! Take on the perfect hair transformation with HairPick! 🌟

#HairPick #AIHairSimulator #360HairPreview
  • 2 replies
·
Nymbo 
posted an update 22 days ago
view post
Post
964
Anyone using Jan-v1-4B for local MCP-based web search, I highly recommend you try out Intelligent-Internet/II-Search-4B

Very impressed with this lil guy and it deserves more downloads. It's based on the original version of Qwen3-4B but find that it questions reality way less often. Jan-v1 seems to think that everything it sees is synthetic data and constantly gaslights me
ginipick 
posted an update 26 days ago
view post
Post
471
🎨 AI Webtoon Creation Platform: Turn Your Ideas into Reality!

🌟 Two Powerful Tools, One Perfect Workflow
📖 Webtoon Generator
ginigen/AGI-WebToon-KOREA
"Transform Your Ideas into 40-Episode Masterpieces" ✨

Automated Story Planning 🎬
One-line idea → Complete 40-episode structure
Automatic cliffhangers for each episode
Customized storytelling for 9 different genres

Consistent Character Design 👥
Maintains consistent character appearance throughout
Memorable and distinctive character visuals
Automatic character generation system

Instant 30-Panel Storyboard 🎞️
Auto-placement of dialogue, narration, and sound effects
Cinematic shot composition (close-ups, wide shots, etc.)
Vertical scroll format optimized for webtoons

🖌️ Editing Studio
ginigen/webtoon-studio
"Professional Finishing Touch for Your Generated Webtoons" 🎯

Intuitive Drag & Drop ✏️
10 speech bubble styles (normal, thought, shout, whisper...)
12 Korean fonts for emotional expression
Real-time preview & editing

Professional-Grade Finishing 💎
Image sequence adjustment & spacing control
Individual panel refinement
Publication-ready final export

💡 Who Should Use This?
🏢 Corporate Marketing Teams
Product Launch Campaigns 📱: Turn complex features into engaging stories
Brand Storytelling 🎯: Make corporate messages approachable and shareable

👨‍🎨 Content Creators
Aspiring Artists 🌱: Create webtoons without drawing skills
Professional Writers ⚡: Transform scripts into visual narratives instantly

🚀 Why Use Both Tools Together?
Perfect 3-Step Workflow:
1️⃣ Generate → Input idea, get complete storyboard
2️⃣ Customize → Add branding, adjust dialogue, insert logos
3️⃣ Publish → Export and share across all platforms
📊 Key Benefits

95% faster than traditional production
80% cost reduction compared to agencies
10x better engagement with Gen MZ audience
Zero artistic skills required

🌈 Start Creating Today!
  • 2 replies
·
ginipick 
posted an update about 1 month ago
view post
Post
2464
🚀 FLUXllama gpt-oss: 4-bit Quantization + GPT-OSS-120B = Perfect AI Image Generation

🎯 One-Line Summary
"Maximum Images with Minimal Memory!" - The perfect fusion of 4-bit quantization and GPT-OSS-120B prompt enhancement

ginipick/FLUXllama

🧠 Core Innovation: Prompt Enhancement System
📝 What You Type:

"cat"

✨ What GPT-OSS-120B Transforms:

"Majestic tabby cat with emerald eyes in golden afternoon light, soft bokeh, cinematic lighting, 8K photorealistic"

💡 Result: Beginners create professional-grade images instantly!

⚡ The Magic of 4-bit Quantization
🔥 Before (Standard Model)

📦 Memory: 24GB VRAM required
⏱️ Loading: 45 seconds
💰 Cost: RTX 4090 essential ($2000+)

🎉 After (FLUXllama gpt-oss 4-bit)

📦 Memory: 6GB VRAM (75% reduction!)
⏱️ Loading: 12 seconds (73% faster!)
💰 Cost: RTX 3060 works great! ($400)

Same quality, 4x efficiency! 🎊

🔧 Simple Model Swapping
python# Switch to any LLM in 1 second!
pipe = pipeline("text-generation", model="your-model")
✅ GPT-OSS-120B (Premium quality)
✅ Phi-3 (Lightning fast)
✅ Custom models (Your unique style)

🏆 Why FLUXllama gpt-oss?
💪 Powerful

Hugging Face 'STAR AI 12' Selected (Dec 2024)
95% quality maintained with 75% memory savings

🤝 Easy

No prompt writing skills needed
GPT-OSS-120B enhances automatically

💸 Economical

Works on consumer GPUs
60% cloud cost reduction

🚀 Start Now
Just 3 Steps!

💭 Enter your idea
✨ Click "Enhance Prompt"
🎨 Click "Generate"

Result: Images that rival pro designers!

🎊 FLUXllama gpt-oss = Less Resources + Smart Prompts = Best Images
Experience the perfect synergy of 4-bit quantization and GPT-OSS-120B!


  • 2 replies
·
ginipick 
posted an update about 1 month ago
view post
Post
432
🚀 **Wan 2.2 TI2V Enhanced Released!**

🎬 **More Natural Motion, More Powerful Video Generation**
The original Wan 2.2 TI2V model has been upgraded to the **Enhanced version**!
Now bring your imagination to life with smoother and more natural movements.

ginigen/Wan-2.2-Enhanced
---

✨ **Core Upgrades**

🌊 **Natural Motion Built-in**
- Automatic "smooth and natural movement" applied to all videos
- Seamless fluid dynamic movements
- Natural motion transitions for objects

🎯 **Smart Prompt Templates**
Various styles with one click!
- 🎥 **Cinematic** - Movie-like camera movements
- 🎨 **Animation** - Vibrant animated style
- 🌿 **Nature** - Nature documentary style
- ⚡ **Action** - Dynamic action sequences
- 🔍 **Slow Motion** - Detailed slow motion

💾 **Enhanced Performance & Stability**
- 🧠 Smart GPU memory management for longer video generation
- ⚡ Optimized processing speed (up to 15% improvement)
- 🛡️ Enhanced error handling for stable generation

🎨 **Intuitive UI/UX**
- 📊 Real-time progress display
- 🖼️ Automatic resolution optimization on image upload
- 💡 Contextual help & tips
- ✅ Smart input validation system

---

🎯 **Recommended For**

**Content Creators** 👨‍🎨
*"I can apply the style I want instantly with prompt templates!"*

**Video Producers** 🎬
*"The quality has definitely improved with natural motion"*

**AI Artists** 🎨
*"Work is much easier now that long videos generate stably"*
---

🚀 **Get Started Now!**
  • 3 replies
·
AtAndDev 
posted an update about 2 months ago
view post
Post
464
Qwen 3 Coder is a personal attack to k2, and I love it.
It achieves near SOTA on LCB while not having reasoning.
Finally people are understanding that reasoning isnt necessary for high benches...

Qwen ftw!

DECENTRALIZE DECENTRALIZE DECENTRALIZE
ginipick 
posted an update about 2 months ago
view post
Post
598
🎨 Flux Styler - AI Art Style Transfer

📝 Project Overview
Flux Styler is a cutting-edge AI web application that transforms ordinary images into stunning artworks using FLUX.1-Kontext-dev model with 22 professional style LoRAs.

ginigen/Flux-Kontext-Style

✨ Key Features

🖼️ One-Click Style Selection: Simply click thumbnails to apply styles instantly
🎯 22 Premium Art Styles: From Ghibli to Van Gogh, Pixel Art to LEGO
⚡ High-Speed GPU Processing: Generate 1024x1024 images in 30-60 seconds
🎮 Intuitive Interface: No complex settings - just upload and transform!

🎨 Style Categories
🌸 Anime & Cartoon
Ghibli | American Cartoon | JoJo | Snoopy | Rick & Morty
🎪 3D & Geometric
3D Chibi | Low Poly | LEGO | Clay Toy
🖌️ Traditional Art
Chinese Ink | Oil Painting | Van Gogh | Picasso | Pop Art
🧵 Craft & Material
Fabric | Origami | Paper Cutting | Macaron
💻 Digital Art
Pixel Art | Line Art | Vector
🚀 How to Use

Upload your image (or use default) 📤
Click any style thumbnail 🖱️
(Optional) Add custom instructions ✏️
Hit "Transform Image" 🎨
Download your masterpiece! 💾

🛠️ Tech Stack

Model: FLUX.1-Kontext-dev by Black Forest Labs
Style LoRAs: Owen777/Kontext-Style-Loras
Framework: Gradio + Diffusers
Acceleration: CUDA GPU (12GB+ VRAM)

💡 Use Cases

📱 Social Media Content Creation
🖼️ NFT Art Generation
🎁 Personalized Gift Design
📚 Educational Visual Materials
🎯 Brand Marketing Assets
Nymbo 
posted an update 2 months ago
view post
Post
2831
Anyone know how to reset Claude web's MCP config? I connected mine when the HF MCP first released with just the default example spaces added. I added lots of other MCP spaces but Claude.ai doesn't update the available tools... "Disconnecting" the HF integration does nothing, deleting it and adding it again does nothing.

Refreshing tools works fine in VS Code because I can manually restart it in mcp.json, but claude.ai has no such option. Anyone got any ideas?
·
ginipick 
posted an update 2 months ago
view post
Post
2953
🎨 Flux-Kontext FaceLORA - AI Portrait Style Transfer

🌟 Introduction
Transform your photos into masterpieces! Flux-Kontext FaceLORA is an innovative AI-powered tool that converts portrait photos into various artistic styles using cutting-edge technology.

ginigen/Flux-Kontext-FaceLORA

✨ Key Features

📸 Easy to Use: Upload photo → Select style → Click Generate!
🎨 7 Art Styles: Famous painter styles including Van Gogh, Monet, Renoir
🤖 Face Preservation: AI maintains your facial features while transforming the style
⚡ Fast Generation: Get results in seconds with ZeroGPU support
🎯 Custom LoRA: Use any LoRA model from HuggingFace

🖼️ Available Styles

🏯 Studio Ghibli - Whimsical anime art style
🌊 Winslow Homer - American realist watercolor
🌻 Van Gogh - Post-impressionist with swirling brushstrokes
🍎 Paul Cézanne - Geometric post-impressionist structure
🌸 Renoir - Impressionist with soft luminous light
🪷 Claude Monet - Impressionist light and color
⚔️ Fantasy Art - Epic magical character portraits

🚀 How to Use
1️⃣ Upload your portrait photo
2️⃣ Select an art style from gallery
3️⃣ Add optional description
4️⃣ Click Generate ✨ button!
💡 Pro Tips

🎭 Front-facing photos work best
🎨 Adjust Style Strength for transformation intensity
🎲 Use Randomize seed for varied results
📝 Add descriptions for more detailed outputs

🛠️ Tech Stack

Model: FLUX.1-Kontext-dev by Black Forest Labs
LoRA: Community-created style adapters
Infrastructure: Hugging Face Spaces + ZeroGPU

🎉 Start Creating Now!
Create your unique AI portrait and share it on social media! #FluxKontextFaceLORA #AIArt #PortraitTransfer
multimodalart 
posted an update 3 months ago
view post
Post
13463
Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces 📹💨

multimodalart/self-forcing
·
ginipick 
posted an update 3 months ago
view post
Post
3641
🎬 VEO3 Directors - All-in-One AI Video Creation Suite

🚀 What is VEO3 Directors?
VEO3 Directors is a revolutionary end-to-end AI video creation platform that transforms your ideas into cinematic reality. From story conception to final video with synchronized audio - all in one seamless workflow!

🔗 Try It Now
ginigen/VEO3-Directors
ginigen/VEO3-Free
ginigen/VEO3-Free-mirror

✨ Key Features
📝 Story Seed Generator

🎲 Instantly generate creative story ideas across multiple genres
🌏 Bilingual support (English/Korean)
🎭 Rich categories: Genre, Setting, Characters, and more

🎥 AI Script & Prompt Crafting

💬 Powered by Friendli API for Hollywood-quality prompts
🤖 AI Director writes detailed cinematography instructions
🎬 Professional elements: camera movements, lighting, VFX

🎬 Video + Audio Generation

🎨 Wan2.1-T2V-14B for stunning visual quality
⚡ NAG 4-step inference - 10x faster generation
🎵 MMAudio auto-generates matching soundscapes
🎛️ Full control over resolution, duration, and style
💬LLM(API): VIDraft/Gemma-3-R1984-27B

💡 How It Works

Generate Story → "The Time Traveler's Final Choice" 🕰️
Create Script → AI writes cinematic scene descriptions 📜
Produce Video → 4-8 second clip with synchronized audio 🎞️

🎯 What Makes It Special

Unified Workflow: From idea to video in one interface
Director-Level Prompts: Professional cinematography language
Lightning Fast: Minutes, not hours
Smart Audio: Context-aware sound generation

🏆 Use Cases

📱 Social Media Content
🎓 Educational Videos
📺 Marketing & Ads
🎮 Game Cutscene Prototyping
🎨 Digital Art Creation
  • 1 reply
·
ginipick 
posted an update 3 months ago
view post
Post
4787
🎨 FLUX VIDEO Generation - All-in-One AI Image/Video/Audio Generator

🚀 Introduction
FLUX VIDEO Generation is an all-in-one AI creative tool that generates images, videos, and audio from text prompts, powered by NVIDIA H100 GPU for lightning-fast processing!

ginigen/Flux-VIDEO

✨ Key Features
1️⃣ Text → Image → Video 🖼️➡️🎬

Generate high-quality images from Korean/English prompts
Transform still images into natural motion videos
Multiple size presets (Instagram, YouTube, Facebook, etc.)
Demo: 1-4 seconds / Full version: up to 60 seconds

2️⃣ Image Aspect Ratio Change 🎭

Freely adjust image aspect ratios
Expand images with outpainting technology
5 alignment options (Center, Left, Right, Top, Bottom)
Real-time preview functionality

3️⃣ Video + Audio Generation 🎵

Add AI-generated audio to videos
Korean prompt support (auto-translation)
Context-aware sound generation
Powered by MMAudio technology

🛠️ Tech Stack

Image Generation: FLUX, Stable Diffusion XL
Video Generation: TeaCache optimization
Audio Generation: MMAudio (44kHz high-quality)
Outpainting: ControlNet Union
Infrastructure: NVIDIA H100 GPU for ultra-fast generation

💡 How to Use

Select your desired tab
Enter your prompt (Korean/English supported!)
Adjust settings
Click generate button

🎯 Use Cases

📱 Social media content creation
🎥 YouTube Shorts/Reels
📊 Presentation materials
🎨 Creative artwork
🎵 Background sound generation
  • 1 reply
·
ginipick 
posted an update 3 months ago
view post
Post
3981
🎨 AI Hairstyle Changer - Transform with 93 Styles! 💇‍♀️✨

🚀 Introduction
Experience 93 different hairstyles and 29 hair colors in real-time with your uploaded photo!
Transform your look instantly with this AI-powered Gradio web app.


✨ Key Features

📸 Simple 3 Steps
Upload Photo - Upload a front-facing photo
Select Style - Choose from 93 hairstyles
Pick Color - Click your desired color from 29 color palette options


💫 Diverse Hairstyles (93 types)

🎯 Short Cuts: Pixie Cut, Bob, Lob, Crew Cut, Undercut
🌊 Waves: Soft Waves, Hollywood Waves, Finger Waves
🎀 Braids: French Braid, Box Braids, Fishtail Braid, Cornrows
👑 Updos: Chignon, Messy Bun, Top Knot, French Twist
🌈 Special Styles: Space Buns, Dreadlocks, Mohawk, Beehive

🎨 Hair Color Palette (29 colors)

🤎 Natural Colors: Black, Browns, Blonde variations
❤️ Red Tones: Red, Auburn, Copper, Burgundy
💜 Fashion Colors: Blue, Purple, Pink, Green, Rose Gold
⚪ Cool Tones: Silver, Ash Blonde, Titanium

🌟 Key Advantages

⚡ Fast Processing: Get results in just 10-30 seconds
🎯 High Accuracy: Natural-looking transformations with AI technology
💎 Professional Quality: High-resolution output suitable for social media
🔄 Unlimited Trials: Try as many combinations as you want
📱 User-Friendly: Intuitive interface with visual color palette


💡 Perfect For

💈 Salon Consultations: Show clients potential new looks before cutting
🛍️ Personal Styling: Experiment before making a big change
🎭 Entertainment: Fun transformations for social media content
🎬 Creative Projects: Character design and visualization
👗 Fashion Industry: Match hairstyles with outfits and makeup
📸 Photography: Pre-visualization for photoshoots

LINK: ginipick/Change-Hair
·
AtAndDev 
posted an update 3 months ago
view post
Post
3001
deepseek-ai/DeepSeek-R1-0528

This is the end
  • 1 reply
·