AI & ML interests

None defined yet.

Recent Activity

seawolf2357 
in Heartsync/Nano-Banana about 1 month ago

fake

🤝 👍 8
6
#1 opened about 1 month ago by
un-index

REAL NANO BANANA

2
#2 opened about 1 month ago by
immunobiotech
seawolf2357 
posted an update about 1 month ago
view post
Post
16490
🎨 Open Nano-Banana: Revolution in Ultra-Fast AI Image Editing!

🚀 Introduction
**Open Nano-Banana** is an innovative image editing tool based on the Qwen-Image-Edit model. Experience amazing quality image editing in just 8 steps!

Heartsync/Nano-Banana

✨ Core Features

⚡ Lightning-Fast Editing
* **8-Step Generation**: Ultra-fast processing with Qwen-Image-Lightning LoRA
* **Real-time Editing**: 10x faster than conventional methods
* **GPU Optimization**: Maximized memory efficiency with xformers

🤖 AI Prompt Enhancement
* **Automatic Prompt Improvement**: Intelligent rewriting with Cerebras' Qwen3-235B model
* **Multilingual Support**: Auto-detection for Korean/Chinese/English
* **Context Understanding**: Sophisticated command generation aligned with image context

🎯 Versatile Editing Functions
✅ Add/Delete/Replace objects
✅ Text editing and style transformation
✅ Person editing (expressions, hairstyles)
✅ Vintage restoration and style conversion
✅ Background replacement and enhancement

🛠️ Tech Stack
* Base Model: Qwen-Image-Edit
* Acceleration: Qwen-Image-Lightning LoRA
* Prompt AI: Qwen3-235B (Cerebras)
* Framework: Gradio + Diffusers
* Optimization: bfloat16 precision

🌟 Why Open Nano-Banana?
* ⚡ Speed: Instant results with 8 steps
* 🎨 Quality: Perfect editing with Prompt AI
* 🔒 Security: Token-based secure processing
* 💜 Design: Beautiful gradient UI

🏷️ Tags
#image-editing #ai-image-generation #qwen-image-edit #image-to-image #diffusers
#gradio #huggingface-spaces #lightning-lora #prompt-engineering #cerebras
#multilingual #real-time-editing #gpu-optimization #open-source #computer-vision
#deep-learning #machine-learning #artificial-intelligence #image-processing #creative-ai
·
aiqtech 
posted an update 3 months ago
view post
Post
3269
🔥 HuggingFace Heatmap Leaderboard
Visualizing AI ecosystem activity at a glance

aiqtech/Heatmap-Leaderboard

🎯 Introduction
A leaderboard that visualizes the vibrant HuggingFace community activity through heatmaps.

✨ Key Features
📊 Real-time Tracking - Model/dataset/app releases from AI labs and developers
🏆 Auto Ranking - Rankings based on activity over the past year
🎨 Responsive UI - Unique colors per organization, mobile optimized
⚡ Auto Updates - Hourly data refresh for latest information

🌍 Major Participants
Big Tech: OpenAI, Google, Meta, Microsoft, Apple, NVIDIA
AI Startups: Anthropic, Mistral, Stability AI, Cohere, DeepSeek
Chinese Companies: Tencent, Baidu, ByteDance, Qwen
HuggingFace Official: HuggingFaceH4, HuggingFaceM4, lerobot, etc.
Active Developers: prithivMLmods, lllyasviel, multimodalart and many more

🚀 Value
Trend Analysis 📈 Real-time open source contribution insights
Inspiration 💪 Learn from other developers' activity patterns
Ecosystem Growth 🌱 Visualize AI community development

@John6666 @Nymbo @MaziyarPanahi @prithivMLmods @fffiloni @gokaygokay @enzostvs @black-forest-labs @lllyasviel @briaai @multimodalart @unsloth @Xenova @mistralai @meta-llama @facebook @openai @Anthropic @google @allenai @apple @microsoft @nvidia @CohereLabs @ibm-granite @stabilityai @huggingface @OpenEvals @HuggingFaceTB @HuggingFaceH4 @HuggingFaceM4 @HuggingFaceFW @HuggingFaceFV @open-r1 @parler-tts @nanotron @lerobot @distilbert @kakaobrain @NCSOFT @upstage @moreh @LGAI-EXAONE @naver-hyperclovax @OnomaAIResearch @kakaocorp @Baidu @PaddlePaddle @tencent @BAAI @OpenGVLab @InternLM @Skywork @MiniMaxAI @stepfun-ai @ByteDance @Bytedance Seed @bytedance-research @openbmb @THUDM @rednote-hilab @deepseek-ai @Qwen @wan-ai @XiaomiMiMo @IndexTeam @agents-course
@Agents-MCP-Hackathon @akhaliq @alexnasa @Alibaba-NLP
@ArtificialAnalysis @bartowski @bibibi12345 @calcuis
@ChenDY @city96 @Comfy-Org @fancyfeast @fal @google
  • 1 reply
·
seawolf2357 
posted an update 3 months ago
view post
Post
2099
🚀 VEO3 Real-Time: Real-time AI Video Generation with Self-Forcing

🎯 Core Innovation: Self-Forcing Technology
VEO3 Real-Time, an open-source project challenging Google's VEO3, achieves real-time video generation through revolutionary Self-Forcing technology.

Heartsync/VEO3-RealTime

⚡ What is Self-Forcing?
While traditional methods require 50-100 steps, Self-Forcing achieves the same quality in just 1-2 steps. Through self-correction and rapid convergence, this Distribution Matching Distillation (DMD) technique maintains quality while delivering 50x speed improvement.

💡 Technical Advantages of Self-Forcing
1. Extreme Speed
Generates 4-second videos in under 30 seconds, with first frame streaming in just 3 seconds. This represents 50x faster performance than traditional diffusion methods.
2. Consistent Quality
Maintains cinematic quality despite fewer steps, ensures temporal consistency, and minimizes artifacts.
3. Efficient Resource Usage
Reduces GPU memory usage by 70% and heat generation by 30%, enabling smooth operation on mid-range GPUs like RTX 3060.

🛠️ Technology Stack Synergy
VEO3 Real-Time integrates multiple technologies organically around Self-Forcing DMD. Self-Forcing DMD handles ultra-fast video generation, Wan2.1-T2V-1.3B serves as the high-quality video backbone, PyAV streaming enables real-time transmission, and Qwen3 adds intelligent prompt enhancement for polished results.

📊 Performance Comparison
Traditional methods require 50-100 steps, taking 2-5 minutes for the first frame and 5-10 minutes total. In contrast, Self-Forcing needs only 1-2 steps, delivering the first frame in 3 seconds and complete videos in 30 seconds while maintaining equal quality.🔮 Future of Self-Forcing
Our next goal is real-time 1080p generation, with ongoing research to achieve
seawolf2357 
posted an update 4 months ago
view post
Post
8268
⚡ FusionX Enhanced Wan 2.1 I2V (14B) 🎬

🚀 Revolutionary Image-to-Video Generation Model
Generate cinematic-quality videos in just 8 steps!

Heartsync/WAN2-1-fast-T2V-FusioniX

✨ Key Features
🎯 Ultra-Fast Generation: Premium quality in just 8-10 steps
🎬 Cinematic Quality: Smooth motion with detailed textures
🔥 FusionX Technology: Enhanced with CausVid + MPS Rewards LoRA
📐 Optimized Resolution: 576×1024 default settings
⚡ 50% Speed Boost: Faster rendering compared to base models
🛠️ Technical Stack

Base Model: Wan2.1 I2V 14B
Enhancement Technologies:

🔗 CausVid LoRA (1.0 strength) - Motion modeling
🔗 MPS Rewards LoRA (0.7 strength) - Detail optimization

Scheduler: UniPC Multistep (flow_shift=8.0)
Auto Prompt Enhancement: Automatic cinematic keyword injection

🎨 How to Use

Upload Image - Select your starting image
Enter Prompt - Describe desired motion and style
Adjust Settings - 8 steps, 2-5 seconds recommended
Generate - Complete in just minutes!

💡 Optimization Tips
✅ Recommended Settings: 8-10 steps, 576×1024 resolution
✅ Prompting: Use "cinematic motion, smooth animation" keywords
✅ Duration: 2-5 seconds for optimal quality
✅ Motion: Emphasize natural movement and camera work
🏆 FusionX Enhanced vs Standard Models
Performance Comparison: While standard models typically require 15-20 inference steps to achieve decent quality, our FusionX Enhanced version delivers premium results in just 8-10 steps - that's more than 50% faster! The rendering speed has been dramatically improved through optimized LoRA fusion, allowing creators to iterate quickly without sacrificing quality. Motion quality has been significantly enhanced with advanced causal modeling, producing smoother, more realistic animations compared to base implementations. Detail preservation is substantially better thanks to MPS Rewards training, maintaining crisp textures and consistent temporal coherence throughout the generated sequences.
  • 1 reply
·
seawolf2357 
posted an update 4 months ago
view post
Post
1727
🚀 Just Found an Interesting New Leaderboard for Medical AI Evaluation!

I recently stumbled upon a medical domain-specific FACTS Grounding leaderboard on Hugging Face, and the approach to evaluating AI accuracy in medical contexts is quite impressive, so I thought I'd share.

📊 What is FACTS Grounding?
It's originally a benchmark developed by Google DeepMind that measures how well LLMs generate answers based solely on provided documents. What's cool about this medical-focused version is that it's designed to test even small open-source models.

🏥 Medical Domain Version Features

236 medical examples: Extracted from the original 860 examples
Tests small models like Qwen 3 1.7B: Great for resource-constrained environments
Uses Gemini 1.5 Flash for evaluation: Simplified to a single judge model

📈 The Evaluation Method is Pretty Neat

Grounding Score: Are all claims in the response supported by the provided document?
Quality Score: Does it properly answer the user's question?
Combined Score: Did it pass both checks?

Since medical information requires extreme accuracy, this thorough verification approach makes a lot of sense.
🔗 Check It Out Yourself

The actual leaderboard: MaziyarPanahi/FACTS-Leaderboard

💭 My thoughts: As medical AI continues to evolve, evaluation tools like this are becoming increasingly important. The fact that it can test smaller models is particularly helpful for the open-source community!
seawolf2357 
posted an update 5 months ago
view post
Post
6440
Samsung Hacking Incident: Samsung Electronics' Official Hugging Face Account Compromised
Samsung Electronics' official Hugging Face account has been hacked. Approximately 17 hours ago, two new language models (LLMs) were registered under Samsung Electronics' official Hugging Face account. These models are:

https://huggingface.co/Samsung/MuTokenZero2-32B
https://huggingface.co/Samsung/MythoMax-L2-13B

The model descriptions contain absurd and false claims, such as being trained on "1 million W200 GPUs," hardware that doesn't even exist.
Moreover, community participants on Hugging Face who have noticed this issue are continuously posting that Samsung Electronics' account has been compromised.
There is concern about potential secondary and tertiary damage if users download these LLMs released under the Samsung Electronics account, trusting Samsung's reputation without knowing about the hack.
Samsung Electronics appears to be unaware of this situation, as they have not taken any visible measures yet, such as changing the account password.
Source: https://discord.gg/openfreeai
  • 2 replies
·
seawolf2357 
posted an update 5 months ago
view post
Post
5958
📚 Papers Leaderboard - See the Latest AI Research Trends at a Glance! ✨

Hello, AI research community! Today I'm introducing a new tool for exploring research papers. Papers Leaderboard is an open-source dashboard that makes it easy to find and filter the latest AI research papers.

Heartsync/Papers-Leaderboard

🌟 Key Features

Date Filtering: View only papers published within a specific timeframe (from May 5, 2023 to present)
Title Search: Quickly find papers containing your keywords of interest
Abstract Search: Explore paper content more deeply by searching for keywords within abstracts
Automatic Updates: The database is updated with the latest papers every hour

💡 How to Use It?

Select a start date and end date
Enter keywords you want to find in titles or abstracts
Adjust the maximum number of search results for abstract searches
Results are displayed neatly in table format
aiqtech 
posted an update 5 months ago
view post
Post
5430
🌐 AI Token Visualization Tool with Perfect Multilingual Support

Hello! Today I'm introducing my Token Visualization Tool with comprehensive multilingual support. This web-based application allows you to see how various Large Language Models (LLMs) tokenize text.

aiqtech/LLM-Token-Visual

✨ Key Features

🤖 Multiple LLM Tokenizers: Support for Llama 4, Mistral, Gemma, Deepseek, QWQ, BERT, and more
🔄 Custom Model Support: Use any tokenizer available on HuggingFace
📊 Detailed Token Statistics: Analyze total tokens, unique tokens, compression ratio, and more
🌈 Visual Token Representation: Each token assigned a unique color for visual distinction
📂 File Analysis Support: Upload and analyze large files

🌏 Powerful Multilingual Support
The most significant advantage of this tool is its perfect support for all languages:

📝 Asian languages including Korean, Chinese, and Japanese fully supported
🔤 RTL (right-to-left) languages like Arabic and Hebrew supported
🈺 Special characters and emoji tokenization visualization
🧩 Compare tokenization differences between languages
💬 Mixed multilingual text processing analysis

🚀 How It Works

Select your desired tokenizer model (predefined or HuggingFace model ID)
Input multilingual text or upload a file for analysis
Click 'Analyze Text' to see the tokenized results
Visually understand how the model breaks down various languages with color-coded tokens

💡 Benefits of Multilingual Processing
Understanding multilingual text tokenization patterns helps you:

Optimize prompts that mix multiple languages
Compare token efficiency across languages (e.g., English vs. Korean vs. Chinese token usage)
Predict token usage for internationalization (i18n) applications
Optimize costs for multilingual AI services

🛠️ Technology Stack

Backend: Flask (Python)
Frontend: HTML, CSS, JavaScript (jQuery)
Tokenizers: 🤗 Transformers library
·
seawolf2357 
posted an update 6 months ago
view post
Post
6818
🔥 AgenticAI: The Ultimate Multimodal AI with 16 MBTI Girlfriend Personas! 🔥

Hello AI community! Today, our team is thrilled to introduce AgenticAI, an innovative open-source AI assistant that combines deep technical capabilities with uniquely personalized interaction. 💘

🛠️ MBTI 16 Types SPACES Collections link
seawolf2357/heartsync-mbti-67f793d752ef1fa542e16560

✨ 16 MBTI Girlfriend Personas

Complete MBTI Implementation: All 16 MBTI female personas modeled after iconic characters (Dana Scully, Lara Croft, etc.)
Persona Depth: Customize age groups and thinking patterns for hyper-personalized AI interactions
Personality Consistency: Each MBTI type demonstrates consistent problem-solving approaches, conversation patterns, and emotional expressions

🚀 Cutting-Edge Multimodal Capabilities

Integrated File Analysis: Deep analysis and cross-referencing of images, videos, CSV, PDF, and TXT files
Advanced Image Understanding: Interprets complex diagrams, mathematical equations, charts, and tables
Video Processing: Extracts key frames from videos and understands contextual meaning
Document RAG: Intelligent analysis and summarization of PDF/CSV/TXT files

💡 Deep Research & Knowledge Enhancement

Real-time Web Search: SerpHouse API integration for latest information retrieval and citation
Deep Reasoning Chains: Step-by-step inference process for solving complex problems
Academic Analysis: In-depth approach to mathematical problems, scientific questions, and data analysis
Structured Knowledge Generation: Systematic code, data analysis, and report creation

🖼️ Creative Generation Engine

FLUX Image Generation: Custom image creation reflecting the selected MBTI persona traits
Data Visualization: Automatic generation of code for visualizing complex datasets
Creative Writing: Story and scenario writing matching the selected persona's style

  • 1 reply
·
seawolf2357 
posted an update 6 months ago
view post
Post
8533
🎨 Ghibli-Style Image Generation with Multilingual Text Integration: FLUX.1 Hugging Face Edition 🌏✨

Hello creators! Today I'm introducing a special image generator that combines the beautiful aesthetics of Studio Ghibli with multilingual text integration! 😍

seawolf2357/Ghibli-Multilingual-Text-rendering

✨ Key Features

Ghibli-Style Image Generation - High-quality animation-style images based on FLUX.1
Multilingual Text Rendering - Support for Korean, Japanese, English, and all languages! 🇰🇷🇯🇵🇬🇧
Automatic Image Editing with Simple Prompts - Just input your desired text and you're done!
Two Stylistic Variations Provided - Get two different results from a single prompt
Full Hugging Face Spaces Support - Deploy and share instantly!

🚀 How Does It Work?

Enter a prompt describing your desired image (e.g., "a cat sitting by the window")
Input the text you want to add (any language works!)
Select the text position, size, and color
Two different versions are automatically generated!

💯 Advantages of This Model

No Tedious Post-Editing Needed - Text is perfectly integrated during generation
Natural Text Integration - Text automatically adjusts to match the image style
Perfect Multilingual Support - Any language renders beautifully!
User-Friendly Interface - Easily adjust text size, position, and color
One-Click Hugging Face Deployment - Use immediately without complex setup

🎭 Use Cases

Creating multilingual greeting cards
Animation-style social media content
Ghibli-inspired posters or banners
Character images with dialogue in various languages
Sharing with the community through Hugging Face Spaces

This project leverages Hugging Face's FLUX.1 model to open new possibilities for seamlessly integrating high-quality Ghibli-style images with multilingual text using just prompts! 🌈
Try it now and create your own artistic masterpieces! 🎨✨

#GhibliStyle #MultilingualSupport #AIImageGeneration #TextRendering #FLUX #HuggingFace
·
aiqtech 
posted an update 6 months ago
view post
Post
7598
✨ High-Resolution Ghibli Style Image Generator ✨
🌟 Introducing FLUX Ghibli LoRA
Hello everyone! Today I'm excited to present a special LoRA model for FLUX Dev.1. This model leverages a LoRA trained on high-resolution Ghibli images for FLUX Dev.1 to easily create beautiful Ghibli-style images with stunning detail! 🎨

space: aiqtech/FLUX-Ghibli-Studio-LoRA
model: openfree/flux-chatgpt-ghibli-lora

🔮 Key Features

Trained on High-Resolution Ghibli Images - Unlike other LoRAs, this one is trained on high-resolution images, delivering sharper and more beautiful results
Powered by FLUX Dev.1 - Utilizing the latest FLUX model for faster generation and superior quality
User-Friendly Interface - An intuitive UI that allows anyone to create Ghibli-style images with ease
Diverse Creative Possibilities - Express various themes in Ghibli style, from futuristic worlds to fantasy elements

🖼️ Sample Images


Include "Ghibli style" in your prompts
Try combining nature, fantasy elements, futuristic elements, and warm emotions
Add "[trigger]" tag at the end for better results

🚀 Getting Started

Enter your prompt (e.g., "Ghibli style sky whale transport ship...")
Adjust image size and generation settings
Click the "Generate" button
In just seconds, your beautiful Ghibli-style image will be created!

🤝 Community
Want more information and tips? Join our community!
Discord: https://discord.gg/openfreeai

Create your own magical world with the LoRA trained on high-resolution Ghibli images for FLUX Dev.1! 🌈✨
aiqtech 
posted an update 6 months ago
view post
Post
5479
🤗 Hug Contributors
Hugging Face Contributor Dashboard 👨‍💻👩‍💻

aiqtech/Contributors-Leaderboard

📊 Key Features

Contributor Activity Tracking: Visualize yearly and monthly contributions through interactive calendars
Top 100 Rankings: Provide rankings based on models, spaces, and dataset contributions
Detailed Analysis: Analyze user-specific contribution patterns and influence
Visualization: Understand contribution activities at a glance through intuitive charts and graphs

🌟 Core Visualization Elements

Contribution Calendar: Track activity patterns with GitHub-style heatmaps
Radar Chart: Visualize balance between models, spaces, datasets, and activity levels
Monthly Activity Graph: Identify most active months and patterns
Distribution Pie Chart: Analyze proportion by contribution type

🏆 Ranking System

Rankings based on overall contributions, spaces, and models
Automatic badges for top 10, 30, and 100 contributors
Ranking visualization to understand your position in the community

💡 How to Use

Select a username from the sidebar or enter directly
Choose a year to view specific period activities
Select desired items from models, datasets, and spaces
View comprehensive contribution activities in the detailed dashboard

🚀 Expected Benefits

Provide transparency for Hugging Face community contributors' activities
Motivate contributions and energize the community
Recognize and reward active contributors
Visualize contributions to the open AI ecosystem
·
aiqtech 
posted an update 10 months ago
view post
Post
4179
🎨 SORA 3D: Create 3D Models from Text and Images
Hey there! Today I'm excited to share 'SORA 3D', a project that generates 3D models from text prompts or images.
✨ Key Features

3D generation from text/image input
Multilingual prompt support
Automatic GLB conversion
Real-time 3D preview
Mesh optimization & texture quality control

🚀 How to Use

Enter text or upload image
Adjust generation settings (optional)
Click 'Generate 3D'
Extract and download GLB file

🛠 Tech Stack

Hugging Face Transformers
PyTorch
Gradio
TRELLIS image-to-3D conversion
FLUX image generation

💡 Use Cases

Game asset creation
Metaverse content
Product prototyping
Educational 3D models

LINK: ginipick/SORA-3D

Try the demo and let me know what you think! 😊
#AI #3DGeneration #MachineLearning #HuggingFace #ComputerVision
·
aiqtech 
updated a Space over 1 year ago