fahrizalfarid
akahana
AI & ML interests
NLP
Recent Activity
reacted
to
seawolf2357's
post
with π₯
about 4 hours ago
β‘ FusionX Enhanced Wan 2.1 I2V (14B) π¬
π Revolutionary Image-to-Video Generation Model
Generate cinematic-quality videos in just 8 steps!
https://huggingface.co/spaces/Heartsync/WAN2-1-fast-T2V-FusioniX
β¨ Key Features
π― Ultra-Fast Generation: Premium quality in just 8-10 steps
π¬ Cinematic Quality: Smooth motion with detailed textures
π₯ FusionX Technology: Enhanced with CausVid + MPS Rewards LoRA
π Optimized Resolution: 576Γ1024 default settings
β‘ 50% Speed Boost: Faster rendering compared to base models
π οΈ Technical Stack
Base Model: Wan2.1 I2V 14B
Enhancement Technologies:
π CausVid LoRA (1.0 strength) - Motion modeling
π MPS Rewards LoRA (0.7 strength) - Detail optimization
Scheduler: UniPC Multistep (flow_shift=8.0)
Auto Prompt Enhancement: Automatic cinematic keyword injection
π¨ How to Use
Upload Image - Select your starting image
Enter Prompt - Describe desired motion and style
Adjust Settings - 8 steps, 2-5 seconds recommended
Generate - Complete in just minutes!
π‘ Optimization Tips
β
Recommended Settings: 8-10 steps, 576Γ1024 resolution
β
Prompting: Use "cinematic motion, smooth animation" keywords
β
Duration: 2-5 seconds for optimal quality
β
Motion: Emphasize natural movement and camera work
π FusionX Enhanced vs Standard Models
Performance Comparison: While standard models typically require 15-20 inference steps to achieve decent quality, our FusionX Enhanced version delivers premium results in just 8-10 steps - that's more than 50% faster! The rendering speed has been dramatically improved through optimized LoRA fusion, allowing creators to iterate quickly without sacrificing quality. Motion quality has been significantly enhanced with advanced causal modeling, producing smoother, more realistic animations compared to base implementations. Detail preservation is substantially better thanks to MPS Rewards training, maintaining crisp textures and consistent temporal coherence throughout the generated sequences.
reacted
to
DualityAI-RebekahBogdanoff's
post
with π
about 4 hours ago
Can AI models trained solely on 100% synthetic data achieve top-tier accuracy in real-world object detection?
π Sergio Sanz, PhD just proved it while winning Duality AIβs Synthetic-to-Real Object Detection Challenge using Falcon-generated imagery. His model achieved perfect real-world detection accuracy without a single real image in the training loop.
In this blog, Dr. Sanz walks us through his method, which includes the design and training of an advanced pipeline to achieve 100% detection accuracy.
His full technical breakdown covers:
π Synthetic-only training
π Data augmentation with an ensemble learning approach for better generalization
π Custom occlusion generation
π A Faster R-CNN model fine-tuned with Falcon generated data
π And much more!
The results speak for themselves!
π Read the blog here: https://www.duality.ai/blog/leveraging-synthetic-data-for-real-world-object-detection
Congratulations Sergio! We can't wait to see what you do next.
π Ready to take on the next Synthetic-to-Real challenge? The third edition of our Kaggle competitionβMulti-Instance Object Detection Challengeβis now live: https://www.kaggle.com/competitions/multi-instance-object-detection-challenge
updated
a dataset
11 days ago
akahana/anti-spoofing-nuaaaa
Organizations
None yet
Collections
2
models
32

akahana/llm-models
Updated
β’
22

akahana/translation
Updated

akahana/dokter-chat-v0.1
Question Answering
β’
Updated

akahana/whisper-small-id
Automatic Speech Recognition
β’
Updated
β’
11

akahana/wikipedia-gpt2
Updated
β’
10

akahana/tebak-gambar-mobilevit
Updated
β’
5

akahana/mnist-mobilevit
Updated
β’
4

akahana/tinybert-javanese
Fill-Mask
β’
Updated
β’
10

akahana/minibert-indonesia
Updated
β’
17

akahana/smallbert-javanese
Fill-Mask
β’
Updated
β’
8
datasets
42
akahana/anti-spoofing-nuaaaa
Viewer
β’
Updated
β’
8.6k
β’
126
akahana/anti-spoofing-casiafasd
Viewer
β’
Updated
β’
4.06k
β’
120
akahana/hifi-gan
Updated
β’
59
akahana/Driver-Drowsiness-Dataset
Viewer
β’
Updated
β’
41.8k
β’
100
akahana/mpii-face-gaze
Updated
β’
14
akahana/common-voice-11-eng-sample
Updated
β’
117
akahana/children-codes-stories
Updated
β’
9
akahana/vlm
Updated
β’
22
akahana/medical
Updated
β’
68
akahana/llm-opus-ParaCrawl-english-id-v2
Updated
β’
10