👉 fffiloni/image-2-music-v3 | Feel free to test it and share feedback.
Just wiring together: merve/moondream3 * victor/ace-step-jam
Image → prompt → audio | Early version, will evolve | Follow: @fffiloni
Deeply interrogate audio file content
Long video understanding with smart attention
Extraction & Reconstruction for Efficient Speech Separation
Detect and split video scenes into separate clips
Generate high‑quality images from text prompts
Apache Licensed Advanced Video Generation Model
Animation Sketches sequence Colorization
Aesthetically Controllable Text-Driven Stylization w/o Train
Deeply interrogate audio file content
Long video understanding with smart attention
Extraction & Reconstruction for Efficient Speech Separation
Detect and split video scenes into separate clips
Generate high‑quality images from text prompts
Apache Licensed Advanced Video Generation Model
Animation Sketches sequence Colorization
Aesthetically Controllable Text-Driven Stylization w/o Train
Easily expand image boundaries
Speech generation from text and acoustic reference
Get a music sample inspired by the mood of an image
Long video understanding with smart attention
Extraction & Reconstruction for Efficient Speech Separation
Every image has a soundtrack