My Projects - a mrfakename Collection

mrfakename 's Collections

Archive

Zero-Shot Voice Cloning

Llamafied Models

Spaces of the Week

Failed Experiments

My Projects

updated May 17

Projects I've worked on

Running on CPU Upgrade

854

854

TTS Arena V2

🏆

Vote on the latest TTS models!
Running on Zero

2.59k

2.59k

F5-TTS

🗣

F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)

Note Unofficial demo for E2/F5-TTS, which supports zero-shot voice cloning. Not affiliated with the authors of F5-TTS
mrfakename/OpenF5-TTS-Base

Text-to-Speech • Updated May 17 • 300 • 70

Note Apache 2.0 retrain of F5-TTS
Running on Zero

414

414

OpenDalle V1.1 GPU Demo

🖼

A demo of OpenDalle V1.1 on a ZERO GPU.

Note Note: I did not create the model, just the demo.
Running on Zero

35

35

Moonshine ASR

🌒

Fast & efficient ASR outperforming Whisper!

Note Unofficial demo for the Moonshine ASR model, an efficient & fast ASR model by Useful Sensors Moonshine ASR: https://github.com/usefulsensors/moonshine
Running

10

10

Did StyleTTS 2 Generate It?

🤔

Did StyleTTS 2 generate that audio?!?

Note A Whipser-based audio classification model to detect StyleTTS 2
Running on T4

469

469

MeloTTS

🗣

Fast, efficient, & multilingual text-to-speech

Note Demo for MeloTTS: Multilingual, multispeaker text-to-speech licensed under the MIT license
Runtime error

74

74

RWKV Music

🎵

Generate MIDI music using RWKV v4!

Note My newest project, a demo of RWKV 4 Music (the MIDI model).
Running on L4

691

691

StyleTTS 2

🗣

Efficient, fast, and natural text to speech with StyleTTS 2!

Note My most successful project: an online demo for StyleTTS 2. Reached HF Spaces of the Week and was the most popular Space of the Week. Note: I did not create StyleTTS 2, just the demo.
Paused

35

35

OpenDalle GPU Demo

🖼
mrfakename/NeuralOrca-7B-v1

Text Generation • 7B • Updated Mar 4, 2024 • 643 • 5

Note A frankenmerge of NeuralHermes 2.5 and OpenOrca
styletts2-community/multilingual-phonemes-10k-alpha

Viewer • Updated Mar 5, 2024 • 259k • 348 • 35

Note A multilingual dataset of text-phoneme pairs supporting 15 languages.
Running

31

31

seewav-gui

🔊

Generate a video from audio with customizable waveform visualization