322 372 621

Yatharth Sharma

YaTharThShaRma999

AI & ML interests

None yet

Recent Activity

updated a model about 13 hours ago

YaTharThShaRma999/pretrained_tts_tokenizers

liked a model 6 days ago

ostris/qwen_image_edit_inpainting

liked a model 7 days ago

YaTharThShaRma999/pretrained_tts_tokenizers

View all activity

Organizations

None yet

updated a model about 13 hours ago

YaTharThShaRma999/pretrained_tts_tokenizers

Updated about 13 hours ago • 1

liked a model 6 days ago

ostris/qwen_image_edit_inpainting

Text-to-Image • Updated 6 days ago • 2.45k • 45

liked a model 7 days ago

YaTharThShaRma999/pretrained_tts_tokenizers

Updated about 13 hours ago • 1

published a model 7 days ago

YaTharThShaRma999/pretrained_tts_tokenizers

Updated about 13 hours ago • 1

upvoted a paper 13 days ago

Wan-S2V: Audio-Driven Cinematic Video Generation

Paper • 2508.18621 • Published 14 days ago • 16

upvoted a paper 14 days ago

TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling

Paper • 2508.16790 • Published 18 days ago • 7

liked a model 14 days ago

amphion/TaDiCodec-TTS-AR-Qwen2.5-0.5B

Text-to-Speech • 0.5B • Updated 7 days ago • 132 • 7

upvoted a paper 18 days ago

Waver: Wave Your Way to Lifelike Video Generation

Paper • 2508.15761 • Published 19 days ago • 33

reacted to wcy1122's post with 🔥🚀 20 days ago

Post

4180

🚀 Introducing MGM-Omni, an omni-chatbot capable of processing text, image, video, and speech inputs, and can generate both text and speech responses.
👂 MGM-Omni support hour-level audio understanding.
🗣️ MGM-Omni support 10-minute speech generation and voice cloning.
For more details, please check:
📝 Blog: https://mgm-omni.notion.site/MGM-Omni-An-Open-source-Omni-Chatbot-2395728e0b0180149ac9f24683fc9907
🌟 Code: https://github.com/dvlab-research/MGM-Omni
🤖 Model: wcy1122/mgm-omni-6896075e97317a88825032e1
🎮 Demo: wcy1122/MGM-Omni

updated a model 20 days ago

YaTharThShaRma999/finetunedmodel

Updated 20 days ago • 6

liked 2 models 20 days ago

cartesia/azzurra-voice

Text-to-Speech • 2B • Updated 6 days ago • 3.82k • 7

DavidBrowne17/Mimi-Voice

Updated 27 days ago • 5

liked a model 22 days ago

shichaog/MeloVC

Text-to-Speech • Updated 24 days ago • 6 • 3

reacted to etemiz's post with 👀👍 24 days ago

Post

6274

gpt-oss-120B scored 28 (one of the lowest) on AHA leaderboard. not very human aligned model.

these kind of models are not really "free": they are costing you your freedom if you know what i mean.