DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining
Shangeth Rajaa
shangeth
AI & ML interests
Speech Representation Learning, Multi-Modal LLM, Spoken Dialogue Systems, Speech Synthesis
Recent Activity
updated a model 15 days ago
shangeth/Wren-ASR-0.5B-multi updated a Space 18 days ago
shangeth/Wren-ASR-0.5B-multi-demo published a Space 18 days ago
shangeth/Wren-ASR-0.5B-multi-demoOrganizations
Wren
Wren: A Family of Small Open-Weight Models for Unified Speech-Text Modelling
-
shangeth/Wren-TTS-0.5B-multi-expressive
Text-to-Speech • 0.5B • Updated • 142 - SleepingAgents1
Wren-TTS-0.5B-multi-expressive
🎭1Expressive multilingual voice-cloning TTS — 23 style tags
-
shangeth/Wren-TTS-0.5B-multi
Text-to-Speech • 0.5B • Updated • 145 - SleepingAgents
Wren-TTS-0.5B-multi
🐦Multilingual voice-cloning TTS — 8 languages
DualTurn
DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining
Wren
Wren: A Family of Small Open-Weight Models for Unified Speech-Text Modelling
-
shangeth/Wren-TTS-0.5B-multi-expressive
Text-to-Speech • 0.5B • Updated • 142 - SleepingAgents1
Wren-TTS-0.5B-multi-expressive
🎭1Expressive multilingual voice-cloning TTS — 23 style tags
-
shangeth/Wren-TTS-0.5B-multi
Text-to-Speech • 0.5B • Updated • 145 - SleepingAgents
Wren-TTS-0.5B-multi
🐦Multilingual voice-cloning TTS — 8 languages
spaces 4
Sleeping
Agents
Wren-ASR-0.5B-multi
🐦
Multilingual ASR — 8 languages
Sleeping
Agents
1
Wren-TTS-0.5B-multi-expressive
🎭
Expressive multilingual voice-cloning TTS — 23 style tags
Sleeping
Agents
Wren-TTS-0.5B-multi
🐦
Multilingual voice-cloning TTS — 8 languages
Sleeping
Agents
Wren-TTS-360M-en
🐦
Voice-cloning TTS — Mimi codec + SmolLM2-360M (English)
models 7
shangeth/Wren-ASR-0.5B-multi
Automatic Speech Recognition • 0.5B • Updated • 93 • 2
shangeth/Wren-TTS-0.5B-multi
Text-to-Speech • 0.5B • Updated • 145
shangeth/Wren-TTS-360M-en
Text-to-Speech • 0.4B • Updated • 72
shangeth/Wren-TTS-0.5B-multi-expressive
Text-to-Speech • 0.5B • Updated • 142
shangeth/phi3-mini-ta_en
Translation • 4B • Updated • 2
shangeth/speechllm-2B
Feature Extraction • 2B • Updated • 2
shangeth/SpeechLLM
Feature Extraction • 2B • Updated • 19
datasets 10
shangeth/expresso-mimi-codes-tagged
Viewer • Updated • 25.7k • 110
shangeth/expresso-mimi-codes
Viewer • Updated • 27.5k • 255 • 1
shangeth/expresso
Viewer • Updated • 27.5k • 885
shangeth/mls-mimi-codes
Viewer • Updated • 1.47M • 1.08k
shangeth/jenny-mimi-codes
Viewer • Updated • 21k • 348
shangeth/vctk-mimi-codes
Viewer • Updated • 44.3k • 100
shangeth/libritts-r-mimi-codes
Viewer • Updated • 375k • 217
shangeth/librispeech-mimi-codes
Viewer • Updated • 292k • 127
shangeth/ljspeech-mimi-codes
Viewer • Updated • 13.1k • 345
shangeth/libriasr-mimi-codes
Preview • Updated • 271