DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining
Shangeth Rajaa
shangeth
AI & ML interests
Speech Representation Learning, Multi-Modal LLM, Spoken Dialogue Systems, Speech Synthesis
Recent Activity
updated a model about 8 hours ago
shangeth/Wren-ASR-0.5B-multi updated a Space 1 day ago
shangeth/Wren-ASR-0.5B-multi-demo published a Space 1 day ago
shangeth/Wren-ASR-0.5B-multi-demoOrganizations
Wren
Wren: A Family of Small Open-Weight Models for Unified Speech-Text Modelling
-
shangeth/Wren-TTS-0.5B-multi-expressive
Text-to-Speech • 0.5B • Updated • 114 - SleepingAgents
Wren-TTS-0.5B-multi-expressive
🎭Expressive multilingual voice-cloning TTS — 23 style tags
-
shangeth/Wren-TTS-0.5B-multi
Text-to-Speech • 0.5B • Updated • 137 - RunningAgents
Wren-TTS-0.5B-multi
🐦Multilingual voice-cloning TTS — 8 languages
DualTurn
DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining
Wren
Wren: A Family of Small Open-Weight Models for Unified Speech-Text Modelling
-
shangeth/Wren-TTS-0.5B-multi-expressive
Text-to-Speech • 0.5B • Updated • 114 - SleepingAgents
Wren-TTS-0.5B-multi-expressive
🎭Expressive multilingual voice-cloning TTS — 23 style tags
-
shangeth/Wren-TTS-0.5B-multi
Text-to-Speech • 0.5B • Updated • 137 - RunningAgents
Wren-TTS-0.5B-multi
🐦Multilingual voice-cloning TTS — 8 languages
spaces 4
Running
Agents
Wren-ASR-0.5B-multi
🐦
Multilingual ASR — 8 languages
Sleeping
Agents
Wren-TTS-0.5B-multi-expressive
🎭
Expressive multilingual voice-cloning TTS — 23 style tags
Running
Agents
Wren-TTS-0.5B-multi
🐦
Multilingual voice-cloning TTS — 8 languages
Sleeping
Agents
Wren-TTS-360M-en
🐦
Voice-cloning TTS — Mimi codec + SmolLM2-360M (English)
models 7
shangeth/Wren-ASR-0.5B-multi
Automatic Speech Recognition • 0.5B • Updated • 20
shangeth/Wren-TTS-0.5B-multi
Text-to-Speech • 0.5B • Updated • 137
shangeth/Wren-TTS-360M-en
Text-to-Speech • 0.4B • Updated • 146
shangeth/Wren-TTS-0.5B-multi-expressive
Text-to-Speech • 0.5B • Updated • 114
shangeth/phi3-mini-ta_en
Translation • 4B • Updated • 1
shangeth/speechllm-2B
Feature Extraction • 2B • Updated • 1
shangeth/SpeechLLM
Feature Extraction • 2B • Updated • 23
datasets 10
shangeth/expresso-mimi-codes-tagged
Viewer • Updated • 25.7k • 85
shangeth/expresso-mimi-codes
Viewer • Updated • 27.5k • 169 • 1
shangeth/expresso
Viewer • Updated • 27.5k • 493
shangeth/mls-mimi-codes
Viewer • Updated • 1.47M • 885
shangeth/jenny-mimi-codes
Viewer • Updated • 21k • 241
shangeth/vctk-mimi-codes
Viewer • Updated • 44.3k • 97
shangeth/libritts-r-mimi-codes
Viewer • Updated • 375k • 242
shangeth/librispeech-mimi-codes
Viewer • Updated • 292k • 79
shangeth/ljspeech-mimi-codes
Viewer • Updated • 13.1k • 262
shangeth/libriasr-mimi-codes
Preview • Updated • 177