-
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 54 -
115
Pop2Piano Demo
🎹Convert pop audio to piano cover
-
khanhld/wav2vec2-base-vietnamese-160h
Automatic Speech Recognition • Updated • 328 • 10 -
4K4D: Real-Time 4D View Synthesis at 4K Resolution
Paper • 2310.11448 • Published • 40
Nguyễn Tiến Đạt
datnt114
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
reducto/RolmOCR
liked
a model
about 2 months ago
ByteDance-Seed/BAGEL-7B-MoT
liked
a model
about 2 months ago
ByteDance/Dolphin
Organizations
3D
music gen
data-SD
Speech to text
LLM
-
H2O Open Ecosystem for State-of-the-art Large Language Models
Paper • 2310.13012 • Published • 8 -
SALMONN: Towards Generic Hearing Abilities for Large Language Models
Paper • 2310.13289 • Published • 17 -
MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer
Paper • 2311.12052 • Published • 32
video
TTS
-
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 54 -
Runtime error115115
Pop2Piano Demo
🎹Convert pop audio to piano cover
-
khanhld/wav2vec2-base-vietnamese-160h
Automatic Speech Recognition • Updated • 328 • 10 -
4K4D: Real-Time 4D View Synthesis at 4K Resolution
Paper • 2310.11448 • Published • 40
Speech to text
3D
LLM
-
H2O Open Ecosystem for State-of-the-art Large Language Models
Paper • 2310.13012 • Published • 8 -
SALMONN: Towards Generic Hearing Abilities for Large Language Models
Paper • 2310.13289 • Published • 17 -
MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer
Paper • 2311.12052 • Published • 32
music gen
video
data-SD