Yifan Peng

pyf98

·

https://pyf98.github.io

AI & ML interests

Multimodal LLMs, Speech-to-Speech, Speech Recognition

Organizations

Collections 1

Papers 20

arxiv:2604.24954

arxiv:2506.00338

arxiv:2505.24200

arxiv:2505.13404

spaces 1

OWSM Demo

models 48

pyf98/DPHuBERT

Updated Oct 18, 2023 • 4

pyf98/fisher_callhome_spanish_e_branchformer

Automatic Speech Recognition • Updated Mar 1, 2023 • 1

pyf98/fisher_callhome_spanish_conformer

Automatic Speech Recognition • Updated Mar 1, 2023 • 4

pyf98/slurp_entity_e_branchformer

Automatic Speech Recognition • Updated Feb 28, 2023 • 2

pyf98/aidatatang_200zh_e_branchformer_e16

Automatic Speech Recognition • Updated Feb 24, 2023 • 1

pyf98/librispeech_100_transducer_e_branchformer

Automatic Speech Recognition • Updated Feb 22, 2023 • 1

pyf98/librispeech_100_transducer_conformer

Automatic Speech Recognition • Updated Feb 22, 2023 • 2 • 1

pyf98/jsut_e_branchformer

Automatic Speech Recognition • Updated Feb 22, 2023 • 4

pyf98/aishell_ctc_e_branchformer_e12

Automatic Speech Recognition • Updated Feb 22, 2023 • 9

pyf98/aishell_ctc_conformer_e15_linear1024

Automatic Speech Recognition • Updated Feb 22, 2023 • 2

datasets 0

None public yet