1 2 2

Quan

wq2012

https://wangquan.me/

wq2012

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

google/gemma-3n-E4B-it-litert-preview

upvoted a paper 5 months ago

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

authored a paper 6 months ago

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

View all activity

Organizations

liked a model about 2 months ago

google/gemma-3n-E4B-it-litert-preview

Image-Text-to-Text • Updated May 26 • 1.37k

upvoted a paper 5 months ago

OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning

Paper • 2502.11271 • Published Feb 16 • 18

authored 2 papers 6 months ago

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

Paper • 2312.11123 • Published Dec 18, 2023

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation

Paper • 2201.03713 • Published Jan 11, 2022

updated 2 models 10 months ago

tflite-hub/conformer-lang-id

Updated Sep 19, 2024 • 38

tflite-hub/conformer-speaker-encoder

Updated Sep 19, 2024 • 77 • 5

commented a paper 10 months ago

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Paper • 2401.03506 • Published Jan 7, 2024 • 14 •

updated a collection 10 months ago

DiarizationLM

Collection

5 items • Updated Sep 19, 2024

updated 3 Spaces 10 months ago

authored 2 papers 10 months ago

SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System

Paper • 2104.02125 • Published Apr 5, 2021

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Paper • 2202.12163 • Published Feb 24, 2022

liked a Space 10 months ago

923

Open ASR Leaderboard

🏆

Request evaluation for a speech model

updated a model 11 months ago

google/DiarizationLM-13b-Fisher-v1

Text Generation • 13B • Updated Aug 11, 2024 • 71 • • 11

updated a Space 12 months ago

DiarizationLM GGUF

💬

Generate detailed speaker diarization from text input💬

updated a model 12 months ago

google/DiarizationLM-8b-Fisher-v2

8B • Updated Aug 2, 2024 • 2.47k • 30

updated a collection 12 months ago

DiarizationLM

Collection

5 items • Updated Sep 19, 2024

updated a model 12 months ago

google/DiarizationLM-8b-Fisher-v1

Text Generation • 8B • Updated Aug 2, 2024 • 39 • 3

upvoted a paper 12 months ago

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Paper • 2401.03506 • Published Jan 7, 2024 • 14

Quan

AI & ML interests

Recent Activity

Organizations

wq2012's activity

README

Lang Id Demo

Speaker Recognition Demo

Open ASR Leaderboard

DiarizationLM GGUF