Sonal Kumar

sonalkum

AI & ML interests

None yet

Recent Activity

published a model about 2 months ago

sonalkum/GAMA

updated a dataset 5 months ago

sonalkum/MMAU-test-mini

published a dataset 5 months ago

sonalkum/MMAU-test-mini

View all activity

Organizations

published a model about 2 months ago

sonalkum/GAMA

Updated Jun 26

updated a dataset 5 months ago

sonalkum/MMAU-test-mini

Viewer • Updated Mar 21 • 1k • 5

published a dataset 5 months ago

sonalkum/MMAU-test-mini

Viewer • Updated Mar 21 • 1k • 5

authored a paper 5 months ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6 • 25

upvoted a paper 5 months ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6 • 25

authored 3 papers 10 months ago

Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Paper • 2410.02056 • Published Oct 2, 2024 • 6

Do Audio-Language Models Understand Linguistic Variations?

Paper • 2410.16505 • Published Oct 21, 2024 • 1

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 21

upvoted a paper 10 months ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 21

updated a Space 10 months ago

Synthio Stable Audio Open

📚

Stable audio open model from Synthio paper.

updated 2 models 10 months ago

sonalkum/synthio-t5

Updated Oct 26, 2024

sonalkum/synthio-stable-audio-open

Updated Oct 19, 2024 • 2

upvoted a paper 10 months ago

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Paper • 2410.13198 • Published Oct 17, 2024 • 10

authored a paper 11 months ago

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 13

upvoted a paper 11 months ago

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 13

updated 2 Spaces about 1 year ago

GAMA

🌍

Answer questions about audio

GAMA-IT

🏆

Analyze audio and answer questions about it

liked 2 Spaces about 1 year ago

GAMA-IT

🏆

Analyze audio and answer questions about it

GAMA

🌍

Answer questions about audio

authored a paper about 1 year ago

ASPIRE: Language-Guided Augmentation for Robust Image Classification

Paper • 2308.10103 • Published Aug 19, 2023

Sonal Kumar

AI & ML interests

Recent Activity

Organizations

sonalkum's activity

Synthio Stable Audio Open

GAMA

GAMA-IT

GAMA-IT

GAMA