Vox-Profile
Collection
This collection includes the implementation of models described in Vox-Profile benchmark. (https://arxiv.org/pdf/2505.14648)
•
14 items
•
Updated
•
2
This model includes the implementation of age and sex classification described in Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits (https://arxiv.org/pdf/2505.14648)
The sex labels are: ["Female", "Male"]. The age output is from 0-1, and times 100 is the actual age.
@article{feng2025vox,
title={Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits},
author={Feng, Tiantian and Lee, Jihwan and Xu, Anfeng and Lee, Yoonjeong and Lertpetchpun, Thanathai and Shi, Xuan and Wang, Helin and Thebaud, Thomas and Moro-Velazquez, Laureano and Byrd, Dani and others},
journal={arXiv preprint arXiv:2505.14648},
year={2025}
}
Base model
microsoft/wavlm-large