ArabCeleb: Speaker Recognition in Arabic

ArabCeleb: Speaker Recognition in Arabic

In this paper we present ArabCeleb, a dataset collected in the wild that specifically focuses on arabic language. The proposed dataset contains utterances from 100 celebrities taken from video on YouTube.com. The dataset might be used for several speaker recognition tasks: identification, verification, gender recognition as well as multimodal recognition tasks thus integrating audio and video tracks.

To complete our study, we evaluated the most recent state-of-the-art methods for speaker recognition by measuring robustness as the length of the utterances increases.

Download the ArabCeleb dataset!

Publications

1.

ArabCeleb: Speaker Recognition in Arabic
(Simone Bianco, Luigi Celona, Intissar Khalifa, Paolo Napoletano, Alexey Petrovsky, Flavio Piccoli, Raimondo Schettini, Ivan Shanin) In International Conference of the Italian Association for Artificial Intelligence (AIxIA), 2021.

@inproceedings{bianco2021arabceleb,
 author = {Bianco, Simone and Celona, Luigi and Khalifa, Intissar and Napoletano, Paolo and Petrovsky, Alexey and Piccoli, Flavio and Schettini, Raimondo and Shanin, Ivan},
 year = {2021},
 title = {ArabCeleb: Speaker Recognition in Arabic},
 booktitle = {International Conference of the Italian Association for Artificial Intelligence (AIxIA)},
 projectref = {http://www.ivl.disco.unimib.it/activities/arabceleb/}}