Toroidal Probabilistic Spherical Discriminant Analysis

Silnova, Anna, Brümmer, Niko, Swart, Albert, Burget, Lukáš

Oct-27-2022–arXiv.org Machine Learning

In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring back-ends are commonly used, namely cosine scoring and PLDA. We have recently proposed PSDA, an analog to PLDA that uses Von Mises-Fisher distributions instead of Gaussians. In this paper, we present toroidal PSDA (T-PSDA). It extends PSDA with the ability to model within and between-speaker variabilities in toroidal submanifolds of the hypersphere. Like PLDA and PSDA, the model allows closed-form scoring and closed-form EM updates for training. On VoxCeleb, we find T-PSDA accuracy on par with cosine scoring, while PLDA accuracy is inferior. On NIST SRE'21 we find that T-PSDA gives large accuracy gains compared to both cosine scoring and PLDA.

artificial intelligence, cosine, machine learning, (13 more...)

arXiv.org Machine Learning

Oct-27-2022

arXiv.org PDF

Add feedback

Country:
- Africa > South Africa (0.04)
- Europe
  - Finland (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Czechia > South Moravian Region
    - Brno (0.05)
  - Austria > Styria
    - Graz (0.04)
- Asia > China
  - Beijing > Beijing (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found