PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification
Baali, Massa, Aldoobi, Abdulhamid, Dhamyal, Hira, Singh, Rita, Raj, Bhiksha
–arXiv.org Artificial Intelligence
ABSTRACT Speaker verification systems are crucial for authenticating identity through voice. Traditionally, these systems focus on comparing feature vectors, overlooking the speech's content. The lexical content L determines the phonetic structure a measure of the frequency or duration of phonemes, as a P, which in turn determines the acoustics A. Thus, any production crucial cue in speaker verification. A novel Phoneme-Debiasing Attention of a signal actually represents the draws of all three variables. Framework (PDAF) is introduced, integrating with existing Content-agnostic verification systems, however, only consider the attention frameworks to mitigate biases caused by phonetic dominance. This approach paves the way for more accurate and reliable identity authentication through voice.
arXiv.org Artificial Intelligence
Sep-9-2024
- Country:
- Asia > India (0.04)
- North America > United States (0.14)
- Genre:
- Research Report (0.82)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: