AITopics | ai-generated speech

Collaborating Authors

ai-generated speech

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion

Bird, Jordan J., Lotfi, Ahmad

arXiv.org Artificial IntelligenceAug-24-2023

There are growing implications surrounding generative AI in the speech domain that enable voice cloning and real-time voice conversion from one individual to another. This technology poses a significant ethical threat and could lead to breaches of privacy and misrepresentation, thus there is an urgent need for real-time detection of AI-generated speech for DeepFake Voice Conversion. To address the above emerging issues, the DEEP-VOICE dataset is generated in this study, comprised of real human speech from eight well-known figures and their speech converted to one another using Retrieval-based Voice Conversion. Presenting as a binary classification problem of whether the speech is real or AI-generated, statistical analysis of temporal audio features through t-testing reveals that there are significantly different distributions. Hyperparameter optimisation is implemented for machine learning models to identify the source of speech. Following the training of 208 individual machine learning models over 10-fold cross validation, it is found that the Extreme Gradient Boosting model can achieve an average classification accuracy of 99.3% and can classify speech in real-time, at around 0.004 milliseconds given one second of speech. All data generated for this study is released publicly for future research on AI speech detection.

ai-generated speech, real-time detection, speech, (10 more...)

arXiv.org Artificial Intelligence

2308.12734

Country:

North America > United States (1.00)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.04)
Asia > Japan > Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

AI voices are hard to spot even if you know audio might be a deepfake

New ScientistAug-2-2023, 19:00:51 GMT

Could you tell if you were listening to an AI-generated voice? Even when people know they may be listening to AI-generated speech, it is still difficult for both English and Mandarin speakers to reliably detect a deepfake voice. That means billions of people who understand the world's most spoken languages are potentially at risk when exposed to deepfake scams or misinformation. Kimberly Mai at University College London and her colleagues challenged more than 500 people to identify speech deepfakes among multiple audio clips. Some clips contained the authentic voice of a female speaker reading generic sentences in either English or Mandarin, while others were deepfakes created by generative AIs trained on female voices.

ai voice, authentic voice, deepfake, (9 more...)

New Scientist

Country: North America > United States > California > Alameda County > Berkeley (0.06)

Genre: Research Report > Experimental Study (0.53)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback