AITopics | voice data

Collaborating Authors

voice data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Cost Analysis of Human-corrected Transcription for Predominately Oral Languages

Diarra, Yacouba, Coulibaly, Nouhoum Souleymane, Leventhal, Michael

arXiv.org Artificial IntelligenceOct-15-2025

Creating speech datasets for low-resource languages is a critical yet poorly understood challenge, particularly regarding the actual cost in human labor. This paper investigates the time and complexity required to produce high-quality annotated speech data for a subset of low-resource languages, low literacy Predomi-nately Oral Languages, focusing on Bambara, a Manding language of Mali. Through a one-month field study involving ten transcribers with native proficiency, we analyze the correction of ASR-generated transcriptions of 53 hours of Bambara voice data. We report that it takes, on average, 30 hours of human labor to accurately transcribe one hour of speech data under laboratory conditions and 36 hours under field conditions. The study provides a baseline and practical insights for a large class of languages with comparable profiles undertaking the creation of NLP resources.

artificial intelligence, natural language, transcriber, (17 more...)

arXiv.org Artificial Intelligence

2510.12781

Country:

Europe (0.93)
Africa > Mali (0.34)
North America > United States (0.29)

Genre:

Research Report > New Finding (0.94)
Research Report > Experimental Study (0.66)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.95)

Add feedback

PRAC3 (Privacy, Reputation, Accountability, Consent, Credit, Compensation): Long Tailed Risks of Voice Actors in AI Data-Economy

Sharma, Tanusree, Zhou, Yihao, Berisha, Visar

arXiv.org Artificial IntelligenceJul-23-2025

Early large-scale audio datasets, such as LibriSpeech, were built with hundreds of individual contributors whose voices were instrumental in the development of speech technologies, including audiobooks and voice assistants. Y et, a decade later, these same contributions have exposed voice actors to a range of risks. While existing ethical frameworks emphasize Consent, Credit, and Compensation (C), they do not adequately address the emergent risks involving vocal identities that are increasingly decoupled from context, authorship, and control. Drawing on qualitative interviews with 20 professional voice actors, this paper reveals how synthetic replication of voice without clear provenance or enforceable constraints exposes individuals to both reputational and security threats. Beyond reputational harm, such as re-purposing voice data in erotic content, offensive political messaging, and meme culture, we document concerns about accountability breakdowns when their voice is leveraged to clone voices that are deployed in high-stakes scenarios such as financial fraud, misinformation campaigns, or impersonation scams. In such cases, actors face social and legal fallout without recourse, while very few of them have a legal representative or union protection. To make sense of these shifting dynamics, we introduce the PRAC framework - an expansion of C that foregrounds Privacy, Reputation, Accountability, Consent, Credit, and Compensation as interdependent pillars of data used in the synthetic voice economy. This framework captures how privacy risks are amplified through non-consensual training, how reputational harm arises from decontextualized deployment, and how accountability can be reimagined AI Data ecosystems. We argue that voice, as both a biometric identifier and creative labor, demands governance models that restore creator agency, ensure traceability, and establish enforceable boundaries for ethical reuse.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.16247

Country: North America > United States (1.00)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (0.87)

Industry:

Media (1.00)
Law > Litigation (1.00)
Law Enforcement & Public Safety (1.00)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Speech (1.00)
(3 more...)

Add feedback

Experts reveal sneaky way your phone listens in on your conversations - and how to stop it

Daily Mail - Science & techSep-4-2024, 18:31:45 GMT

It was long thought to be a myth and dismissed by big tech companies. But experts have revealed how listening into your conversations has become a multi-billion dollar industry. Earlier this week, a leak from a leading marketing firm appeared to confirm how companies use microphones on your smart devices to eavesdrop before selling the data to advertisers. 'You can be talking to one of your friends about going on a vacation to Portugal through a phone call, and then a day later or that same day, what do you see? An advertisement for a trip,' data security expert Andy LoCascio told DailyMail.com.

artificial intelligence, locascio, natural language, (18 more...)

Daily Mail - Science & tech

Country:

Europe > Portugal (0.25)
North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > United States > California (0.05)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.72)

Add feedback

Your bank wants your voice. Just say no.

FOX NewsJul-23-2024, 19:03:49 GMT

Silicon Valley tech pioneer Allison Huynh joined'Fox & Friends First' to discuss her take on the worldwide outages and why she believes Biden could only have'hours' left in the 2024 race. You already gave your bank your address, date of birth, Social Security number and your mother's maiden name. Now, they want your voice. Banks say it's an extra layer of biometric protection against fraud and cybercrime. But with the rise of hackers stealing voice data for deepfakes, is it worth the risk?

getty image, public wi-fi, voice authentication, (6 more...)

FOX News

Country: North America > United States > California (0.25)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Networks (0.51)
Information Technology > Artificial Intelligence > Vision (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.36)

Add feedback

Smartwatch-derived Acoustic Markers for Deficits in Cognitively Relevant Everyday Functioning

Yamada, Yasunori, Shinkawa, Kaoru, Kobayashi, Masatomo, Nemoto, Miyuki, Ota, Miho, Nemoto, Kiyotaka, Arai, Tetsuaki

arXiv.org Artificial IntelligenceSep-11-2023

Detection of subtle deficits in everyday functioning due to cognitive impairment is important for early detection of neurodegenerative diseases, particularly Alzheimer's disease. However, current standards for assessment of everyday functioning are based on qualitative, subjective ratings. Speech has been shown to provide good objective markers for cognitive impairments, but the association with cognition-relevant everyday functioning remains uninvestigated. In this study, we demonstrate the feasibility of using a smartwatch-based application to collect acoustic features as objective markers for detecting deficits in everyday functioning. We collected voice data during the performance of cognitive tasks and daily conversation, as possible application scenarios, from 54 older adults, along with a measure of everyday functioning. Machine learning models using acoustic features could detect individuals with deficits in everyday functioning with up to 77.8% accuracy, which was higher than the 68.5% accuracy with standard neuropsychological tests. We also identified common acoustic features for robustly discriminating deficits in everyday functioning across both types of voice data (cognitive tasks and daily conversation). Our results suggest that common acoustic features extracted from different types of voice data can be used as markers for deficits in everyday functioning.

acoustic feature, alzheimer, everyday functioning, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICDH60066.2023.00015

2309.05777

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Early Warning: Changes in Speech May Be the First Sign of Parkinson's Disease

#artificialintelligenceFeb-16-2023, 23:55:12 GMT

Parkinson's disease is a progressive nervous system disorder that affects movement and muscle control. Lithuanian researchers from Kaunas University of Technology (KTU) utilized AI to identify the early signs of Parkinson's disease using voice data. The diagnosis of Parkinson's disease has shaken many lives, with over 10 million people currently living with the condition. Although there is no cure, early detection of symptoms can lead to better management of the disease. As the disease progresses, changes in speech can occur alongside other symptoms.

algorithm, diagnosis, parkinson, (14 more...)

#artificialintelligence

Country: Europe > Lithuania > Kaunas County > Kaunas (0.28)

Industry:

Health & Medicine > Therapeutic Area > Neurology > Parkinson's Disease (1.00)
Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Talk to me: How AI can diagnose disease - POLITICO

#artificialintelligenceJan-13-2023, 22:26:16 GMT

EXPRESSING A DISEASE: Want to know whether you have Covid-19 or even Alzheimer's? Artificial intelligence might soon have an answer just by listening to your voice. Leading researchers are developing technology that sorts through evidence of so-called vocal biomarkers to hone in on medical conditions that might not be detectable during routine office visits or exams. "This line might seem to have been lifted from a Star Trek script," said Bertalan Meskó, director of the Medical Futurist Institute. "But we are close to having such conversations with our computers."

artificial intelligence, development cost, investment, (16 more...)

#artificialintelligence

Country: North America > United States (1.00)

Industry:

Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.94)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Addressing the Selection Bias in Voice Assistance: Training Voice Assistance Model in Python with Equal Data Selection

Piya, Kashav, Shrestha, Srijal, Frank, Cameran, Jebessa, Estephanos, Mohd, Tauheed Khan

arXiv.org Artificial IntelligenceDec-20-2022

In recent times, voice assistants have become a part of our day-to-day lives, allowing information retrieval by voice synthesis, voice recognition, and natural language processing. These voice assistants can be found in many modern-day devices such as Apple, Amazon, Google, and Samsung. This project is primarily focused on Virtual Assistance in Natural Language Processing. Natural Language Processing is a form of AI that helps machines understand people and create feedback loops. This project will use deep learning to create a Voice Recognizer and use Commonvoice and data collected from the local community for model training using Google Colaboratory. After recognizing a command, the AI assistant will be able to perform the most suitable actions and then give a response. The motivation for this project comes from the race and gender bias that exists in many virtual assistants. The computer industry is primarily dominated by the male gender, and because of this, many of the products produced do not regard women. This bias has an impact on natural language processing. This project will be utilizing various open-source projects to implement machine learning algorithms and train the assistant algorithm to recognize different types of voices, accents, and dialects. Through this project, the goal to use voice data from underrepresented groups to build a voice assistant that can recognize voices regardless of gender, race, or accent. Increasing the representation of women in the computer industry is important for the future of the industry. By representing women in the initial study of voice assistants, it can be shown that females play a vital role in the development of this technology. In line with related work, this project will use first-hand data from the college population and middle-aged adults to train voice assistant to combat gender bias.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2301.00646

Country:

North America > United States > Illinois > Rock Island County > Rock Island (0.05)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.82)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

The Race to Hide Your Voice

WIREDJun-1-2022, 11:00:00 GMT

Your voice reveals more about you than you realize. To the human ear, your voice can instantly give away your mood, for example--it's easy to tell if you're excited or upset. But machines can learn a lot more: inferring your age, gender, ethnicity, socio-economic status, health conditions, and beyond. Researchers have even been able to generate images of faces based on the information contained in individuals' voice data. As machines become better at understanding you through your voice, companies are cashing in.

information, kleinberger, voice data, (5 more...)

WIRED

Country:

Europe > France (0.07)
North America > United States (0.06)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.06)

Industry:

Health & Medicine (0.37)
Information Technology > Security & Privacy (0.32)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.53)
Information Technology > Artificial Intelligence > Speech > Acoustic Processing (0.42)

Add feedback

Deepdub closes fresh round for dubbing AI that dubs movies, shows, and games - Dataconomy

#artificialintelligenceMar-7-2022, 04:40:54 GMT

Dubbing, where recordings in other languages are lip-synced and mixed with a show's original soundtrack, is an exploding business. One localization platform, Zoo Digital, saw revenues jump by 73% to $28.6 million in July 2018 compared to the year prior. Another, BTI Studios, told Television Business International that dubbing grew from 3% of its revenue in 2010 to 61% in 2019. According to Verified Market Research, the film dubbing market alone could reach $3.6 billion in worth by 2027, growing at a compound annual growth rate of 5.6% from 2020. But barriers stand in the way of expansion.

actor, deepdub close fresh round, platform, (12 more...)

#artificialintelligence

Country: Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.06)

Industry:

Leisure & Entertainment (1.00)
Media > Television (0.90)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback