AITopics | parakeet

Collaborating Authors

parakeet

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Parakeets teach a lesson in friendship

Breakthroughs, discoveries, and DIY tips sent every weekday. Making new friends (especially as an adult) can be challenging. When new birds are introduced to a group, monk parakeets will "test the waters" to avoid getting injured by defensive strangers. The parakeets will gradually approach the new bird, taking some time to get familiar before ramping up to more risky or vulnerable interactions that are needed to form the bonds necessary for survival. "There can be a lot of benefits to being social, but these friendships have to start somewhere," said Claire O'Connell, a study co-author and a doctoral student in the University of Cincinnati, said in a statement .

artificial intelligence, friendship, laura baisa, (16 more...)

Popular Science

Country:

North America > United States > New Jersey (0.05)
North America > United States > Hawaii (0.05)
North America > United States > Alaska (0.05)
(2 more...)

Genre: Research Report > New Finding (0.89)

Industry:

Education > Educational Setting > Higher Education (0.35)
Health & Medicine (0.30)

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People

Zhou, Haoshuai, Cao, Boxuan, Mo, Changgeng, Li, Linkai, Wang, Shan Xiang

arXiv.org Artificial IntelligenceMay-14-2025

Speech foundation models (SFMs) have demonstrated strong performance across a variety of downstream tasks, including speech intelligibility prediction for hearing-impaired people (SIP-HI). However, optimizing SFMs for SIP-HI has been insufficiently explored. In this paper, we conduct a comprehensive study to identify key design factors affecting SIP-HI performance with 5 SFMs, focusing on encoder layer selection, prediction head architecture, and ensemble configurations. Our findings show that, contrary to traditional use-all-layers methods, selecting a single encoder layer yields better results. Additionally, temporal modeling is crucial for effective prediction heads. We also demonstrate that ensembling multiple SFMs improves performance, with stronger individual models providing greater benefit. Finally, we explore the relationship between key SFM attributes and their impact on SIP-HI performance. Our study offers practical insights into effectively adapting SFMs for speech intelligibility prediction for hearing-impaired populations.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.08215

Country:

North America > United States > Florida > Hillsborough County > University (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

We finally know how parrots 'talk'

Parrots are so adept at mimicking people that the avian moniker has become synonymous with repetition. Yet for as long as we've known about the birds' incredible ability for impressions, how they manage such complex and flexible vocalizations has been a mystery. A new study offers a piece of the puzzle by peeking into the parakeet brain, and finds remarkable similarities to the human neural region that controls speech. The research, published March 19 in the journal Nature, suggests parrots (and specifically parakeets) could be a model for studying human speech, helping scientists to better understand and treat speech disorders. It also adds to the growing stack of scientific findings that demonstrate "bird-brained" isn't much of an insult after all.

artificial intelligence, brain, parakeet, (10 more...)

Popular Science

Genre: Research Report > New Finding (0.49)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence (0.30)

Add feedback

Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition

Cornell, Samuele, Darefsky, Jordan, Duan, Zhiyao, Watanabe, Shinji

arXiv.org Artificial IntelligenceAug-17-2024

Currently, a common approach in many speech processing tasks is to leverage large scale pre-trained models by fine-tuning them on in-domain data for a particular application. Yet obtaining even a small amount of such data can be problematic, especially for sensitive domains and conversational speech scenarios, due to both privacy issues and annotation costs. To address this, synthetic data generation using single speaker datasets has been employed. Yet, for multi-speaker cases, such an approach often requires extensive manual effort and is prone to domain mismatches. In this work, we propose a synthetic data generation pipeline for multi-speaker conversational ASR, leveraging a large language model (LLM) for content creation and a conversational multi-speaker text-to-speech (TTS) model for speech synthesis. We conduct evaluation by fine-tuning the Whisper ASR model for telephone and distant conversational speech settings, using both in-domain data and generated synthetic data. Our results show that the proposed method is able to significantly outperform classical multi-speaker generation approaches that use external, non-conversational speech datasets.

parakeet, proc, recognition, (16 more...)

arXiv.org Artificial Intelligence

2408.09215

Country:

North America > United States > Indiana > Hamilton County > Fishers (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.69)

Industry: Information Technology > Security & Privacy (0.54)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Synthesis (0.91)

Add feedback

Intelligently Aiding Human-Guided Correction of Speech Recognition

Vertanen, Keith (University of Cambridge) | Kristensson, Per Ola (University of Cambridge)

AAAI ConferencesJul-15-2010

Correcting recognition errors is often necessary in a speech interface. These errors not only reduce users' overall entry rate, but can also lead to frustration. While making fewer recognition errors is undoubtedly helpful, facilities for supporting user-guided correction are also critical. We explore how to better support user corrections using Parakeet — a continuous speech recognition system for mobile touch-screen devices. Parakeet's interface is designed for easy error correction on a handheld device. Users correct errors by selecting alternative words from a word confusion network and by typing on a predictive software keyboard. Our interface design was guided by computational experiments and used a variety of information sources to aid the correction process. In user studies, participants were able to write text effectively despite sometimes high initial recognition error rates. Using Parakeet as an example, we discuss principles we found were important for building an effective speech correction interface.

artificial intelligence, human computer interaction, machine learning, (18 more...)

AAAI Conferences

Twenty-Fourth AAAI Conference on Artificial Intelligence

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.67)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback