AITopics | soundscape recording

Collaborating Authors

soundscape recording

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Semi-supervised classification of bird vocalizations

Hexeberg, Simen, Chitre, Mandar, Hoffmann-Kuhnt, Matthias, Low, Bing Wen

arXiv.org Artificial IntelligenceFeb-19-2025

Changes in bird populations can indicate broader changes in ecosystems, making birds one of the most important animal groups to monitor. Combining machine learning and passive acoustics enables continuous monitoring over extended periods without direct human involvement. However, most existing techniques require extensive expert-labeled datasets for training and cannot easily detect time-overlapping calls in busy soundscapes. We propose a semi-supervised acoustic bird detector designed to allow both the detection of time-overlapping calls (when separated in frequency) and the use of few labeled training samples. The classifier is trained and evaluated on a combination of community-recorded open-source data and long-duration soundscape recordings from Singapore. It outperforms the state-of-the-art BirdNET classifier on a test set of 103 bird species despite significantly fewer labeled training samples. The detector is further tested on 144 microphone-hours of continuous soundscape data. The rich soundscape in Singapore makes suppression of false positives a challenge on raw, continuous data streams. Nevertheless, we demonstrate that achieving high precision in such environments with minimal labeled training data is possible. Introduction Biodiversity monitoring is a critical aspect of biodiversity conservation, as it helps inform decision making, improves our knowledge and enhances public education and awareness. Birds are one of the most surveyed animal groups in biodiversity monitoring programmes, with point counts and transect surveys being well-established survey techniques for monitoring bird communities [1]. However, birds can be very difficult to detect and identify especially in tropical regions characterised by high avian diversity and numerous rare species [2], [3]. Additionally, such manned survey techniques are manpower-intensive, require highly specialized expertise, and tend to overlook rare species that are sensitive to human presence [4], [5], [6]. Passive monitoring of biodiversity using acoustics is thus an area of great potential, as various animal groups including birds make unique vocalizations, which can be used to validate their presence.

classifier, representation, tfr, (16 more...)

arXiv.org Artificial Intelligence

2502.1344

Country:

Asia > Singapore (0.46)
North America > United States > Florida > Orange County > Orlando (0.04)
North America > Panama (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Towards Deep Active Learning in Avian Bioacoustics

Rauch, Lukas, Huseljic, Denis, Wirth, Moritz, Decke, Jens, Sick, Bernhard, Scholz, Christoph

arXiv.org Artificial IntelligenceJun-26-2024

Passive acoustic monitoring (PAM) in avian bioacoustics enables cost-effective and extensive data collection with minimal disruption to natural habitats. Despite advancements in computational avian bioacoustics, deep learning models continue to encounter challenges in adapting to diverse environments in practical PAM scenarios. This is primarily due to the scarcity of annotations, which requires labor-intensive efforts from human experts. Active learning (AL) reduces annotation cost and speed ups adaption to diverse scenarios by querying the most informative instances for labeling. This paper outlines a deep AL approach, introduces key challenges, and conducts a small-scale pilot study.

active learning, avian bioacoustic, learning, (12 more...)

arXiv.org Artificial Intelligence

2406.18621

Country:

North America > United States > Nevada (0.05)
Europe > Switzerland (0.05)
Europe > Germany (0.05)
South America > Peru (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

BirdSet: A Dataset and Benchmark for Classification in Avian Bioacoustics

Rauch, Lukas, Schwinger, Raphael, Wirth, Moritz, Heinrich, René, Huseljic, Denis, Lange, Jonas, Kahl, Stefan, Sick, Bernhard, Tomforde, Sven, Scholz, Christoph

arXiv.org Artificial IntelligenceJun-17-2024

Deep learning (DL) models have emerged as a powerful tool in avian bioacoustics to assess environmental health. To maximize the potential of cost-effective and minimal-invasive passive acoustic monitoring (PAM), DL models must analyze bird vocalizations across a wide range of species and environmental conditions. However, data fragmentation challenges a comprehensive evaluation of generalization performance. Therefore, we introduce the BirdSet dataset, comprising approximately 520,000 global bird recordings for training and over 400 hours of PAM recordings for testing. Our benchmark offers baselines for several DL models to enhance comparability and consolidate research across studies, along with code implementations that include comprehensive training and evaluation protocols.

dataset, soundscape recording, vocalization, (15 more...)

arXiv.org Artificial Intelligence

2403.1038

Country:

North America > United States > New York > Tompkins County > Ithaca (0.14)
North America > United States > Nevada (0.04)
North America > Costa Rica (0.04)
(13 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Law (1.00)
Information Technology (1.00)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science (0.92)

Add feedback

AudioProtoPNet: An interpretable deep learning model for bird sound classification

Heinrich, René, Sick, Bernhard, Scholz, Christoph

arXiv.org Artificial IntelligenceMay-29-2024

Recently, scientists have proposed several deep learning models to monitor the diversity of bird species. These models can detect bird species with high accuracy by analyzing acoustic signals. However, traditional deep learning algorithms are black-box models that provide no insight into their decision-making process. For domain experts, such as ornithologists, it is crucial that these models are not only efficient, but also interpretable in order to be used as assistive tools. In this study, we present an adaption of the Prototypical Part Network (ProtoPNet) for audio classification that provides inherent interpretability through its model architecture. Our approach is based on a ConvNeXt backbone architecture for feature extraction and learns prototypical patterns for each bird species using spectrograms of the training data. Classification of new data is done by comparison with these prototypes in latent space, which simultaneously serve as easily understandable explanations for the model's decisions. We evaluated the performance of our model on seven different datasets representing bird species from different geographical regions. In our experiments, the model showed excellent results, achieving an average AUROC of 0.82 and an average cmAP of 0.37 across the seven datasets, making it comparable to state-of-the-art black-box models for bird sound classification. Thus, this work demonstrates that even for the challenging task of bioacoustic bird classification, powerful yet interpretable deep learning models can be developed to provide valuable insights to domain experts.

audioprotopnet, classification, prototype, (12 more...)

arXiv.org Artificial Intelligence

2404.1042

Country:

North America > United States > Nevada (0.04)
North America > Costa Rica (0.04)
South America > Colombia (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (0.68)
Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Active Bird2Vec: Towards End-to-End Bird Sound Monitoring with Transformers

Rauch, Lukas, Schwinger, Raphael, Wirth, Moritz, Sick, Bernhard, Tomforde, Sven, Scholz, Christoph

arXiv.org Artificial IntelligenceNov-21-2023

We propose a shift towards end-to-end learning in bird sound monitoring by combining self-supervised (SSL) and deep active learning (DAL). Leveraging transformer models, we aim to bypass traditional spectrogram conversions, enabling direct raw audio processing. ActiveBird2Vec is set to generate high-quality bird sound representations through SSL, potentially accelerating the assessment of environmental changes and decision-making processes for wind farms. Additionally, we seek to utilize the wide variety of bird vocalizations through DAL, reducing the reliance on extensively labeled datasets by human experts. We plan to curate a comprehensive set of tasks through Huggingface Datasets, enhancing future comparability and reproducibility of bioacoustic research. A comparative analysis between various transformer models will be conducted to evaluate their proficiency in bird sound recognition tasks. We aim to accelerate the progression of avian bioacoustic research and contribute to more effective conservation strategies.

arxiv, representation, transformer model, (15 more...)

arXiv.org Artificial Intelligence

2308.07121

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
North America > Dominican Republic (0.04)
North America > Canada > Ontario > Toronto (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

In Search for a Generalizable Method for Source Free Domain Adaptation

Boudiaf, Malik, Denton, Tom, van Merriënboer, Bart, Dumoulin, Vincent, Triantafillou, Eleni

arXiv.org Artificial IntelligenceJun-24-2023

Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-shelf model to a new domain using only unlabelled data. In this work, we apply existing SFDA techniques to a challenging set of naturally-occurring distribution shifts in bioacoustics, which are very different from the ones commonly studied in computer vision. We find existing methods perform differently relative to each other than observed in vision benchmarks, and sometimes perform worse than no adaptation at all. We propose a new simple method which outperforms the existing methods on our new shifts while exhibiting strong performance on a range of vision datasets. Our findings suggest that existing SFDA methods are not as generalizable as previously thought and that considering diverse modalities can be a useful avenue for designing more robust models.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2302.06658

Country:

North America > United States > Nevada (0.05)
South America > Colombia (0.04)
North America > United States > California (0.04)
(5 more...)

Genre: Research Report > New Finding (0.86)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Comparing Western and Chinese classical music using deep learning algorithms

#artificialintelligenceMar-23-2020, 19:47:37 GMT

Deep learning techniques are proving to be extremely useful for analyzing all kinds of data, ranging from images to text, online posts and audio recordings. These techniques are designed to identify patterns in large datasets, separate items in different categories and make predictions far quicker than humans. In a recent study, researchers at Simon Fraser University, Academia Sinica and Dartmouth College have applied deep learning techniques to identify similarities and differences between Chinese and Western classical music. Their paper, pre-published on arXiv, presents a comparative analysis of music recordings using sound event detection (SED) and soundscape emotion recognition (SER) models. "We have listened to both Chinese and Western classical music," Jianyu Fan, one of the researchers who carried out the study, told TechXplore.

chinese and western classical music, classical music, music, (13 more...)

#artificialintelligence

Country: Asia > China (0.05)

Genre: Research Report > New Finding (0.32)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

BirdCLEF 2018 ImageCLEF / LifeCLEF - Multimedia Retrieval in CLEF

#artificialintelligenceMay-16-2018, 19:00:36 GMT

As in 2017, two scenarios will be evaluated, (i) the identification of a particular bird specimen in a recording of it, and (ii), the recognition of all specimens singing in a long sequence (up to one hour) of raw soundscapes that can contain tens of birds singing simultaneously. The first scenario is aimed at developing new interactive identification tools, to help user and expert who is today equipped with a directional microphone and spend too much time observing and listening the birds to assess their population on the field. The soundscapes, on the other side, correspond to a passive monitoring scenario in which any multi-directional audio recording device could be used without or with very light user's involvement, and thus efficient biodiversity assessment. The goal of the task is to identify the species of the most audible bird (i.e. the one that was intended to be recorded) in each of the provided test recordings. Therefore, the evaluated systems have to return a ranked list of possible species for each of the 12,347 test recordings.

artificial intelligence, machine learning, training data, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.81)

Add feedback