AITopics | source localization

Learning Spatially-Aware Language and Audio Embeddings

Neural Information Processing SystemsMar-19-2026, 15:54:09 GMT

Humans can picture a sound scene given an imprecise natural language description. For example, it is easy to imagine an acoustic environment given a phrase like the lion roar came from right behind me!. For a machine to have the same degree of comprehension, the machine must know what a lion is (semantic attribute), what the concept of behind is (spatial attribute) and how these pieces of linguistic information align with the semantic and spatial attributes of the sound (what a roar sounds like when its coming from behind). State-of-the-art audio foundation models, such as CLAP, which learn to map between audio scenes and natural textual descriptions, are trained on non-spatial audio and text pairs, and hence lack spatial awareness. In contrast, sound event localization and detection models are limited to recognizing sounds from a fixed number of classes, and they localize the source to absolute position (e.g., 0.2m) rather than a position described using natural language (e.g., next to me).

artificial intelligence, natural language, proceedings, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language (0.60)

Add feedback

ce953d71deeb33d9ffa2c879b518d273-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 05:21:50 GMT

information, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (1.00)
Energy (0.94)
Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Data Science (0.93)
Information Technology > Communications > Networks (0.93)
Information Technology > Artificial Intelligence > Robots (0.93)
(3 more...)

Add feedback

Vocal Call Locator Benchmark (VCL) for localizing rodent vocalizations from multi-channel audio Ralph E Peterson

Neural Information Processing SystemsFeb-17-2026, 22:20:44 GMT

Here, we present the VCL Benchmark: the first large-scale dataset for benchmarking SSL algorithms in rodents.

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
Asia > Middle East > Iran (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Speech (0.68)

Add feedback

bf8f6f5b017dc60d0c4e28a7a9a4ee7b-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-17-2026, 22:00:25 GMT

artificial intelligence, machine learning, speech enhancement, (19 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Data Science (0.67)
Information Technology > Communications (0.67)

Add feedback

Aligning Audio-Visual Joint Representations with an Agentic Workflow

Neural Information Processing SystemsFeb-15-2026, 15:39:05 GMT

Visual content and accompanied audio signals naturally formulate a joint representation to improve audio-visual (A V) related applications.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Genre:

Workflow (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(2 more...)

Add feedback

f3f2ff9579ba6deeb89caa2fe1f0b99c-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 20:48:29 GMT

artificial intelligence, machine learning, slavc, (13 more...)

Neural Information Processing Systems

Country: North America > United States > Wisconsin > Dane County > Madison (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A state-space model for inferring effective connectivity of latent neural dynamics from simultaneous EEG/fMRI

Tao Tu, John Paisley, Stefan Haufe, Paul Sajda

Neural Information Processing SystemsFeb-12-2026, 12:05:20 GMT

Finally,an EEG forward model based onapre-estimated lead field matrix wasconstructed together with the ROIsource model togenerate scalp EEG observations.

artificial intelligence, connectivity, effective connectivity, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Health & Medicine > Health Care Technology (0.89)

Technology: Information Technology > Artificial Intelligence (0.30)

Add feedback

46ab9d9645b6975b947231ddb48da1ab-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 06:07:45 GMT

artificial intelligence, dataset, machine learning, (18 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia (0.04)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Data Science (0.94)

Add feedback

46ab9d9645b6975b947231ddb48da1ab-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 06:07:41 GMT

ddmsl, diffusion, node, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)
Information Technology (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization

Panah, Davoud Shariat, Ragano, Alessandro, Barry, Dan, Skoglund, Jan, Hines, Andrew

arXiv.org Artificial IntelligenceNov-19-2025

ABSTRACT This study presents a systematic evaluation of time-frequency feature design for binaural sound source localization (SSL), focusing on how feature selection influences model performance across diverse conditions. We investigate the performance of a convolu-tional neural network (CNN) model using various combinations of amplitude-based features (magnitude spectrogram, interaural level difference - ILD) and phase-based features (phase spectrogram, interaural phase difference - IPD). Evaluations on in-domain and out-of-domain data with mismatched head-related transfer functions (HRTFs) reveal that carefully chosen feature combinations often outperform increases in model complexity. While two-feature sets such as ILD + IPD are sufficient for in-domain SSL, generalization to diverse content requires richer inputs combining channel spectrograms with both ILD and IPD. Using the optimal feature sets, our low-complexity CNN model achieves competitive performance. Our findings underscore the importance of feature design in binaural SSL and provide practical guidance for both domain-specific and general-purpose localization.

artificial intelligence, localization, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2511.13487

Country: North America (0.28)

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

source localization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Learning Spatially-Aware Language and Audio Embeddings

ce953d71deeb33d9ffa2c879b518d273-Paper-Conference.pdf

Vocal Call Locator Benchmark (VCL) for localizing rodent vocalizations from multi-channel audio Ralph E Peterson

bf8f6f5b017dc60d0c4e28a7a9a4ee7b-Paper-Datasets_and_Benchmarks_Track.pdf

Aligning Audio-Visual Joint Representations with an Agentic Workflow

f3f2ff9579ba6deeb89caa2fe1f0b99c-Supplemental-Conference.pdf

A state-space model for inferring effective connectivity of latent neural dynamics from simultaneous EEG/fMRI

46ab9d9645b6975b947231ddb48da1ab-Supplemental-Conference.pdf

46ab9d9645b6975b947231ddb48da1ab-Paper-Conference.pdf

Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization