An Efficient GPU-based Implementation for Noise Robust Sound Source Localization
Lin, Zirui, Takigahira, Masayuki, Terakado, Naoya, Gulzar, Haris, Busto, Monikka Roslianna, Eda, Takeharu, Itoyama, Katsutoshi, Nakadai, Kazuhiro, Amano, Hideharu
–arXiv.org Artificial Intelligence
Dept. of Information and Computer Science, Keio University, Kanagawa, Japan Email: hunga@am.ics.keio.ac.jp Abstract --Robot audition, encompassing Sound Source Localization (SSL), Sound Source Separation (SSS), and Automatic Speech Recognition (ASR), enables robots and smart devices to acquire auditory capabilities similar to human hearing. Despite their wide applicability, processing multi-channel audio signals from microphone arrays in SSL involves computationally intensive matrix operations, which can hinder efficient deployment on Central Processing Units (CPUs), particularly in embedded systems with limited CPU resources. This paper introduces a GPU-based implementation of SSL for robot audition, utilizing the Generalized Singular V alue Decomposition-based Multiple Signal Classification (GSVD-MUSIC), a noise-robust algorithm, within the HARK platform, an open-source software suite. For a 60-channel microphone array, the proposed implementation achieves significant performance improvements. On the Jet-son AGX Orin, an embedded device powered by an NVIDIA GPU and ARM Cortex -A78AE v8.2 64-bit CPUs, we observe speedups of 5648.7 for GSVD calculations and 10.7 for the SSL module, while speedups of 4245.1 for GSVD calculation and 17.3 for the entire SSL module on a server configured with an NVIDIA A100 GPU and AMD EPYC 7352 CPUs, making real-time processing feasible for large-scale microphone arrays and providing ample capacity for real-time processing of potential subsequent machine learning or deep leraning tasks. I NTRODUCTION Audition is a critical aspect of human inter-individual communication [1].
arXiv.org Artificial Intelligence
May-9-2025
- Country:
- South America > Peru
- Loreto Department (0.04)
- North America > United States
- Maryland > Baltimore (0.04)
- Missouri > St. Louis County
- St. Louis (0.04)
- California > San Francisco County
- San Francisco (0.14)
- Arkansas > Washington County
- Fayetteville (0.04)
- Asia
- Middle East > Israel
- Tel Aviv District > Tel Aviv (0.04)
- Japan > Honshū
- Kantō
- Kanagawa Prefecture (0.24)
- Tokyo Metropolis Prefecture > Tokyo (0.14)
- Saitama Prefecture > Saitama (0.04)
- Kantō
- China > Beijing
- Beijing (0.04)
- Middle East > Israel
- South America > Peru
- Genre:
- Research Report (0.40)
- Industry:
- Information Technology (0.87)
- Technology:
- Information Technology
- Hardware (1.00)
- Artificial Intelligence
- Robots (1.00)
- Speech > Speech Recognition (0.68)
- Information Technology