BAN: Neuroanatomical Aligning in Auditory Recognition between Artificial Neural Network and Human Cortex

Wang, Haidong, Xiao, Pengfei, Liu, Ao, Zhang, Jianhua, Shan, Qia

arXiv.org Artificial Intelligence 

--Drawing inspiration from neurosciences, artificial neural networks (ANNs) have evolved from shallow architectures to highly complex, deep structures, yielding exceptional performance in auditory recognition tasks. However, traditional ANNs often struggle to align with brain regions due to their excessive depth and lack of biologically realistic features, like recurrent connection. T o address this, a brain-like auditory network (BAN) is introduced, which incorporates four neuroanatomically mapped areas and recurrent connection, guided by a novel metric called the brain-like auditory score (BAS). BAS serves as a benchmark for evaluating the similarity between BAN and human auditory recognition pathway. We further propose that specific areas in the cerebral cortex, mainly the middle and medial superior temporal (T2/T3) areas, correspond to the designed network structure, drawing parallels with the brain's auditory perception pathway. Our findings suggest that the neuroanatomical similarity in the cortex and auditory classification abilities of the ANN are well-aligned. In addition to delivering excellent performance on a music genre classification task, the BAN demonstrates a high BAS score. In conclusion, this study presents BAN as a recurrent, brain-inspired ANN, representing the first model that mirrors the cortical pathway of auditory recognition. EARING plays a vital role in human sound recognition and is especially important for the comprehension and creation of music. One key task in this domain is genre classification, which involves predicting the genre of a piece of music based on its audio signal. Music genres, such as jazz, rock, and classical, serve as descriptive labels that provide high-level information about a musical piece. As noted by previous work [2], genres are classes introduced by humans to categorize musical works.