Microphone Array Based Surveillance Audio Classification

Silva, Dimitri Leandro de Oliveira, Spadini, Tito, Suyama, Ricardo

arXiv.org Machine Learning 

Several public security systems depend directly on human action in numerous stages of its operation. The monitoring of public areas, for instance, is usually done with the use of cameras spread over the busiest places in large urban centers. In general, these systems depend on an operator to pay attention to the images so that the agencies responsible for security can be activated when events such as thefts, vandalism, and traffic accidents are observed. Considering the amount of information to which the operator is exposed, there is a high probability that surveillance failures will occur, even if the patrol center has a large team [1]. Although the operators are attentive at all times, this type of monitoring has some disadvantages: the images are limited to the direction in which the camera points and have low visibility at dusk and in cases of rain or bright light. Besides, events such as gunshots, alarms, distress calls, among others, are much more noticeable in the auditory field than in the visual [2, 3]. In this sense, the monitoring of risk areas could be done through the use of audio processing techniques, reducing the need for human participation in the surveillance process, and making public security systems more efficient [4]. To support this argument, it is worth recalling two very favorable characteristics concerning these signals: initially, the sound consumes less bandwidth in the transmission of information, reducing the need for high transmission rates, as in the case of high definition images; in addition, sound processing techniques require, in general, less computational power than techniques for video processing and analysis, which would enable the implementation of simpler and therefore less costly embedded systems [3, 5].

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found