Goto

Collaborating Authors

 asca


Asca: less audio data is more insightful

arXiv.org Artificial Intelligence

Audio recognition in specialized areas such as birdsong and submarine acoustics faces challenges in large-scale pre-training due to the limitations in available samples imposed by sampling environments and specificity requirements. While the Transformer model excels in audio recognition, its dependence on vast amounts of data becomes restrictive in resource-limited settings. Addressing this, we introduce the Audio Spectrogram Convolution Attention (ASCA) based on CoAtNet, integrating a Transformer-convolution hybrid architecture, novel network design, and attention techniques, further augmented with data enhancement and regularization strategies. On the BirdCLEF2023 and AudioSet(Balanced), ASCA achieved accuracies of 81.2% and 35.1%, respectively, significantly outperforming competing methods. The unique structure of our model enriches output, enabling generalization across various audio detection tasks. Our code can be found at https://github.com/LeeCiang/ASCA.


UK researchers use AI model to record keystrokes with accuracy over 90%

FOX News

The first video shows a man who thinks he's talking to a woman (bottom right corner) but is actually talking to a man (top left corner) and the second videos is deepfake demo. Researchers in the United Kingdom have used artificial intelligence technology to record the sound of keystrokes with surprising accuracy. A recent study, reportedly published as part of the IEEE European Symposium on Security and Privacy Workshops, simulated a cyberattack in which a deep learning model classified laptop keystrokes using audio from the video-conferencing platform Zoom and a smartphone-integrated microphone. Computer scientists from Durham University, University of Surrey and Royal Holloway University of London said that, when trained on keystrokes recorded by a nearby phone, the classifier achieved an accuracy of 95%. That's the highest accuracy seen without the use of a language model.


A Practical Deep Learning-Based Acoustic Side Channel Attack on Keyboards

arXiv.org Artificial Intelligence

With recent developments in deep learning, the ubiquity of micro-phones and the rise in online services via personal devices, acoustic side channel attacks present a greater threat to keyboards than ever. This paper presents a practical implementation of a state-of-the-art deep learning model in order to classify laptop keystrokes, using a smartphone integrated microphone. When trained on keystrokes recorded by a nearby phone, the classifier achieved an accuracy of 95%, the highest accuracy seen without the use of a language model. When trained on keystrokes recorded using the video-conferencing software Zoom, an accuracy of 93% was achieved, a new best for the medium. Our results prove the practicality of these side channel attacks via off-the-shelf equipment and algorithms. We discuss a series of mitigation methods to protect users against these series of attacks.


Distributed Swarm Collision Avoidance Based on Angular Calculations

arXiv.org Artificial Intelligence

Collision avoidance is one of the most important topics in the robotics field. The goal is to move the robots from initial locations to target locations such that they follow shortest non-colliding paths in the shortest time and with the least amount of energy. In this paper, a distributed and real-time algorithm for dense and complex 2D and 3D environments is proposed. This algorithm uses angular calculations to select the optimal direction for the movement of each robot and it has been shown that these separate calculations lead to a form of cooperative behavior among agents. We evaluated the proposed approach on various simulation and experimental scenarios and compared the results with FMP and ORCA, two important algorithms in this field. The results show that the proposed approach is at least 25% faster than ORCA and at least 7% faster than FMP and also more reliable than both methods. The proposed method is shown to enable fully autonomous navigation of a swarm of crazyflies.