Abstract
Deep learning has emerged as a pivotal technology across various domains, demonstrating remarkable performance. However, its susceptibility to security threats, particularly adversarial samples, poses a significant concern. These samples can manipulate inputs slightly, deceiving models such as those used in image or speech recognition and potentially leading to incorrect predictions. This undermines the reliability of deep learning in critical applications. In this paper, we propose an effective method utilizing Autoencoder to detect and intercept audio adversarial attacks before they are input to speech recognition models. The proposed approach first uses clean audio data to train the Autoencoder model, then isolates adversarial samples from clean ones by comparing their features against normal features encoded in the Autoencoder. Our method does not require prior knowledge about the target automatic speech recognition model or attack methods. Experimental results show that it can effectively discriminate adversarial attack samples from clean ones with high accuracy.
Original language | English (US) |
---|---|
Title of host publication | AVSS 2024 - 20th IEEE International Conference on Advanced Video and Signal-Based Surveillance |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Edition | 2024 |
ISBN (Electronic) | 9798350374285 |
DOIs | |
State | Published - 2024 |
Event | 20th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2024 - Niagara Falls, Canada Duration: Jul 15 2024 → Jul 16 2024 |
Conference
Conference | 20th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2024 |
---|---|
Country/Territory | Canada |
City | Niagara Falls |
Period | 7/15/24 → 7/16/24 |
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Vision and Pattern Recognition
- Signal Processing
- Media Technology