Effective Adversarial Sample Detection for Securing Automatic Speech Recognition

Chih Yang Lin, Yan Zhang Wang, Shou Kuan Lin, Isack Farady, Yih Kuen Jan, Wei Yang Lin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Deep learning has emerged as a pivotal technology across various domains, demonstrating remarkable performance. However, its susceptibility to security threats, particularly adversarial samples, poses a significant concern. These samples can manipulate inputs slightly, deceiving models such as those used in image or speech recognition and potentially leading to incorrect predictions. This undermines the reliability of deep learning in critical applications. In this paper, we propose an effective method utilizing Autoencoder to detect and intercept audio adversarial attacks before they are input to speech recognition models. The proposed approach first uses clean audio data to train the Autoencoder model, then isolates adversarial samples from clean ones by comparing their features against normal features encoded in the Autoencoder. Our method does not require prior knowledge about the target automatic speech recognition model or attack methods. Experimental results show that it can effectively discriminate adversarial attack samples from clean ones with high accuracy.

Original languageEnglish (US)
Title of host publicationAVSS 2024 - 20th IEEE International Conference on Advanced Video and Signal-Based Surveillance
PublisherInstitute of Electrical and Electronics Engineers Inc.
Edition2024
ISBN (Electronic)9798350374285
DOIs
StatePublished - 2024
Event20th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2024 - Niagara Falls, Canada
Duration: Jul 15 2024Jul 16 2024

Conference

Conference20th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2024
Country/TerritoryCanada
CityNiagara Falls
Period7/15/247/16/24

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Signal Processing
  • Media Technology

Fingerprint

Dive into the research topics of 'Effective Adversarial Sample Detection for Securing Automatic Speech Recognition'. Together they form a unique fingerprint.

Cite this