Adversarial Example Detection by Classification for Deep Speech Recognition

Saeid Samizade; Zheng-Hua Tan; Chao Shen; Guan  Xiaohong

doi:10.1109/ICASSP40776.2020.9054750

Adversarial Example Detection by Classification for Deep Speech Recognition

Saeid Samizade, Zheng-Hua Tan, Chao Shen, Guan Xiaohong

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

29 Citations (Scopus)

Abstract

Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary’s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.

Original language	English
Title of host publication	ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Number of pages	5
Publisher	IEEE
Publication date	9 Apr 2020
Pages	3102-3106
Article number	9054750
ISBN (Print)	978-1-5090-6632-2
ISBN (Electronic)	978-1-5090-6631-5
DOIs	https://doi.org/10.1109/ICASSP40776.2020.9054750
Publication status	Published - 9 Apr 2020
Event	ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Barcelona, Spain Duration: 4 May 2020 → 8 May 2020

Conference

Conference	ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Country/Territory	Spain
City	Barcelona
Period	04/05/2020 → 08/05/2020

Series	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN	1520-6149

Keywords

Adversarial attack
Cepstral feature
Convolutional neural network
Speech recognition

Access to Document

10.1109/ICASSP40776.2020.9054750

https://arxiv.org/pdf/1910.10013.pdf

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{7b7ba5aab4f14bdc84fb9035c4b0f954,

title = "Adversarial Example Detection by Classification for Deep Speech Recognition",

abstract = "Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary{\textquoteright}s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.",

keywords = "Adversarial attack, Cepstral feature, Convolutional neural network, Speech recognition",

author = "Saeid Samizade and Zheng-Hua Tan and Chao Shen and Guan Xiaohong",

year = "2020",

month = apr,

day = "9",

doi = "10.1109/ICASSP40776.2020.9054750",

language = "English",

isbn = "978-1-5090-6632-2",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

publisher = "IEEE",

pages = "3102--3106",

booktitle = "ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)",

address = "United States",

note = "ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; Conference date: 04-05-2020 Through 08-05-2020",

}

Samizade, S, Tan, Z-H, Shen, C & Xiaohong, G 2020, Adversarial Example Detection by Classification for Deep Speech Recognition. in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)., 9054750, IEEE, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 3102-3106, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 04/05/2020. https://doi.org/10.1109/ICASSP40776.2020.9054750

Adversarial Example Detection by Classification for Deep Speech Recognition. / Samizade, Saeid; Tan, Zheng-Hua; Shen, Chao et al.
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2020. p. 3102-3106 9054750 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - Adversarial Example Detection by Classification for Deep Speech Recognition

AU - Samizade, Saeid

AU - Tan, Zheng-Hua

AU - Shen, Chao

AU - Xiaohong, Guan

PY - 2020/4/9

Y1 - 2020/4/9

N2 - Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary’s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.

AB - Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary’s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.

KW - Adversarial attack

KW - Cepstral feature

KW - Convolutional neural network

KW - Speech recognition

UR - http://www.scopus.com/inward/record.url?scp=85091286747&partnerID=8YFLogxK

U2 - 10.1109/ICASSP40776.2020.9054750

DO - 10.1109/ICASSP40776.2020.9054750

M3 - Article in proceeding

SN - 978-1-5090-6632-2

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 3102

EP - 3106

BT - ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

PB - IEEE

T2 - ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Y2 - 4 May 2020 through 8 May 2020

ER -

Samizade S, Tan Z-H, Shen C, Xiaohong G. Adversarial Example Detection by Classification for Deep Speech Recognition. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE. 2020. p. 3102-3106. 9054750. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP40776.2020.9054750

Adversarial Example Detection by Classification for Deep Speech Recognition

Abstract

Conference

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Cite this