Adversarial Example Detection by Classification for Deep Speech Recognition

Saeid Samizade; Zheng-Hua Tan; Chao Shen; Guan  Xiaohong

doi:10.1109/ICASSP40776.2020.9054750

Adversarial Example Detection by Classification for Deep Speech Recognition

Saeid Samizade, Zheng-Hua Tan, Chao Shen, Guan Xiaohong

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

29 Citationer (Scopus)

Abstract

Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary’s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.

Originalsprog	Engelsk
Titel	ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Antal sider	5
Forlag	IEEE
Publikationsdato	9 apr. 2020
Sider	3102-3106
Artikelnummer	9054750
ISBN (Trykt)	978-1-5090-6632-2
ISBN (Elektronisk)	978-1-5090-6631-5
DOI	https://doi.org/10.1109/ICASSP40776.2020.9054750
Status	Udgivet - 9 apr. 2020
Begivenhed	ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Barcelona, Spanien Varighed: 4 maj 2020 → 8 maj 2020

Konference

Konference	ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Land/Område	Spanien
By	Barcelona
Periode	04/05/2020 → 08/05/2020

Navn	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN	1520-6149

Adgang til dokumentet

10.1109/ICASSP40776.2020.9054750

https://arxiv.org/pdf/1910.10013.pdf

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=85091286747&partnerID=8YFLogxK

Citationsformater

@inproceedings{7b7ba5aab4f14bdc84fb9035c4b0f954,

title = "Adversarial Example Detection by Classification for Deep Speech Recognition",

abstract = "Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary{\textquoteright}s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.",

keywords = "Adversarial attack, Cepstral feature, Convolutional neural network, Speech recognition",

author = "Saeid Samizade and Zheng-Hua Tan and Chao Shen and Guan Xiaohong",

year = "2020",

month = apr,

day = "9",

doi = "10.1109/ICASSP40776.2020.9054750",

language = "English",

isbn = "978-1-5090-6632-2",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

publisher = "IEEE",

pages = "3102--3106",

booktitle = "ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)",

address = "United States",

note = "ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; Conference date: 04-05-2020 Through 08-05-2020",

}

Samizade, S, Tan, Z-H, Shen, C & Xiaohong, G 2020, Adversarial Example Detection by Classification for Deep Speech Recognition. i ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)., 9054750, IEEE, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, s. 3102-3106, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spanien, 04/05/2020. https://doi.org/10.1109/ICASSP40776.2020.9054750

Adversarial Example Detection by Classification for Deep Speech Recognition. / Samizade, Saeid; Tan, Zheng-Hua; Shen, Chao et al.
ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2020. s. 3102-3106 9054750 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - Adversarial Example Detection by Classification for Deep Speech Recognition

AU - Samizade, Saeid

AU - Tan, Zheng-Hua

AU - Shen, Chao

AU - Xiaohong, Guan

PY - 2020/4/9

Y1 - 2020/4/9

N2 - Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary’s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.

AB - Machine Learning systems are vulnerable to adversarial attacks and will highly likely produce incorrect outputs under these attacks. There are white-box and black-box attacks regarding to adversary’s access level to the victim learning algorithm. To defend the learning systems from these attacks, existing methods in the speech domain focus on modifying input signals and testing the behaviours of speech recognizers. We, however, formulate the defense as a classification problem and present a strategy for systematically generating adversarial example datasets: one for white-box attacks and one for black-box attacks, containing both adversarial and normal examples. The white-box attack is a gradient-based method on Baidu DeepSpeech with the Mozilla Common Voice database while the black-box attack is a gradient-free method on a deep model-based keyword spotting system with the Google Speech Command dataset. The generated datasets are used to train a proposed Convolutional Neural Network (CNN), together with cepstral features, to detect adversarial examples. Experimental results show that, it is possible to accurately distinct between adversarial and normal examples for known attacks, in both single-condition and multi-condition training settings, while the performance degrades dramatically for unknown attacks. The adversarial datasets and the source code are made publicly available.

KW - Adversarial attack

KW - Cepstral feature

KW - Convolutional neural network

KW - Speech recognition

UR - http://www.scopus.com/inward/record.url?scp=85091286747&partnerID=8YFLogxK

U2 - 10.1109/ICASSP40776.2020.9054750

DO - 10.1109/ICASSP40776.2020.9054750

M3 - Article in proceeding

SN - 978-1-5090-6632-2

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 3102

EP - 3106

BT - ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

PB - IEEE

T2 - ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Y2 - 4 May 2020 through 8 May 2020

ER -

Samizade S, Tan Z-H, Shen C, Xiaohong G. Adversarial Example Detection by Classification for Deep Speech Recognition. I ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE. 2020. s. 3102-3106. 9054750. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP40776.2020.9054750

Adversarial Example Detection by Classification for Deep Speech Recognition

Abstract

Konference

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater