Information Loss in the Human Auditory System

Mohsen Zareian Jahromi; Adel Zahedi; Jesper Jensen; Jan Østergaard

doi:10.1109/TASLP.2018.2882913

Information Loss in the Human Auditory System

Mohsen Zareian Jahromi, Adel Zahedi, Jesper Jensen, Jan Østergaard

Research output: Contribution to journal › Journal article › Research › peer-review

2 Citations (Scopus)

Abstract

From the eardrum to the auditory cortex, where acoustic stimuli are decoded, there are several stages of auditory processing and transmission where information may potentially be lost. In this paper, we aim at quantifying the total information loss in the human auditory system by using information theoretic tools. To do so, we consider a speech communication model, where words are uttered and sent through a noisy channel, and then received and processed by a human listener. We define a notion of information loss that is related to the human word recognition rate. To assess the word recognition rate of humans, we conduct a closed-vocabulary intelligibility test. We derive upper and lower bounds on the information loss. Simulations reveal that the bounds are tight and we observe that the information loss in the human auditory system increases as the signal to noise ratio (SNR) decreases. Our framework also allows us to study whether humans are optimal in terms of speech perception in a noisy environment. Toward that end, we derive optimal classifiers and compare the human and machine performance in terms of information loss and word recognition rate. We observe a higher information loss and lower word recognition rate for humans compared to the optimal classifiers. In fact, depending on the SNR, the machine classifier may outperform humans by as much as 8 dB. This implies that for the speech-in-stationary-noise setup considered here, the human auditory system is suboptimal for recognizing noisy words.

Original language	English
Article number	8579632
Journal	IEEE/ACM Transactions on Audio, Speech, and Language Processing
Volume	27
Issue number	3
Pages (from-to)	472-481
Number of pages	10
ISSN	2329-9290
DOIs	https://doi.org/10.1109/TASLP.2018.2882913
Publication status	Published - Mar 2019

Keywords

Gaussian mixture model
Human auditory system
maximum likelihood classifier
mutual information

Access to Document

10.1109/TASLP.2018.2882913

https://arxiv.org/pdf/1805.00698.pdf

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{28c24478120e43829ecc4972df890dfb,

title = "Information Loss in the Human Auditory System",

abstract = "From the eardrum to the auditory cortex, where acoustic stimuli are decoded, there are several stages of auditory processing and transmission where information may potentially be lost. In this paper, we aim at quantifying the total information loss in the human auditory system by using information theoretic tools. To do so, we consider a speech communication model, where words are uttered and sent through a noisy channel, and then received and processed by a human listener. We define a notion of information loss that is related to the human word recognition rate. To assess the word recognition rate of humans, we conduct a closed-vocabulary intelligibility test. We derive upper and lower bounds on the information loss. Simulations reveal that the bounds are tight and we observe that the information loss in the human auditory system increases as the signal to noise ratio (SNR) decreases. Our framework also allows us to study whether humans are optimal in terms of speech perception in a noisy environment. Toward that end, we derive optimal classifiers and compare the human and machine performance in terms of information loss and word recognition rate. We observe a higher information loss and lower word recognition rate for humans compared to the optimal classifiers. In fact, depending on the SNR, the machine classifier may outperform humans by as much as 8 dB. This implies that for the speech-in-stationary-noise setup considered here, the human auditory system is suboptimal for recognizing noisy words.",

keywords = "Gaussian mixture model, Human auditory system, maximum likelihood classifier, mutual information",

author = "Jahromi, {Mohsen Zareian} and Adel Zahedi and Jesper Jensen and Jan {\O}stergaard",

year = "2019",

month = mar,

doi = "10.1109/TASLP.2018.2882913",

language = "English",

volume = "27",

pages = "472--481",

journal = "IEEE/ACM Transactions on Audio, Speech, and Language Processing",

issn = "2329-9290",

publisher = "IEEE Signal Processing Society",

number = "3",

}

TY - JOUR

T1 - Information Loss in the Human Auditory System

AU - Jahromi, Mohsen Zareian

AU - Zahedi, Adel

AU - Jensen, Jesper

AU - Østergaard, Jan

PY - 2019/3

Y1 - 2019/3

N2 - From the eardrum to the auditory cortex, where acoustic stimuli are decoded, there are several stages of auditory processing and transmission where information may potentially be lost. In this paper, we aim at quantifying the total information loss in the human auditory system by using information theoretic tools. To do so, we consider a speech communication model, where words are uttered and sent through a noisy channel, and then received and processed by a human listener. We define a notion of information loss that is related to the human word recognition rate. To assess the word recognition rate of humans, we conduct a closed-vocabulary intelligibility test. We derive upper and lower bounds on the information loss. Simulations reveal that the bounds are tight and we observe that the information loss in the human auditory system increases as the signal to noise ratio (SNR) decreases. Our framework also allows us to study whether humans are optimal in terms of speech perception in a noisy environment. Toward that end, we derive optimal classifiers and compare the human and machine performance in terms of information loss and word recognition rate. We observe a higher information loss and lower word recognition rate for humans compared to the optimal classifiers. In fact, depending on the SNR, the machine classifier may outperform humans by as much as 8 dB. This implies that for the speech-in-stationary-noise setup considered here, the human auditory system is suboptimal for recognizing noisy words.

AB - From the eardrum to the auditory cortex, where acoustic stimuli are decoded, there are several stages of auditory processing and transmission where information may potentially be lost. In this paper, we aim at quantifying the total information loss in the human auditory system by using information theoretic tools. To do so, we consider a speech communication model, where words are uttered and sent through a noisy channel, and then received and processed by a human listener. We define a notion of information loss that is related to the human word recognition rate. To assess the word recognition rate of humans, we conduct a closed-vocabulary intelligibility test. We derive upper and lower bounds on the information loss. Simulations reveal that the bounds are tight and we observe that the information loss in the human auditory system increases as the signal to noise ratio (SNR) decreases. Our framework also allows us to study whether humans are optimal in terms of speech perception in a noisy environment. Toward that end, we derive optimal classifiers and compare the human and machine performance in terms of information loss and word recognition rate. We observe a higher information loss and lower word recognition rate for humans compared to the optimal classifiers. In fact, depending on the SNR, the machine classifier may outperform humans by as much as 8 dB. This implies that for the speech-in-stationary-noise setup considered here, the human auditory system is suboptimal for recognizing noisy words.

KW - Gaussian mixture model

KW - Human auditory system

KW - maximum likelihood classifier

KW - mutual information

UR - http://www.scopus.com/inward/record.url?scp=85059371570&partnerID=8YFLogxK

U2 - 10.1109/TASLP.2018.2882913

DO - 10.1109/TASLP.2018.2882913

M3 - Journal article

SN - 2329-9290

VL - 27

SP - 472

EP - 481

JO - IEEE/ACM Transactions on Audio, Speech, and Language Processing

JF - IEEE/ACM Transactions on Audio, Speech, and Language Processing

IS - 3

M1 - 8579632

ER -

Information Loss in the Human Auditory System

Abstract

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Cite this