• Fredrik Bajers Vej 7, B4-202

    9220 Aalborg Ø

    Denmark

20022022
If you made any changes in Pure these will be visible here soon.

Research Output 2002 2020

2020

rVAD: An unsupervised segment-based robust voice activity detection method

Tan, Z. H., Sarkar, A. K. & Dehak, N., 1 Jan 2020, In : Computer Speech and Language. 59, p. 1-21 21 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Voice Activity Detection
Speech Signal
High Energy
Speech Enhancement
Speaker Verification
2019
1 Citation (Scopus)

Adaptive protection combined with machine learning for microgrids

Lin, H., Sun, K., Tan, Z. H., Liu, C., Guerrero, J. M. & Vasquez, J. C., Mar 2019, In : IET Generation, Transmission and Distribution. 13, 6, p. 770-779 10 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Learning systems
Adaptive algorithms
Support vector machines
Data mining
Automation

Deep Joint Embeddings of Context and Content for Recommendation

Kristoffersen, M. S., Wieland, J. L., Shepstone, S. E., Tan, Z-H. & Vinayagamoorthy, V., 2019, Context-Aware Recommender Systems Workshop: CARS 2.0 - in conjunction with the 13th ACM Conference on Recommender Systems (RecSys'19).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Open Access

Deep-Learning-Based Audio-Visual Speech Enhancement in Presence of Lombard Effect

Michelsanti, D., Tan, Z-H., Sigurdsson, S. & Jensen, J., 2019, In : Speech Communication. 115, p. 38-50 13 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Speech Enhancement
Speech enhancement
learning
Speech intelligibility
Speech Intelligibility
1 Citation (Scopus)

Effects of Lombard Reflex on the Performance of Deep-Learning-Based Audio-Visual Speech Enhancement Systems

Michelsanti, D., Tan, Z-H., Sigurdsson, S. & Jensen, J., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 6615-6619 5 p. 8682713. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Speech enhancement
Deep learning

Keyword Spotting for Hearing Assistive Devices Robust to External Speakers

Lopez-Espejo, I., Tan, Z-H. & Jensen, J., Sep 2019, Interspeech 2019. ISCA, p. 3223-3227 5 p. (Proceedings of the International Conference on Spoken Language Processing).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Audition
Hearing aids
Experiments
25 Downloads (Pure)
Open Access
File
Speech intelligibility
Speech enhancement
intelligibility
Mean square error
augmentation
2 Citations (Scopus)

On Training Targets and Objective Functions for Deep-Learning-Based Audio-Visual Speech Enhancement

Michelsanti, D., Tan, Z-H., Sigurdsson, S. & Jensen, J., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 8077-8081 5 p. 8682790. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Speech enhancement
Speech intelligibility
Masks
Deep learning

Robust Bayesian and Maximum a Posteriori Beamforming for Hearing Assistive Devices

Hoang, P., Jensen, J., de Haan, J. M., Tan, Z-H. & Lunner, T., Sep 2019, (Accepted/In press) IEEE Global Conference on Signal and Information Processing (GlobalSIP). (IEEE Global Conference on Signal and Information Processing (GlobalSIP). Proceedings).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearch

2 Downloads (Pure)

SketchSegNet+: An End-to-end Learning of RNN for Multi-Class Sketch Semantic Segmentation

Qi, Y. & Tan, Z., 18 Jul 2019, In : IEEE Access. 7, p. 102717-102726 10 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access
File

Soft Dropout and Its Variational Bayes Approximation

Xie, J., Ma, Z., Zhang, G., Xue, J-H., Tan, Z-H. & Guo, J., 2019, (Accepted/In press) MSLP 2019. IEEE, 6 p. (IEEE International Workshop on Machine Learning for Signal Processing (MLSP). Proceedings.).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Subjective Annotations for Vision-Based Attention Level Estimation

Coifman, A. L., Rohoska, P., Kristoffersen, M. S., Shepstone, S. E. & Tan, Z-H., 2019, Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications: Volume 5: VISAPP. SCITEPRESS Digital Library, Vol. 5. p. 249-256 7 p.

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Human robot interaction
Human computer interaction
Fusion reactions
Deep learning

The Importance of Context When Recommending TV Content: Dataset and Algorithms

Kristoffersen, M. S., Shepstone, S. E. & Tan, Z-H., 27 Sep 2019, In : I E E E Transactions on Multimedia.

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access

Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification

Sarkar, A. K., Tan, Z-H., Tang, H., Shon, S. & Glass, J., Aug 2019, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 27, 8, p. 1267-1279 13 p.

Research output: Contribution to journalJournal articleResearchpeer-review

learning
Speech recognition
Labels
speech recognition
Brain
2018
1 Citation (Scopus)
217 Downloads (Pure)

A Dataset for Inferring Contextual Preferences of Users Watching TV

Kristoffersen, M. S., Shepstone, S. E. & Tan, Z-H., 3 Jul 2018, UMAP 2018 - Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization: UMAP '18. Singapore, Singapore: Association for Computing Machinery, p. 367-368 2 p.

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

File
Recommender systems
2 Citations (Scopus)

A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments

Peng, R., Tan, Z-H., Li, X. & Zheng, C., Feb 2018, In : Speech Communication. 96, p. 129-141 13 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Linear Prediction
Reverberation
Generalized Singular Value Decomposition
Singular value decomposition
Estimator

A Spatial Self-Similarity Based Feature Learning Method for Face Recognition under Varying Poses

Duan, X. & Tan, Z-H., 2018, In : Pattern Recognition Letters. 111, p. 109-116 8 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Face recognition
Linear transformations
Experiments

Audio-based Granularity-adapted Emotion Classification

Shepstone, S. E., Tan, Z-H. & Jensen, S. H., 2018, In : IEEE Transactions on Affective Computing. 9, 2, p. 176-190 15 p.

Research output: Contribution to journalJournal articleResearchpeer-review

1 Citation (Scopus)

Bias-compensated informed sound source localization using relative transfer functions

Farmani, M., Pedersen, M. S., Tan, Z. H. & Jensen, J., 1 Jul 2018, In : IEEE/ACM Transactions on Audio Speech and Language Processing. 26, 7, p. 1271-1285 15 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Microphones
microphones
estimators
transfer functions
Direction of arrival
68 Citations (Scopus)

Decorrelation of Neutral Vector Variables: Theory and Applications

Ma, Z., Xue, J-H., Leijon, A., Tan, Z-H., Yang, Z. & Guo, J., Jan 2018, In : I E E E Transactions on Neural Networks and Learning Systems. 29, 1, p. 129-143 14 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Principal component analysis
105 Downloads (Pure)

Effectiveness of Single-Channel BLSTM Enhancement for Language Identification

Sibbern Frederiksen, P., Villalba, J., Watanabe, S., Tan, Z-H. & Dehak, N., Sep 2018, Interspeech 2018. ISCA, Vol. 2018-September. p. 1823-1827 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Open Access
File
Telephone
Speech enhancement
Masks
Bins
Long short-term memory
1 Citation (Scopus)

Guided spectrogram filtering for speech dereverberation

Zheng, C., Tan, Z. H., Peng, R. & Li, X., 1 May 2018, In : Applied Acoustics. 134, p. 154-159 6 p.

Research output: Contribution to journalJournal articleResearchpeer-review

spectrograms
intelligibility
reverberation
smoothing
preserving
3 Citations (Scopus)

Incorporating Pass-Phrase Dependent Background Models for Text-Dependent Speaker verification

Sarkar, A. K. & Tan, Z-H., 1 Jan 2018, In : Computer Speech and Language. 47, p. 259-271 13 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Speaker Verification
Dependent
Model
Log-likelihood Ratio
Text
4 Citations (Scopus)

iSocioBot; A Multimodal Interactive Social Robot

Tan, Z-H., Thomsen, N. B., Duan, X., Vlachos, E., Shepstone, S. E., Rasmussen, M. H. & Højvang, J. L., 1 Jan 2018, In : International Journal of Social Robotics. 10, 1, p. 5-19 15 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Robots
Face recognition
Robotics
Speech synthesis
Speech recognition
4 Citations (Scopus)

Latent Dirichlet Mixture Model

Chien, J-T., Lee, C-H. & Tan, Z-H., 2018, In : Neurocomputing. 278, p. 12-22 11 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Cluster Analysis
Semantics
Learning
6 Citations (Scopus)

Monaural Speech Enhancement using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure

Kolbæk, M., Tan, Z-H. & Jensen, J., 2018, International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 5059-5063 (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Speech enhancement
Speech intelligibility
Cost functions
Mean square error
Deep neural networks
1 Citation (Scopus)

Multi-Task Adversarial Network Bottleneck Features for Noise-Robust Speaker Verification

Yu, H., Hu, T., Ma, Z., Tan, Z-H. & Guo, J., 6 Nov 2018, 2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC). IEEE, p. 165-169 5 p. 8525526. (International Conference on Network Infrastructure and Digital Content (IC-NIDC)).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Labels
Network components
Feature extraction
Signal to noise ratio
24 Downloads (Pure)

Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks

Heidemann Andersen, A., Haan, J. M. D., Tan, Z-H. & Jensen, J., Oct 2018, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26, 10, p. 1925-1939 15 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access
File
Speech intelligibility
intelligibility
Neural networks
predictions
Speech processing

Proceedings of 2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2018)

Pustelnik, N. (ed.), Ma, Z. (ed.), Tan, Z-H. (ed.) & Larsen, J. (ed.), Sep 2018, IEEE. 460 p. (IEEE International Workshop on Machine Learning for Signal Processing (MLSP). Proceedings.).

Research output: Book/ReportAnthologyResearchpeer-review

Public Perception of Android Robots: Indications from an Analysis of YouTube Comments

Vlachos, E. & Tan, Z-H., 27 Dec 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, p. 1255-1260 6 p. 8594058

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Robots
Learning systems
Robotics
Specifications

Recent Advances in Machine Learning for Non-Gaussian Data Processing

Ma, Z., Chien, J-T., Tan, Z-H., Song, Y-Z., Taghia, J. & Xiao, M., Feb 2018, In : Neurocomputing. 278, p. 1-3 3 p.

Research output: Contribution to journalEditorialResearch

2 Citations (Scopus)

Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., Sep 2018, In : Speech Communication. 102, p. 1-13 13 p.

Research output: Contribution to journalJournal articleResearchpeer-review

5 Citations (Scopus)
26 Downloads (Pure)

Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones

Sahidullah, M., Thomsen, D. A. L., Hautamaki, R. G., Kinnunen, T., Tan, Z-H., Parts, R. & Pitkänen, M., 2018, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26, 1, p. 44-56 13 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access
File
throats
Microphones
microphones
attack
acceptability
17 Citations (Scopus)

Spoofing Detection in Automatic Speaker Verification Systems Using DNN Classifiers and Dynamic Acoustic Features

Yu, H., Tan, Z. H., Ma, Z., Martin, R. & Guo, J., 1 Oct 2018, In : IEEE Transactions on Neural Networks and Learning Systems. 29, 10, p. 4633-4644 12 p., 8128906.

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access
File
Classifiers
Acoustics
Speech synthesis
Filter banks
Deep neural networks

The Sound or Silence: investigating the influence of robot noise on proxemics

Trovato, G., Paredes, P., Balvin, J., Cuellar, F., Thomsen, N. B., Bech, S. & Tan, Z-H., 6 Nov 2018, Proceedings of the 27th IEEE International Symposium on Robot and Human Interactive Communication. IEEE, p. 713-718 6 p. 8525795

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

robots
acoustics
music
acceptability
masks
1 Citation (Scopus)
150 Downloads (Pure)

Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users

Shepstone, S. E., Tan, Z-H. & Kristoffersen, M. S., 2018, In : IEEE Transactions on Consumer Electronics. 64, 1, p. 11-18 8 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access
File
Collaborative filtering
Identification (control systems)

Wireless Personal Communications: Machine Learning for Big Data Processing in Mobile Internet

Guo, J., Tan, Z-H., Cho, S. H. & Zhang, G., Oct 2018, In : Wireless Personal Communications. 102, 3, p. 2093-2098 6 p.

Research output: Contribution to journalEditorialResearchpeer-review

Open Access
2017
7 Citations (Scopus)

Adversarial Network Bottleneck Features for Noise Robust Speaker Verification

Yu, H., Tan, Z-H., Ma, Z. & Guo, J., 2017, Proc. Interspeech 2017. ISCA, p. 1492-1496 5 p. (INTERSPEECH ).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Open Access
6 Citations (Scopus)

A Non-Intrusive Short-Time Objective Intelligibility Measure

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., 7 Mar 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 5 p.

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

44 Citations (Scopus)

Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification

Michelsanti, D. & Tan, Z-H., 2017, Proc. Interspeech 2017. ISCA, p. 2008-2012 5 p. (INTERSPEECH ).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Open Access
18 Citations (Scopus)
159 Downloads (Pure)

DNN Filter Bank Cepstral Coefficients for Spoofing Detection

Yu, H., Tan, Z-H., Zhang, Y., Ma, Z. & Guo, J., 24 Mar 2017, In : IEEE Access. 5, p. 4779 - 4787 9 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access
File
Filter banks
Classifiers
Neural networks
Deep neural networks
Speech synthesis
2 Citations (Scopus)

Frame Selection for Robust Speaker Identification: A Hybrid Approach

Prasad, S., Tan, Z. H. & Prasad, R., Nov 2017, In : Wireless Personal Communications. 97, 1, p. 933-950 18 p.

Research output: Contribution to journalJournal articleResearchpeer-review

Feature extraction
Signal to noise ratio
Statistical Models
2 Citations (Scopus)
70 Downloads (Pure)

Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data

Sarkar, A. K., Sahidullah, M., Tan, Z-H. & Kinnunen, T., 20 Aug 2017, Interspeech 2017. International Speech Communications Association, p. 2611-2615 (INTERSPEECH ).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Open Access
File
Speech synthesis
Network protocols
Experiments
11 Citations (Scopus)
229 Downloads (Pure)

Informed Sound Source Localization Using Relative Transfer Functions for Hearing Aid Applications

Farmani, M., Pedersen, M. S., Tan, Z-H. & Jensen, J., 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 3, p. 611-623

Research output: Contribution to journalJournal articleResearchpeer-review

Open Access
File
9 Citations (Scopus)

Joint separation and denoising of noisy multi-talker speech using recurrent neural networks and permutation invariant training

Kolbæk, M., Yu, D., Tan, Z-H. & Jensen, J., 2017, International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, 6 p. (IEEE International Workshop on Machine Learning for Signal Processing (MLSP). Proceedings.).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

79 Citations (Scopus)

Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks

Kolbæk, M., Yu, D., Tan, Z-H. & Jensen, J., 13 Jul 2017, In : I E E E Transactions on Audio, Speech and Language Processing. 25, 10, p. 1901-1913 13 p.

Research output: Contribution to journalJournal articleResearchpeer-review

51 Downloads (Pure)

On the Use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., 21 Aug 2017, Proc. Interspeech 2017. ISCA, p. 2963-2967 5 p. (INTERSPEECH , Vol. 2017).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Open Access
File

Performance evaluation of the short-time objective intelligibility measure with different band importance functions

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., 5 Jan 2017.

Research output: Contribution to conference without publisher/journalPosterResearch

88 Citations (Scopus)

Permutation invariant training of deep models for speaker-independent multi-talker speech separation

Yu, D., Kolbæk, M., Tan, Z-H. & Jensen, J., 19 Jun 2017, International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 241 - 245

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

42 Citations (Scopus)

RedDots Replayed: A New Replay Spoofing Attack Corpus for Text-dependent Speaker Verification Research

Kinnunen, T., Sahidullah, M., Falcone, M., Costantini, L., Hautamaki, R. G., Thomsen, D. A. L., Sarkar, A. K., Tan, Z-H., Delgado, H., Todisco, M., Evans, N., Hautamaki, V. & Lee, K. A., Mar 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 5395-5399 5 p.

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Network protocols