20022022
Hvis du har foretaget ændringer i Pure, vil de snart blive vist her.

Publikationer 2002 2020

2020

Robust Bayesian and Maximum a Posteriori Beamforming for Hearing Assistive Devices

Hoang, P., Jensen, J., de Haan, J. M., Tan, Z-H. & Lunner, T., 2020, (Accepteret/In press) IEEE Global Conference on Signal and Information Processing (GlobalSIP). (IEEE Global Conference on Signal and Information Processing (GlobalSIP). Proceedings).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskning

rVAD: An unsupervised segment-based robust voice activity detection method

Tan, Z. H., Sarkar, A. K. & Dehak, N., 1 jan. 2020, I : Computer Speech and Language. 59, s. 1-21 21 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Voice Activity Detection
Speech Signal
High Energy
Speech Enhancement
Speaker Verification

The Importance of Context When Recommending TV Content: Dataset and Algorithms

Kristoffersen, M. S., Shepstone, S. E. & Tan, Z-H., 2020, I : I E E E Transactions on Multimedia.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
2019
2 Citationer (Scopus)

Adaptive protection combined with machine learning for microgrids

Lin, H., Sun, K., Tan, Z. H., Liu, C., Guerrero, J. M. & Vasquez, J. C., mar. 2019, I : IET Generation, Transmission and Distribution. 13, 6, s. 770-779 10 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Learning systems
Adaptive algorithms
Support vector machines
Data mining
Automation

Deep Joint Embeddings of Context and Content for Recommendation

Kristoffersen, M. S., Wieland, J. L., Shepstone, S. E., Tan, Z-H. & Vinayagamoorthy, V., 2019, Context-Aware Recommender Systems Workshop: CARS 2.0 - in conjunction with the 13th ACM Conference on Recommender Systems (RecSys'19).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Åben adgang

Deep-Learning-Based Audio-Visual Speech Enhancement in Presence of Lombard Effect

Michelsanti, D., Tan, Z-H., Sigurdsson, S. & Jensen, J., dec. 2019, I : Speech Communication. 115, s. 38-50 13 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Speech Enhancement
Speech enhancement
learning
Speech intelligibility
Speech Intelligibility
3 Citationer (Scopus)

Effects of Lombard Reflex on the Performance of Deep-Learning-Based Audio-Visual Speech Enhancement Systems

Michelsanti, D., Tan, Z-H., Sigurdsson, S. & Jensen, J., 17 apr. 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, s. 6615-6619 5 s. 8682713. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Speech enhancement
Deep learning

Keyword Spotting for Hearing Assistive Devices Robust to External Speakers

Lopez-Espejo, I., Tan, Z-H. & Jensen, J., sep. 2019, Interspeech 2019. ISCA, s. 3223-3227 5 s. (Proceedings of the International Conference on Spoken Language Processing).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Audition
Hearing aids
Experiments
47 Downloads (Pure)
Åben adgang
Fil
Speech intelligibility
Speech enhancement
intelligibility
Mean square error
augmentation
2 Citationer (Scopus)

On Training Targets and Objective Functions for Deep-Learning-Based Audio-Visual Speech Enhancement

Michelsanti, D., Tan, Z-H., Sigurdsson, S. & Jensen, J., 17 apr. 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, s. 8077-8081 5 s. 8682790. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Speech enhancement
Speech intelligibility
Masks
Deep learning
7 Downloads (Pure)

SketchSegNet+: An End-to-end Learning of RNN for Multi-Class Sketch Semantic Segmentation

Qi, Y. & Tan, Z., 18 jul. 2019, I : IEEE Access. 7, s. 102717-102726 10 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil

Soft Dropout and Its Variational Bayes Approximation

Xie, J., Ma, Z., Zhang, G., Xue, J-H., Tan, Z-H. & Guo, J., 5 dec. 2019, 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, s. 1-6 6 s. (IEEE International Workshop on Machine Learning for Signal Processing (MLSP). Proceedings.).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Subjective Annotations for Vision-Based Attention Level Estimation

Coifman, A. L., Rohoska, P., Kristoffersen, M. S., Shepstone, S. E. & Tan, Z-H., 2019, Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications: Volume 5: VISAPP. SCITEPRESS Digital Library, Bind 5. s. 249-256 7 s.

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Human robot interaction
Human computer interaction
Fusion reactions
Deep learning
1 Downloads (Pure)

Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification

Sarkar, A. K., Tan, Z-H., Tang, H., Shon, S. & Glass, J., aug. 2019, I : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 27, 8, s. 1267-1279 13 s., 8708955.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil
learning
Speech recognition
Labels
speech recognition
Brain
2018
1 Citation (Scopus)
246 Downloads (Pure)

A Dataset for Inferring Contextual Preferences of Users Watching TV

Kristoffersen, M. S., Shepstone, S. E. & Tan, Z-H., 3 jul. 2018, UMAP 2018 - Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization: UMAP '18. Singapore, Singapore: Association for Computing Machinery, s. 367-368 2 s.

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Fil
Recommender systems
3 Citationer (Scopus)

A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments

Peng, R., Tan, Z-H., Li, X. & Zheng, C., feb. 2018, I : Speech Communication. 96, s. 129-141 13 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Linear Prediction
Reverberation
Generalized Singular Value Decomposition
Singular value decomposition
Estimator

A Spatial Self-Similarity Based Feature Learning Method for Face Recognition under Varying Poses

Duan, X. & Tan, Z-H., 2018, I : Pattern Recognition Letters. 111, s. 109-116 8 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Face recognition
Linear transformations
Experiments

Audio-based Granularity-adapted Emotion Classification

Shepstone, S. E., Tan, Z-H. & Jensen, S. H., 2018, I : IEEE Transactions on Affective Computing. 9, 2, s. 176-190 15 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

1 Citation (Scopus)

Bias-compensated informed sound source localization using relative transfer functions

Farmani, M., Pedersen, M. S., Tan, Z. H. & Jensen, J., 1 jul. 2018, I : IEEE/ACM Transactions on Audio Speech and Language Processing. 26, 7, s. 1271-1285 15 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Microphones
microphones
estimators
transfer functions
Direction of arrival
71 Citationer (Scopus)

Decorrelation of Neutral Vector Variables: Theory and Applications

Ma, Z., Xue, J-H., Leijon, A., Tan, Z-H., Yang, Z. & Guo, J., jan. 2018, I : I E E E Transactions on Neural Networks and Learning Systems. 29, 1, s. 129-143 14 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Principal component analysis
125 Downloads (Pure)

Effectiveness of Single-Channel BLSTM Enhancement for Language Identification

Sibbern Frederiksen, P., Villalba, J., Watanabe, S., Tan, Z-H. & Dehak, N., sep. 2018, Interspeech 2018. ISCA, Bind 2018-September. s. 1823-1827 5 s. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Åben adgang
Fil
Telephone
Speech enhancement
Masks
Bins
Long short-term memory
1 Citation (Scopus)

Guided spectrogram filtering for speech dereverberation

Zheng, C., Tan, Z. H., Peng, R. & Li, X., 1 maj 2018, I : Applied Acoustics. 134, s. 154-159 6 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

spectrograms
intelligibility
reverberation
smoothing
preserving
3 Citationer (Scopus)

Incorporating Pass-Phrase Dependent Background Models for Text-Dependent Speaker verification

Sarkar, A. K. & Tan, Z-H., 1 jan. 2018, I : Computer Speech and Language. 47, s. 259-271 13 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Speaker Verification
Dependent
Model
Log-likelihood Ratio
Text
5 Citationer (Scopus)

iSocioBot; A Multimodal Interactive Social Robot

Tan, Z-H., Thomsen, N. B., Duan, X., Vlachos, E., Shepstone, S. E., Rasmussen, M. H. & Højvang, J. L., 1 jan. 2018, I : International Journal of Social Robotics. 10, 1, s. 5-19 15 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Robots
Face recognition
Robotics
Speech synthesis
Speech recognition
4 Citationer (Scopus)

Latent Dirichlet Mixture Model

Chien, J-T., Lee, C-H. & Tan, Z-H., 2018, I : Neurocomputing. 278, s. 12-22 11 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Cluster Analysis
Semantics
Learning
8 Citationer (Scopus)

Monaural Speech Enhancement using Deep Neural Networks by Maximizing a Short-Time Objective Intelligibility Measure

Kolbæk, M., Tan, Z-H. & Jensen, J., 2018, International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, s. 5059-5063 (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Speech enhancement
Speech intelligibility
Cost functions
Mean square error
Deep neural networks
1 Citation (Scopus)

Multi-Task Adversarial Network Bottleneck Features for Noise-Robust Speaker Verification

Yu, H., Hu, T., Ma, Z., Tan, Z-H. & Guo, J., 6 nov. 2018, 2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC). IEEE, s. 165-169 5 s. 8525526. (International Conference on Network Infrastructure and Digital Content (IC-NIDC)).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Labels
Network components
Feature extraction
Signal to noise ratio
2 Citationer (Scopus)
44 Downloads (Pure)

Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks

Heidemann Andersen, A., Haan, J. M. D., Tan, Z-H. & Jensen, J., okt. 2018, I : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26, 10, s. 1925-1939 15 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil
Speech intelligibility
intelligibility
Neural networks
predictions
Speech processing

Proceedings of 2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2018)

Pustelnik, N. (red.), Ma, Z. (red.), Tan, Z-H. (red.) & Larsen, J. (red.), sep. 2018, IEEE. 460 s. (IEEE International Workshop on Machine Learning for Signal Processing (MLSP). Proceedings.).

Publikation: Bog/antologi/afhandling/rapportAntologiForskningpeer review

Public Perception of Android Robots: Indications from an Analysis of YouTube Comments

Vlachos, E. & Tan, Z-H., 27 dec. 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, s. 1255-1260 6 s. 8594058

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Robots
Learning systems
Robotics
Specifications

Recent Advances in Machine Learning for Non-Gaussian Data Processing

Ma, Z., Chien, J-T., Tan, Z-H., Song, Y-Z., Taghia, J. & Xiao, M., feb. 2018, I : Neurocomputing. 278, s. 1-3 3 s.

Publikation: Bidrag til tidsskriftLederForskning

2 Citationer (Scopus)

Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., sep. 2018, I : Speech Communication. 102, s. 1-13 13 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

5 Citationer (Scopus)
45 Downloads (Pure)

Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones

Sahidullah, M., Thomsen, D. A. L., Hautamaki, R. G., Kinnunen, T., Tan, Z-H., Parts, R. & Pitkänen, M., 2018, I : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26, 1, s. 44-56 13 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil
throats
Microphones
microphones
attack
acceptability
18 Citationer (Scopus)
27 Downloads (Pure)

Spoofing Detection in Automatic Speaker Verification Systems Using DNN Classifiers and Dynamic Acoustic Features

Yu, H., Tan, Z. H., Ma, Z., Martin, R. & Guo, J., 1 okt. 2018, I : IEEE Transactions on Neural Networks and Learning Systems. 29, 10, s. 4633-4644 12 s., 8128906.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil
Classifiers
Acoustics
Speech synthesis
Filter banks
Deep neural networks
1 Citation (Scopus)

The Sound or Silence: investigating the influence of robot noise on proxemics

Trovato, G., Paredes, P., Balvin, J., Cuellar, F., Thomsen, N. B., Bech, S. & Tan, Z-H., 6 nov. 2018, Proceedings of the 27th IEEE International Symposium on Robot and Human Interactive Communication. IEEE, s. 713-718 6 s. 8525795

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

robots
acoustics
music
acceptability
masks
1 Citation (Scopus)
177 Downloads (Pure)

Using Closed-Set Speaker Identification Score Confidence to Enhance Audio-Based Collaborative Filtering for Multiple Users

Shepstone, S. E., Tan, Z-H. & Kristoffersen, M. S., 2018, I : IEEE Transactions on Consumer Electronics. 64, 1, s. 11-18 8 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil
Collaborative filtering
Identification (control systems)

Wireless Personal Communications: Machine Learning for Big Data Processing in Mobile Internet

Guo, J., Tan, Z-H., Cho, S. H. & Zhang, G., okt. 2018, I : Wireless Personal Communications. 102, 3, s. 2093-2098 6 s.

Publikation: Bidrag til tidsskriftLederForskningpeer review

Åben adgang
2017
9 Citationer (Scopus)

Adversarial Network Bottleneck Features for Noise Robust Speaker Verification

Yu, H., Tan, Z-H., Ma, Z. & Guo, J., 2017, Proc. Interspeech 2017. ISCA, s. 1492-1496 5 s. (INTERSPEECH ).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Åben adgang
8 Citationer (Scopus)

A Non-Intrusive Short-Time Objective Intelligibility Measure

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., 7 mar. 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 5 s.

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

57 Citationer (Scopus)

Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification

Michelsanti, D. & Tan, Z-H., 2017, Proc. Interspeech 2017. ISCA, s. 2008-2012 5 s. (INTERSPEECH ).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Åben adgang
18 Citationer (Scopus)
204 Downloads (Pure)

DNN Filter Bank Cepstral Coefficients for Spoofing Detection

Yu, H., Tan, Z-H., Zhang, Y., Ma, Z. & Guo, J., 24 mar. 2017, I : IEEE Access. 5, s. 4779 - 4787 9 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil
Filter banks
Classifiers
Neural networks
Deep neural networks
Speech synthesis
2 Citationer (Scopus)

Frame Selection for Robust Speaker Identification: A Hybrid Approach

Prasad, S., Tan, Z. H. & Prasad, R., nov. 2017, I : Wireless Personal Communications. 97, 1, s. 933-950 18 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Feature extraction
Signal to noise ratio
Statistical Models
2 Citationer (Scopus)
89 Downloads (Pure)

Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data

Sarkar, A. K., Sahidullah, M., Tan, Z-H. & Kinnunen, T., 20 aug. 2017, Interspeech 2017. International Speech Communications Association, s. 2611-2615 (INTERSPEECH ).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Åben adgang
Fil
Speech synthesis
Network protocols
Experiments
14 Citationer (Scopus)
253 Downloads (Pure)

Informed Sound Source Localization Using Relative Transfer Functions for Hearing Aid Applications

Farmani, M., Pedersen, M. S., Tan, Z-H. & Jensen, J., 2017, I : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 3, s. 611-623

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Åben adgang
Fil
11 Citationer (Scopus)

Joint separation and denoising of noisy multi-talker speech using recurrent neural networks and permutation invariant training

Kolbæk, M., Yu, D., Tan, Z-H. & Jensen, J., 2017, International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, 6 s. (IEEE International Workshop on Machine Learning for Signal Processing (MLSP). Proceedings.).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

101 Citationer (Scopus)

Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks

Kolbæk, M., Yu, D., Tan, Z-H. & Jensen, J., 13 jul. 2017, I : I E E E Transactions on Audio, Speech and Language Processing. 25, 10, s. 1901-1913 13 s.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

74 Downloads (Pure)

On the Use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., 21 aug. 2017, Proc. Interspeech 2017. ISCA, s. 2963-2967 5 s. (INTERSPEECH , Bind 2017).

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Åben adgang
Fil

Performance evaluation of the short-time objective intelligibility measure with different band importance functions

Heidemann Andersen, A., de Haan, J. M., Tan, Z-H. & Jensen, J., 5 jan. 2017.

Publikation: Konferencebidrag uden forlag/tidsskriftPosterForskning

110 Citationer (Scopus)

Permutation invariant training of deep models for speaker-independent multi-talker speech separation

Yu, D., Kolbæk, M., Tan, Z-H. & Jensen, J., 19 jun. 2017, International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, s. 241 - 245

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

48 Citationer (Scopus)

RedDots Replayed: A New Replay Spoofing Attack Corpus for Text-dependent Speaker Verification Research

Kinnunen, T., Sahidullah, M., Falcone, M., Costantini, L., Hautamaki, R. G., Thomsen, D. A. L., Sarkar, A. K., Tan, Z-H., Delgado, H., Todisco, M., Evans, N., Hautamaki, V. & Lee, K. A., mar. 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, s. 5395-5399 5 s.

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Network protocols