Zheng-Hua Tan

2024

PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs

Eringis, D., Leth, J-J., Tan, Z-H., Wisniewski, R. & Petreczky, M., 25 Mar 2024, Proceedings of the AAAI Conference on Artificial Intelligence. Wooldridge, M., Dy, J. & Natarajan, S. (eds.). 11 ed. AAAI Press, Vol. 38. p. 11901-11909 9 p.

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Open Access

File

21 Downloads (Pure)

Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions

Bovbjerg, H. S., Jensen, J., Østergaard, J. & Tan, Z-H., 14 Apr 2024, (Accepted/In press) In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 5 p.

Research output: Contribution to journal › Conference article in Journal › Research › peer-review

Utilization of acoustic signals with generative Gaussian and autoencoder modeling for condition-based maintenance of injection moulds

Rønsch, G. Ø., Espejo, I. L., Michelsanti, D., Xie, Y., Popovski, P. & Tan, Z-H., 2024, In: International Journal of Computer Integrated Manufacturing. 37, 4, p. 438-453 16 p.

Research output: Contribution to journal › Journal article › Research › peer-review

1 Citation (Scopus)

2023

A Dual-Polarized Reconfigurable Reflectarray with A Thin Liquid Crystal Layer and 2D Beam Scanning

Aghabeyki, P., Cai, Y., Deng, G., Tan, Z-H. & Zhang, S., 1 Apr 2023, In: I E E E Transactions on Antennas and Propagation. 71, 4, p. 3282-3293 12 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

11 Citations (Scopus)

381 Downloads (Pure)

Explicit construction of the minimum error variance estimator for stochastic LTI-ss systems

Eringis, D., Leth, J., Tan, Z. H., Wisniewski, R. & Petreczky, M., Jul 2023, In: Automatica. 153, 111018.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

16 Downloads (Pure)

Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

Espejo, I. L., Shekar, R. C. M. C., Tan, Z-H., Jensen, J. & Hansen, J., May 2023, ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings. IEEE, 5 p. 10095436. (International Conference on Acoustics Speech and Signal Processing (ICASSP)).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining

Bovbjerg, H. S. & Tan, Z. H., Aug 2023, ICASSPW 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, Proceedings. IEEE, 10193371

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

Yadav, S., Theodoridis, S., Hansen, L. K. & Tan, Z-H., 1 Jun 2023, arXiv, 17 p.

Research output: Working paper/Preprint › Preprint

Open Access

File

98 Downloads (Pure)

Minimum Processing Near-End Listening Enhancement

Fuglsig, A. J., Jensen, J., Tan, Z. H., Bertelsen, L. S., Lindof, J. C. & Ostergaard, J., 5 Jun 2023, In: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 31, p. 2233-2245 13 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

1 Citation (Scopus)

30 Downloads (Pure)

On the Comparisons of Decorrelation Approaches for Non-Gaussian Neutral Vector Variables

Ma, Z., Lu, X., Xie, J., Yang, Z., Xue, J-H., Tan, Z-H., Xiao, B. & Guo, J., Apr 2023, In: I E E E Transactions on Neural Networks and Learning Systems. 34, 4, p. 1823-1837 15 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

3 Citations (Scopus)

107 Downloads (Pure)

On the Deficiency of Intelligibility Metrics as Proxies for Subjective Intelligibility

Espejo, I. L., Edraki, A., Chan, W-Y., Tan, Z-H. & Jensen, J., May 2023, In: Speech Communication. 150, p. 9-22 14 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

2 Citations (Scopus)

22 Downloads (Pure)

2022

Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization

Xie, J., Ma, Z., Lei, J., Zhang, G., Xue, J-H., Tan, Z-H. & Guo, J., 1 Sept 2022, In: I E E E Transactions on Pattern Analysis and Machine Intelligence. 44, 9, p. 4605-4625 21 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

24 Citations (Scopus)

31 Downloads (Pure)

Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay

Larsen, C. M., Koch, P. & Tan, Z-H., 2022, Interspeech 2022. p. 3759-3763 (Proceedings of the International Conference on Spoken Language Processing).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting

Espejo, I. L., Tan, Z-H. & Jensen, J., 2022, IberSPEECH 2022.

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Open Access

File

43 Downloads (Pure)

AoI and Throughput Optimization for Hybrid Traffic in Cellular Uplink Using Reinforcement Learning

Wu, C. C., Tan, Z. H. & Stefanovic, C., 2022, 2022 IEEE 95th Vehicular Technology Conference - Spring, VTC 2022-Spring - Proceedings. IEEE, 9861011. (IEEE Vehicular Technology Conference, Vol. 2022-June).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Data Augmentation for Breakdown Prediction in CLIC RF Cavities

Bovbjerg, H., Obermair, C., Apollonio, A., Cartier-Michaud, T., Millar, W., Tan, Z-H., Shen, M. & Wollmann, D., 2022, Proceedings of the 13th International Particle Accelerator Conference. JACoW Publishing, Vol. IPAC2022. p. 1553-1556 4 p. (Journals of Accelerator Conferences Website (JACoW)).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Open Access

File

9 Downloads (Pure)

Deep Spoken Keyword Spotting: An Overview

Espejo, I. L., Tan, Z-H., Hansen, J. & Jensen, J., Jan 2022, In: IEEE Access. 10, p. 4169-4199 31 p.

Research output: Contribution to journal › Review article › peer-review

Open Access

File

42 Citations (Scopus)

196 Downloads (Pure)

Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent Surface

Vaca-Rubio, C. J., Pereira, R., Mestre, X., Gregoratti, D., Tan, Z-H., Carvalho, E. D. & Popovski, P., 21 Jun 2022, 2022 IEEE 32nd International Workshop on Machine Learning for Signal Processing, MLSP 2022. IEEE, 7 p. 9943430. (Machine Learning for Signal Processing).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

File

1 Citation (Scopus)

32 Downloads (Pure)

IVAE-GAN: Identifiable VAE-GAN Models for Latent Representation Learning

Dideriksen, B. U., Derosche, K. & Tan, Z. H., 2022, In: IEEE Access. 10, p. 48405-48418 14 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

30 Downloads (Pure)

Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index

Fuglsig, A. J., Østergaard, J., Jensen, J., Søndergaard Bertelsen, L., Mariager, P. B. & Tan, Z-H., 2022, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Singapore: IEEE, p. 7752-7756 5 p. 9746170. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Open Access

File

2 Citations (Scopus)

48 Downloads (Pure)

Multichannel Speech Enhancement with Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices

Hoang, P., De Haan, J. M., Tan, Z. H. & Jensen, J., 2022, In: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 30, p. 706-720 15 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

5 Citations (Scopus)

11 Downloads (Pure)

Reduced-Resolution Speech Features for Small-Footprint Keyword Spotting

Espejo, I. L., Tan, Z-H. & Jensen, J., 2022.

Research output: Contribution to conference without publisher/journal › Poster › Communication

Open Access

File

13 Downloads (Pure)

Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge

Yu, F., Zhang, S., Guo, P., Fu, Y., Du, Z., Zheng, S., Huang, W., Xie, L., Tan, Z. H., Wang, D. L., Qian, Y., Lee, K. A., Yan, Z., Ma, B., Xu, X. & Bu, H., 2022, 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Proceedings. IEEE Signal Processing Society, p. 9156-9160 5 p. 9746270. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Vol. 2022-May).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

17 Citations (Scopus)

The Minimum Overlap-Gap Algorithm for Speech Enhancement

Hoang, P., Tan, Z. H., De Haan, J. M. & Jensen, J., 2022, In: IEEE Access. 10, p. 14698-14716 19 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

3 Citations (Scopus)

72 Downloads (Pure)

Training Data-Driven Speech Intelligibility Predictors on Heterogeneous Listening Test Data

Pedersen, M. B., Andersen, A. H., Jensen, S. H., Tan, Z. H. & Jensen, J., 2022, In: IEEE Access. 10, p. 66175-66189 15 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

2 Citations (Scopus)

25 Downloads (Pure)

User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars

Vaca-Rubio, C. J., Salami, D., Popovski, P., Carvalho, E. D., Tan, Z-H. & Sigg, S., 2 Sept 2022, 2022 30th European Signal Processing Conference (EUSIPCO). IEEE Communications Society, p. 1916-1920 5 p. 9909583. (Proceedings of the European Signal Processing Conference).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

2021

A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting

Espejo, I. L., Tan, Z-H. & Jensen, J., Jul 2021, In: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29, p. 2254 - 2266 13 p., 9465680.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

13 Citations (Scopus)

81 Downloads (Pure)

An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

Michelsanti, D., Tan, Z-H., Zhang, S-X., Xu, Y., Yu, M., Yu, D. & Jensen, J., 2021, In: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29, p. 1368-1396 29 p., 9380418.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

126 Citations (Scopus)

299 Downloads (Pure)

A Primer on Large Intelligent Surface (LIS) for Wireless Sensing in an Industrial Setting

Vaca Rubio, C. J., Espinosa, P. R., Williams, R. J., Kansanen, K., Tan, Z-H., De Carvalho, E. & Popovski, P., 31 Mar 2021, EAI CROWNCOM 2020 - 15th EAI International Conference on Cognitive Radio Oriented Wireless Networks. Caso, G., De Nardis, L. & Gavrilovska, L. (eds.). Springer, p. 126-138 13 p. (Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, Vol. 374).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

1 Citation (Scopus)

Assessing Wireless Sensing Potential with Large Intelligent Surfaces

Vaca Rubio, C. J., Espinosa, P. R., Kansanen, K., Tan, Z-H., De Carvalho, E. & Popovski, P., Apr 2021, In: IEEE Open Journal of the Communications Society. 2, p. 934-947 14 p., 9405304.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

7 Citations (Scopus)

72 Downloads (Pure)

Audio-Visual Speech Inpainting with Deep Learning

Morrone, G., Michelsanti, D., Tan, Z-H. & Jensen, J., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Vol. 2021-June. p. 6653-6657 5 p. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

20 Citations (Scopus)

CC-LOSS: CHANNEL CORRELATION LOSS FOR IMAGE CLASSIFICATION

Song, Z., Chang, D., Ma, Z., Li, X. & Tan, Z-H., 10 Jan 2021, 2020 25th International Conference on Pattern Recognition (ICPR). IEEE, p. 7601-7608 8 p. 9412069. (Proceeding IEEE International Conference on Pattern Recognition (ICPR)).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

6 Citations (Scopus)

Compression of dnns using magnitude pruning and nonlinear information bottleneck training

Nielsen, M. Ø., Østergaard, J., Jensen, J. & Tan, Z-H., Oct 2021, 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, p. 1-6 6 p. 9596128. (IEEE Workshop on Machine Learning for Signal Processing).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

1 Citation (Scopus)

ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

Rao, W., Fu, Y., Hu, Y., Xu, X., Jv, Y., Han, J., Jiang, Z., Xie, L., Wang, Y., Watanabe, S., Tan, Z-H., Bu, H., Yu, T. & Shang, S., 2021, IEEE Automatic Speech Recognition and Understanding Workshop. IEEE, 9688126

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

12 Citations (Scopus)

Data Augmentation Enhanced Speaker Enrollment for Text-Dependent Speaker Verification

Sarkar, A. K., Sarma, H., Dwivedi, P. & Tan, Z-H., 21 Apr 2021, 3rd International Conference on Energy, Power and Environment: Towards Clean Energy Technologies, ICEPE 2020. IEEE, 6 p. 9404373

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

Deep InterBoost networks for small-sample image classification

Li, X., Chang, D., Ma, Z., Tan, Z-H., Xue, J-H., Cao, J. & Jun, G., 7 Oct 2021, In: Neurocomputing. 456, p. 492-503 12 p.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

7 Citations (Scopus)

54 Downloads (Pure)

Design of AoI-Aware 5G Uplink Scheduler Using Reinforcement Learning

Wu, C-C., Popovski, P., Tan, Z-H. & Stefanovic, C., 15 Oct 2021, 2021 IEEE 4th 5G World Forum (5GWF). IEEE, p. 176-181 6 p. 9604981

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

5 Citations (Scopus)

Disentangled speech representation learning based on factorized hierarchical variational autoencoder with self-supervised objective

Xie, Y., Arildsen, T. & Tan, Z-H., 28 Oct 2021, 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing, MLSP 2021. IEEE, p. 1-6 6 p. 9596320. (IEEE Workshop on Machine Learning for Signal Processing).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

6 Citations (Scopus)

Exploring Filterbank Learning for Keyword Spotting

Espejo, I. L., Tan, Z-H. & Jensen, J., 2021, 28th European Signal Processing Conference (EUSIPCO). IEEE, p. 331-335 5 p. 9287772. (Proceedings of the European Signal Processing Conference).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

12 Citations (Scopus)

Joint Maximum Likelihood Estimation of Power Spectral Densities and Relative Acoustic Transfer Functions for Acoustic Beamforming

Hoang, P., Tan, Z-H., de Haan, J. M. & Jensen, J., 2021, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, Vol. 2021-June. p. 6119-6123 5 p. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

6 Citations (Scopus)

PAC-Bayesian theory for stochastic LTI systems

Eringis, D., Leth, J-J., Tan, Z-H., Wisniewski, R., Fakhrizadeh Esfahani, A. & Petreczky, M., 2021, 2021 60th IEEE Conference on Decision and Control (CDC). IEEE, p. 6626-6633 8 p. 9682808. (I E E E Conference on Decision and Control. Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

3 Citations (Scopus)

Radio Sensing with Large Intelligent Surface for 6G

Vaca-Rubio, C. J., Ramirez-Espinosa, P., Kansanen, K., Tan, Z-H. & Carvalho, E. D., 4 Nov 2021, arXiv, 6 p.

Research output: Working paper/Preprint › Preprint

Open Access

File

23 Downloads (Pure)

Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices

Kalør, A. E., Michelsanti, D., Chiariotti, F., Tan, Z-H. & Popovski, P., 30 Sept 2021, 2021 IEEE 22nd International Workshop on Signal Processing Advances in Wireless Communications (SPAWC). IEEE, p. 251-255 5 p. 9593188. (IEEE International Workshop on Signal Processing Advances in Wireless Communications (SPAWC)).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

1 Citation (Scopus)

Self-Segmentation of Pass-Phrase Utterances for Deep Feature Learning in Text-Dependent Speaker Verification

Sarkar, A. K. & Tan, Z-H., 2021, In: Computer Speech and Language. 70, 101229.

Research output: Contribution to journal › Journal article › Research › peer-review

3 Citations (Scopus)

UIAI System for Short-Duration Speaker Verification Challenge 2020

Sahidullah, M., Sarkar, A. K., Vestman, V., Liu, X., Serizel, R., Kinnunen, T., Tan, Z-H. & Vincent, E., 25 Mar 2021, 2021 IEEE Spoken Language Technology Workshop (SLT). IEEE, p. 323-329 7 p. 9383596

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

6 Citations (Scopus)

Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding

Sarkar, A. & Tan, Z-H., 28 Jan 2021, In: I E E E Signal Processing Letters. 28, p. 364-368 5 p., 9339931.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

9 Citations (Scopus)

27 Downloads (Pure)

2020

Adversarial Example Detection by Classification for Deep Speech Recognition

Samizade, S., Tan, Z-H., Shen, C. & Xiaohong, G., 9 Apr 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 3102-3106 5 p. 9054750. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

29 Citations (Scopus)

Can robots express facial emotions dominantly enough for use in dementia care?

Vlachos, E. & Tan, Z. H., 1 Jul 2020, In: International Psychogeriatrics. 32, 7, p. 891-892 2 p.

Research output: Contribution to journal › Comment/debate › Research › peer-review

Open Access

Highlights From the Machine Learning for Signal Processing Technical Committee [In the Spotlight]

Rao, B. D. & Tan, Z-H., 2020, In: I E E E - Signal Processing Magazine. 37, 6, p. 202; 200 2 p.

Research output: Contribution to journal › Editorial

Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices

Lopez-Espejo, I., Tan, Z-H. & Jensen, J., Apr 2020, In: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28, p. 1233-1247 15 p., 9054977.

Research output: Contribution to journal › Journal article › Research › peer-review

Open Access

File

12 Citations (Scopus)

98 Downloads (Pure)

Zheng-Hua Tan

Research output

Search results