TY - JOUR
T1 - Analysis and Classification of Frequency-Domain Terahertz Reflection Spectra Using Supervised and Unsupervised Dimensionality Reduction Methods
AU - Cielecki, Pawel Piotr
AU - Kristensen, Mathias Hedegaard
AU - Skovsen, Esben
PY - 2021/9
Y1 - 2021/9
N2 - The unique properties of terahertz (THz) spectroscopy show a great potential for security and defense applications such as safe screening of persons and objects. However, a successful implementation of THz screening systems requires a development of reliable and efficient identification algorithms. Dimensionality reduction (DR) methods aim to reduce the dimensionality of the multivariate data and are therefore commonly used as a preprocessing step for classification algorithms and as an analytical tool allowing data visualization. In this paper, we compare the use of unsupervised and supervised DR methods for analysis and classification of THz reflection spectra based on their most widespread linear representatives, namely principal component analysis and linear discriminant analysis, respectively. To this end, both methods were applied to more than 5000 THz reflection spectra acquired from six active materials mixed at three different concentrations with polyethylene and measured at various humidity conditions. While considering scenarios with different levels of complexity, we found that the supervised approach provides better results because it enables efficient grouping despite intra-class variability. Furthermore, we showed that manipulating labels introduced into the supervised DR algorithm allows conditioning the data for a desired classification task such as security screening. Presented classification results show that simple machine learning algorithms are sufficient for highly accurate classification (>98.6%) of THz spectra, which will be suitable for many real-life applications of THz spectroscopy based on material identification.
AB - The unique properties of terahertz (THz) spectroscopy show a great potential for security and defense applications such as safe screening of persons and objects. However, a successful implementation of THz screening systems requires a development of reliable and efficient identification algorithms. Dimensionality reduction (DR) methods aim to reduce the dimensionality of the multivariate data and are therefore commonly used as a preprocessing step for classification algorithms and as an analytical tool allowing data visualization. In this paper, we compare the use of unsupervised and supervised DR methods for analysis and classification of THz reflection spectra based on their most widespread linear representatives, namely principal component analysis and linear discriminant analysis, respectively. To this end, both methods were applied to more than 5000 THz reflection spectra acquired from six active materials mixed at three different concentrations with polyethylene and measured at various humidity conditions. While considering scenarios with different levels of complexity, we found that the supervised approach provides better results because it enables efficient grouping despite intra-class variability. Furthermore, we showed that manipulating labels introduced into the supervised DR algorithm allows conditioning the data for a desired classification task such as security screening. Presented classification results show that simple machine learning algorithms are sufficient for highly accurate classification (>98.6%) of THz spectra, which will be suitable for many real-life applications of THz spectroscopy based on material identification.
KW - Terahertz
KW - Spectroscopy
KW - Machine learning
KW - THz-FDS
KW - Classification
KW - Dimensionality reduction
KW - Frequency-domain
KW - Machine learning
KW - THz screening
KW - THz spectroscopy
UR - http://www.scopus.com/inward/record.url?scp=85115709956&partnerID=8YFLogxK
U2 - 10.1007/s10762-021-00810-w
DO - 10.1007/s10762-021-00810-w
M3 - Journal article
VL - 42
SP - 1005
EP - 1026
JO - Journal of Infrared, Millimeter and Terahertz Waves
JF - Journal of Infrared, Millimeter and Terahertz Waves
SN - 1866-6892
IS - 9-10
ER -