Using decision trees and their ensembles for analysis of NIR spectroscopic data

Publikation: Konferencebidrag uden forlag/tidsskriftKonferenceabstrakt til konferenceForskning

Abstract

Advanced machine learning methods, like convolutional neural networks and decision trees, became extremely popular in the last decade. This, first of all, is directly related to the current boom in Big data analysis, where traditional statistical methods are not efficient. According to the kaggle.com — the most popular online resource for Big data problems and solutions — methods based on decision trees and their ensembles are most widely used for solving the problems.

It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.

In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.
OriginalsprogEngelsk
Publikationsdato2018
Antal sider2
StatusUdgivet - 2018
Begivenhed11th Winter Symposium on Chemometrics - Saint-Petersburg, Rusland
Varighed: 26 feb. 20182 mar. 2018
Konferencens nummer: 11
http://wsc.chemometrics.ru/wsc11/

Konference

Konference11th Winter Symposium on Chemometrics
Nummer11
Land/OmrådeRusland
BySaint-Petersburg
Periode26/02/201802/03/2018
Internetadresse

Fingeraftryk

Dyk ned i forskningsemnerne om 'Using decision trees and their ensembles for analysis of NIR spectroscopic data'. Sammen danner de et unikt fingeraftryk.

Citationsformater