Using decision trees and their ensembles for analysis of NIR spectroscopic data

Publikation: Konferencebidrag uden forlag/tidsskriftKonferenceabstrakt til konferenceForskning

Resumé

Advanced machine learning methods, like convolutional neural networks and decision trees, became extremely popular in the last decade. This, first of all, is directly related to the current boom in Big data analysis, where traditional statistical methods are not efficient. According to the kaggle.com — the most popular online resource for Big data problems and solutions — methods based on decision trees and their ensembles are most widely used for solving the problems.

It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.

In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.
OriginalsprogEngelsk
Publikationsdato2018
Antal sider2
StatusUdgivet - 2018
Begivenhed11th Winter Symposium on Chemometrics - Saint-Petersburg, Rusland
Varighed: 26 feb. 20182 mar. 2018
Konferencens nummer: 11
http://wsc.chemometrics.ru/wsc11/

Konference

Konference11th Winter Symposium on Chemometrics
Nummer11
LandRusland
BySaint-Petersburg
Periode26/02/201802/03/2018
Internetadresse

Fingerprint

Decision trees
Learning systems
Neural networks
Gradient methods
Statistical methods
Big data

Citer dette

Kucheryavskiy, S. V. (2018). Using decision trees and their ensembles for analysis of NIR spectroscopic data. 26-27. Abstract fra 11th Winter Symposium on Chemometrics, Saint-Petersburg, Rusland.
Kucheryavskiy, Sergey V. / Using decision trees and their ensembles for analysis of NIR spectroscopic data. Abstract fra 11th Winter Symposium on Chemometrics, Saint-Petersburg, Rusland.2 s.
@conference{d170a766671e44e7b9b325650de0c7ca,
title = "Using decision trees and their ensembles for analysis of NIR spectroscopic data",
abstract = "Advanced machine learning methods, like convolutional neural networks and decision trees, became extremely popular in the last decade. This, first of all, is directly related to the current boom in Big data analysis, where traditional statistical methods are not efficient. According to the kaggle.com — the most popular online resource for Big data problems and solutions — methods based on decision trees and their ensembles are most widely used for solving the problems.It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.",
author = "Kucheryavskiy, {Sergey V.}",
year = "2018",
language = "English",
pages = "26--27",
note = "11th Winter Symposium on Chemometrics, WSC-11 ; Conference date: 26-02-2018 Through 02-03-2018",
url = "http://wsc.chemometrics.ru/wsc11/",

}

Kucheryavskiy, SV 2018, 'Using decision trees and their ensembles for analysis of NIR spectroscopic data', Saint-Petersburg, Rusland, 26/02/2018 - 02/03/2018, s. 26-27.

Using decision trees and their ensembles for analysis of NIR spectroscopic data. / Kucheryavskiy, Sergey V.

2018. 26-27 Abstract fra 11th Winter Symposium on Chemometrics, Saint-Petersburg, Rusland.

Publikation: Konferencebidrag uden forlag/tidsskriftKonferenceabstrakt til konferenceForskning

TY - ABST

T1 - Using decision trees and their ensembles for analysis of NIR spectroscopic data

AU - Kucheryavskiy, Sergey V.

PY - 2018

Y1 - 2018

N2 - Advanced machine learning methods, like convolutional neural networks and decision trees, became extremely popular in the last decade. This, first of all, is directly related to the current boom in Big data analysis, where traditional statistical methods are not efficient. According to the kaggle.com — the most popular online resource for Big data problems and solutions — methods based on decision trees and their ensembles are most widely used for solving the problems.It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.

AB - Advanced machine learning methods, like convolutional neural networks and decision trees, became extremely popular in the last decade. This, first of all, is directly related to the current boom in Big data analysis, where traditional statistical methods are not efficient. According to the kaggle.com — the most popular online resource for Big data problems and solutions — methods based on decision trees and their ensembles are most widely used for solving the problems.It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.

UR - http://wsc.chemometrics.ru/media/files/conferences/wsc11/documents/WSC11%20Abstract%20book.pdf

M3 - Conference abstract for conference

SP - 26

EP - 27

ER -

Kucheryavskiy SV. Using decision trees and their ensembles for analysis of NIR spectroscopic data. 2018. Abstract fra 11th Winter Symposium on Chemometrics, Saint-Petersburg, Rusland.