Resumé
It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.
In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.
Originalsprog | Engelsk |
---|---|
Publikationsdato | 2018 |
Antal sider | 2 |
Status | Udgivet - 2018 |
Begivenhed | 11th Winter Symposium on Chemometrics - Saint-Petersburg, Rusland Varighed: 26 feb. 2018 → 2 mar. 2018 Konferencens nummer: 11 http://wsc.chemometrics.ru/wsc11/ |
Konference
Konference | 11th Winter Symposium on Chemometrics |
---|---|
Nummer | 11 |
Land | Rusland |
By | Saint-Petersburg |
Periode | 26/02/2018 → 02/03/2018 |
Internetadresse |
Fingerprint
Citer dette
}
Using decision trees and their ensembles for analysis of NIR spectroscopic data. / Kucheryavskiy, Sergey V.
2018. 26-27 Abstract fra 11th Winter Symposium on Chemometrics, Saint-Petersburg, Rusland.Publikation: Konferencebidrag uden forlag/tidsskrift › Konferenceabstrakt til konference › Forskning
TY - ABST
T1 - Using decision trees and their ensembles for analysis of NIR spectroscopic data
AU - Kucheryavskiy, Sergey V.
PY - 2018
Y1 - 2018
N2 - Advanced machine learning methods, like convolutional neural networks and decision trees, became extremely popular in the last decade. This, first of all, is directly related to the current boom in Big data analysis, where traditional statistical methods are not efficient. According to the kaggle.com — the most popular online resource for Big data problems and solutions — methods based on decision trees and their ensembles are most widely used for solving the problems.It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.
AB - Advanced machine learning methods, like convolutional neural networks and decision trees, became extremely popular in the last decade. This, first of all, is directly related to the current boom in Big data analysis, where traditional statistical methods are not efficient. According to the kaggle.com — the most popular online resource for Big data problems and solutions — methods based on decision trees and their ensembles are most widely used for solving the problems.It can be noted that the decision trees and convolutional neural networks are not very popular in Chemometrics. One of the reasons for that is the landscape of the data matrix: the modern machine learning methods need number of measurements much larger than the number of variables to avoid overfitting, which is opposite to the layout of the data we usually deal with. Another drawback is a lack of interactive instruments for exploring and interpretation of the models.In this presentation, we are going to discuss an applicability of decision trees based methods (including gradient boosting) for solving classification and regression tasks with NIR spectra as predictors. We will cover such aspects as evaluation, optimization and validation of models, sensitivity to outliers and selection of most important variables.
UR - http://wsc.chemometrics.ru/media/files/conferences/wsc11/documents/WSC11%20Abstract%20book.pdf
M3 - Conference abstract for conference
SP - 26
EP - 27
ER -