Abstract
Ensembles are widely used in machine learning and, usually, provide state-of-the-art performance in many prediction tasks. From the very beginning, the diversity of an ensemble has been identified as a key factor for the superior performance of these models. But the exact role that diversity plays in ensemble models is poorly understood, specially in the context of neural networks. In this work, we combine and expand previously published results in a theoretically sound framework that describes the relationship between diversity and ensemble performance for a wide range of ensemble methods. More precisely, we provide sound answers to the following questions: how to measure diversity, how diversity relates to the generalization error of an ensemble, and how diversity is promoted by neural network ensemble algorithms. This analysis covers three widely used loss functions, namely, the squared loss, the cross-entropy loss, and the 0-1 loss; and two widely used model combination strategies, namely, model averaging and weighted majority vote. We empirically validate this theoretical analysis with neural network ensembles.
Originalsprog | Engelsk |
---|---|
Bogserie | Proceedings of Machine Learning Research |
Vol/bind | 151 |
Sider (fra-til) | 11720-11743 |
Antal sider | 24 |
ISSN | 2640-3498 |
Status | Udgivet - 2022 |
Begivenhed | 25th International Conference on Artificial Intelligence and Statistics, AISTATS 2022 - Virtual, Online, Spanien Varighed: 28 mar. 2022 → 30 mar. 2022 |
Konference
Konference | 25th International Conference on Artificial Intelligence and Statistics, AISTATS 2022 |
---|---|
Land/Område | Spanien |
By | Virtual, Online |
Periode | 28/03/2022 → 30/03/2022 |
Bibliografisk note
Funding Information:This research is part of projects PID2019-106758GB-C31, PID2019-106758GB-C32, funded by MCIN/AEI/10.13039/501100011033, FEDER “Una manera de hacer Europa” funds. This research is also partially funded by Junta de Andalućıa grant P20-00091. Finally we would like to thank the “María Zambrano” grant (RR C 2021 01) from the Spanish Ministry of Universities and funded with NextGenerationEU funds.
Funding Information:
This research is part of projects PID2019-106758GB-C31, PID2019-106758GB-C32, funded by MCIN/AEI/10.13039/501100011033, FEDER “Una manera de hacer Europa” funds. This research is also partially funded by Junta de Andalucía grant P20-00091. Finally we would like to thank the “María Zambrano” grant (RR C 2021 01) from the Spanish Ministry of Universities and funded with NextGenerationEU funds.
Publisher Copyright:
Copyright © 2022 by the author(s)