Skip to main content

Research Repository

Advanced Search

VEGAS: A Variable Length-Based Genetic Algorithm for Ensemble Selection in Deep Ensemble Learning

Han, Kate; Pham, Tien; Hieu Vu, Trung; Dang, Truong; McCall, John; Thanh Nguyen, Tien

Authors

Tien Pham

Trung Hieu Vu

Truong Dang

John McCall

Tien Thanh Nguyen



Abstract

In this study, we introduce an ensemble selection method for deep ensemble systems called VEGAS. The deep ensemble models include multiple layers of the ensemble of classifiers (EoC). At each layer, we train the EoC and generates training data for the next layer by concatenating the predictions for training observations and the original training data. The predictions of the classifiers in the last layer are combined by a combining method to obtain the final collaborated prediction. We further improve the prediction accuracy of a deep ensemble model by searching for its optimal configuration, i.e., the optimal set of classifiers in each layer. The optimal configuration is obtained using the Variable-Length Genetic Algorithm (VLGA) to maximize the prediction accuracy of the deep ensemble model on the validation set. We developed three operators of VLGA: roulette wheel selection for breeding, a chunk-based crossover based on the number of classifiers to generate new offsprings, and multiple random points-based mutation on each offspring. The experiments on 20 datasets show that VEGAS outperforms selected benchmark algorithms, including two well-known ensemble methods (Random Forest and XgBoost) and three deep learning methods (Multiple Layer Perceptron, gcForest, and MULES).

Presentation Conference Type Conference Paper (published)
Conference Name 13th Asian Conference, ACIIDS 2021
Start Date Apr 7, 2021
End Date Apr 10, 2021
Acceptance Date Apr 5, 2021
Online Publication Date Apr 5, 2021
Publication Date Apr 5, 2021
Deposit Date Jan 7, 2025
Publisher Springer
Series Title Lecture Notes in Computer Science
Series Number 12672
Series ISSN 1611-3349
Book Title Intelligent Information and Database Systems
ISBN 9783030732790
DOI https://doi.org/10.1007/978-3-030-73280-6_14