Skip to main content

Research Repository

Advanced Search

A new discrete particle swarm algorithm applied to attribute selection in a bioinformatics data set

Correa, ES; Freitas, AA; Johnson, CG

Authors

ES Correa

AA Freitas

CG Johnson



Abstract

Many data mining applications involve the task of building a model for predictive classification. The goal of such a model is to classify examples (records or data instances) into classes or categories of the same type. The use of variables (attributes) not related to the classes can reduce the accuracy and reliability of a classification or prediction model. Superuous variables can also increase the costs of building a model - particularly on large data sets. We propose a discrete Particle Swarm Optimization (PSO) algorithm designed for attribute selection. The proposed algorithm deals with discrete variables, and its population of candidate solutions contains particles of different sizes. The performance of this algorithm is compared with the performance of a standard binary PSO algorithm on the task of selecting attributes in a bioinformatics data set. The criteria used for comparison are: (1) maximizing predictive accuracy; and (2) finding the smallest subset of attributes.

Citation

Correa, E., Freitas, A., & Johnson, C. (2006). A new discrete particle swarm algorithm applied to attribute selection in a bioinformatics data set. In Proceedings of the 8th annual conference on Genetic and evolutionary computation - GECCO '06 (35). ACM Digital Library. https://doi.org/10.1145/1143997.1144003

Start Date Jul 8, 2006
End Date Jul 12, 2006
Publication Date Jan 1, 2006
Deposit Date Feb 10, 2017
Pages 35
Book Title Proceedings of the 8th annual conference on Genetic and evolutionary computation - GECCO '06
ISBN 1595931864
DOI https://doi.org/10.1145/1143997.1144003
Publisher URL http://dx.doi.org/10.1145/1143997.1144003
Additional Information Event Type : Conference



Downloadable Citations