|
|
Selecting optimal feature set in high-dimensional data by Swarm Search
|
|
|
|
|
نویسنده
|
fong s. ,zhuang y. ,tang r. ,yang x.-s. ,deb s.
|
منبع
|
journal of applied mathematics - 2013 - دوره : 2013 - شماره : 0
|
چکیده
|
Selecting the right set of features from data of high dimensionality for inducing an accurate classification model is a tough computational challenge. it is almost a np-hard problem as the combinations of features escalate exponentially as the number of features increases. unfortunately in data mining,as well as other engineering applications and bioinformatics,some data are described by a long array of features. many feature subset selection algorithms have been proposed in the past,but not all of them are effective. since it takes seemingly forever to use brute force in exhaustively trying every possible combination of features,stochastic optimization may be a solution. in this paper,we propose a new feature selection scheme called swarm search to find an optimal feature set by using metaheuristics. the advantage of swarm search is its flexibility in integrating any classifier into its fitness function and plugging in any metaheuristic algorithm to facilitate heuristic search. simulation experiments are carried out by testing the swarm search over some high-dimensional datasets,with different classification algorithms and various metaheuristic algorithms. the comparative experiment results show that swarm search is able to attain relatively low error rates in classification without shrinking the size of the feature subset to its minimum. © 2013 simon fong et al.
|
|
|
آدرس
|
department of computer and information science, Macau, department of computer and information science, Macau, department of computer and information science, Macau, faculty of science and technology, United Kingdom, department of computer science and engineering,cambridge institute of technology, India
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|