|
|
a distributed minimum redundancy maximum relevance feature selection approac
|
|
|
|
|
نویسنده
|
sharifnezhad m. ,rahmani m. ,ghaffarian h.
|
منبع
|
مهندسي برق دانشگاه تبريز - 2021 - دوره : 51 - شماره : 2 - صفحه:285 -293
|
چکیده
|
Feature selection (fs) is served in almost all data mining applications along with some benefits such as reducing the computation and storage cost. most of the current feature selection algorithms just work in a centralized manner. however, this process does not apply to high dimensional datasets, effectively. in this paper, we propose a distributed version of minimum redundancy maximum relevance (mrmr) algorithm. the proposed algorithm acts in six steps to solve the problem. it distributes datasets horizontally into subsets, selects and eliminates redundant features, and finally merges the subsets into a single set. we evaluate the performance of the proposed method using different datasets. the results prove that the suggested method can improve classification accuracy and reduce the runtime
|
کلیدواژه
|
minimum redundancy ,maximum relevance ,classification accuracy ,feature reduction ,distributed processing
|
آدرس
|
arak university, faculty of engineering, department of computer engineering, iran, arak university, faculty of engineering, department of computer engineering, iran, arak university, faculty of engineering, department of computer engineering, iran
|
پست الکترونیکی
|
h-ghaffarian@araku.ac.ir
|
|
|
|
|
|
|
|
|
A Distributed Minimum Redundancy Maximum Relevance Feature Selection Approac
|
|
|
Authors
|
Sharifnezhad M. ,Rahmani M. ,Ghaffarian H.
|
Abstract
|
Feature selection (FS) is served in almost all data mining applications along with some benefits such as reducing the computation and storage cost. Most of the current feature selection algorithms just work in a centralized manner. However, this process does not apply to high dimensional datasets, effectively. In this paper, we propose a distributed version of Minimum Redundancy Maximum Relevance (mRMR) algorithm. The proposed algorithm acts in six steps to solve the problem. It distributes datasets horizontally into subsets, selects and eliminates redundant features, and finally merges the subsets into a single set. We evaluate the performance of the proposed method using different datasets. The results prove that the suggested method can improve classification accuracy and reduce the runtime
|
Keywords
|
Minimum Redundancy ,Maximum Relevance ,Classification accuracy ,feature reduction ,Distributed processing
|
|
|
|
|
|
|
|
|
|
|