a distributed minimum redundancy maximum relevance feature selection approac

Fa | Ar | En

a distributed minimum redundancy maximum relevance feature selection approac


نویسنده	sharifnezhad m. ,rahmani m. ,ghaffarian h.
منبع	مهندسي برق دانشگاه تبريز - 2021 - دوره : 51 - شماره : 2 - صفحه:285 -293
چکیده	Feature selection (fs) is served in almost all data mining applications along with some benefits such as reducing the computation and storage cost. most of the current feature selection algorithms just work in a centralized manner. however, this process does not apply to high dimensional datasets, effectively. in this paper, we propose a distributed version of minimum redundancy maximum relevance (mrmr) algorithm. the proposed algorithm acts in six steps to solve the problem. it distributes datasets horizontally into subsets, selects and eliminates redundant features, and finally merges the subsets into a single set. we evaluate the performance of the proposed method using different datasets. the results prove that the suggested method can improve classification accuracy and reduce the runtime
کلیدواژه	minimum redundancy ,maximum relevance ,classification accuracy ,feature reduction ,distributed processing
آدرس	arak university, faculty of engineering, department of computer engineering, iran, arak university, faculty of engineering, department of computer engineering, iran, arak university, faculty of engineering, department of computer engineering, iran
پست الکترونیکی	h-ghaffarian@araku.ac.ir

A Distributed Minimum Redundancy Maximum Relevance Feature Selection Approac

Authors	Sharifnezhad M. ,Rahmani M. ,Ghaffarian H.
Abstract	Feature selection (FS) is served in almost all data mining applications along with some benefits such as reducing the computation and storage cost. Most of the current feature selection algorithms just work in a centralized manner. However, this process does not apply to high dimensional datasets, effectively. In this paper, we propose a distributed version of Minimum Redundancy Maximum Relevance (mRMR) algorithm. The proposed algorithm acts in six steps to solve the problem. It distributes datasets horizontally into subsets, selects and eliminates redundant features, and finally merges the subsets into a single set. We evaluate the performance of the proposed method using different datasets. The results prove that the suggested method can improve classification accuracy and reduce the runtime
Keywords	Minimum Redundancy ,Maximum Relevance ,Classification accuracy ,feature reduction ,Distributed processing