>
Fa   |   Ar   |   En
   a distributed minimum redundancy maximum relevance feature selection approac  
   
نویسنده sharifnezhad m. ,rahmani m. ,ghaffarian h.
منبع مهندسي برق دانشگاه تبريز - 2021 - دوره : 51 - شماره : 2 - صفحه:285 -293
چکیده    Feature selection (fs) is served in almost all data mining applications along with some benefits such as reducing the computation and storage cost. most of the current feature selection algorithms just work in a centralized manner. however, this process does not apply to high dimensional datasets, effectively. in this paper, we propose a distributed version of minimum redundancy maximum relevance (mrmr) algorithm. the proposed algorithm acts in six steps to solve the problem. it distributes datasets horizontally into subsets, selects and eliminates redundant features, and finally merges the subsets into a single set. we evaluate the performance of the proposed method using different datasets. the results prove that the suggested method can improve classification accuracy and reduce the runtime
کلیدواژه minimum redundancy ,maximum relevance ,classification accuracy ,feature reduction ,distributed processing
آدرس arak university, faculty of engineering, department of computer engineering, iran, arak university, faculty of engineering, department of computer engineering, iran, arak university, faculty of engineering, department of computer engineering, iran
پست الکترونیکی h-ghaffarian@araku.ac.ir
 
   A Distributed Minimum Redundancy Maximum Relevance Feature Selection Approac  
   
Authors Sharifnezhad M. ,Rahmani M. ,Ghaffarian H.
Abstract    Feature selection (FS) is served in almost all data mining applications along with some benefits such as reducing the computation and storage cost. Most of the current feature selection algorithms just work in a centralized manner. However, this process does not apply to high dimensional datasets, effectively. In this paper, we propose a distributed version of Minimum Redundancy Maximum Relevance (mRMR) algorithm. The proposed algorithm acts in six steps to solve the problem. It distributes datasets horizontally into subsets, selects and eliminates redundant features, and finally merges the subsets into a single set. We evaluate the performance of the proposed method using different datasets. The results prove that the suggested method can improve classification accuracy and reduce the runtime
Keywords Minimum Redundancy ,Maximum Relevance ,Classification accuracy ,feature reduction ,Distributed processing
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved