>
Fa   |   Ar   |   En
   The Performance of Clustering Approach with Robust MM-Estimator for Multiple Outlier Detection in Linear Regression  
   
نویسنده Mohd Azmi Nurulhuda Firdaus ,MIDI HABSHAH ,ISMAIL NORANITA FAIRUS
منبع jurnal teknologi - 2006 - دوره : 45 - شماره : C - صفحه:15 -28
چکیده    Identifying outlier is a fundamental step in the regression model building process. outlying observations should be identified because of their potential effect on the fitted model. as a result of the need to identify outliers, numerous outlying measures such as residuals and hat matrix diagonal are built. however, these outlying measures works well when a regression data set contains only a single outlying point and it is well established that regression real data sets may have multiple outlying observations that individually are not easy to identify by the same measures. in this paper, an alternative approach is proposed, that is clustering technique incorporated with robust estimator for multiple outlier identification. the robust estimator proposes is mm-estimator. the performance of clustering approach with proposed estimator is compared with other estimator that is the classical estimator namely least square (ls) and other robust estimator that is least trimmed square (lts). the evaluation of the estimator performance is carried out through analyses on a classical multiple outlier data sets found in the literature and simulated multiple outlier data sets. additionally, the analysis of root mean square error (rmse) value and coverage probabilities of bootstrap bias corrected and accelerated (bca) confidence interval are also being conducted to identify the best estimator in identification of multiple outliers. from the analysis, it has been revealed that the mm- estimator performed excellently on the classical multiple outlier data sets and a wide variety of simulated data sets with any percentage of outliers, any number of regressor variables and any sample sizes followed by lts and ls. the analysis also showed that the value of rmse of the proposed estimator is always smaller than the other two estimators. whereupon, the coverage probabilities of bca confidence interval also conclude that the mm-estimator confidence interval have all the criteria's to be the best estimator since it has a good coverage probabilities, good equatailness and the shortest average confident length followed by lts and ls.
کلیدواژه Multiple outliers ,linear regression ,robust estimator ,MM-Estimator ,Bootstrap Bias Corrected and Accelerated (BCa) confidence interval
آدرس Universiti Teknologi Malaysia City Campus, Centre for Advanced Software Engineering (CASE), Malaysia, Universiti Putra Malaysia, Institut Penyelidikan Matematik (INSPEM), Malaysia, Universiti Teknologi Malaysia, Fakulti Sains Komputer & Sistem Maklumat, Malaysia
پست الکترونیکی anita@utm.my
 
     
   
Authors
  
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved