|
|
|
|
a local density-based outlier detection method for high dimension data
|
|
|
|
|
|
|
|
نویسنده
|
abdulghafoor shahad adel ,mohamed lekaa ali
|
|
منبع
|
international journal of nonlinear analysis and applications - 2022 - دوره : 13 - شماره : 1 - صفحه:1683 -1699
|
|
چکیده
|
The researchers faced challenges in the outlier detection process, mainly when deals with the high dimensional dataset; to handle this problem, we use the principal component analysis. outlier detection or anomaly detection, with local density-based methods, compares the density of observation with the surrounding local density neighbors. we apply the outlier score as a measure of comparison. in this research, we choose different density estimation functions and calculated different distances. weighted kernel density estimation with adaptive bandwidth for multivariate kernel density estimation (gaussian) considered the knn and rnn. knn is considered too for the epanenchnikov kernel density estimation. lastly, we estimate the lof as a base method in detecting outliers. extensive experiments on a synthetic dataset have shown that rkdos and epa are more efficient than lof using the precision evaluation criterion.
|
|
کلیدواژه
|
local density; k-nearest neighbor; r-nearest neighbor; outlier score; wkde
|
|
آدرس
|
central statistical organization, ministry of planning, department of statistics, iraq. university of baghdad, college of management and economics, department of statistics, iraq, university of baghdad, college of management and economics, department of statistics, iraq
|
|
پست الکترونیکی
|
lekaa.a@coadec.uobaghdad.edu.iq
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|