|
|
an anomaly data mining method for mass sensor networks using improved pso algorithm based on spark parallel framework
|
|
|
|
|
نویسنده
|
yuan jingzhen
|
منبع
|
journal of grid computing - 2020 - دوره : 18 - شماره : 2 - صفحه:251 -261
|
چکیده
|
Accurate detection and capture of anomaly data in complex network data stream is an important part of ensuring network security. traditional methods cannot adapt to the high dynamic changes of abnormal data characteristics in complex network. thus, the detection accuracy is reduced. in this paper, a k-means parallel clustering algorithm is proposed. it is optimized by particle swarm optimization with dynamic adaptive inertia weight (dspsok-means). and it is used to mine the anomaly data for mass sensor networks. the inertia weight is dynamically adjusted through the fitness function, so that the dspso algorithm has the adaptive characteristics. then, the output of the dspso algorithm is taken as the input of the k-means algorithm. thus, the intelligence and self-adaptability of the k-means algorithm in selecting the initial center point is improved. finally, with the help of spark platform, the parallelization of dspsok-means clustering algorithm in the clustering environment is designed and implemented. it is shown by the experimental results that the traffic among nodes in the execution process can be effectively reduced by the dspsok-means algorithm. and the accuracy of abnormal data mining in complex network data flow is 5% higher than that of the comparison algorithm on average.
|
کلیدواژه
|
spark parallel framework ,anomaly data mining ,fitness function ,improved pso algorithm ,sensor network ,mass data
|
آدرس
|
hanshan normal university, school of physics and electronic engineering, china
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|