|
|
|
|
Predicting viral protein subcellular localization with Chou's pseudo amino acid composition and imbalance-weighted multi-label K-nearest neighbor algorithm
|
|
|
|
|
|
|
|
نویسنده
|
cao j.-z. ,liu w.-q. ,gu h.
|
|
منبع
|
protein and peptide letters - 2012 - دوره : 19 - شماره : 11 - صفحه:1163 -1169
|
|
چکیده
|
Machine learning is a kind of reliable technology for automated subcellular localization of viral proteins within a host cell or virus-infected cell. one challenge is that the viral protein samples are not only with multiple location sites,but also class-imbalanced. the imbalanced dataset often decreases the prediction performance. in order to accomplish this challenge,this paper proposes a novel approach named imbalance-weighted multi-label k-nearest neighbor to predict viral protein subcellular location with multiple sites. the experimental results by jackknife test indicate that the presented algorithm achieves a better performance than the existing methods and has great potentials in protein science. © 2012 bentham science publishers.
|
|
کلیدواژه
|
Class-imbalance; K-nearest neighbor; Multi-label learning; Pseudo amino acid composition; Subcellular localization
|
|
آدرس
|
school of control science and engineering,dalian university of technology,#2 ling-gong road,dalian, China, school of control science and engineering,dalian university of technology,#2 ling-gong road,dalian, China, school of control science and engineering,dalian university of technology,#2 ling-gong road,dalian, China
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|