|
|
comparative analysis of tree-based machine learning algorithms on thyroid disease prediction using ros technique and hyperparameter optimization
|
|
|
|
|
نویسنده
|
moradi elahe
|
منبع
|
journal of ai and data mining - 2024 - دوره : 12 - شماره : 4 - صفحه:511 -520
|
چکیده
|
Thyroid disease is common worldwide and early diagnosis plays an important role in effective treatment and management. utilizing machine learning techniques is vital in thyroid disease diagnosis. this research proposes tree-based machine learning algorithms using hyperparameter optimization techniques to predict thyroid disease. the thyroid disease dataset from the uci repository is benchmarked to evaluate the performance of the proposed algorithms. after data preprocessing and normalization steps, data balancing has been applied to the data using the random oversampling (ros) technique. also, two methods of grid search (gs) and random search (rs) have been employed to optimize hyperparameters. finally, employing python software, various criteria were used to evaluate the performance of proposed algorithms such as decision tree, random forest, adaboost, and extreme gradient boosting. the results of the simulations indicate that the extreme gradient boosting (xgb) algorithm with the grid search method outperforms all the other algorithms, obtaining an impressive accuracy, auc, sensitivity, precision, and mcc of 99.39%, 99.97%, 98.85%, 99.40%, 98.79%, respectively. these results demonstrated the potential of the proposed method for accurately predicting thyroid disease.
|
کلیدواژه
|
machine learning ,thyroid disease prediction ,imbalanced data ,optimization ,random forest
|
آدرس
|
islamic azad university, yadegar-e-imam khomeini (rah) shahre rey branch, department of electrical and computer engineering, iran
|
پست الکترونیکی
|
elahe.moradi@iau.ac.ir
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|