|
|
random-splitting random forest with multiple mixed-data covariates
|
|
|
|
|
نویسنده
|
fayaz mohammad ,abadi alireza ,khodakarimd soheila
|
منبع
|
journal of biostatistics and epidemiology - 2023 - دوره : 9 - شماره : 1 - صفحه:20 -29
|
چکیده
|
Introduction:the bagging (bg) and random forest (rf) are famous supervised statistical learning methods based on the classification and regression trees. the bg and rf can deal with different types of responses such as categorical, continuous, etc. there are curves, time series, functional data, or observations that are related to each other based on their domain in many statistical applications. the rf methods are extended to some cases for functional data as covariates or responses in many pieces of literature. among them, random-splitting is used to summarize the functional data to the multiple related summary statistics such as average, etc. methods: this research article extends this method and introduces the mixed data bg (md-bg) and rf (md-rf) algorithm for multiple functional and non-functional, or mixed and hybrid data, covariates and it calculates the variable importance plot (vip) for each covariate. results: the main differences between md-bg and md-rf are in choosing the covariates that in the first, all covariates remain in the model but the second uses a random sample of covariates. the md-rf helps to unmask the most important parts of functional covariates and the most important non-functional covariates. conclusion: we apply our methods on the two datasets of dti and tecator and compare their performances for continuous and categorical responses with developed r package (“rsrf”) in the github.
|
کلیدواژه
|
bagging ,functional data ,random forest ,random splitting ,statistical learning
|
آدرس
|
allameh tabataba'i university, eco college of insurance, iran, shahid beheshti university of medical sciences, social determinants of health research center, faculty of medicine, department of community medicine, iran, shiraz university of medical sciences, school of medicine, department of biostatistics, iran
|
پست الکترونیکی
|
lkhodakarim@gmail.com
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|