مدل‌ سازی بارش روزانه تبریز با روش‌های درختی ادغام شده با تجزیه فصلی-روند و رویکرد دسته‌بندی

Fa | Ar | En

مدل‌ سازی بارش روزانه تبریز با روش‌های درختی ادغام شده با تجزیه فصلی-روند و رویکرد دسته‌بندی


نویسنده	جاویدان سحر ,ستاری محمدتقی ,محسن زاده شکوه
منبع	آب و خاك - 1401 - دوره : 36 - شماره : 3 - صفحه:407 -420
چکیده	بارش به‌عنوان یک متغیر تصادفی با داشتن تغییرات مکانی و زمانی یکی از عناصر پیچیده در چرخه هیدرولوژی است. هدف پژوهش حاضر برآورد میزان بارش روزانه تبریز در بازه زمانی 36 ساله (1986-2021) با استفاده از گروه روش‌های درختی شامل، مدل درختی m5p، درخت تصادفی، کاهش خطای هرس درخت و روش دسته‌بندی است. بدین منظور از مقادیر بارش ایستگاه‌های حوضه دریاچه ارومیه از جمله سهند، سراب، ارومیه، مراغه و مهاباد در ترکیب‌های ورودی مختلف استفاده شد. ماتریس همبستگی و الگوریتم رلیف مبنای انتخاب سناریوهای ورودی در نظر گرفته شد و تاثیر مولفه‌های تجزیه فصلی-روند در بهبود نتایج مدل‌سازی بررسی شد. عملکرد روش‌های مذکور با معیارهای ضریب همبستگی، ریشه میانگین مربعات خطا، ضریب نش ساتکلیف، میانگین خطای قدر مطلق و ضریب ویلموت اصلاح شده مورد ارزیابی قرار گرفت. بررسی نتایج نشان داد رویکرد دسته‌بندی در اکثر موارد نتایج قابل قبولی ارائه نموده و باعث بهبود نتایج مدل‌سازی می‌گردد. بررسی‌ها مشخص نمود که ایستگاه سهند با بیشترین همبستگی و کمترین فاصله از تبریز، موثرترین ایستگاه مجاور در برآورد میزان بارش تبریز می‌باشد. در حالت اول و بدون اعمال مولفه‌های تجزیه (روند، فصلی و باقیمانده) در بین روش‌های مورد استفاده روش m5p با سناریو اول شامل بارش سهند به‌عنوان روش و سناریو برتر انتخاب شد. در حالت دوم با وارد شدن مولفه‌های تجزیه، دقت تخمین‌ها به‌صورت چشم گیری افزایش یافت. ادغام روش دسته‌بندی با الگوریتم پایه m5p با پارامترهای بارش سهند و باقیمانده بارش تبریز با r=0.98 و ns=0.95 به‌عنوان برترین حالت انتخاب گردید. در حالت کلی نتایج نشان داد، بهره‌گیری توام از رویکرد دسته‌بندی مدل‌ها و الگوریتم پیش‌پردازش مولفه‌های تجزیه باعث بهبود نتایج مدل‌سازی بارش روزانه تبریز می‌شود. به طوریکه مقدار خطای rmse نسبت به حالت اول 64/60 درصد کاهش یافت. بنابراین به علت استفاده از حداقل تعداد پارامتر ورودی و ارائه نتایج قابل قبول، مدل‌های دسته‌بندی با الگوریتم پایه درختی به‌عنوان روش‌های ساده و پرکاربرد پیشنهاد می گردد.
کلیدواژه	تجزیه، حوضه دریاچه ارومیه، رویکرد دسته‌بندی، مدل‌های درختی، ویلموت اصلاح شده
آدرس	دانشگاه تبریز, دانشکده کشاورزی, گروه علوم و مهندسی آب, ایران, دانشگاه تبریز, دانشکده کشاورزی, گروه علوم و مهندسی آب, ایران, دانشگاه تبریز, دانشکده کشاورزی, گروه علوم و مهندسی آب, ایران
پست الکترونیکی	shkmsn2000@gmail.com

tabriz daily rainfalls modeling via hybridized tree based and seasonal-trend component bagging method

Authors	javidan s. ,sattari m.t. ,mohsenzadeh sh.
Abstract	introductionprecipitation is one of the most important components of water cycle. accurate precipitation measurement is essential for flood forecasting and control, drought analysis, runoff modeling, sediment control and management, watershed management, agricultural irrigation planning, and water quality studies. determining the correct amount of precipitation in cities and rural areas is also important for managing floods. the precipitation process is completely non-linear and involves randomness in terms of time and space. therefore, it is not easy to explain that with simple linear models due to various climatic factors and may contain major errors. therefore, various methods and models have been proposed to evaluate, and predict precipitation. this study aimed to estimate the daily precipitation of tabriz based on hybridized tree-based and bagging methods by using neighboring stations.materials and methodsin the present study, the rainfall data of adjacent stations in urmia lake basin (sahand, sarab, urmia, maragheh and mahabad) were employed in 1986-2021 to estimate the daily rainfall in tabriz. about 70% of data were considered for calibration and 30% of data were applied for validation. using the correlation matrix and relief algorithm, various input components were identified. modeling was performed using tree-based data mining methods including m5p, rt and rept and bagging method. the daily precipitations of tabriz was decomposed into their components by seasonal-trend analysis method. its components, including trend, seasonal and residual, were used in different input scenarios to investigate the effect of these components on improving the modeling results. to evaluate the modeling performance, the indices of correlation coefficient, root mean square error, nash-sutcliffe efficiency and modified wilmot coefficient were applied.results and discussionrt and rept methods increased the accuracy of the model and decreased its error when they were used as the basic algorithm of the bagging method. this was not the case with the m5p method, as the results were slightly weaker. it was also observed that tabriz rainfall is largely influenced by sahand rainfall, as the most models gave reliable estimates by using the rainfall data for sahand station. this can be explained by the high correlation between tabriz rainfall and sahand. the results showed that the first scenario (sahand) for m5p, rt, rept and b-m5p method, the fifth scenario (sahand, sarab, urmia, maragheh and mahabad) for the b-rt method, and the fourth scenario (sahand, sarab, urmia and mahabad) for the b-rept method were the best scenarios. the best performance was found for the scenario 1 of the m5p decision tree model, followed by the bagging method with the m5p base algorithm. in general, it was concluded that application of the bagging method produced reliable results. modeling without considering the decomposition components was compared with modeling with decomposition components. adding seasonal, trend and residual components to the modeling input combinations significantly improved the accuracy of the results. application of bagging method in most cases also increased the modeling accuracy. the first scenario (sahand and residual) for m5p and b-m5p methods, the tenth scenario (residual, trend, seasonal, sahand and sarab) for rt, rept and b-rept methods, and the eighth scenario (residual, trend and sahand) for b-rt method were selected as the best scenarios. as a result, among the stations, sahand, due to proximity and high correlation, and sarab, due to greater correlation, had a great impact on precipitation in tabriz. in general, the bagging method with the basic m5p algorithm (b-m5p) was best suited in the first scenario. thus, adding precipitation analysis components and using the bagging method improve the modeling results with tree-based data mining methods.conclusionour results showed that bagging method provided acceptable results in most cases. in the first case, the first scenario of m5p method including sahand precipitation data was selected as the superior method and scenario. as a result, sahand was the most effective station in estimating tabriz rainfall with the highest correlation and the shortest distance from tabriz. in the second case, with the decomposition components, the accuracy of the results increased significantly. the bagging method with the basic m5p algorithm, the parameters of sahand precipitation and the residual of tabriz precipitation was considered as the best modeling algorithm. it can be concluded that using bagging method and decomposition components with the closest station to the studied station results in the highest accuracy. therefore, bagging models with tree-based algorithm can be considered as simple and widely used methods.
Keywords	bagging method ,decomposition ,modified wilmot ,tree models ,urmia lake basin