بهبود پیش‌ بینی علاقه کاربران در کلان ‌داده توییتر با استفاده از طبقه ‌بند تجمعی

Fa | Ar | En

بهبود پیش‌ بینی علاقه کاربران در کلان ‌داده توییتر با استفاده از طبقه ‌بند تجمعی


نویسنده	چاکرالحسینی فیروزآباد محمد ,قائمی رضا
منبع	رسانه - 1403 - دوره : 35 - شماره : 2 - صفحه:107 -131
چکیده	در دنیای امروزی، شبکه‌های اجتماعی که بخشی از زندگی روزمره انسان ‌ها شده‌اند، از جمله توییتر، تلگرام، اینستاگرام و غیره، روز ‌به‌ روز در حال افزایش و گسترش هستند. لذا تعداد کاربران آن‌ها نیز در حال افزایش است و در نتیجه، حجم داده زیادی در این شبکه‌ها در حال تبادل و ذخیره‌سازی است که این حجم عظیم داده، شبکه‌های اجتماعی به‌خصوص توییتر را تبدیل به کلان‌داده کرده است. مدیریت، سامان‌دهی و هرس‌کردن این کلان‌داده‌ها و همچنین، پیش‌بینی رفتار کاربران شبکه‌های اجتماعی امری بسیار مهم است. یکی از روش‌های مهم و تاثیرگذار برای پیش‌بینی علاقه کاربر در شبکه‌های اجتماعی، تکنیک‌های طبقه‌بندی است که در اغلب کاربردها و پژوهش‌های موجود در پیشینه تحقیق، هنوز در معیارهایی مانند دقت و صحت پیش‌بینی ضعف دارند. در این مقاله، به‌منظور پیش‌بینی علاقه کاربر در شبکه‌های اجتماعی توییتر، از روش طبقه‌بندی تجمعی مبتنی بر رای‌گیری که دارای دو گام اساسی است، استفاده شده است. در گام نخست، با بهره‌گیری از الگوریتم‌های طبقه‌بندی پایه‌ای شامل نزدیک‌ترین همسایه، درخت تصمیم، جنگل تصادفی و بیزین ساده، خروجی‌های هر طبقه‌بندی حاصل می‌شوند. در گام دوم، خروجی نهایی طبقه‌بندی تجمعی با استفاده از روش رای‌گیری محاسبه می‌شود. نتایج آزمایش‌ها بر روی مجموعه کلان‌داده‌های شبکه اجتماعی توییتر و براساس معیارهای دقت، صحت و پوشش، استدلال بر این دارد که روش پیشنهادی طبقه‌بندی تجمعی مبتنی ‌بر رای‌گیری، نتایج مطلوب‌تری را نسبت‌به الگوریتم‌ های دیگر داشته است.
کلیدواژه	کلان ‌داده، شبکه اجتماعی، پیش‌ بینی علاقه کاربر، طبقه ‌بندی تجمعی، توییتر
آدرس	دانشگاه آزاد اسلامی واحد نیشابور, دانشکده فنی و مهندسی, ایران, دانشگاه آزاد اسلامی واحد قوچان, دانشکده فنی و مهندسی, گروه مهندسی کامپیوتر, ایران
پست الکترونیکی	r.ghaemi@iauq.ac.ir

improving user relationship prediction in twitter metadata using aggregate classification

Authors	chakerolhoseini firouzabad mohamad ,ghaemi reza
Abstract	in today’s world, social networks that have become a part of people’s daily life, including twitter, telegram, instagram, etc., are increasing and expanding day by day. therefore, the number of their users is also increasing and as a result, a large amount of data is being exchanged and stored in these network; and this huge amount of data has turned social networks, especially twitter, into big data. it is very important to manage, organize and prune these big data, as well as to predict the behavior of social network users.one of the most important and effective methods for predicting user relationships in social networks is classification techniques, which in most of the applications and researches in the background of the research, are still based on criteria such as ‘accuracy; and accuracy of prediction. have weakness in this article, in order to predict the user relationship in twitter social networks, the cumulative classification method based on voting, which has two basic steps, has been used. in the first step, by using basic classification algorithms including nearest neighbor, decision tree, random forest and simple bayesian, the outputs of each classification are obtained. in the second step, the final output of cumulative classification is calculated using the voting method. the results of the experiments on the dataset of the twitter social network and based on the criteria of accuracy, correctness and coverage, argue that the proposed cumulative classification method based on voting has more favorable results than it has other algorithms.
Keywords	big data ,social network ,prediction of user relationship ,cumulative classification ,twitter