تشخیص شایعه در شبکه اجتماعی توییتر با استفاده از ویژگی‌های توییت و کاربر

Fa | Ar | En

تشخیص شایعه در شبکه اجتماعی توییتر با استفاده از ویژگی‌های توییت و کاربر


نویسنده	صامت عمرانی مسلم ,صنیعی آباده محمد ,مقدم چرکری نصراله
منبع	پردازش علائم و داده ها - 1403 - شماره : 2 - صفحه:15 -28
چکیده	با شنیدن هر خبر در شبکه ‌های اجتماعی، واکنش ‌ها به آن متفاوت است و از زوایای مختلف موجب برانگیخته‌شدن حس کنجکاوی می‌شود. مهم ‌ترین بخش آن فهمیدن صحّت‌وسقم خبر است. شایعه، خبری نامعتبر است؛ یعنی هنوز تایید نشده و ممکن است در صورت نداشتن اعتبار موجب خسارات جبران‌ناپذیری شود؛ ازاین‌رو، تشخیص آن بسیار مهم است. تشخیص شایعه و یا به‌عبارتی مشخص‌کردن اعتبار آن نقش اساسی در جلوگیری از خبر نادرست دارد. در این مقاله، با استفاده از ویژگی ‌های جدید دستی مبتنی بر توییت، کاربر و ترکیبی از این‌ دو و با استفاده از چهار دسته بند یادگیری ماشین، شایعه موجود در شبکه ‌های اجتماعی تشخیص داده شد؛ همچنین با توجه به نامتعادل‌بودن مجموعه‌داده از روش بیش ‌نمونه برداری استفاده و با توجه به تفاوت ویژگی ‌ها از نرمال‌سازی استفاده شده‌است. نتایج نشان‌ داد این روش با وجود سادگی نسبت به روش ‌های یادگیری ماشین و عمیق بهبود قابل ‌توجّهی داشته‌است و مقدار صحّت به 0/99 رسید.
کلیدواژه	تشخیص شایعه، یادگیری ماشین، ویژگی کاربر، ویژگی توییت، ویژگی دستی
آدرس	دانشگاه تربیت مدرس, دانشکده مهندسی برق و کامپیوتر, ایران, دانشگاه تربیت مدرس, دانشکده مهندسی برق و کامپیوتر, ایران, دانشگاه تربیت مدرس, دانشکده مهندسی برق و کامپیوتر, ایران
پست الکترونیکی	moghadam@modares.ac.ir

rumor detection on twitter using tweet and user features

Authors	samet omrani moslem ,saniee abadeh mohammad ,moghaddam charkari nasrollah
Abstract	when every news item is posted on social media, reactions to it are different and arouse curiosity from different viewpoints. the most important part is to understand the accuracy of the news. a rumor is invalid news, meaning it has not yet been confirmed and it may cause irreparable damage if it is not valid. therefore, it is very important to detect it. rumor detection, or in other words, determining its validity, plays an essential role in preventing fake news. naturally, every phenomenon of normal and anomaly is transmitted to people through social networks. every news reactions to that news are different. depending on the importance of the news, it may be widely covered or it may not have a specific reaction. but if the news spreads widely, it arouses curiosity from different angles. the news is false or true, or the news is valid or invalid. in this work, an attempt was made to identify rumors on social networks by using hand-crafted features based on tweets, users and a combination of the two, oversampling and normalization, and by using machine learning classification. using 4 machine learning classifiers, including support vector machine, logistic regression, k-nearest neighbors and random forest, the two rumors on social networks were detected. two data sets, pheme 2017 and pheme 2018, have been used. the results on these two datasets show that in pheme 2017, the random forest classifier shows an accuracy of 0.988 using tweet and combination features. also, these features show a precision of 0.987, which is better than other classifiers used in this work. this classifier has a better recall than other classifiers along with logistic regression with a value of 0.986. also, this classifier obtained better results with the two mentioned features, with 0.987. in the pheme 2018 dataset, it obtained the rf classifier with an accuracy of 0.969 using tweet and combination features, and it has better performance in precision, recall and f1. in addition, the user feature in the classifier of k nearest neighbors brings better results than the other two features.
Keywords	rumor detection ,machine learning ,user feature ,tweet feature ,hand-crafted feature