تحلیل فیلترینگ جرائم فهرست مصادیق محتوای مجرمانه مبتنی بر داده‌کاوی

Fa | Ar | En

تحلیل فیلترینگ جرائم فهرست مصادیق محتوای مجرمانه مبتنی بر داده‌کاوی


نویسنده	لک بهزاد ,نوروزی مهدی
منبع	پژوهش هاي اطلاعاتي و جنايي - 1403 - دوره : 19 - شماره : 2 - صفحه:126 -151
چکیده	زمینه و هدف: با رواج استفاده از اینترنت، گرایش و تمایل برخی افراد به انجام جرم در این محیط افزایش ‌یافته است و در تمامی کشورها، برای جلوگیری از محتوای مضر و آسیب‌رسان به افراد جامعه از پالایش محتوا و فیلترینگ به عنوان یک ابزار قدرتمند استفاده می‌شود. در همین راستا، پژوهش حاضر با هدف تحلیل داده‌های فیلترینگ جرائم فهرست مصادیق محتوای مجرمانه مبتنی بر داده‌کاوی پلیس انجام شده است که می‌تواند برای کنترل دامنه بسیاری از جرائم در حوزه‌های اخلاقی، اقتصادی، اجتماعی، فرهنگی و سیاسی مفید باشد. روش: این تحقیق از نظر نوع و هدف، کاربردی و از لحاظ روش توصیفی- تحلیلی است. در این تحقیق از بانک‌ اطلاعاتی سامانه فیلترینگ استفاده شده و به عوامل و مولفه‌ها و ارتباطات داده‌های این سامانه می‌پردازد. پس از آماده‌سازی داده‌ها و انتخاب بهترین فرایند داده‌کاوی، الگوریتم‌های به‌کاررفته برای پیش‌بینی مصادیق محتوای مجرمانه تفسیر می‌شوند.یافته‌ها: از بین مدل‌های مختلف داده‌کاوی، مدل جنگل تصادفی با معیار دقت 82.88 درصد، بالاترین نتیجه را برای داده‌های پژوهش به دست داد. برابر یافته‌های پژوهش، مولفه‌های تاثیرگذار بر روی داده‌های فیلترینگ جرائم ماموریتی فراجا برای جلوگیری از محتوای مجرمانه در فضای مجازی به ترتیب اهمیت از کمترین به بیشترین عبارت‌اند از: آدرس تارنما، فاصله زمانی بین رصد و اظهارنظر سایت‌ها، کاربر، زمان اظهارنظر، تاریخ رصد، کد مجرمانه، نوع دستور، نوع زبان سایت و زمان رصد.نتیجه‌گیری: نتایج به‌دست‌آمده از مدل‌های طبقه‌بندی با استفاده از معیار دقت مورد ارزیابی قرار گرفتند. از بین الگوریتم‌های مختلفی که بررسی شد از مدل درخت تصمیم ساخته‌شده با الگوریتم جنگل تصادفی می‌توان برای پیش‌بینی کد مصادیق محتوای مجرمانه در داده‌های ثبت‌شده در سامانه فیلترینگ استفاده کرد.
کلیدواژه	فیلترینگ، جرائم رایانه‌ای، فهرست مصادیق محتوای مجرمانه، داده‌کاوی
آدرس	دانشگاه علوم انتظامی امین, ایران, دانشگاه علوم انتظامی امین, ایران
پست الکترونیکی	tanha200886@gmail.com

data mining-based analysis of filtered crimes from the criminal content index

Authors	lak behzad ,norouzi mahdi
Abstract	background and aim: with the widespread use of the internet, some individuals' inclination toward committing crimes in this environment has increased. in all countries, content filtering is employed as a powerful tool to prevent harmful and damaging content from affecting society. accordingly, the present study aims to analyze filtering data of crimes listed in the examples of criminal content based on data mining by the police, which can be useful in controlling a wide range of crimes in ethical, economic, social, cultural, and political domains.method: this research is applied in terms of type and purpose and descriptive-analytical in terms of method. the study utilizes the database of the filtering system and examines the factors, components, and relationships within the system's data. after data preparation and selecting the best data mining process, the algorithms used to predict instances of criminal content are interpreted.findings: among various data mining models, the random forest model achieved the highest accuracy of 88.82% for the research data. according to the findings, the components influencing the filtering data of crimes by the cyber police (faraja) to prevent criminal content in cyberspace, ranked from least to most important, are: website address, time interval between monitoring and site comments, user, comment time, monitoring date, crime code, order type, site language type, and monitoring time.conclusion: the results obtained from classification models were evaluated using the accuracy criterion. among the various algorithms examined, the decision tree model constructed with the random forest algorithm can be used to predict the crime code of criminal content instances in the data recorded in the filtering system.
Keywords	filtering ,cybercrimes ,list of criminal content examples ,data mining