تشخیص اسلحه دستی بااستفاده از مدل شبکه‎‎های عصبی کانولوشنال سه بعدی

Fa | Ar | En

تشخیص اسلحه دستی بااستفاده از مدل شبکه‎‎های عصبی کانولوشنال سه بعدی


نویسنده	معتمد سارا ,عسکری الهام
منبع	پردازش علائم و داده ها - 1402 - شماره : 2 - صفحه:69 -79
چکیده	از آنجایی که رفتار افراد در ویدئوها بصورت سیگنال‎های سه بعدی است و جستجوی یک عمل خاص بسیار دشوار می‎باشد، لذا نیاز به یک تکنیک مناسب جهت تشخیص خودکار دزدان مسلح در ویدئو‎های امنیتی در حال ضبط می‎باشد. در این مقاله روشی سریع و کارا جهت شناسایی موقعیت افراد و سپس تشخیص اسلحه در دست آنها، با استخراج فریم‌های تصاویر برگرفته از ویدئوها و بدون حذف نقاط اصلی، ارائه شده است. در مرحله نخست و به منظور استخراج فریم‌های تصاویر برگرفته از ویدئوها، الگوریتم جداسازی با نرخ فریم مشخص اعمال خواهد شد و تمامی تصاویر در یک پوشه قرار می‎گیرند. سپس روی تمامی تصاویر بدست آمده طبقه‎بند (hc) haar cascade اعمال شده تا نقاط کلیدی یا فریم‌های مربوط به تصاویر کل بدن استخراج شوند و باقی پس‎زمینه‎ها از تصاویر حذف گردند. در انتها، نمونه‌های هر ویدئو در قالب ماتریس چهار بعدی شامل تعداد دنباله فریم‌های هر ویدئو، عرض، ارتفاع و تعداد کانال تصویر به شبکه 3dcnns ارسال می‌شود تا سلاح در تصاویر شناسایی شوند. لذا نوآوری مقاله ترکیب طبقه‎بند hcو 3dcnns بمنظور افزایش سرعت و کارایی تشخیص اسلحه می‎باشد. همچنین بمنظور بررسی دقت مدل پیشنهادی، از پارامترهای نرخ مثبت صحیح و مثبت کاذب، مقدار پیش بینی مثبت و نرخ تشخیص کاذب استفاده‎ می‎شود.
کلیدواژه	شبکه‎های عصبی سه بعدی (3dcnn)، طبقه‎بندی haar cascade (hc)، بازشناسی اشیاء، شناسایی کل بدن
آدرس	دانشگاه آزاد اسلامی واحد فومن و شفت, گروه کامپیوتر, ایران, دانشگاه آزاد اسلامی واحد فومن و شفت, گروه کامپیوتر, ایران
پست الکترونیکی	askary.elham@gmail.com

detection of handgun using 3d convolutional neural network model (3dcnns)

Authors	motamed sara ,askari elham
Abstract	since the behavior of people in the videos are in 3d signals format and they are long, it is difficult to search for a specific action. therefore, a suitable technique in live security videos is required to detect ongoing armed thieves to reduce the occurrence of crime and theft. the innovation of this paper is to provide a rapid and efficient method for detecting guns in frames of images taken from videos without deleting the main points. the hierarchy of object recognition is that in order to extract frames from images derived from videos, the separation algorithm will be applied at a specified frame rate and all images will be placed in a folder. then, video samples are divided into three categories of training, validation and testing, and using haar cascade (hc) classification, the frames of whole body images are extracted and the rest of the backgrounds are removed from the images. the reason for choosing this method is that the hc classification is resistant to rotation of images and also this algorithm has shown good performance compared to complex calculations. therefore, in our proposed model, we will use this algorithm as a whole body diagnosis. this is done by detecting the region of interest (roi) area by cutting the selected areas, followed by subtracting the background to eliminate unwanted backgrounds. all key points of selection and extraction are stored inside a folder. finally, all images are sent to 3d convolutional neural networks (3dcnns) to detect weapons in the images. finally, in order to evaluate the performance of the system in terms of accuracy, it is used with correct positive rate parameters, false positive rate, positive prediction value and false detection rate. as can be seen in the results of the tests, the highest gun detection rate is related to the 3dcnns model with a detection rate of 96.1%, followed by the best detection model rate related to yolo v3 and with a detection rate of 95.6%.
Keywords	3d neural networks (3dcnns) ,haar-cascade (hc) classification ,object recognition ,full body recognition