توسعه مدلی مبتنی بر تشدید گرادیان در شبکه های کانولوشنی عمیق به‌منظور شناسایی اهداف در تصاویر سنجش ازدوری

Fa | Ar | En

توسعه مدلی مبتنی بر تشدید گرادیان در شبکه های کانولوشنی عمیق به‌منظور شناسایی اهداف در تصاویر سنجش ازدوری


نویسنده	فرهادی نیما ,کیانی عباس ,عبادی حمید
منبع	علوم و فنون نقشه برداري - 1400 - دوره : 11 - شماره : 1 - صفحه:35 -50
چکیده	پیشرفت های صورت گرفته در فناوری تصویر برداری ماهواره ای امکان تهیه اطلاعات متنوع برای شناسایی اهداف را فراهم می کند. چنین اطلاعاتی فرآیند تفسیر تصاویر سنجش از دوری نوری را تسهیل می بخشد. نوع خاصی از این تفاسیر به فعالیت های مربوط به شناسایی اهداف ختم می شود که امروزه اکثر تحقیقات انجام شده در این حوزه با استفاده از شبکه های عصبی و تکنیک های یادگیری عمیق صورت می گیرد. نحوه طراحی شبکه عصبی کانولوشن مورداستفاده، در دقت شناسایی نقش بسزایی دارد. تحقیقات اخیر درزمینه یادگیری عمیق و شبکه‌های کانولوشن نشان می‌دهد که عمیق‌تر کردن این شبکه‌ها باعث افزایش دقت آن‌ها می‌شود؛ اما گاهی بیش‌ازحد عمیق‌تر شدن باعث به وجود آمدن مشکلاتی ازجمله بالا رفتن پارامترهای آموزشی، محو شدن گرادیان آموزشی، بلااستفاده ماندن بسیاری از ویژگی‌های تولیدشده و... می‌شود که در پی آن کاهش دقت در شناسایی اهداف موردنظر را خواهد داشت. به این منظور در این تحقیق روشی توسعه داده‌شده است که در آن سعی گردید با حفظ ویژگی های تولیدشده توسط لایه های کانولوشن و انتقال آن ها به لایه های بعدی، بر این مشکل غلبه گردد. این نوع ارتباط بین لایه‌ها، اجازه عمیق‌تر کردن شبکه‌های کانولوشنی با افت گرادیان کمتر را می‌دهد. معماری ارائه‌شده علاوه بر کم رنگ کردن مشکل ناپدید شدن گرادیان، باعث می‌شود تعداد پارامترها و همچنین مدت‌زمان موردنیاز برای آموزش یک مدل یادگیری عمیق کاهش یابد. بدین منظور در ابتدا با استفاده از تصاویر سنجش‌ازدوری، مجموعه‌ای از داده های آموزشی آماده و پس از پردازش‌های اولیه، عوارض هدف برچسب‌گذاری شده است. سپس روش پیشنهادشده را به‌عنوان استخراج گر ویژگی مدل faster r-cnn تعریف کرده و بر روی داده های آموزشی، آموزش داده می شود. جهت ارزیابی روش پیشنهادی نیز، بخشی از فرودگاه بین‌المللی پکن چین به‌عنوان مطالعه موردی اول و بخشی از فرودگاه بین المللی امام خمینی (ره) به‌عنوان منطقه موردمطالعه دوم انتخاب‌شده است و مقادیر معیار f1-measure برای هر دو منطقه به ترتیب برابر 97.9 و 93.7 می باشد. درنهایت نتایج حاصله از اعمال مدل پیشنهادی، با مدل‌های مختلف شبکه مطرح موجود، مقایسه شده است. نتایج به‌دست‌آمده، دلالت بر قابل‌اعتماد بودن و موثر بودن روش ارائه شده دارند.
کلیدواژه	یادگیری عمیق، شبکه‌های کانولوشن، شناسایی اهداف سنجش‌ازدوری، استخراج گر ویژگی
آدرس	دانشگاه صنعتی خواجه نصیرالدین طوسی, دانشکده مهندسی نقشه برداری, ایران, دانشگاه صنعتی نوشیروانی بابل, دانشکده عمران, ایران, دانشگاه صنعتی خواجه نصیرالدین طوسی, دانشکده مهندسی نقشه برداری, ایران
پست الکترونیکی	ebadi@kntu.ac.ir

development of a model based on gradient resonance in deep convolutional networks to identify targets in remote sensing images

Authors	farhadi n. ,kiani a. ,ebadi h.
Abstract	advances in remote sensing technologies provide various information regarding object detection problems. this information makes the interpretation of optical remote sensing images easier. especial kinds of these interpretations relate to object detection approaches that most researches in this field are carried out using neural networks and deep learning techniques; design of the network is an important process that affects detection accuracy. recent researches in the deep learning field and convolutional neural networks show that deeper networks can achieve better accuracy. however, in previous researches, sometimes too deep networks are the reason for other problems such as increasing the number of trainable parameters, vanishing gradients, unused extracted features, etc. these problems decrease the accuracy of the network in recognition of objects. this issue has been mentioned in many types of researches in the field of convolutional networks, and they have tried to meet the challenge by examining different topologies or presenting new training methods. in this article, a model was developed and tried to keep extracted features and transfer them to the next layers. the proposed architecture is a combination of several blocks stacked in a row. the blocks receive their input from the previous block and perform the relevant calculations. each block consists of several cells that have two layers of convolution. to efficiently use all the features of the training images, the filters used in the convolution layers have kernels with sizes of 1×1 and 3×3. the output of the 3×3 layer in the combining stage is integrated with the information of the previous layers. the architecture of each cell in the proposed network keeps all the extracted features from previous layers to be used in subsequent cells. with these connections between layers, the networks can be deeper with fewer effects of vanishing gradient. in addition to solving gradient problem, this architecture decreases the number of trainable parameters and duration of the training phase impressively. the result of this process is an increase in the ability of existing models to distinguish multi-class objectives.for this purpose, first, a collection of 320 training images is proposed and preprocessed. the proposed method is defined as feature extractor of faster r-cnn model, and it is trained on image collection. to evaluate the proposed method, a part of beijing international airport and a part of imam khomeini international airport were selected as the first and second case study areas. the f1-measure criterion values for both regions are 97.9 and 93.7, respectively. while, resnet architecture with 101 layers of convolution and 14.4 million more trainable parameters than the proposed architecture has achieved values of 96.7 and 93% for the mentioned criterion. finally, the results of applying the proposed model were compared with different famous models of the existing network. the experimental results indicated the reliability and efficiency of the proposed method.to improve the proposed architecture in this paper, dilated convolution operators can be used to extract more prominent features. on the other hand, with the aim of development and generalization, the proposed method can be applied in two stages on high resolution remote sensing images; in the first step, the goal is to identify the location of the airport, and in the next step, the planes inside each airport will be identified by the proposed method.
Keywords	deep learning ,convolution networks ,remote sensing imagery ,object detection ,feature extractor ,artificial intelligence