تشخیص نقاط برجسته تصاویر با استفاده از نمونه‌برداری فشرده در حوزه موجک

Fa | Ar | En

تشخیص نقاط برجسته تصاویر با استفاده از نمونه‌برداری فشرده در حوزه موجک


نویسنده	بنی‌طالبی دهکردی مهدی ,ابراهیمی‌مقدم عباس ,خادمی مرتضی ,هادی‌زاده هادی
منبع	پردازش علائم و داده ها - 1398 - دوره : - شماره : 4 - صفحه:59 -72
چکیده	امروزه پژوهش‌گران، از مزایای بسیار زیاد استفاده از مدل‌سازی توجه بصری انسان، در زمینه‌های مختلف، به‌صورت گسترده استفاده می‌کنند. در روش‌های مختلف ارایه‌شده در این راستا، نقشه‌هایی دو بُعدی موسوم به نقشه نقاط برجسته استخراج می‌شود که مقادیر نقاط مختلف در آن، بیان‌گر میزان جلب توجه بیننده به نقاط متناظر در تصویر است. در این مقاله نیز برای به‌دست‌آوردن نقشه برجستگی از ضرایب موجک تصاویر، براساس تکنیک نمونه‌برداری فشرده، نمونه‌های تصادفی انتخاب می‌شوند. در ادامه، از نمونه‌های انتخاب‌شده نقشه‌های ویژگی تولید می‌شود. با استفاده از نقشه‌های ویژگی به‌دست‌آمده، نقشه برجستگی محلی و نقشه برجستگی کلی محاسبه می‌شود. در‌نهایت، با ترکیب خطی نقشه برجستگی محلی و کلی به‌دست‌آمده، نقشه برجستگی نهایی محاسبه می‌شود. ارزیابی‌های تجربی حاکی از نتایج امیدوارکننده‌ای از برتری روش ارایه‌شده نسبت به سایر مدل‌های تشخیص برجستگی، در آشکارسازی نواحی برجسته و در عین حال در کاهش حجم محاسباتی است.
کلیدواژه	نقشه نقاط برجسته، توجه بصری، تبدیل موجک، تُنُکی، نمونه‌برداری فشرده
آدرس	دانشگاه فردوسی مشهد, دانشکده مهندسی, گروه مهندسی برق, ایران, دانشگاه فردوسی مشهد, دانشکده مهندسی, گروه مهندسی برق, ایران, دانشگاه فردوسی مشهد, دانشکده مهندسی, گروه مهندسی برق, ایران, دانشگاه صنعتی قوچان, دانشکده مهندسی برق و کامپیوتر, ایران
پست الکترونیکی	h.hadizadeh@qiet.ac.ir

Compressed-Sampling-Based Image Saliency Detection in the Wavelet Domain

Authors	Banitalebi-Dehkordi Mehdi ,Ebrahimi-moghadam Abbas ,Khademi Morteza ,Hadizadeh Hadi
Abstract	When watching natural scenes, an overwhelming amount of information is delivered to the Human Visual System (HVS). The optic nerve is estimated to receive around 108 bits of information a second. This large amount of information can rsquo;t be processed right away through our neural system. Visual attention mechanism enables HVS to spend neural resources efficiently, only on the selected parts of the scene at order. This results in a better and faster perception of events. In order to perform saliency measurement on visual data, subjective eyetracking experiments may be carried out. These experiments involve using devices to track eye movements of a number of subjects while they watch images or videos on a screen.That being said, such devices are not very suitable in practice due to hardship involved with carrying out experiments, such as need to have restricted test environment, being time consuming as well as expensive. Instead, researchers developed Computational Visual Attention Models (VAMs) in attempts to mimic the HVS saliency prediction process.Visual Attention Modelling has widely been used in various areas of image processing and understanding. Computational models of visual attention aim to predict the most interesting areas of an image to the observers. To this end, these models produce saliency maps, in which each pixel is assigned a likelihood value of being looked at. In other words, saliency maps highlight where the most likely for viewers to look at in an image is. Knowing the Regions of Interests (ROIs) can be helpful in applications such as image and video compression, object recognition and detection, visual search, retargeting, retrieval, image matching, and segmentation. Saliency prediction is generally done in a bottomup, topdown, or hybrid fashion. Bottomup approaches exploit lowlevel attributes such as brightness, color, edges, texture, etc. Topdown approaches focus on contextdependent information from the scene such as appearance of humans, animals, text, etc. Hybrid methods combine the two streams.This paper proposes a new method of saliency prediction using sparse wavelet coefficients selected from lowlevel bottomup saliency features. Wavelet based image methods are used widely in image processing algorithms as they are especially powerful in decomposing images into several scales of resolutions. In our method, first random compressive sampling is performed on wavelet coefficients in the Lab color space. Random sampling enables a reduction in computational complexity and provides a sparse representation of the coefficients. The number of decomposition levels is chosen based on the information diffusion property of the signal. In the proposed method, the sampling can be done at a rate different than the Nyquist rate, and based on the sparsity degree of the signal. It is shown that having the basis vectors of a sparse representation of the signal, can result in an accurate signal reconstruction. In this work, the sparsity degree and thus the sampling rate is computed empirically. Next, local and global saliency maps are generated from these random samples to account for smallscale and largescale (scenewide) saliency attributes. These maps are then combined to form an overall saliency map. The overall saliency map therefore includes both local, and global saliency attributes. The main contribution of this paper is the use of compressive sampling in creating a novel wavelet domain representation for image saliency prediction.Extensive performance evaluations show that the proposed method provides a promising saliency prediction performance while the computation complexity remains reasonable, thanks to the dimensionality reduction of compressive sampling. In particular, the proposed method demonstrated favorable precision, recall, and Fmeasure, when compared to stateoftheart saliency detection methods, over largescale datasets. We hope the proposed approach brings ideas to the saliency analysis research community.
Keywords	Saliency map ,visual attention ,wavelet transform ,sparsity ,compressive sampling