ارزیابی کارایی شبکه عصبی گازی بافتی در خوشه‌بندی داده‌های بلوک‌های آماری شهر اصفهان مبتنی بر متغیرهای توسعه پایدار شهری

Fa | Ar | En

ارزیابی کارایی شبکه عصبی گازی بافتی در خوشه‌بندی داده‌های بلوک‌های آماری شهر اصفهان مبتنی بر متغیرهای توسعه پایدار شهری


نویسنده	تاریقلی زاده هادی ,میرباقری بابک ,متکان علی اکبر
منبع	پژوهش هاي جغرافياي برنامه ريزي شهري - 1402 - دوره : 11 - شماره : 4 - صفحه:91 -109
چکیده	خوشه‌بندی داده‌های بزرگ، ساختارها را آشکار و گروه‌بندی‌ها را شناسایی می‌کند و هدف اصلی آن تفکیک داده‌ها در خوشه‌هایی با ویژگی‌های مشابه است. شبکه‌های عصبی مصنوعی ابزاری استاندارد برای خوشه‌بندی داده‌های بزرگ و چندبعدی هستند. هدف این تحقیق، خوشه‌بندی داده‌های بلوک‌های آماری شامل 21 متغیر اجتماعی-اقتصادی و دسترسی به خدمات مرتبط با رویکرد توسعه پایدار شهری با شبکه عصبی گازی بدون استفاده از پارامترهای مکانی و همچنین با به‌کارگیری مرکز هندسی بلوک‌های آماری به‌عنوان پارامتر مکانی در روند خوشه‌بندی و مقایسه نتایج حاصل می‌باشد. الگوریتم شبکه عصبی گازی (ng) متداول‌ترین شبکه برای خوشه‌بندی داده‌های با ابعاد بالا و شبکه عصبی گازی بافتی (cng) مکانی شده این الگوریتم است. در این مطالعه بلوک‌های آماری شهر اصفهان با آموزش این دو الگوریتم بر اساس متغیرهای منتخب خوشه‌بندی شدند. نتایج بیانگر وجود تفاوت قابل‌توجه در خوشه‌های حاصل از اجرای این دو الگوریتم است. خوشه‌بندی با استفاده از الگوریتم ng، منتج به خوشه‌های ناهمگن می‌شود و بالعکس اجرای الگوریتم cng به دلیل استفاده از پارامترهای مکانی منجر به تولید خوشه‌های همگن می‌گردد. در این پژوهش ارزیابی کیفیت خوشه‌بندی با محاسبه متوسط ضریب سیلهوته برای بلوک‌های آماری انجام شد که الگوریتم cng با متوسط ضریب سیلهونه برابر 29/ 0 عملکرد بهتری نسبت به الگوریتم ng با متوسط ضریب سیلهوته 0/02- دارد. این نتایج بیانگر تاثیر مثبت پارامترهای مکانی در ایجاد خوشه‌های همگن در محیط شهری است. خوشه‌بندی بلوک‌های آماری شهری با به‌کارگیری متغیرهای مرتبط با توسعه پایدار و رویکرد مکان‌مبنا با استفاده از الگوریتم cng ازجمله نوآوری‌های این تحقیق به شمار می‌رود
کلیدواژه	شبکه عصبی گازی بافتی، خوشه‌بندی، داده‌های مکانی، توسعه پایدار، بلوک‌های آماری، شهر اصفهان
آدرس	دانشگاه شهید بهشتی, مرکز مطالعات سنجش‌ازدور و gis, دانشکده علوم زمین, ایران, دانشگاه شهید بهشتی, مرکز مطالعات سنجش‌ازدور و gis, دانشکده علوم زمین, ایران, دانشگاه شهید بهشتی, مرکز مطالعات سنجش‌ازدور و gis, دانشکده علوم زمین, ایران
پست الکترونیکی	a-matkan@sbu.ac.ir

evaluating the efficiency of contextual neural gas networks in clustering of isfahan's census blocks based on sustainable urban development variables

Authors	tarigholizadeh hadi ,mirbagheri babak ,matkan ali akbar
Abstract	abstractclustering is a vital technique for revealing structures and discerning groupings within extensive datasets, particularly in spatial data analysis, where the primary objective is to segregate data into clusters with shared characteristics. artificial neural networks are established tools for clustering large and multidimensional datasets. this research focuses on clustering census block data, encompassing 21 socio-economic variables and access to services relevant to sustainable urban development. the study employs the neural gas (ng) network without spatial parameters. then, it introduces the geographic coordinates of census blocks as spatial parameters, comparing the outcomes of the two approaches (ng & cng). the ng algorithm, a prevalent choice for clustering high-dimensional data, and its spatially enhanced version, the contextual neural gas (cng) algorithm, were employed in clustering isfahan city’s census blocks. results indicated a notable distinction in the clusters derived from the implementation of the ng and cng algorithms. clustering with the ng algorithm yielded heterogeneous clusters, whereas the cng algorithm produced homogeneous clusters benefiting from spatial parameters. evaluation of clustering quality, performed by calculating the average silhouette coefficient for census blocks, showed the superior performance of the cng algorithm, attaining a silhouette coefficient of 0.29 compared to the ng algorithm’s -0.02. this research affirmed the positive impact of spatial parameters on creating homogeneous clusters within the urban environment. leveraging the cng algorithm and extracting homogenous areas based on sustainable development variables contributed to streamlined urban planning and management. the clustering of census blocks using variables related to sustainable urban development and a location-based approach using the cng algorithm is one of the innovations of this researchextended abstractintroduction in recent years, there has been a dramatic increase in the volume of available spatial data. consequently, it is necessary to comprehensively assess spatial data, considering each location’s distinctive characteristics, to extract meaningful insights. with the abundance and diversity of urban spatial data, the primary challenge lies in effectively representing the knowledge derived from these data and illuminating the relationships between the data and their respective locations, incorporating various studied variables. spatial data mining employs artificial neural networks (ann) to unveil patterns and unknown relationships within data, transforming this information into new and potentially valuable knowledge. clustering, a pivotal aspect of unsupervised machine learning, is an effective method for extracting knowledge from spatial data, aiming to segregate data into clusters with similar characteristics. it is crucial to note that the clustering algorithm for spatial data diverges fundamentally from that used for non-spatial data. this study focuses on clustering the census blocks of isfahan city based on sustainable development data, encompassing socioeconomic information and access to services. the process employs the contextual neural gas (cng) algorithm, and the results are compared with those obtained from implementing the neural gas (ng) algorithm. this comparative analysis sheds light on the efficacy of these algorithms in clustering spatial data and extracting meaningful insights related to sustainable development in the urban texture. methodologyin this study, data from the isfahan census blocks (2015), compiled by the iran statistics center, was utilized, alongside information on medical-emergency, cultural-educational, and transportation service points provided by isfahan municipality. the research incorporates 13,361 statistical blocks, with 21 socioeconomic variables and indicators related to various urban services associated with sustainable urban development used for the clustering process. both the neural gas (ng) and contextual neural gas (cng) algorithms were deployed to cluster socioeconomic data of census blocks, and the outcomes were subjected to a comparative analysis. the neural gas network, a competitive neural network employing an unsupervised learning model, specializes in solving clustering problems and topology learning. in the ng algorithm, neurons, lacking neighboring connections, dynamically distribute in the input space during training, mirroring the behavior of physical gas. during training, input vectors are presented, a specific vector is chosen, and neurons move towards it, with the displacement influenced by neuron ranking, distance to the input vector, learning rate, and neighborhood range. importantly, ng lacks a predefined topology representing relationships between neurons. topology learning is facilitated through hebb’s competitive learning in the post-processing step. the contextual neural gas network (cng), an extension of the ng algorithm, integrates spatial characteristics of input data vectors into the clustering process. while neuron adaptation remains consistent in both ng and cng, their distinction lies in the definition of rank order. cng accommodates spatial autocorrelation between observations and neurons by leveraging spatial ordering. however, due to the absence of a topologically ordered network in cng, a two-step procedure is employed to determine rank ordering, incorporating spatial autocorrelation. the silhouette coefficient was employed in this research to evaluate clustering results. this coefficient, calculated for each sample, class, and the entire dataset, measures the similarity within clusters and dissimilarity between clusters. the overall quality of clustering was assessed using the average silhouette coefficient for the entire dataset, providing a comprehensive evaluation of the effectiveness of both ng and cng algorithms in clustering the isfahan census blocks. results and discussion the outcomes underscore a fundamental distinction between the two algorithms, primarily rooted in their approach to mapping input vectors onto network neurons, resulting in disparate classifications within the respective clusters. the ng algorithm employs a distance criterion to map input vectors, yielding intertwined and heterogeneous clusters. the comparison of the clustered census blocks graph network derived from both algorithms reveals obvious differences in results. notably, the cng algorithm, with an average silhouette coefficient of 0.29, demonstrates superior clustering performance compared to the ng algorithm, which yields a notably lower average silhouette coefficient of -0.02. this emphasizes the enhanced ability of the cng algorithm to form cohesive and meaningful clusters based on socioeconomic and service access data related to sustainable development in isfahan city.
Keywords	contextual neural gas ,clustering ,spatial data ,sustainable development ,census blocks ,isfahan city