روشی کارا بر پایه ترکیب مدل‌های یادگیری ژرف برای تجزیه ‌و تحلیل احساسات در متون

Fa | Ar | En

روشی کارا بر پایه ترکیب مدل‌های یادگیری ژرف برای تجزیه ‌و تحلیل احساسات در متون


نویسنده	صدر حسین ,پدرام محسن ,تشنه لب محمد
منبع	پردازش علائم و داده ها - 1401 - شماره : 1 - صفحه:19 -38
چکیده	یکی از مهم‌ترین داده‌های متنی موجود در سطح وب احساسات و دید‌گاه‌‌های افراد نسبت به یک موضوع یا مفهوم مشخص است. با این حال، یافتن و نظارت بر وبگاه‌های حاوی این احساسات و استخراج اطلاعات موردنیاز از آن‌ها به‌علت گسترش وبگاه‌های گوناگون کاری دشوار محسوب می‌شود. در این راستا، توسعه سامانه‌های تجزیه ‌و تحلیل خودکار احساسات که بتواند نظرات را استخراج کرده و روند فکری مرتبط با آن‌ها را بیان کند، در سال‌های اخیر توجه زیادی را به خود جلب کرده است و روش‌های بر پایه یادگیری ژرف، یکی از راه‌کارهایی هستند که توانسته‌ا‌ند به نتایج چشم‌گیری در کاربردهای مختلف پردازش زبان‌های طبیعی به‌خصوص تجزیه ‌و تحلیل احساسات دست یابند؛ اما این روش‌ها برخلاف عملکرد قابل‌توجه هنوز با چالش‌هایی مواجه هستند و نیاز به پیشرفت در این حوزه همچنان وجود دارد؛ ازاین‌رو، هدف این مقاله ترکیب مدل‌های یادگیری ژرف به‌منظور ارائه یک روش جدید برای تجزیه ‌و تحلیل احساسات متنی است که بتواند ضمن استفاده هم‌زمان از مزایای شبکه‌های عصبی ژرف بر مشکلات آن‌ها چیره شود. در این راستا، در این مقاله روشی بر پایه ترکیب شبکه عصبی پیچشی و شبکه عصبی هم‌گشتی معرفی‌ شده است که در آن به‌منظور حفظ وابستگی‌های بلندمدت در جملات و کاهش از‌دست‌رفتن داده‌های محلی که به‌عنوان چالش‌های شبکه عصبی پیچشی به شمار‌ می‌آیند، از لایه هم‌گشتی تعمیم‌یافته که در آن از یک ویژگی میانی حاصل از ترکیب گره‌های فرزندان استفاده می‌شود، به‌عنوان جایگزین لایه ادغام در شبکه عصبی پیچشی بر پایه ساز‌و‌کار توجه استفاده شده است. بر اساس نتایج آزمایش‌ها، روش پیشنهادی به‌ترتیب با دقت 53.92 و 92.89 درصد روی مجموعه‌داده‌های sst1 و sst2 و دارای دقت بالاتری نسبت به سایر روش‌های موجود است.
کلیدواژه	تجزیه ‌و تحلیل احساسات، یادگیری ژرف، شبکه عصبی پیچشی، شبکه عصبی هم‌گشتی، ساز‌و‌کار توجه
آدرس	موسسه آموزش عالی راهبرد شمال, دانشکده فنی و مهندسی, گروه مهندسی کامپیوتر, ایران, دانشگاه خوارزمی, دانشکده فنی و مهندسی, گروه مهندسی برق و کامپیوتر, ایران, دانشگاه صنعتی خواجه نصیر طوسی, دانشکده مهندسی برق, گروه سیستمها و کنترل, ایران
پست الکترونیکی	teshnehlab@eetd.kntu.ac.ir

Efficient Method Based on Combination of Deep Learning Models for Sentiment Analysis of Text

Authors	Sadr Hossein ,Pedram Mir mohsen ,Teshnehlab Mohammad
Abstract	People #39;s opinions about a specific concept are considered as one of the most important textual data that are available on the web. However, finding and monitoring web pages containing these comments and extracting valuable information from them is very difficult. In this regard, developing automatic sentiment analysis systems that can extract opinions and express their intellectual process has attracted considerable attention in recent years. Sentiment analysis is considered as one of the most active research areas in the field of natural language processing which tries to classify a piece of text containing opinions based on its polarity and determine whether an expressed opinion about a specific topic, event or product is positive or negative. Since about a decade ago, many studies have been carried out to investigate the effects of traditional classification models, such as Support Vector Machine (SVM), Na iuml;ve Bayes, Logistic Regression, etc. in the task of sentiment analysis. Although machine learning models have achieved great success in this filed, they are still confronted with some limitations, notably manual feature engineering requirements. In other words, the classification performance of machine learning models is highly dependent on the extracted features and they play an important role in obtaining higher classification accuracy. To deal with these problems, deep learning models have been extensively employed as an alternative to traditional machine learning models and have achieved impressive results. It is worth mentioning that despite the remarkable performance of these methods, they are still confronted with some limitations and they are on their first steps of progress. Therefore, the goal of this paper is to propose a combinational deep learning model that can overcome their problems as well as utilizing their benefits. In this regard, an efficient method based on combination of convolutional and recursive neural networks is proposed in this paper that employs a generalized recursive neural network, where an intermediate feature is obtained by combining children #39;s nodes, as an alternative of pooling layer in attentionbased convolutional neural network with the aim of capturing long term dependencies and decreasing the loss of local information. Based on empirical results, the proposed method with the accuracy of 53.92% and 92.89% respectively on SST1 and SST2 datasets not only outperforms other existing models but also can be trained much faster.
Keywords	Sentiment analysis ,Deep Leaning ,Convolutional neural network ,Recursive neural network ,Attention mechanism