بررسی اثر پرسپترون چند لایه در صحت انتخاب ژن های ریز rna کرم ابریشم (bombyx mori)

Fa | Ar | En

بررسی اثر پرسپترون چند لایه در صحت انتخاب ژن های ریز rna کرم ابریشم (bombyx mori)


نویسنده	سیددخت عاطفه ,رحمانی نیا جواد
منبع	پژوهشهاي علوم دامي ايران - 1400 - دوره : 13 - شماره : 4 - صفحه:615 -627
چکیده	ریز rna ها خانواده ای گسترده از مولکول هایrna کوتاه غیر کد کننده پروتئینی (ncrna) و دارای وظایفی مهم در تنظیم فرآیندهای رشد در گیاهان و حیوانات هستند. مطالعات اندکی در ارتباط با ریز rna های کرم ابریشم که از نظر اقتصادی بسیار مهم نیز هستند، با تمرکز بر شناسایی، آنالیز بیان و پیش بینی عملکرد انجام شده است. به طور کلی توالی ریز rna ها در سرتاسر گونه ها بسیار محافظت شده هستند و از ساختار ساقهحلقه اولیه در هسته که از ویژگی های بسیار مهم ریز rna ها است، تولید می شوند. ریز rna ها از مهمترین عوامل تنظیمی دخیل در سطوح پس از رونویسی پس از بیان ژن هستند که در تنظیم تعداد زیادی از فرآیندهای فیزیولوژیکی مانند رشد و نمو، متابولیسم و وقوع بیماری ها مشارکت می کنند. با اینکه هزاران ریز rna در گونه های مختلف شناسایی شده اند، تعداد خیلی زیادی هنوز هم ناشناخته باقی مانده است. بنابراین کشف ژن های جدید ریز rna یک گام مهم برای درک ریز rna هایی است که مکانیسم های تنظیم پس از رونویسی را واسطه گری می کنند. روش های بیولوژیکی برای شناسایی ژن های ریز rna ممکن است در شناسایی تشخیص ریز rna های نادر محدودیت داشته باشند و بیشتر محدود به بافت های خاص و مراحل رشد و نموی ارگانیسم تحت آزمایش می شوند. این محدودیت ها منجر به پیشرفت روش های محاسباتی پیشرفته برای شناسایی ریز rna های احتمالی جدید شده است. استفاده از روش های محاسباتی باعث افزایش دقت در شناسایی ریز rna های کرم ابریشم خواهد شد. در این پژوهش، انواع مدل های محاسباتی برای شناسایی توالی های ریز rna استفاده شد. با استفاده از داده های مناسب و استخراج ویژگی های بیولوژیکی موثر، عملکرد این روش ها ارزیابی شد. در مقایسه با سایر مدل های استفاده شده در این تحقیق، مدل پرسپترون چند لایه با بیشترین مقادیر دقت، معیار f و ضریب همبستگی متیو به عنوان روشی مناسب جهت پیش بینی توالی های ریز rna در کرم ابریشم معرفی شد.
کلیدواژه	روش های محاسباتی، عوامل تنظیمی، کرم ابریشم، ریز rna
آدرس	سازمان تحقیقات، آموزش و ترویج کشاورزی, مرکز تحقیقات و آموزش کشاورزی و منابع طبیعی استان خراسان رضوی, بخش تحقیقات علوم دامی, ایران, سازمان تحقیقات، آموزش و ترویج کشاورزی, موسسه تحقیقات علوم دامی کشور, ایران

A survey on effect of multilayer perceptron on the accuracy of selection of silkworm (Bombyx mori) microRNA genes

Authors	Seyeddokht Atefe ,Rahmaninia Javad
Abstract	Introduction MicroRNAs (miRNAs) constitute a large family of nonproteincoding small RNA (ncRNA) molecules and have important roles in the regulation of both plant and animal developmental procedures. Generally, sequences of miRNA demonstrate high sequence conservation across animals and are produced from the primary stemloop structure in the nucleus, which is an important feature of miRNAs. MiRNAs are one of the most important regulatory factors involved in posttranscriptional levels of gene expression that contribute to the modulation of a large number of physiological processes such as development, metabolism and disease occurrence. To date, A few studies related to miRNAs of the economically important silkworm, Bombyx mori, have been carried out, focusing on detection, expression study, and prediction of function. Machine learning approaches are crucial for prediction success. These methods can solve classification problem.Materials and Method Although hundreds of miRNAs have been detected in different animals, a lot of them are still unknown. Then, finding of novel miRNA genes is an essential step for understanding miRNA intervened post transcriptional regulation processes. It appears that biological methods to recognize miRNA genes might be inadequate in their capacity to identify uncommon miRNAs and are further limited to the tissues surveyed and the developmental phase of the animal under experiment. These restrictions have led to the development of new computational methods attempting to detect potential miRNAs. Experimentally verified miRNA sequences in miRBase release 22.0 were extracted for inclusion in the positive data set. In the miRBase, the reported secondary structures were predicted by a collection of RNA folding software packages. Consequently, in this study for uniformity, all miRNA secondary structures analyzed using RNAfold packages. The major step for machine learning approaches is the selection of a suitable negative dataset. It is important for a welltrained classifier. If the sequences are too artificial, e.g. completely random sequences, then there is a risk that the classifiers will not be well trained to differentiate between different categories of real biological sequences. Conversely, if the negative dataset is too similar to the positive dataset, the classifiers will be unable to find a way to adequately differentiate between these two data sets. We investigated several different types of negative sequences and finally selected negative sequences which made the best distinction with positive data set. The positive training dataset for our classifier development composed of known silkworm pre miRNAs, while the negative training dataset composed of other ncRNA sequences. Our feature set composed of various features and selecting the most discriminative set of features would increase the performance, efficiency and comprehensibility of a classifier method by reducing its complexity.Results and Discussion Secondary structural patterns of pre miRNA used in this study such as the intramolecular base pairing of pre miRNA is an important beneficial feature for miRNAs classification. The selective powers of the two different classes of miRNAs secondary structural conformation (dotbracket notation) were analyzed. Secondary structural feature of miRNA such as Minimum Free Energy, Watsoncrick base pairing (AU, GC), Wobble base pairing (GU) and unpaired bases (A, G, C, U) is analyzed by different algorithms. Here we could successfully solve classification problem by developing an effective classification system using machine learning techniques. Our approach includes introducing more representative datasets, extracting new effective biological features, and comprehensive evaluating of classification performance through these methods via crossvalidation. Performance of different algorithms was measured by the total number of true negatives (TN), true positives (TP), false positives (FP), false negatives (FN), and accuracy (Q). In order to evaluate the efficiency of various methods developed in this study, various parameters like Fmeasure, Matthews correlation coefficient (MCC), accuracy (Q) and, ROC area were calculated. Performance measurement of various models tested with data from miRBase in release 22 in tenfold cross validation. Multilayer Perceptron model could predict pre miRNAs from noncoding sequences that can be important for detecting the true pre miRNAs in genomic sequences. Consequently a new method on miRNA prediction model could be favorable to understand the characteristics miRNA associated with miRNA biogenesis.Conclusion Research on miRNA represents important progress in the study of ncRNAs and may provide further information on understanding of RNA regulation networks. Practical research on silkworm microRNAs has shown that microRNAs can have significant effects on the underlying mechanisms of silkworm growth processes. In addition to the research that has been done so far, it provides the basis for advances in improving our understanding of RNA regulatory networks and the molecular mechanisms involved in gene expression patterns during different stages of silkworm life. Due to insufficient computational research in the field of silkworm microRNAs, further research on the microRNAs of this species represents an important advance in the study of noncoding RNAs, which can provide further information on the activity of noncoding RNAs. Machine learning algorithms will help the researcher discover the uncover miRNA that many researchers were not able to explore.
Keywords