>
Fa   |   Ar   |   En
   فراتحلیل داده‌های بیانی rna-seq و microarray برای شناسایی ژن‌های موثر در رشد و نمو عضله گوسفند  
   
نویسنده محمدی فهیمه ,طهمورث پور مجتبی ,جوادمنش علی
منبع پژوهشهاي علوم دامي ايران - 1402 - دوره : 15 - شماره : 4 - صفحه:557 -569
چکیده    امروزه می توان با روش های بیوانفورماتیکی داده های حاصل از مطالعات و پلتفرم‌های مختلف را ادغام و از آن ها بهره برد. در این مطالعه، با ادغام داده های ریزآرایه و rna-seq بافت عضله گوسفند نژاد تکسل موجود در پایگاه داده به مقایسه پروفایل ترنسکریپتومی عضله در دو مقطع سنی جنینی و بلوغ پرداخته شد. برای محاسبه مقادیر بیانی عضله ریزآرایه مربوط به دوران جنینی از بسته های نرم افزاری limma، biobase و geoquery در محیط r و برای محاسبه مقادیر بیانی عضله rna-seq از پروتکل tuxedo و بسته نرم افزاری htseq در محیط لینوکس و بسته نرم افزاری deseq2 در محیط r استفاده شد. سپس دو نوع مقادیر بیانی ادغام شدند. نتایج نشان داد، در بافت عضله بین مقطع سنی بلوغ و جنینی بیان 62 ژن (37 ژن افزایش و 25 ژن کاهش بیان) اختلاف معناداری داشتند. با رسم شبکه ژنی بین ژن های افتراقی، 15 ژن منتخب myh1، actn3، casq1، tmod4، fbp2، slc2a4، mx1، cox4i1، sod2، mfn2،uqcrb، ucp3، prkab2، phkg2، ppp1r3c شناسایی شدند. عملکرد این ژن ها در تکثیر سلولی، تشکیل میوفیبریل ها و متابولیسم های چربی زایی ثابت شده است. آنالیز هستی شناسی ژن های افتراقی نقش برخی از این ژن ها مثل actn3 وcasq1 را در فرآیندهای زیستی مثل توسعه سلول عضلانی مخطط، مسیرهای علامت‌دهی calcineurin-nfat و jak-stat آشکار کرد. این مطالعه علاوه‌بر تایید روش ادغامی داده های ناهمگن، دیدی کلی از تفاوت‌های ترنسکریپتومی بافت عضله گوسفند تکسل در دو مقطع مهم سنی را فراهم آورد تا ژن های منتخب معرفی شده منبع مفیدی برای بررسی‌های زیستی ژن‌های مربوط به رشد و نمو عضله باشد.
کلیدواژه ادغام داده‌های ناهمگن، بیان افتراقی ژن‌ها، شبکه ژنی، هستی‌شناسی ژن‌ها
آدرس دانشگاه فردوسی مشهد, دانشکده کشاورزی, گروه علوم دامی, ایران, دانشگاه فردوسی مشهد, دانشکده کشاورزی, گروه علوم دامی, ایران, دانشگاه فردوسی مشهد, دانشکده کشاورزی, گروه علوم دامی, ایران
پست الکترونیکی javadmanesh@um.ac.ir
 
   meta-analysis of rna-seq and microarray expression data to identify genes effective in sheep muscle growth and development  
   
Authors mohammadi fahime ,tahmoorespur mojtaba ,javadmanesh ali
Abstract    introduction: among different sheep breeds in the world, the texel breed is known as a meaty and muscular breed. skeletal muscle growth is a step-by-step and exponential process from differentiation, development and maturation, which is regulated by gene networks and cell signaling pathways, and several genes and factors are involved in the process of muscle fiber formation and their growth and hypertrophy (badday betti et al. 2022). the study of gene expression is done with several methods, and this gene expression information is used in breeding programs as a tool to improve phenotypic choices. databases are a large source of expression data that can be used by bioinformatics methods to integrate heterogeneous data from different studies and platforms. in this study, by integrating the microarray and rna-seq data available in the database belonging to the muscle tissue of texel breed sheep, the transcriptomic profile of the muscle was compared at two ages of embryonic and adult.materials and methods: microarray data related to longissimus dorsi muscle tissue with three replicates d-70 embryos from geo database with accession number gse23563 and rna-seq data related to muscle tissue from six samples with two replicates from adult individuals from arrayexpress database were selected. limma, biobase and geoquery software packages were used to calculate the expression values of the microarray data related to the embryonic age  in the r environment, and tuxedo, htseq and deseq2 packages were used in the linux and r environment to calculate the expression values of the rna-seq data (kamali et al. 2022; sahraei et al. 2019). then two types of expression values were integrated and to eliminate non-biological effects, the batch effects were also removed. next, differential genes were identified with the limma software package. in order to identify the relationship between the identified differential genes, the gene network was drawn between them by software of cytoscape version 3.7.1 and string 1.5.1 program. next, due to the vastness of the gene network, each network was clustered with mcode 1.6.1 and cytocluster 2.1.0 programs (clusterone algorithm) and significant clusters (p-value < 0.05) were identified (saedi et al. 2022). in order to better understand the ontology and function of the identified differential genes, the gene ontology of the genes was investigated using software of cytoscape version 3.7.1 and cluego 2.5.9 and cluepedia 1.5.9 programs. after receiving the gene ontology results, significant gene ontology terms (p-value < 0.05) related to functional groups were identified. finally, the selected genes (adj p-value < 0.05) were identified and introduced in these two age groups.results and discussion: after quality control, correcting and normalizing the microarray data, the gpl10778 platform annotation file with 1042520 probe id was used to calculate their expression values. after relevant analyzes of 9289 probe id identified related to the data of this study, 7918 gene symbol was identified finally. after quality control, trimming and normalizing the rna-seq data in total, the number of ensembl_genes based on which the reading values were calculated by htseq was 27056. after removing ids that had zero readings in all 6 samples, 10855 ids remained. then, these 10855 ensembl id were merged with the annotation file to obtain gene symbol, and finally 9417 common genes were identified between the six samples of adult age. the results of differential expression analysis showed that there were significant differences in the expression of 62 genes (37 increased and 25 decreased) in the muscle tissue between adult and embryonic age. by creating a gene network between differential genes, 15 selected genes were identified, including myh1, actn3, casq1, tmod4, fbp2, slc2a4, mx1, cox4i1, sod2, mfn2, uqcrb, ucp3, prkab2, phkg2, ppp1r3c. the function of these genes has been proven in cell proliferation, protein synthesis, myofibril formation, and lipid metabolism. differential gene enrichment analysis revealed some biological processes such as vasculogenesis, positive regulation of ossification, positive regulation of muscle tissue development, regulation of muscle contraction, contractile fiber part, calcium signaling, calcineurin-nfat signaling cascade and regulation of receptor signaling pathway via jak-stat, the molecular function of regulating cation channel activity and the cellular components of the contractile fiber.conclusion: this study in addition to confirming the accuracy of the integration method of two types of heterogeneous data, provided a general view of the transcriptomic differences of texel sheep muscle tissue at two important age points to be a useful source for biological investigations of genes related to muscle growth and development in sheep.
Keywords data integration ,differential gene expression ,gene network ,gene ontology
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved