|
|
residualconv1d: a deep learning approach for enhancing splice site prediction acrossgenomic contexts
|
|
|
|
|
نویسنده
|
rezvan mohammad reza ,ghanbari sorkhi ali ,pirgazi jamshid ,pourhashem kallehbasti mohammad mehdi
|
منبع
|
نخستين همايش ملي هوش مصنوعي و فناوري هاي آينده نگر - 1402 - دوره : 1 - نخستین همایش ملی هوش مصنوعی و فناوری های آینده نگر - کد همایش: 03230-86475 - صفحه:0 -0
|
چکیده
|
This study addresses the challenge of accurately predicting splice sites, a crucial element in understanding gene expression and protein synthesis. we assume that conventional prediction methods may lack the specificity and adaptability required for diverse genomic contexts. to improve this, we present a novel method that integrates two-gram features and one-hot encoding with a deep convolutional neural network (residualconv1d) model. our approach begins with using the two-gram technique to capture nucleotide dependencies at splice sites. these sequences are then enriched with two-gram features using one-hot encoding. the core of our methodology is the residualconv1d model, which employs convolutional blocks with residual connections to detect complex sequence patterns effectively. our results indicate a significant advancement in splice site prediction accuracy. the model particularly excels in the hs3d acceptor and arabidopsis thaliana donor datasets, outperforming the established ensemble splice algorithm. in the hs3d acceptor dataset, the model achieved an accuracy of 94.18% and an f1-score of 94.24%, demonstrating its effectiveness. additionally, it shows competitive performance in a range of metrics across various datasets, highlighting its robustness in different genomic environments. in conclusion, our innovative combination of two-gram features, one-hot encoding, and the residualconv1d model substantially improves the accuracy of splice site prediction across diverse species. this improvement in prediction capability could be pivotal in advancing the understanding of gene splicing mechanisms.
|
کلیدواژه
|
splice site prediction ,two-gram features ,residualconv1d ,genomic contexts ,accuracy
|
آدرس
|
, iran, , iran, , iran, , iran
|
پست الکترونیکی
|
pourhashem@mazust.ac.ir
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|