A Novel Method for Splice Sites Prediction Using Sequence Component and Hidden Markov Model


Pashaei E., YILMAZ A., Ozen M., AYDIN N.

38th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC), Florida, United States Of America, 16 - 20 August 2016, pp.3076-3079 identifier identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/embc.2016.7591379
  • City: Florida
  • Country: United States Of America
  • Page Numbers: pp.3076-3079
  • Keywords: Splice site prediction, DNA encoding method, AdaBoost.M1 classifier
  • Yıldız Technical University Affiliated: Yes

Abstract

With increasing growth of DNA sequence data, it has become an urgent demand to develop new methods to accurately predict the genes. The performance of gene detection methods mainly depend on the efficiency of splice site prediction methods. In this paper, a novel method for detecting splice sites is proposed by using a new effective DNA encoding method and AdaBoost.M1 classifier. Our proposed DNA encoding method is based on multi-scale component (MSC) and first order Markov model (MM1). It has been applied to the HS3D dataset with repeated 10 fold cross validation. The experimental results indicate that the new method has increased the classification accuracy and outperformed some current methods such as MM1-SVM, Reduced MM1-SVM, SVM-B, LVMM, DM-SVM, DM2-AdaBoost and MSC+Pos(+APR)-SVM.