earticle

논문검색

Could Decimal-binary Vector be a Representative of DNA Sequence for Classification?

원문정보

초록

영어

In recent years, one of deep learning models called Deep Belief Network (DBN) which formed by stacking restricted Boltzman machine in a greedy fashion has beed widely used for classification and recognition. With an ability to extracting features of high-level abstraction and deal with higher dimensional data structure, this model has ouperformed outstanding result on image and speech recognition. In this research, we assess the applicability of deep learning in dna classification level. Since the training phase of DBN is costly expensive, specially if deals with DNA sequence with thousand of variables, we introduce a new encoding method, using decimal-binary vector to represent the sequence as input to the model, thereafter compare with one- hot-vector encoding in two datasets. We evaluated our proposed model with different contrastive algorithms which achieved significant improvement for the training speed with comparable classification result. This result has shown a potential of using decimal-binary vector on DBN for DNA sequence to solve other sequence problem in bioinformatics.

목차

Abstract
 1. Introduction
 2. Background
  2.1 Deoxyribonucleic Acid (DNA)
  2.2 Restricted Boltzmann Machine
  2.2 Deep Belief Network for Text
 3. Proposed Method
  3.1 Encoding
  3.2 Forming DBN by Training Stack of RBMs and Fine-tuning
 4. Experimental Result and Discussion
  4.1 Dataset
  4.2 General Setup
  4.3 Result and Discussion
 5. Conclusion
 6. Acknowledgement
 References

저자정보

  • Prima Sanjaya Department of Ubiquitous IT, Dongseo University, 47 Jurye-ro, Sasang-gu, Busan 47011, Republic of Korea
  • Dae-Ki Kang Department of Computer & Information Engineering, Dongseo University

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 기관로그인 시 무료 이용이 가능합니다.

      • 4,000원

      0개의 논문이 장바구니에 담겼습니다.