earticle

논문검색

Detection of Alternative Ovarian Cancer Biomarker via Word Embedding

초록

영어

Diagnosis of cancer with biomarkers is relatively simple with the use of blood samples, and it can detect cancer at an early stage with expense compared to the other diagnosis methods. We use word embedding to find an alternative biomarker for the early diagnosis of ovarian cancer from the biomedical corpus. Word embedding is a word vector representation with previously proven efficiency in the biomedical domain. First, we derived a low dimensional representation of each biomarker embedding induced from Canonical Correlation Analysis (CCA), which is a powerful and flexible statistical technique for dimensionality reduction. Second, we found a similar pair of biomarkers in the literature by using cosine similarity of biomarker embedding. In order to determine the clinical similarity between the pair of biomarkers, we used the area under the curve (AUC) of the combination of 2 biomarkers used previously. In the experiment, we confirmed that correlation between the high similarity biomarker pair, was highly correlated as the average 0.710 of the actual AUC correlation of the top 10% of the pair.

목차

Abstract
 1. Introduction
 2. Data
 3. Methods
  3.1. Canonical Correlation Analysis (CCA)
  3.2. Inducing Biomarker Embedding
  3.3. Embedding Cosin Similarity
  3.4. AUC Correlation among the Combination of 2 Biomarkers
 4. Results
 5. Conclusion
 Acknowledgments
 References

저자정보

  • Kyeong-Min Nam Department of Convergence Software, Hallym University 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea, Bio-IT Research Center, Hallym University, 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea
  • Hey-Jung Song Department of Convergence Software, Hallym University 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea, Bio-IT Research Center, Hallym University, 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea
  • Jong-Dae Kim Department of Convergence Software, Hallym University 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea, Bio-IT Research Center, Hallym University, 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea
  • Can-Young Park Department of Convergence Software, Hallym University 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea, Bio-IT Research Center, Hallym University, 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea
  • Yu-Seop Kim Department of Convergence Software, Hallym University 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea, Bio-IT Research Center, Hallym University, 1 Hallymdaehak-gil, Chuncheon, Gangwon-do, 200-702 Korea

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.