Design of Distributed Cloud System for Managing large-scale Genomic Data

Seine Jang; Seok-Jae Moon

IT Marketing and Policy

Design of Distributed Cloud System for Managing large-scale Genomic Data

원문정보

국제인공지능학회(구 한국인터넷방송통신학회) International Journal of Internet, Broadcasting and Communication Vol.16 No.2 2024.05 pp.119-126

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

The volume of genomic data is constantly increasing in various modern industries and research fields. This growth presents new challenges and opportunities in terms of the quantity and diversity of genetic data. In this paper, we propose a distributed cloud system for integrating and managing large-scale gene databases. By introducing a distributed data storage and processing system based on the Hadoop Distributed File System (HDFS), various formats and sizes of genomic data can be efficiently integrated. Furthermore, by leveraging Spark on YARN, efficient management of distributed cloud computing tasks and optimal resource allocation are achieved. This establishes a foundation for the rapid processing and analysis of large-scale genomic data. Additionally, by utilizing BigQuery ML, machine learning models are developed to support genetic search and prediction, enabling researchers to more effectively utilize data. It is expected that this will contribute to driving innovative advancements in genetic research and applications.

키워드

저자정보

Seine Jang The master’s course, Graduate School of Smart Convergence, Kwangwoon University, Seoul
Seok-Jae Moon Professor, Graduate School of Smart Convergence, Kwangwoon University, Seoul, Korea

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle

Design of Distributed Cloud System for Managing large-scale Genomic Data

원문정보

초록

목차

키워드

저자정보

참고문헌

함께 이용한 논문