의료 데이터 산업을 위한 비정형 데이터 비식별화 정책에 관한 연구

이선진; 박태림; 김소희; 오영은; 이일구

원문정보

A study on the policy of de-identifying unstructured data for the medical data industry

이선진, 박태림, 김소희, 오영은, 이일구

한국융합보안학회 융합보안논문지 제22권 제4호 2022.10 pp.85-97 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

With the development of big data technology, data is rapidly entering a hyperconnected intelligent society that accelerates innovative growth in all industries. The convergence industry, which holds and utilizes various high-quality data, is becoming a new growth engine, and big data is fused to various traditional industries. In particular, in the medical field, structured data such as electronic medical record data and unstructured medical data such as CT and MRI are used together to increase the accuracy of disease prediction and diagnosis. Currently, the importance and size of unstructured data are increasing day by day in the medical industry, but conventional data security technologies and policies are structured data-oriented, and considerations for the security and utilization of unstructured data are insufficient. In order for medical treatment using big data to be activated in the future, data diversity and security must be internalized and organically linked at the stage of data construction, distribution, and utilization. In this paper, the current status of domestic and foreign data security systems and technologies is analyzed. After that, it is proposed to add unstructured data-centered de-identification technology to the guidelines for unstructured data and technology application cases in the industry so that unstructured data can be actively used in the medical field, and to establish standards for judging personal information for unstructured data. Furthermore, an object feature-based identification ID that can be used for unstructured data without infringing on personal information is proposed.

한국어

빅데이터 기술이 발전하면서 데이터가 전 산업의 혁신 성장을 가속하는 초연결 지능화 사회로 빠르게 진입하고 있 다. 고품질의 다양한 데이터를 보유하고 활용하는 융복합 산업이 새로운 성장 동력으로 자리매김하고 있으며, 다양한 전통 산업군에 빅데이터가 융합되어 데이터 기반의 혁신을 통해 디지털 전환이 이루어지고 있다. 특히 의료 분야에서는 전자의무기록 데이터와 같은 정형 데이터와 CT, MRI 등의 비정형 의료 데이터를 함께 활용함으로써, 질병 예측 및 진 단의 정확도를 높이고 있다. 현재 의료 산업에서 비정형 데이터의 중요성과 규모는 나날이 증가하고 있지만, 종래의 데 이터 보안 기술과 정책은 정형 데이터 중심이며, 비정형 데이터의 보안성과 활용성에 대한 고려는 미비하다. 향후 빅데 이터를 활용한 진료가 활성화되려면 데이터의 다양성과 보안성이 데이터 구축, 유통, 활용 단계에서 내재화되고 유기적 으로 연계되어야 한다. 본 논문에서는 국내외 데이터 보안 제도와 기술 현황을 분석한다. 이후 의료 분야에서 비정형 데이터가 활발히 사용될 수 있도록 비식별조치 가이드라인에 비정형 데이터 중심의 비식별 기술과 산업에서의 기술 적 용 사례를 추가하고, 비정형 데이터에 대한 개인정보 판단 기준을 수립할 것을 제안한다. 더 나아가 개인정보를 침해하 지 않고, 비정형 데이터에 활용할 수 있는 객체 특징 기반의 식별 ID를 제안한다.

요약
ABSTRACT
1. 서론
2. 종래 데이터 비식별 기술ㆍ정책 분석
2.1 사전 검토 단계
2.2 비식별조치 단계
2.3 데이터 결합 단계
2.4 적정성 평가 및 사후관리 단계
3. 비정형 빅데이터 비식별조치 기술
4. 의료 데이터 산업 활성화를 위한 비정형 빅데이터 비식별조치 정책
4.1 비식별조치 정책 내 비정형 데이터 비식별 기술 및 사례 도입
4.2 비정형 데이터에 대한 개인정보 판단기준 수립
4.3 객체 특징 기반의 식별 ID 활용
5. 결론
참고문헌

키워드

저자정보

이선진 Sun-Jin Lee. 성신여자대학교 미래융합기술공학과
박태림 Tae-Rim Park. 성신여자대학교 미래융합기술공학과
김소희 So-Hui Kim. 신세계아이앤씨 정보보안팀
오영은 Young-Eun Oh. 라인비즈플러스 viz security
이일구 Il-Gu Lee. 성신여자대학교 융합보안공학과 / 미래융합기술공학과

참고문헌

자료제공 : 네이버학술정보

earticle