적대적 생성 신경망을 활용한 비지도 학습 기반의 대기 자료 이상 탐지 알고리즘 연구

양호준; 이선우; 이문형; 김종구; 최정무; 신유미; 이석채; 권장우; 박지훈; 정동희; 신혜정

적대적 생성 신경망을 활용한 비지도 학습 기반의 대기 자료 이상 탐지 알고리즘 연구

원문정보

A Study on Atmospheric Data Anomaly Detection Algorithm based on Unsupervised Learning Using Adversarial Generative Neural Network

양호준, 이선우, 이문형, 김종구, 최정무, 신유미, 이석채, 권장우, 박지훈, 정동희, 신혜정

중소기업융합학회 융합정보논문지(구 중소기업융합학회논문지) 제12권 제4호 2022.04 pp.260-269 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

In this paper, We propose an anomaly detection model using deep neural network to automate the identification of outliers of the national air pollution measurement network data that is previously performed by experts. We generated training data by analyzing missing values and outliers of weather data provided by the Institute of Environmental Research and based on the BeatGAN model of the unsupervised learning method, we propose a new model by changing the kernel structure, adding the convolutional filter layer and the transposed convolutional filter layer to improve anomaly detection performance. In addition, by utilizing the generative features of the proposed model to implement and apply a retraining algorithm that generates new data and uses it for training, it was confirmed that the proposed model had the highest performance compared to the original BeatGAN models and other unsupervised learning model like Iforest and One Class SVM. Through this study, it was possible to suggest a method to improve the anomaly detection performance of proposed model while avoiding overfitting without additional cost in situations where training data are insufficient due to various factors such as sensor abnormalities and inspections in actual industrial sites.

한국어

본 논문에서는 기존에 전문가에 의해서 이루어지던 국가 대기오염 측정망 데이터들의 이상 탐지 작업을 인공지능을 통해 자동화하고자 심층 신경망을 이용한 이상 탐지 모델을 제안하였다. 환경과학원에서 제공받은 기상자료 데이터의 결측치 및 이상치를 분석하여 학습데이터를 생성하였으며 비지도 학습 방식의 BeatGAN 모델에 기반하여 커널 구조 변경과 합성곱 필터층 및 전치 합성곱 필터층의 추가를 통해 새로운 모델을 제안하여 이상 탐지 성능을 높이고자 하였다. 또한 제안하는 모델의 생성적 특징을 활용하여 새로운 데이터를 생성하고 이를 학습에 사용하는 재학습 알고리즘을 구현 및 적용하여 기존 BeatGAN 모델뿐 아니라 다른 비지도 학습 모델인 Iforest, One Class SVM과 비교하였을 때 제안모델의 성능이 가장 높았음을 확인할 수 있었다. 본 연구를 통해 실제 산업현장에서 센서의 이상, 점검 등의 여러 요인으로 인해 학습 데이터가 부족한 상황에서 추가 적인 비용없이 과적합을 피하며 제안하는 모델의 이상탐지 성능을 올릴 수 있는 방법을 제시할 수 있었다.

키워드

저자정보

양호준 Ho-Jun Yang. 인하대학교 전기컴퓨터공학과 학생
이선우 Seon-Woo Lee. 인하대학교 전기컴퓨터공학과 학생
이문형 Mun-Hyung Lee. 인하대학교 전기컴퓨터공학과 학생
김종구 Jong-Gu Kim. 인하대학교 전기컴퓨터공학과 학생
최정무 Jung-Mu Choi. 인하대학교 컴퓨터공학과 학생
신유미 Yu-mi Shin. 인하대학교 컴퓨터공학과 학생
이석채 Seok-Chae Lee. 인하대학교 행정학과 학생
권장우 Jang-Woo Kwon. 인하대학교 컴퓨터공학과 교수
박지훈 Ji-Hoon Park. 국립환경과학원 대기환경연구과 환경연구원
정동희 Dong-Hee Jung. 국립환경과학원 대기환경연구과 환경연구원
신혜정 Hye-Jung Shin. 국립환경과학원 대기환경연구과 환경연구원

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 기관로그인 시 무료 이용이 가능합니다.

4,000원

0개의 논문이 장바구니에 담겼습니다.

earticle