N-gram을 활용한 DGA 기반의 봇넷 탐지 방안

정일옥; 신덕하; 김수철; 이록석

N-gram을 활용한 DGA 기반의 봇넷 탐지 방안

원문정보

DGA-based Botnet Detection Technology using N-gram

정일옥, 신덕하, 김수철, 이록석

한국융합보안학회 융합보안논문지 제22권 제5호 2022.12 pp.145-154 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

Recently, the widespread proliferation and high sophistication of botnets are having serious consequences not o nly for enterprises and users, but also for cyber warfare between countries. Therefore, research to detect botnets is steadily progressing. However, the DGA-based botnet has a high detection rate with the existing signature and st atistics-based technology, but also has a high limit in the false positive rate. Therefore, in this paper, we propose a detection model using text-based n-gram to detect DGA-based botnets. Through the proposed model, the detecti on rate, which is the limit of the existing detection technology, can be increased and the false positive rate can als o be minimized. Through experiments on large-scale domain datasets and normal domains used in various DGA b otnets, it was confirmed that the performance was superior to that of the existing model. It was confirmed that the false positive rate of the proposed model is less than 2 to 4%, and the overall detection accuracy and F1 score are both 97.5%. As such, it is expected that the detection and response capabilities of DGA-based botnets will be improved th rough the model proposed in this paper.

한국어

최근 봇넷의 광범위한 확산과 고도의 정교함은 기업과 사용자뿐만 아니라 국가 간 사이버전에도 심각한 결과를 초래하고 있다. 이 때문에 봇넷을 탐지하고자 하는 연구는 꾸준히 되고 있다. 하지만, DGA 기반의 봇넷은 기존의 시그니처 및 통계 기 반의 기술로는 탐지율은 높지만, 오탐율 또한 높은 한계가 있다. 이에 본 논문에서는 DGA 기반의 봇넷을 탐지하고자 문자 기 반의 n-gram을 활용한 탐지모델을 제안한다. 제안한 모델을 통해 기존의 탐지 기술의 한계인 탐지율을 높이고 오탐율을 최소 화할 수 있다. 다양한 DGA 봇넷에서 사용하는 대규모의 도메인 데이터셋과 정상 도메인에 대한 실험을 통해 기존의 모델보 다 성능이 우수함을 확인하였다. 제안된 모델의 오탐율은 2~4% 미만이며 전체 탐지 정확도와 F1 점수는 모두 97.5%임을 확 인하였다. 이처럼 본 논문에서 제안한 모델을 통해 DGA 기반의 봇넷에 대한 탐지 및 대응 능력이 향상될 것을 기대한다.

요약
ABSTRACT
1. 서론
2. 관련 연구
2.1 봇넷 탐지와 DGA에 관한 연구
2.2 DGA 기반의 봇넷 탐지에 관한 연구
3. 제안된 기법
3.1 제안된 DGA 기반의 봇넷 탐지 프로세스
3.2 문자 단위 bigram 기반의 피처 추출
3.3 DGA 알고리즘
4. 실험 및 평가
4.1 데이터셋
4.2 데이터 전처리
4.3 실험 환경 및 평가 방법
4.4 실험 결과
5. 결론
참고문헌

키워드

저자정보

정일옥 Jung Il Ok. 고려대학교/정보보호학과
신덕하 Shin Deok Ha. 경희대학교/응용수학과
김수철 Kim Su Chul. 숭실대학교/IT정책경영학과
이록석 Lee Rock Seok. 전남대학교/정보보호협동과정

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 기관로그인 시 무료 이용이 가능합니다.

4,000원

0개의 논문이 장바구니에 담겼습니다.

earticle