원문정보
초록
영어
Everyday there are millions of domains registered and some of them are related to malicious activities. Recently, domain names have been used to operate malicious networks such as botnet and other types of malicious software (malware). Studies have revealed that it was challenging to keep track of malicious domains by Web content analysis or human observation because of the large number of domains. Legitimate domain names usually consist of English words or other meaningful sequences and can be easy to understand by humans, while malicious domains are generated randomly and do not include meaningful words or are not otherwise readable. Recently, a classification method has been proposed to classify malicious domain names. They used many features from DNS queries, including some textual features. However, it seems difficult to collect and maintain those data. Our contribution is that, by using only domain names we could achieve better classification results, thus showing that domain names themselves contain enough information for classification.
목차
1. Introduction
2. Background and Related Work
2.1. DNS Concept
2.2. DNS Queries
2.3. Related Work
3. Data Sets and Feature Extraction
3.1. Data Collection
3.2. Constructing the Dataset
3.3. Feature Extraction
4. SVM Classifier
4.1. SVM
4.2. SVM light
5. Result and Discussion
6. Conclusion
Acknowledgements
References