N-그램 증강 나이브 베이스 알고리즘과 일반화된 k-절단 서픽스 트리를 이용한 확장 가능하고 정확한 침입 탐지 기법

강대기; 황기현; Dae-Ki Kang; Gi-Hyun Hwang

연구문헌

국내 논문지

홈 > 연구문헌 > 국내 논문지 > 한국정보통신학회 논문지 (Journal of the Korea Institute of Information and Communication Engineering)

한국정보통신학회 논문지 (Journal of the Korea Institute of Information and Communication Engineering)

Current Result Document : 2 / 6 이전건 다음건

한글제목(Korean Title)	N-그램 증강 나이브 베이스 알고리즘과 일반화된 k-절단 서픽스 트리를 이용한 확장 가능하고 정확한 침입 탐지 기법
영문제목(English Title)	Scalable and Accurate Intrusion Detection using n-Gram Augmented Naive Bayes and Generalized k-Truncated Suffix Tree
저자(Author)	강대기 황기현 Dae-Ki Kang Gi-Hyun Hwang
원문수록처(Citation)	VOL 13 NO. 04 PP. 0805 ~ 0812 (2009. 04)
한글내용 (Korean Abstract)	기계 학습을 응용한 많은 침입 탐지 시스템들에서 n-그램 접근 방법이 사용되고 있다. 그러나, n-그램 접근 방법은 확장이 어렵고, 주어진 시퀀스에서 획득한 n-그램들이 서로 겹치는 문제들을 가지고 있다. 본 연구에서는 이러한 문제들을 해결하기 위해, 일반화된 k-절단 서픽스 트리 (generalized k-truncated suffix tree; k-TST) 기반의 n-그램 증강 나이브 베이스 (n-gram augmented naive Bayes) 알고리즘을 침입 시퀀스의 분류에 적용하여 보았다. 제안된 시스템의 성능을 평가하기 위해 n-그램 특징들을 사용하는 일반 나이브 베이스 (naive Bayes) 알고리즘과 서포트 벡터 머신 (support vector machines) 알고리즘과 본 연구에서 제안한 n-그램 증강 나이브 베이스 알고리즘을 호스트 기반 침입 탐지 벤치마크 데이터와 비교하였다. 공개된 호스트 기반 침입 탐지 벤치마크 데이터인 뉴 멕시코 대학(University of New Mexico)의 벤치마크 데이터에 적용해 본 결과에 따르면, n-그램 증강 방법이, n-그램이 나이브 베이스에 직접 적용되는 경우(예: n-그램 특징을 사용하는 일반 나이브 베이스), 생기는 독립성 가정에 대한 위배의 문제도 해결하면서, 동시에 더 정확한 침입 탐지기를 생성해 냄을 알 수 있었다.
영문내용 (English Abstract)	In many intrusion detection applications, n-gram approach has been widely applied. However, n-gram approach has shown a few problems including unscalability and double counting of features. To address those problems, we applied n-gram augmented Naive Bayes with k-truncated suffix tree (k-TST) storage mechanism directly to classify intrusive sequences and compared performance with those of Naive Bayes and Support Vector Machines (SVM) with n-gram features by the experiments on host-based intrusion detection benchmark data sets. Experimental results on the University of New Mexico (UNM) benchmark data sets show that the n-gram augmented method, which solves the problem of independence violation that happens when n-gram features are directly applied to Naive Bayes (i.e. Naive Bayes with n-gram features), yields intrusion detectors with higher accuracy than those from Naive Bayes with n-gram features and shows comparable accuracy to those from SVM with n-gram features. For the scalable and efficient counting of n-gram features, we use k-truncated suffix tree mechanism for storing n-gram features. With the k-truncated suffix tree storage mechanism, we tested the performance of the classifiers up to 20-gram, which illustrates the scalability and accuracy of n-gram augmented Naive Bayes with k-truncated suffix tree storage mechanism.
키워드(Keyword)	N-그램 나이브 베이스 알고리즘 일반화된 k-절단 서픽스 트리 호스트 기반 침입 탐지
파일첨부	PDF 다운로드