Á¤º¸Ã³¸®ÇÐȸ ³í¹®Áö B
Current Result Document :
ÇѱÛÁ¦¸ñ(Korean Title) |
È®ÀåµÈ º¤ÅÍ °ø°£ ¸ðµ¨À» ÀÌ¿ëÇÑ Çѱ¹¾î ¹®¼ ºÐ·ù ¹æ¾È |
¿µ¹®Á¦¸ñ(English Title) |
Korean Document Classification Using Extended Vector Space Model |
ÀúÀÚ(Author) |
ÀÌ»ó°ï
Samuel Sangkon Lee
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 18-B NO. 02 PP. 0093 ~ 0108 (2011. 04) |
Çѱ۳»¿ë (Korean Abstract) |
º» ³í¹®¿¡¼´Â Çѱ¹¾î ¹®¼ÀÇ ºÐ·ù Á¤¹Ðµµ Çâ»óÀ» À§ÇØ ¾Ö¸Å¾î¿Í ÇؼҾî Á¤º¸¸¦ ÀÌ¿ëÇÑ È®ÀåµÈ º¤ÅÍ °ø°£ ¸ðµ¨À» Á¦¾ÈÇÏ¿´´Ù. º¤ÅÍ °ø°£ ¸ðµ¨¿¡ »ç¿ëµÈ º¤ÅÍ´Â °°Àº Á¤µµÀÇ °¡ÁßÄ¡¸¦ °®´Â ÃàÀÌ Çϳª ´õ Á¸ÀçÇÏÁö¸¸, ±âÁ¸ÀÇ ¹æ¹ýÀº ±× Ãà¿¡ ¾Æ¹«·± 󸮰¡ ÀÌ·ç¾îÁöÁö ¾Ê¾Ò±â ¶§¹®¿¡ º¤Åͳ¢¸®ÀÇ ºñ±³¸¦ ÇÒ ¶§ ¹®Á¦°¡ ¹ß»ýÇÑ´Ù. °°Àº °¡ÁßÄ¡¸¦ °®´Â ÃàÀÌ µÇ´Â ´Ü¾î¸¦ ¾Ö¸Å¾î¶ó Á¤ÀÇÇÏ°í, ´Ü¾î¿Í ºÐ¾ß »çÀÌÀÇ »óÈ£Á¤º¸·®À» °è»êÇÏ¿© ¾Ö¸Å¾î¸¦ °áÁ¤ÇÏ¿´´Ù. ¾Ö¸Å¾î¿¡ ÀÇÇØ ¾Ö¸Å¼ºÀ» ÇؼÒÇÏ´Â ´Ü¾î¸¦ ÇؼҾî¶ó Á¤ÀÇÇÏ°í, ¾Ö¸Å¾î¿Í µ¿ÀÏÇÑ ¹®¼¿¡¼ ÃâÇöÇÏ´Â ´Ü¾î Áß¿¡¼ »óÈ£ Á¤º¸·®À» °è»êÇÏ¿© ÇؼҾîÀÇ ¼¼±â¸¦ °áÁ¤ÇÏ¿´´Ù. º» ³í¹®¿¡¼´Â ¾Ö¸Å¾î¿Í ÇؼҾ ÀÌ¿ëÇÏ¿© º¤ÅÍÀÇ Â÷¿øÀ» È®ÀåÇÏ¿© ¹®¼ ºÐ·ùÀÇ Á¤¹Ðµµ¸¦ Çâ»ó½ÃÅ°´Â ¹æ¹ýÀ» Á¦¾ÈÇÏ¿´´Ù.
|
¿µ¹®³»¿ë (English Abstract) |
We propose a extended vector space model by using ambiguous words and disambiguous words to improve the result of a Korean document classification method. In this paper we study the precision enhancement of vector space model and we propose a new axis that represents a weight value. Conventional classification methods without the weight value had some problems in vector comparison. We define a word which has same axis of the weight value as ambiguous word after calculating a mutual information value between a term and its classification field. We define a word which is disambiguous with ambiguous meaning as disambiguous word. We decide the strengthness of a disambiguous word among several words which is occurring ambiguous word and a same document. Finally, we proposed a new classification method based on extension of vector dimension with ambiguous and disambiguous words.
|
Å°¿öµå(Keyword) |
º¤ÅÍ °ø°£ ¸ðµ¨
¾Ö¸Å¾î
ÇؼҾî
ÀüÄ¡ À妽º ¹æ¹ý
»óÈ£Á¤º¸·®
¹®¼ºÐ·ù
Á¤º¸°Ë»ö
Vector Space Model
Ambiguous Word
Disambiguous Word
Transposed Index Method
Mutual Information
Document Classification
Information Retrieval
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|