Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ ³í¹®Áö
Current Result Document :
ÇѱÛÁ¦¸ñ(Korean Title) |
¿ª»ç°´Ã¼ ±â¹ÝÀÇ ±â°è ÇнÀ ±â¹ýÀ» È°¿ëÇÑ À¥ ¹®¼ÀÇ ½Ã°£Á¤º¸ ÃßÃâ ¹æ¾È Á¦¾È |
¿µ¹®Á¦¸ñ(English Title) |
A Proposal of Methods for Extracting Temporal Information of History-related Web Document based on Historical Objects Using Machine Learning Techniques |
ÀúÀÚ(Author) |
ÀÌ ÁØ
±Ç¿ëÁø
Jun Lee
KWON YongJin
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 16 NO. 04 PP. 0039 ~ 0050 (2015. 08) |
Çѱ۳»¿ë (Korean Abstract) |
ÃÖ±Ù °Ë»ö¿£ÁøÀ» ÅëÇÑ Á¤º¸°Ë»ö °úÁ¤¿¡¼ ƯÁ¤ ½Ã±¸°£ »óȲ¿¡ ´ëÀÀÇÏ´Â ¹®¼¸¦ °Ë»öÇÏ°íÀÚ ÇÏ´Â °æ¿ì°¡ ÀÖ´Ù. ¿¹¸¦ µé¸é, ÀÓÁø¿Ö¶õ ÀÌÀüÀÇ ½Ã´ëÀû »óȲ°ú °ü·ÃµÈ ¹®¼¸¦ °Ë»öÇϱâ À§ÇØ Å°¿öµå¡®ÀÓÁø¿Ö¶õ¡¯À¸·Î °Ë»öÇÏ¸é ½Ã°£¿¡ °ü°è¾øÀÌ ÀÓÁø¿Ö¶õ ´ç½Ã³ª ÀüÈÄÀÇ ¸ðµç ¹®¼°¡ °Ë»öµÇ¾î Ãß°¡ÀûÀÎ ÀÛ¾÷ÀÌ ¿ä±¸ µÈ´Ù. ¶ÇÇÑ, ¿ª»ç °ü·Ã ¹®¼ÀÇ °æ¿ì´Â ¹®¼ ³»¿ë¿¡ ´ëÀÀÇÏ´Â ½Ã°£ Á¤º¸°¡ ¹®¼ »ý¼º½Ã°£°ú ÀÏÄ¡ÇÏÁö ¾Ê´Â °æ¿ì°¡ ´ëºÎºÐÀÌ´Ù. ¸¸¾à À¥ ¹®¼ÀÇ ³»¿ë¿¡ ´ëÀÀÇÏ´Â ½Ã°£ Á¤º¸¸¦ ÃßÃâÇÒ ¼ö ÀÖ´Ù¸é È¿°úÀûÀÎ Á¤º¸ °Ë»öÀº ¹°·Ð ´Ù¾çÇÑ ÀÀ¿ë¿¡ Àû¿ë °¡´ÉÇÒ °ÍÀÌ´Ù. µû¶ó¼ º» ³í¹®Àº ¹®¼ ³»¿ë¿¡ ´ëÀÀÇÏ´Â ½Ã°£ Á¤º¸ ÃßÃâÀ» ¸ñÀûÀ¸·Î, Á¶¼±½Ã´ë¸¦ ´ë»óÀ¸·ÎÇÑ ¿ª»ç ¹®ÇåÀ» È°¿ëÇÏ¿© Á¶¼±½Ã´ë ¿ª»ç °ü·Ã ¹®¼ÀÇ ½Ã°£ÃßÃâ¿¡ ´ëÇÑ ¿¬±¸¸¦ ÁøÇàÇÑ´Ù. ¿ª»ç ¹®Çå°ú À¥À¸·ÎºÎÅÍ ¼öÁýµÈ ¿ª»ç °ü·Ã ¹®¼¸¦ ¹ÙÅÁÀ¸·Î ¿ª»ç °´Ã¼¸¦ Á¤ÀÇÇÏ°í, À̸¦ ±â¹ÝÀ¸·Î ´Ù¾çÇÑ ±â°è ÇнÀ ±â¹ýÀ» È°¿ëÇÏ¿© À¥ ¹®¼ÀÇ ½Ã°£ Á¤º¸ÃßÃâ¿¡ ´ëÇÑ °¡´É¼ºÀ» È®ÀÎÇÑ´Ù. ¶ÇÇÑ ±â°è ÇнÀ°úÁ¤¿¡ ÀÖ¾î¼ °´Ã¼ÀÇ À¯»çµµ¿¡ ±â¹ÝÇÑ ¿©°ú°úÁ¤À» Á¦¾ÈÇÏ°í À̸¦ Àû¿ëÇÑ È¿À²ÀûÀÎ ½Ã°£ Á¤º¸ ÃßÃâ ¹× Á¤È®µµ Çâ»ó¿¡ ´ëÇÑ °á°ú¸¦ ºñ±³ ºÐ¼®ÇÑ´Ù.
|
¿µ¹®³»¿ë (English Abstract) |
In information retrieval process through search engine, some users want to retrieve several documents that are corresponding with specific time period situation. For example, if user wants to search a document that contains the situation before ¡®Japanese invasions of Korea era¡¯, he may use the keyword ¡®Japanese invasions of Korea¡¯ by using searching query. Then, search engine gives all of documents about ¡®Japanese invasions of Korea¡¯ disregarding time period in order. It makes user to do an additional work. In addition, a large percentage of cases which is related to historical documents have different time period between generation date of a document and record time of contents. If time period in document contents can be extracted, it may facilitate effective information for retrieval and various applications. Consequently, we pursue a research extracting time period of Joseon era¡¯s historical documents by using historic literature for Joseon era in order to deduct the time period corresponding with document content in this paper. We define historical objects based on historic literature that was collected from web and confirm a possibility of extracting time period of
web document by machine learning techniques. In addition to the machine learning techniques, we propose and apply the similarity filtering based on the comparison between the historical objects. Finally, we¡¯ll evaluate the result of temporal indexing accuracy and improvement.
|
Å°¿öµå(Keyword) |
½Ã°£Á¤º¸
½Ã°£ÃßÃâ
±â°èÇнÀ
À¯»çµµ¿©°ú¹ý
¿ª»çÁ¤º¸
¿ª»ç°´Ã¼
Temporal information
Termporal Extraction
Machine learning. Similarity filtering
Historical information
Historical Object
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|