Á¤º¸Ã³¸®ÇÐȸ ³í¹®Áö ¼ÒÇÁÆ®¿þ¾î ¹× µ¥ÀÌÅÍ °øÇÐ
Current Result Document :
ÇѱÛÁ¦¸ñ(Korean Title) |
¼Ò¼È ÅؽºÆ®ÀÇ ÁÖ¿ä Á¤º¸ ÃßÃâÀ» À§ÇÑ ·ÎÁö½ºÆ½ ȸ±Í ¾Ó»óºí ±â¹ý |
¿µ¹®Á¦¸ñ(English Title) |
Logistic Regression Ensemble Method for Extracting Significant Information from Social Texts |
ÀúÀÚ(Author) |
±è¼ÒÇö
±èÇÑÁØ
Kim So Hyeon
Kim Han Joon
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 06 NO. 05 PP. 0279 ~ 0284 (2017. 05) |
Çѱ۳»¿ë (Korean Abstract) |
ºòµ¥ÀÌÅÍ ½Ã´ë¸¦ ¸ÂÀÌÇÏ¿© ÅؽºÆ®¸¶ÀÌ´×°ú ¿ÀÇǴϾð¸¶ÀÌ´×ÀÇ È°¿ëµµ°¡ Ä¿Áö°í ÀÖ´Â ½ÃÁ¡¿¡¼ ¼Ò¼È ³×Æ®¿öÅ© ¼ºñ½º·ÎºÎÅÍ À¯¿ëÇÑ Á¤º¸¸¦ ÃßÃâÇÏ´Â ÀÛ¾÷Àº ¸Å¿ì Áß¿äÇÑ ¿¬±¸ ÁÖÁ¦ Áß ÇϳªÀÌ´Ù. ÀÌ¿¡ º» ³í¹®Àº ºí·Î±× HTML ¹®¼¿¡¼ ÁÖ¿ä º»¹®À» ã´Â ·ÎÁö½ºÆ½ ȸ±Í ¾Ó»óºí ±â¹ýÀ» Á¦¾ÈÇÑ´Ù. ¸ÕÀú, ºí·Î±× HTML ű׿¡¼ ±¸Á¶Àû Ư¡, ÅؽºÆ® Ư¡À» ÃßÃâÇÑ´Ù. ±× ´ÙÀ½, ºí·Î±× HTML ¹®¼¿¡¼ ÃßÃâÇÑ ÅÂ±× Æ¯Â¡¿¡ ·ÎÁö½ºÆ½ ȸ±Í ¹× ¾Ó»óºí ±â¹ýÀ» Àû¿ëÇÏ¿© º»¹®À» Æ÷ÇÔÇϴ ű׸¦ ºÐ·ùÇÏ´Â ¸ðµ¨À» ±¸¼ºÇÑ´Ù. º» ¿¬±¸ÀÇ Áß¿äÇÑ ¹ß°ß Áß Çϳª´Â ű×ÀÇ ±íÀÌ Æ¯Â¡À» ÀÌ¿ëÇÏ¿© ÁÖ¿ä º»¹®À» ãÀ» ¼ö ÀÖ´Ù´Â Á¡ÀÌ´Ù. ´Ù¾çÇÑ ÁÖÁ¦ÀÇ ±¹³» ºí·Î±× µ¥ÀÌÅ͸¦ ÀÌ¿ëÇÑ ½ÇÇè¿¡¼ ÅÂ±× ºÐ·ù Á¤È®µµ°¡ 99%, º»¹®À» ã¾Æ³½ ¹®¼ÀÇ ºñÀ²ÀÌ 80.5%·Î Æò°¡µÇ¾ú´Ù.
|
¿µ¹®³»¿ë (English Abstract) |
Currenty, in the era of big data, text mining and opinion mining have been used in many domains, and one of their most important research issues is to extract significant information from social media. Thus in this paper, we propose a logistic regression ensemble method of finding the main body text from blog HTML. First, we extract structural features and text features from blog HTML tags. Then we construct a classification model with logistic regression and ensemble that can decide whether any given tags involve main body text or not. One of our important findings is that the main body text can be found through ¡®depth¡¯ features extracted from HTML tags. In our experiment using diverse topics of blog data collected from the web, our tag classification model achieved 99% in terms of accuracy,
|
Å°¿öµå(Keyword) |
±â°èÇнÀ
Á¤º¸ ÃßÃâ
¾Ó»óºí
·ÎÁö½ºÆ½ ȸ±Í
¼Ò¼È ³×Æ®¿öÅ© ¼ºñ½º
Machine Learning
Information Extraction
Ensemble
Logistic Regression
Social Media
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|