• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

±¹³» ³í¹®Áö

Ȩ Ȩ > ¿¬±¸¹®Çå > ±¹³» ³í¹®Áö > Çѱ¹Á¤º¸°úÇÐȸ ³í¹®Áö > Á¤º¸°úÇÐȸ ³í¹®Áö C : ÄÄÇ»ÆÃÀÇ ½ÇÁ¦

Á¤º¸°úÇÐȸ ³í¹®Áö C : ÄÄÇ»ÆÃÀÇ ½ÇÁ¦

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) ¼Ò¼È ºÏ¸¶Å· ½Ã½ºÅÛÀÇ ½ºÆÐ¸Ó Å½Áö¸¦ À§ÇÑ ±â°èÇнÀ ±â¼úÀÇ ¼º´É ºñ±³
¿µ¹®Á¦¸ñ(English Title) Comparative Study of Machine Learning Techniques for Spammer Detection in Social Bookmarking Systems
ÀúÀÚ(Author) ±èÂùÁÖ   Ȳ±Ô¹é   Chanju Kim   Kyubaek Hwang  
¿ø¹®¼ö·Ïó(Citation) VOL 15 NO. 05 PP. 0345 ~ 0349 (2009. 05)
Çѱ۳»¿ë
(Korean Abstract)
¼Ò¼È ºÏ¸¶Å·(social bookmarking) ½Ã½ºÅÛÀº »ç¿ëÀÚ°¡ ºÏ¸¶Å©¸¦ ÀúÀåÇÏ°í °øÀ¯ÇÒ ¼ö ÀÖ´Â Ç÷§ÆûÀ» Á¦°øÇÏ´Â À¥ ±â¹Ý(web-based) ½Ã½ºÅÛÀ¸·Î Æø¼Ò³ë¹Ì(folksonomy)¸¦ ÀÌ¿ëÇÑ ´ëÇ¥ÀûÀÎ À¥2.0 ¼­ºñ½ºÀÌ´Ù. ¼Ò¼È ºÏ¸¶Å·½Ã½ºÅÛ¿¡¼­ÀÇ ½ºÆиÓ(spammer)¶õ ÀڽŵéÀÇ ÀÌÀÍÀ» À§Çؼ­ ½Ã½ºÅÛÀ» °íÀÇÀûÀ¸·Î ¾Ç¿ëÇÏ´Â »ç¶÷À» ¸»ÇÑ´Ù. ½ºÆиӴ ¸¹Àº ¾çÀÇ À߸øµÈ Á¤º¸¸¦ ½Ã½ºÅÛ¿¡ Æ÷½ºÆÃ(posting)Çϱ⠶§¹®¿¡ Àüü ¼Ò¼È ºÏ¸¶Å· ½Ã½ºÅÛÀÇ ¸®¼Ò½º(resource)¸¦ ¾µ¸ð¾ø°Ô ¸¸µé¾î ¹ö¸°´Ù. µû¶ó¼­, ½ºÆиӸ¦ ºü¸¥ ½Ã°£ ¾È¿¡ ŽÁöÇÏ°í ±×µéÀÇ Á¢±ÙÀ» Â÷´ÜÇÏ´Â °ÍÀº ½Ã½ºÅÛÀÇ ºØ±«¸¦ ¹æÁöÇϱâ À§ÇØ Áß¿äÇÏ´Ù. º» ³í¹®¿¡¼­´Â »ç¿ëÀÚ°¡ »ç¿ëÇÑ Å±׿¡ ´ëÇÑ µ¥ÀÌÅ͸¦ ÃßÃâÇÏ¿©, »ç¿ëÀÚ°¡ ½ºÆиÓÀÎÁö ¾Æ´ÑÁö¸¦ ¿¹ÃøÇÏ´Â ¸ðµ¨À» ±â°èÇнÀÀÇ ´Ù¾çÇÑ ¹æ¹ýÀ» Àû¿ëÇÏ¿© »ý¼ºÇÑ ÈÄ ±× ¼º´ÉÀ» ºñ±³ÇØ º¸¾Ò´Ù. ±¸Ã¼ÀûÀ¸·Î, °áÁ¤Å×À̺í(decision table, DT), °áÁ¤Æ®¸®(decision tree, ID3), ³ªÀÌºê º£ÀÌÁî ºÐ·ù±â(naive Bayes classifier), TAN(tree-augmented naive Bayes) ºÐ·ù±â, Àΰø½Å°æ¸Á(artificial neural network)ÀÇ ¹æ¹ýÀ» ºñ±³ÇÏ¿´´Ù. ±× °á°ú AUC(area under the ROC curve)¿Í ¸ðµ¨ »ý¼º½Ã°£À» °í·ÁÇÏ¿´À» ¶§ ³ªÀÌºê º£ÀÌÁî ºÐ·ù±â°¡ °¡Àå ¸¸Á·ÇÒ ¸¸ÇÑ ¼º´ÉÀ» º¸¿´´Ù. ³ªÀÌºê º£ÀÌÁî ºÐ·ù±âÀÇ ºÐ·ù °á°ú°¡ °¡Àå ÁÁ¾Ò´ø ÀÌÀ¯´Â ¼º´ÉÀ» ºñ±³ÇÏ´Â µ¥ »ç¿ëµÈ AUC°¡ °áÁ¤Æ®¸® °è¿­ÀÇ ¹æ¹ý(ID3 µî)º¸´Ù ³ªÀÌºê º£ÀÌÁî ºÐ·ù±â¿¡¼­ ÀϹÝÀûÀ¸·Î ³ô°Ô ³ª¿À´Â °æÇâÀÌ ÀÖ´Ù´Â °Í°ú, ½ºÆÐ¸Ó Å½Áö ¹®Á¦°¡ ¼±ÇüÀ¸·Î ºÐ¸® °¡´ÉÇÑ °æ¿ì(linearly separable)¿Í À¯»çÇÒ °¡´É¼ºÀÌ ³ô±â ¶§¹®À¸·Î ¿©°ÜÁø´Ù.
¿µ¹®³»¿ë
(English Abstract)
Social bookmarking systems are a typical web 2.0 service based on folksonomy, providing the platform for storing and sharing bookmarking information. Spammers in social bookmarking systems denote the users who abuse the system for their own interests in an improper way. They can make the entire resources in social bookmarking systems useless by posting lots of wrong information. Hence, it is important to detect spammers as early as possible and protect social bookmarking systems from their attack. In this paper, we applied a diverse set of machine learning approaches, i.e., decision tables, decision trees (ID3), nave Bayes classifiers, TAN (tree-augment naive Bayes) classifiers, and artificial neural networks to this task. In our experiments, naive Bayes classifiers performed significantly better than other methods with respect to the AUC (area under the ROC curve) score as well as the model building time. Plausible explanations for this result are as follows. First, naive Bayes classifiers are known to usually perform better than decision trees in terms of the AUC score. Second, the spammer detection problem in our is likely to be linearly separable.
Å°¿öµå(Keyword) À¥2.0   ¼Ò¼È ºÏ¸¶Å·   ½ºÆÐ¸Ó Å½Áö   ±â°èÇнÀ   ³ªÀÌºê º£ÀÌÁî ºÐ·ù±â   web 2.0   social bookmarking   spammer detection   machine learning   nave Bayes classifiers  
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå