• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

±¹³» ³í¹®Áö

Ȩ Ȩ > ¿¬±¸¹®Çå > ±¹³» ³í¹®Áö > Çѱ¹Á¤º¸°úÇÐȸ ³í¹®Áö > Á¤º¸°úÇÐȸ ³í¹®Áö C : ÄÄÇ»ÆÃÀÇ ½ÇÁ¦

Á¤º¸°úÇÐȸ ³í¹®Áö C : ÄÄÇ»ÆÃÀÇ ½ÇÁ¦

Current Result Document : 3 / 3

ÇѱÛÁ¦¸ñ(Korean Title) ´ë¿ë·® ¹®¼­ ÁýÇÕ¿¡¼­ À¯»ç ¹®¼­ Ž»öÀ» À§ÇÑ È¿°úÀûÀÎ Àüó¸® ½Ã½ºÅÛÀÇ ¼³°è
¿µ¹®Á¦¸ñ(English Title) An Efficient Preprocessing System for Searching Similar Texts among Massive Document Repository
ÀúÀÚ(Author) ¹Ú¼±¿µ   ±èÁöÈÆ   ±è¼±¿µ   ±èÇüÁØ   Á¶È¯±Ô   Sun-Young Park   Jihun Kim   SeonYeong Kim   HyungJoon Kim   Hwan-Gue Cho  
¿ø¹®¼ö·Ïó(Citation) VOL 16 NO. 05 PP. 0626 ~ 0630 (2010. 05)
Çѱ۳»¿ë
(Korean Abstract)
ÃÖ±Ù ¹®¼­ Ç¥ÀýÀÌ »çȸÀû À̽´°¡ µÇ¸é¼­ ¹®¼­°£ À¯»çµµ¸¦ °Ë»çÇÏ´Â ½Ã½ºÅÛÀÇ Çʿ伺ÀÌ ´ëµÎµÇ¾ú´Ù. ÀÌ¿¡ µû¶ó ¹®¼­ À¯»çµµ °Ë»ç ½Ã½ºÅÛ¿¡¼­ÀÇ Áß¿äÇÑ ¿ä¼ÒÀÎ °Ë»ç ¼Óµµ¿Í Á¤È®µµ¸¦ ÃæÁ·½ÃÅ°±â À§ÇÑ ¿¬±¸°¡ ÁøÇàµÇ°í ÀÖ´Ù. º» ³í¹®¿¡¼­´Â À¯»ç ¹®¼­ Ž»ö ½Ã½ºÅÛ¿¡¼­ÀÇ ¼º´ÉÀ» Çâ»ó½ÃÅ°±â À§ÇØ Àü¿ª »çÀüÀ̶ó´Â ¸ðµ¨À» »ç¿ëÇÑ Àüó¸® ¹æ¹ýÀ» Á¦½ÃÇÑ´Ù. Àü¿ª »çÀüÀ̶õ Ž»ö ´ë»ó ¹®¼­±º¿¡¼­ »ç¿ëµÈ ¸ðµç ´Ü¾îÀÇ Á¤º¸¸¦ Æ÷ÇÔÇÑ °ÍÀ¸·Î, À¯»çÇÑ ¹®¼­°¡ ¾î´À ¹®¼­ÀÎÁö ºü¸£°Ô ÆľÇÇÏ´Â µ¥¿¡ »ç¿ëÇÑ´Ù. ½Ã½ºÅÛ¿¡¼­ ÀÌ ¸ðµ¨À» Àû¿ëÇÏ´Â ¹æ¹ý¿¡ ´ëÇØ ±â¼úÇÏ°í, ½ÇÇèÀ» ÅëÇØ °¢ ¹æ¹ýÀÇ Àüó¸® ¼º´ÉÀ» ºÐ¼®ÇÏ¿© ÃÖÀûÈ­µÈ ¹®¼­ Àüó¸® ¹æ¹ýÀ» ã¾Æ³½´Ù. °á°úÀûÀ¸·Î °Ë»ç ´ë»ó ¹®¼­°¡ 20,000°Ç ÀÌ»óÀÎ °æ¿ì¿¡µµ °Ë»ç ´ë»ó ¹®¼­ÀÇ °³¼ö¸¦ 50°³ ÀÌÇϷΠȹ±âÀûÀ¸·Î ÁÙ¿©¼­ Àüü ½Ã½ºÅÛÀÇ ¼º´ÉÀ» Å©°Ô Çâ»ó½Ãų ¼ö ÀÖ´Ù´Â °ÍÀ» ¾Ë ¼ö ÀÖ¾ú´Ù.
¿µ¹®³»¿ë
(English Abstract)
Since the paper plagiarism has become one of important social issues, it is necessary to develop system for measuring the similarity between papers. The speed and accuracy of the system are very important features. So many researchers are studying the features. In this paper, we propose a preprocessing method using 'Global Dictionary' model to enhance performance of the system. The global dictionary includes information of all words in the document repository. The system uses the model to find similar papers with low computing time. Finally our experiment showed that a set of more than 20,000 documents could be reduced to about 50 documents drastically by our filtering techniques, which proves the excellence of our system.
Å°¿öµå(Keyword) Ç¥Àý   À¯»ç ¹®¼­   Àü󸮠  Àü¿ª »çÀü   Plagiarism   Similar Document   Global Dictionary   Preprocessing  
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå