• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

±¹³» ³í¹®Áö

Ȩ Ȩ > ¿¬±¸¹®Çå > ±¹³» ³í¹®Áö > Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ ³í¹®Áö

Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ ³í¹®Áö

Current Result Document : 8 / 8 ÀÌÀü°Ç ÀÌÀü°Ç

ÇѱÛÁ¦¸ñ(Korean Title) ·£µå¸¶Å© À©µµ¿ì ±â¹ÝÀÇ ºó¹ß ÆÐÅÏ ¸¶ÀÌ´× ±â¹ýÀÇ ºÐ¼® ¹× ¼º´ÉÆò°¡
¿µ¹®Á¦¸ñ(English Title) Analysis and Evaluation of Frequent Pattern Mining Technique based on Landmark Window
ÀúÀÚ(Author) Æí±¤¹ü   À±ÀºÀÏ   Gwangbum Pyun   Unil Yun  
¿ø¹®¼ö·Ïó(Citation) VOL 15 NO. 03 PP. 0101 ~ 0107 (2014. 06)
Çѱ۳»¿ë
(Korean Abstract)
º» ³í¹®¿¡¼­´Â ·£µå¸¶Å© À©µµ¿ì ±â¹ÝÀÇ ºó¹ß ÆÐÅÏ ¸¶ÀÌ´× ±â¹ýÀ» ºÐ¼®ÇÏ°í ¼º´ÉÀ» Æò°¡ÇÑ´Ù. º» ³í¹®¿¡¼­´Â Lossy counting ¾Ë°í¸®Áò°ú hMiner ¾Ë°í¸®Áò¿¡ ´ëÇÑ ºÐ¼®À» ÁøÇàÇÑ´Ù. ÃÖ½ÅÀÇ ·£µå¸¶Å© ¾Ë°í¸®ÁòÀÎ hMiner´Â Æ®·£Àè¼ÇÀÌ ¹ß»ýÇÒ ¶§ ¸¶´Ù ºó¹ß ÆÐÅÏÀ» ¸¶ÀÌ´× ÇÏ´Â ¹æ¹ýÀÌ´Ù. ±×·¡¼­ hMiner¿Í °°Àº ·£µå¸¶Å© ±â¹ÝÀÇ ºó¹ß ÆÐÅÏ ¸¶ÀÌ´×À» ¿Â¶óÀÎ ¸¶ÀÌ´×À̶ó°í ÇÑ´Ù. º» ³í¹®¿¡¼­´Â ·£µå¸¶Å© À©µµ¿ì ¸¶ÀÌ´×ÀÇ ÃÊ±â ¾Ë°í¸®ÁòÀÎ Lossy counting¿Í ÃֽŠ¾Ë°í¸®ÁòÀÎ hMinerÀÇ ¼º´ÉÀ» Æò°¡ÇÏ°í ºÐ¼®ÇÑ´Ù. ¿ì¸®´Â ¼º´ÉÆò°¡ÀÇ Ã´µµ·Î ¸¶ÀÌ´× ½Ã°£°ú Æ®·£Àè¼Ç ´ç Æò±Õ ó¸® ½Ã°£À» Æò°¡ÇÑ´Ù. ±×¸®°í ¿ì¸®´Â ÀúÀå ±¸Á¶ÀÇ È¿À²¼ºÀ» Æò°¡Çϱâ À§ÇÏ¿© ÃÖ´ë ¸Þ¸ð¸® »ç¿ë·®À» Æò°¡ÇÑ´Ù. ¸¶Áö¸·À¸·Î ¿ì¸®´Â ¾Ë°í¸®ÁòÀÌ ¾ÈÁ¤ÀûÀ¸·Î ¸¶ÀÌ´×ÀÌ °¡´ÉÇÑÁö Æò°¡Çϱâ À§ÇØ µ¥ÀÌÅͺ£À̽ºÀÇ ¾ÆÀÌÅÛ ¼ö¸¦ º¯È­½ÃÅ°¸é¼­ Æò°¡ÇÏ´Â È®À强 Æò°¡¸¦ ¼öÇàÇÑ´Ù. µÎ ¾Ë°í¸®ÁòÀÇ Æò°¡ °á°ú·Î, ·£µå¸¶Å© À©µµ¿ì ±â¹ÝÀÇ ºó¹ß ÆÐÅÏ ¸¶ÀÌ´×Àº ½Ç½Ã°£ ½Ã½ºÅÛ¿¡ ÀûÇÕÇÑ ¸¶ÀÌ´× ¹æ½ÄÀ» °¡Áö°í ÀÖÁö¸¸ ¸Þ¸ð¸®¸¦ ¸¹ÀÌ »ç¿ëÇß´Ù.
¿µ¹®³»¿ë
(English Abstract)
With the development of online service, recent forms of databases have been changed from static database structures to dynamic stream database structures. Previous data mining techniques have been used as tools of decision making such as establishment of marketing strategies and DNA analyses. However, the capability to analyze real-time data more quickly is necessary in the recent interesting areas such as sensor network, robotics, and artificial intelligence. Landmark window-based frequent pattern mining, one of the stream mining approaches, performs mining operations with respect to parts of databases or each transaction of them, instead of all the data. In this paper, we analyze and evaluate the techniques of the well-known landmark window-based frequent pattern mining algorithms, called Lossy counting and hMiner. When Lossy counting mines frequent patterns from a set of new transactions, it performs union operations between the previous and current mining results. hMiner, which is a state-of-the-art algorithm based on the landmark window model, conducts mining operations whenever a new transaction occurs. Since hMiner extracts frequent patterns as soon as a new transaction is entered, we can obtain the latest mining results reflecting real-time information. For this reason, such algorithms are also called online mining approaches. We evaluate and compare the performance of the primitive algorithm, Lossy counting and the latest one, hMiner. As the criteria of our performance analysis, we first consider algorithms¡¯ total runtime and average processing time per transaction. In addition, to compare the efficiency of storage structures between them, their maximum memory usage is also evaluated. Lastly, we show how stably the two algorithms conduct their mining works with respect to the databases that feature gradually increasing items. With respect to the evaluation results of mining time and transaction processing, hMiner has higher speed than that of Lossy counting. Since hMiner stores candidate frequent patterns in a hash method, it can directly access candidate frequent patterns. Meanwhile, Lossy counting stores them in a lattice manner; thus, it has to search for multiple nodes in order to access the candidate frequent patterns. On the other hand, hMiner shows worse performance than that of Lossy counting in terms of maximum memory usage. hMiner should have all of the information for candidate frequent patterns to store them to hash¡¯s buckets, while Lossy counting stores them, reducing their information by using the lattice method. Since the storage of Lossy counting can share items concurrently included in multiple patterns, its memory usage is more efficient than that of hMiner. However, hMiner presents better efficiency than that of Lossy counting with respect to scalability evaluation due to the following reasons. If the number of items is increased, shared items are decreased in contrast; thereby, Lossy counting¡¯s memory efficiency is weakened. Furthermore, if the number of transactions becomes higher, its pruning effect becomes worse. From the experimental results, we can determine that the landmark window-based frequent pattern mining algorithms are suitable for real-time systems although they require a significant amount of memory. Hence, we need to improve their data structures more efficiently in order to utilize them additionally in resource-constrained environments such as WSN(Wireless sensor network).
Å°¿öµå(Keyword) ·£µå¸¶Å© À©µµ¿ì   ºó¹ß ÆÐÅÏ ¸¶ÀÌ´×   ¿Â¶óÀÎ ¸¶ÀÌ´×   ¼º´ÉÆò°¡   È®À强   Landmark Window   Frequent pattern mining   Online mining   Performance evaluation   Scalability  
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå