• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

±¹³» ³í¹®Áö

Ȩ Ȩ > ¿¬±¸¹®Çå > ±¹³» ³í¹®Áö > Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ ³í¹®Áö

Çѱ¹ÀÎÅͳÝÁ¤º¸ÇÐȸ ³í¹®Áö

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) ´ë¿ë·® µ¥ÀÌÅÍ ºÐ¼®À» À§ÇÑ ¸Ê¸®µà½º ±â¹ÝÀÇ ÀÌ»óÄ¡ ŽÁö
¿µ¹®Á¦¸ñ(English Title) Outlier Detection Based on MapReduce for Analyzing Big Data
ÀúÀÚ(Author) È«¿¹Áø   ³ªÀºÈñ   Á¤¿ëȯ   ±è¾ç¿ì   Yejin Hong   Eunhee Na   Yonghwan Jung   Yangwoo Kim  
¿ø¹®¼ö·Ïó(Citation) VOL 18 NO. 01 PP. 0027 ~ 0035 (2017. 02)
Çѱ۳»¿ë
(Korean Abstract)
°¡±î¿î ¹Ì·¡¿¡´Â ºòµ¥ÀÌÅÍÀÇ ¸¹Àº ºÎºÐÀ» IoT µ¥ÀÌÅÍ°¡ Â÷ÁöÇÒ °ÍÀ̶ó´Â Àü¸ÁÀÌ ³ª¿À°í ÀÖ´Ù. ±×¿¡ µû¶ó, IoT µ¥ÀÌÅÍÀÇ ¸¹Àº ºÎºÐÀ» Â÷Ä¡ÇÏ´Â ¼¾¼­ µ¥ÀÌÅÍ¿¡ °üÇÑ °ü½É°ú ¿¬±¸ ¶ÇÇÑ È°¹ßÇÏ°Ô ÁøÇàµÇ°í ÀÖ´Ù. ¿©·¯ ºÐ¾ß¿¡¼­ È°¿ëµÇ°í ÀÖ´Â ¼¾¼­ µ¥ÀÌÅÍ´Â ºÐ¼®ÇÒ ¶§ ½ÇÁ¦¿Í´Â ´Ù¸¥ °ªÀÎ ÀÌ»óÄ¡¸¦ Æ÷ÇÔÇÏ°Ô µÇ¸é Á¤È®ÇÑ ºÐ¼®ÀÌ ¾î·Á¿ì¸ç, ¿Ö°îµÈ °á°ú°¡ µµÃâµÇ¾î È°¿ëÇÒ ¼ö ¾ø´Â °æ¿ì°¡ »ý±ä´Ù. µû¶ó¼­ º» ³í¹®¿¡¼­´Â Á¤È®ÇÑ °á°ú¸¦ µµÃâÇϱâ À§ÇØ ¼öÁýµÈ ¿øÀڷḦ ºÐ¼®Çϱâ Àü¿¡ ÀÌ»óÄ¡ ŽÁö ¹× Á¦°Å¸¦ ÇÏ¿´´Ù. ¶ÇÇÑ, Á¡Á¡ ´Ã¾î³ª°íÀÖ´Â ´ë¿ë·®ÀÇ µ¥ÀÌÅ͸¦ ºü¸£°Ô ó¸®Çϱâ À§ÇØ ¸Þ¸ð¸® Á¢±Ù ¹æ½ÄÀÎ ½ºÆÄÅ©¸¦ »ç¿ëÇÑ ºÐ»êó¸® ȯ°æ¿¡¼­ ó¸®ÇÏ¿´´Ù. ¸Ê¸®µà½º ±â¹ÝÀÇ ÀÌ»óÄ¡ ŽÁö ¹× Á¦°Å´Â ÃÑ 4´Ü°è·Î ³ª´©¾î ±¸ÇöÇÏ¿´À¸¸ç, °¢ ´Ü°è¸¦ ¸ÅÆÛ¿Í ¸®µà½º·Î ±¸ÇöÇÏ¿´´Ù. Á¦¾ÈÇÑ ±â¹ýÀÇ Æò°¡¸¦ À§Çؼ­ 3°¡Áö ȯ°æ¿¡¼­ ºñ±³ÇÏ¿´À¸¸ç, ±× °á°ú ÀÌ»óÄ¡ ŽÁö ¹× Á¦°Å¸¦ ÇÏ°íÀÚ ÇÏ´Â µ¥ÀÌÅÍÀÇ ¿ë·®ÀÌ Ä¿Áú¼ö·Ï ½ºÆÄÅ©¸¦ ÀÌ¿ëÇÑ ºÐ»êó¸® ȯ°æ¿¡¼­ÀÇ Ã³¸®°¡ °¡Àå ºü¸£´Ù´Â °á°ú¸¦ ¾ò¾ú´Ù.
¿µ¹®³»¿ë
(English Abstract)
In near future, IoT data is expected to be a major portion of Big Data. Moreover, sensor data is expected to be major portion of IoT data, and its¡¯ research is actively carried out currently. However, processed results may not be trusted and used if outlier data is included in the processing of sensor data. Therefore, method for detection and deletion of those outlier data before processing is studied in this paper. Moreover, we used Spark which is memory based distributed processing environment for fast processing of big sensor data. The detection and deletion of outlier data consist of four stages, and each stage is implemented with Mapper and Reducer operation. The proposed method is compared in three different processing environments, and it is expected that the outlier detection and deletion performance is best in the distributed Spark environment as data volume is increasing.
Å°¿öµå(Keyword) ºòµ¥ÀÌÅÍ   ÀÌ»óÄ¡   ¸Ê¸®µà½º   ºÐ»ê󸮠  ½ºÆÄÅ©   Big Data   Outlier   MapReduce   Distributed Processing   Spark  
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå