µ¥ÀÌÅͺ£À̽º ¿¬±¸È¸Áö(SIGDB)
Current Result Document :
ÇѱÛÁ¦¸ñ(Korean Title) |
±¹°¡R&Dº¸°í¼ÀÇ º¸Á¸ ¹× ¼ºñ½º Çâ»óÀ» À§ÇÑ XML ±â¹Ý ÄÜÅÙÃ÷ ÃßÃâ, º¯È¯ ½Ã½ºÅÛ °³¹ß |
¿µ¹®Á¦¸ñ(English Title) |
Development of the XML-based Contents Extraction and Conversion Systems for Enhancing Preservation and Service of National R&D Reports |
ÀúÀÚ(Author) |
ÃÖ±ÔÁø
Â÷½ÂÁØ
À̱Ôö
Gyu-Jin Choi
Seung-Jun Cha
Kyu-Chul Lee
|
¿ø¹®¼ö·Ïó(Citation) |
VOL 30 NO. 01 PP. 0051 ~ 0064 (2014. 04) |
Çѱ۳»¿ë (Korean Abstract) |
ÃÖ±Ù ±¹°¡R&D»ç¾÷¿¡ ´ëÇÑ ÅõÀÚºñ¿ëÀÌ Å©°Ô Áõ°¡ÇÔ¿¡ µû¶ó, ¿¬±¸°³¹ß ¼º°úÀÇ Ã¼°èÀûÀÎ °ü¸®°¡ ÇÊ¿äÇÏ´Ù. À̸¦ À§ÇØ ÇöÀç ±¹°¡Â÷¿ø¿¡¼ ±¹°¡R&Dº¸°í¼ Á¾ÇÕ°ü¸®½Ã½ºÅÛÀ» ¿î¿µÇÏ¿© PDF Çü½ÄÀÇ º¸°í¼ ¿ø¹®À» ¼öÁý ¹× °ü¸®ÇÑ´Ù. ÇÏÁö¸¸ ¿¬±¸°ü¸®Àü¹®±â°ü º°·Î º¸°í¼ÀÇ ÇüÅÂ¿Í ±âÁØÀÌ »óÀÌÇÏ¿© Ç¥ÁØÈµÈ µ¥ÀÌÅͺ£À̽º ±¸ÃàÀÌ ¾î·Æ°í, ÅؽºÆ® ±â¹ÝÀÇ °Ë»ö ¼ºñ½º¸¸À» Á¦°øÇϱ⠶§¹®¿¡ ¼öõ, ¼ö ¸¸°³ ÀÌ»óÀÇ °Ë»ö °á°úµé¿¡¼ »ç¿ëÀÚ°¡ ¿øÇÏ´Â ¹®¼¸¦ ´Ù½Ã ã¾Æ³»¾ß ÇÏ´Â ¹®Á¦Á¡À» °¡Áø´Ù. º» ³í¹®¿¡¼´Â ±¹°¡R&Dº¸°í¼ÀÇ º¸Á¸ ¹× ¼ºñ½º Çâ»óÀ» À§ÇÑ XML ±â¹Ý ÄÜÅÙÃ÷ ÃßÃâ ¹× º¯È¯½Ã½ºÅÛÀ» °³¹ßÇÏ¿´´Ù. À̸¦ À§ÇØ ´Ù¾çÇÑ º¸°í¼¸¦ ¼ö¿ëÇÒ ¼ö ÀÖ´Â XML ½ºÅ°¸¶¸¦ ¼³°èÇÏ°í, º¸°í¼¸¦ ½ºÅ°¸¶¿¡ ¸Â°Ô ±¸Á¶ÈÇÒ ¼ö ÀÖ´Â XML º¯È¯µµ±¸¸¦ °³¹ßÇÏ¿´´Ù. °³¹ßµÈ µµ±¸´Â ¸ÞŸµ¥ÀÌÅÍ ¹× º»¹® ÄÜÅÙÃ÷¸¦ ÀÚµ¿À¸·Î ÃßÃâÇÏ¿© ±¸Á¶È ÇÏ¿© ÀúÀåÇÑ´Ù. ¶ÇÇÑ Ç¥, ±×¸² À̹ÌÁö¸¦ ÃßÃâÇÏ´Â ±â´ÉÀ» °³¹ßÇÏ¿© º¸°í¼¿¡ ÀúÀåµÈ ºñÅؽºÆ® ÄÜÅÙÃ÷µµ ÃßÃâ ÀúÀåÇÑ´Ù. ÀúÀåµÈ XMLÀº ´Ù¾çÇÑ °Ë»ö ±â¹ýÀÇ Àû¿ëÀ» ÅëÇØ ´ë±¹¹Î ¼ºñ½º°¡ Çâ»óµÉ ¼ö ÀÖ´Ù.
|
¿µ¹®³»¿ë (English Abstract) |
Recently as the investment cost of the National R&D Project increases significantly, systematic managements of R&D results are required. The original reports in PDF format that are one of the results of the R&D project are collected and managed by the National R&D reports total management system. However, there are problems that it is different to build common database system as the types and criteria of reports are different from the institution and users determine intended results in more than hundreds or thousands of results as the system only provided text-based retrieval service. In this paper, we developed the XML-based contents extraction and conversion system for preservation and service of National R&D reports. First the XML schema is designed for adopting variety types of reports and XML conversion tool is developed for structuring reports. The metadata and contents of the report are structured and stored by the developed tool. Furthermore, non text contents such as figures or tables are also extracted. The nationwide service is enhanced through applying various retrieval methods to the system.
|
Å°¿öµå(Keyword) |
±¹°¡R&Dº¸°í¼
PDF
XML ½ºÅ°¸¶
¸ÞŸµ¥ÀÌÅÍ ÃßÃâ
º¯È¯µµ±¸
National R&D Report
PDF
XML Schema
Metadata extraction
Conversion System
|
ÆÄÀÏ÷ºÎ |
PDF ´Ù¿î·Îµå
|