• Àüü
  • ÀüÀÚ/Àü±â
  • Åë½Å
  • ÄÄÇ»ÅÍ
´Ý±â

»çÀÌÆ®¸Ê

Loading..

Please wait....

Çмú´ëȸ ÇÁ·Î½Ãµù

Ȩ Ȩ > ¿¬±¸¹®Çå > Çмú´ëȸ ÇÁ·Î½Ãµù > Çѱ¹Á¤º¸°úÇÐȸ Çмú´ëȸ > 2019³â ÄÄÇ»ÅÍÁ¾ÇÕÇмú´ëȸ

2019³â ÄÄÇ»ÅÍÁ¾ÇÕÇмú´ëȸ

Current Result Document :

ÇѱÛÁ¦¸ñ(Korean Title) ¸ÖƼ ½ºÄÉÀÏ Ç»Àü ³×Æ®¿öÅ©¸¦ ÀÌ¿ëÇÑ ¹®¼­¿µ»ó ÀÌÁøÈ­
¿µ¹®Á¦¸ñ(English Title) Document Image Binarization with Multi-scale Fusion Network
ÀúÀÚ(Author) Quang-Vinh Dang   ÀÌ±Í»ó   Guee-Sang Lee  
¿ø¹®¼ö·Ïó(Citation) VOL 46 NO. 01 PP. 0820 ~ 0822 (2019. 06)
Çѱ۳»¿ë
(Korean Abstract)
¿µ¹®³»¿ë
(English Abstract)
Binarization of degraded document images is an important pre-processing step for document image analysis domain. We develop a ladder net with a multi scale structure to learn text-like features from document images itself to classify text and background from degraded document images. Specifically, we consider two properly designed LadderNet architectures: one with deeper architectures, another with shallower architecture. Each structure is trained independently using document image patches. The target of our design is to predict the foreground maps at two different feature levels. Predicted maps have fewer noises in the background from the shallower architecture with larger size of striding window. However, the detail of the text is not clear. On the other hand, predicted maps generated from the deeper architecture with smaller size of striding window have clearer text strokes but contain more background noises. Therefore, a better result is achieved by combining the outputs of two these architectures. The proposed approach achieves state-of-the-art results on DIBCO datasets, revealing the robustness of the presented method.
Å°¿öµå(Keyword)
ÆÄÀÏ÷ºÎ PDF ´Ù¿î·Îµå