A Nom historical document recognition system for digital archiving

设为首页

收藏本站

网站地图 | English | 公务邮箱

远程访问

NSTL服务站

A Nom historical document recognition system for digital archiving

详细信息查看全文

作者：Truyen Van Phan ; Kha Cong Nguyen…
关键词：Nom script ; Historical documents ; Text digitization ; Off ; line character recognition ; Binarization ; Character segmentation ; Recursive X–Y cut ; Area Voronoi diagram ; Document image analysis
刊名：International Journal on Document Analysis and Recognition
出版年：2016
出版时间：March 2016
年：2016
卷：19
期：1
页码：49-64
全文大小：2,712 KB
参考文献：1.Kim, M.S., Jang, M.D., Choi, H.I., Rhee, T.H., Kim, J.H., Kwag, H.K.: Digitalizing scheme of handwritten Hanja historical documents. In: Proceedings of the 1st International Workshop on Document Image Analysis for Libraries, USA, pp. 321–327, Jan. 2004
2.Shih, V.J., Chu, T.L.: The Han Nom Digital Library. In: The International Nom Conference, The National Library of Vietnam, Hanoi, pp. 12–14, Nov. 2004
3.Phan, T.V., Zhu, B., Nakagawa, M.: Development of Nom character segmentation for collecting patterns from historical document pages. In: Proceedings of 1st International Workshop on Historical Document Imaging and Processing, China, pp. 133–139, Sep. 2011
4.Phan, T.V., Zhu, B., Nakagawa, M.: Collecting handwritten Nom character patterns from historical document pages. In: Proceedings of 10th IAPR International Workshop on Document Analysis Systems, Australia, pp. 344–348, Mar. 2012
5.Su, B., Lu, S., Tan, C.L.: Binarization of historical handwritten document images using local maximum and minimum filter. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, USA, pp. 159–165, Jun. 2010
6.Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
7.Kittler, J., Illingworth, J.: Threshold selection based on a simple image statistics. Comput. Vis. Graphics Image Process. 30, 125–147 (1985)CrossRef
8.Schindelin, J., Arganda-Carreras, I., Frise, E., Kaynig, V., Longair, M., Pietzsch, T., Cardona, A.: Fiji: an open-source platform for biological-image analysis. Nat. Methods. 9(7), 676–682 (2012)CrossRef
9.Tsukumo, J., Tanaka, H.: Classification of handprinted Chinese characters using non-linear normalization and correlation methods. In: Proceedings of the 9th International Conference on Pattern Recognition, Italy, pp. 168–171 (1988)
10.Liu, C.L.: Normalization-cooperated gradient feature extraction for handwritten character recognition. Pattern Anal. Mach. Intell. IEEE Trans. 29(8), 1465–1469 (2007)CrossRef
11.Kawamura, A., Yura, K., Hayama, T., Hidai, Y., Minamikawa, T., Tanaka, A., Masuda, S.: Online recognition of freely handwritten Japanese characters using directional feature densities. In: Proceedings of the 11th International Conference on Pattern Recognition, Netherlands, 2, pp. 183–186 (1992)
12.Fukunaga, K.: Introduction to Statistical Pattern Recognition, 2nd edn. Academic Press, San Diego (1990)MATH
13.Kimura, F., Takashina, K., Tsuruoka, S., Miyake, Y.: Modified quadratic discriminant functions and the application to Chinese character recognition. IEEE Trans. PAMI 9(1), pp. 149–153 (1987)
14.Kohonen, T., Hynninen, J., Kangas, J., Laaksonen, J., Torkkola, K.: LVQ PAK: The learning vector quantization program package. In: Technical Report, Laboratory of Computer and Information Science Rakentajanaukio 2(C), pp. 1991–1992 (1996)
15.Sato, A., Yamada, K.: Generalized learning vector quantization. In: Proceedings of the 1995 Conference on Advances in Neural Information Processing Systems, vol 8, pp 423–429. MIT Press, Cambridge, USA (1996)
16.Juang, B.-H., Katagiri, S.: Discriminative learning for minimum error classification. Signal Process. IEEE Trans. 40(12), 3043–3054 (1992)CrossRef MATH
17.Liu, C.L., Nakagawa, M.: Evaluation of prototype learning algorithms for nearest-neighbor classifier in application to handwritten character recognition. Pattern Recognit. 34(3), 601–615 (2001)CrossRef MATH
18.Fukumoto, T., Wakabayashi, T., Kimura, F., Miyake, Y.: Accuracy improvement of handwritten character recognition by GLVQ. In: Proceedings of the 7th International Workshop on Frontiers in handwriting recognition, pp. 687–692. The Netherlands (2000)
19.Bentley, J.L.: Multidimensional binary search trees used for associative searching. Commun. ACM 18(9), 509–517 (1975)CrossRef MathSciNet MATH
20.Phan, T.V., Nakagawa, M., Baba, H., Watanabe, A.: MokkAnnotator - A system for archiving Mokkan images. In: Proceedings of the 16th Biennial Conference of the International Graphonomics Society, Japan, pp. 54–57, Jun. 2013
21.Nakagawa, M., Matsumoto, K.: Collection of on-line handwritten Japanese character pattern databases and their analysis. Doc. Anal. Recognit. 7(1), 69–81 (2004)
22.Chen, B., Zhu, B., Nakagawa, M.: Effects of generating a large amount of artificial patterns for on-line handwritten Japanese character recognition. In: Proceedings of the 11th International Conference on Document Analysis and Recognition, China, pp. 663–667, Sep. 2011
23.Leung, K.C., Leung, C.H.: Recognition of handwritten Chinese characters by combining regularization, Fisher’s discriminant and transformation sample generation. In: Proceedings of the 10th International Conference of Document Analysis and Recognition, Spain, pp. 1026–1030 (2009)
作者单位：Truyen Van Phan (1)
Kha Cong Nguyen (1)
Masaki Nakagawa (1)

1. Department of Information and Communication Engineering, Tokyo University of Agriculture and Technology, Tokyo, 184-8588, Japan
刊物类别：Computer Science
刊物主题：Image Processing and Computer Vision
Pattern Recognition
出版者：Springer Berlin / Heidelberg
ISSN：1433-2825

文摘

A Nom historical document recognition system is being developed for digital archiving that uses image binarization, character segmentation, and character recognition. It incorporates two versions of off-line character recognition: one for automatic recognition of scanned and segmented character patterns (7660 categories) and the other for user handwritten input (32,695 categories). This separation is used since including less frequently appearing categories in automatic recognition increases the misrecognition rate without reliable statistics on the Nom language. Moreover, a user must be able to check the results and identify the correct categories from an extended set of categories, and a user can input characters by hand. Both versions use the same recognition method, but they are trained using different sets of training patterns. Recursive X–Y cut and Voronoi diagrams are used for segmentation; k–d tree and generalized learning vector quantization are used for coarse classification; and the modified quadratic discriminant function is used for fine classification. The system provides an interface through which a user can check the results, change binarization methods, rectify segmentation, and input correct character categories by hand. Evaluation done using a limited number of Nom historical documents after providing ground truths for them showed that the two stages of recognition along with user checking and correction improved the recognition results significantly.

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700