Accurate Fine-Grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation

9Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Accurate layout analysis without subsequent text-line segmentation remains an ongoing challenge, especially when facing the Kangyur, a kind of historical Tibetan document featuring considerable touching components and mottled background. Aiming at identifying different regions in document images, layout analysis is indispensable for subsequent procedures such as character recognition. However, there was only a little research being carried out to perform line-level layout analysis which failed to deal with the Kangyur. To obtain the optimal results, a fine-grained sub-line level layout analysis approach is presented. Firstly, we introduced an accelerated method to build the dataset which is dynamic and reliable. Secondly, enhancement had been made to the SOLOv2 according to the characteristics of the Kangyur. Then, we fed the enhanced SOLOv2 with the prepared annotation file during the training phase. Once the network is trained, instances of the text line, sentence, and titles can be segmented and identified during the inference stage. The experimental results show that the proposed method delivers a decent 72.7% average precision on our dataset. In general, this preliminary research provides insights into the fine-grained sub-line level layout analysis and testifies the SOLOv2-based approaches. We also believe that the proposed methods can be adopted on other language documents with various layouts.

References Powered by Scopus

Deep residual learning for image recognition

173989Citations
N/AReaders
Get full text

Fully convolutional networks for semantic segmentation

24678Citations
N/AReaders
Get full text

Mask R-CNN

20248Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Character Detection and Segmentation of Historical Uchen Tibetan Documents in Complex Situations

11Citations
N/AReaders
Get full text

Automatic damage identification of Sanskrit palm leaf manuscripts with SegFormer

4Citations
N/AReaders
Get full text

Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks

2Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhao, P., Wang, W., Cai, Z., Zhang, G., & Lu, Y. (2021). Accurate Fine-Grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation. IEEE Access, 9, 154435–154447. https://doi.org/10.1109/ACCESS.2021.3128536

Readers over time

‘21‘22‘23‘2402468

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 1

50%

Researcher 1

50%

Readers' Discipline

Tooltip

Computer Science 2

50%

Agricultural and Biological Sciences 1

25%

Engineering 1

25%

Save time finding and organizing research with Mendeley

Sign up for free
0