Text localization in natural images using stroke feature transform and text covariance descriptors

268Citations
Citations of this article
144Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we present a new approach for text localization in natural images, by discriminating text and non-text regions at three levels: pixel, component and text line levels. Firstly, a powerful low-level filter called the Stroke Feature Transform (SFT) is proposed, which extends the widely-used Stroke Width Transform (SWT) by incorporating color cues of text pixels, leading to significantly enhanced performance on inter-component separation and intra-component connection. Secondly, based on the output of SFT, we apply two classifiers, a text component classifier and a text-line classifier, sequentially to extract text regions, eliminating the heuristic procedures that are commonly used in previous approaches. The two classifiers are built upon two novel Text Covariance Descriptors (TCDs) that encode both the heuristic properties and the statistical characteristics of text stokes. Finally, text regions are located by simply thresholding the text-line confident map. Our method was evaluated on two benchmark datasets: ICDAR 2005 and ICDAR 2011, and the corresponding Fmeasure values are 0.72 and 0.73, respectively, surpassing previous methods in accuracy by a large margin. © 2013 IEEE.

References Powered by Scopus

Random forests

96533Citations
N/AReaders
Get full text

Robust Real-Time Face Detection

11146Citations
N/AReaders
Get full text

Image classification using random forests and ferns

1138Citations
N/AReaders
Get full text

Cited by Powered by Scopus

EAST: An efficient and accurate scene text detector

1434Citations
N/AReaders
Get full text

Detecting text in natural image with connectionist text proposal network

840Citations
N/AReaders
Get full text

Detecting oriented text in natural images by linking segments

620Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Huang, W., Lin, Z., Yang, J., & Wang, J. (2013). Text localization in natural images using stroke feature transform and text covariance descriptors. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1241–1248). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICCV.2013.157

Readers over time

‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘2407142128

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 80

81%

Researcher 11

11%

Professor / Associate Prof. 4

4%

Lecturer / Post doc 4

4%

Readers' Discipline

Tooltip

Computer Science 89

82%

Engineering 16

15%

Mathematics 2

2%

Decision Sciences 1

1%

Save time finding and organizing research with Mendeley

Sign up for free
0