Region-aware Arbitrary-shaped Text Detection with Progressive Fusion

Abstract

Segmentation-based text detectors are flexible to capture arbitrary-shaped text regions. Due to large geometry variance, it is necessary to construct effective and robust representations to identify text regions with various shapes and scales. In this paper, we focus on designing effective multi-scale contextual features for locating text instances. Specially, we develop a Region Context Module (RCM) to summarize the semantic response and adaptively extract text-region-aware information in a limited local area. To construct complementary multi-scale contextual representations, multiple RCM branches with different scales are employed and integrated via Progressive Fusion Module (PFM). Our proposed RCM and PFM serve as the plug-and-play modules which can be incorporated into existing scene text detection platforms to further boost detection performance. Extensive experiments show that our methods achieve state-of-the-art performances on Total-Text, SCUT-CTW1500 and MSRA-TD500 datasets.

Published at: IEEE Transactions on Multimedia (TMM), 2022.

Paper

Bibtex

@article{wang2022region,
 title={Region-aware Arbitrary-shaped Text Detection with Progressive Fusion},
 author={Wang, Qitong and Fu, Bin and Li, Ming and He, Junjun and Peng, Xi and Qiao, Yu},
 journal={IEEE Transactions on Multimedia},
 year={2022},
 publisher={IEEE}
}