research-article

Scene Text Detection and Tracking in Video with Background Cues

Authors:

Lan Wang,

Yang Wang,

Susu Shan,

Feng SuAuthors Info & Claims

ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval

Pages 160 - 168

https://doi.org/10.1145/3206025.3206051

Published: 05 June 2018 Publication History

Get Access

Abstract

To detect scene text in the video is valuable to many content-based video applications. In this paper, we present a novel scene text detection and tracking method for videos, which effectively exploits the cues of the background regions of the text. Specifically, we first extract text candidates and potential background regions of text from the video frame. Then, we exploit the spatial, shape and motional correlations between the text and its background region with a bipartite graph model and the random walk algorithm to refine the text candidates for improved accuracy. We also present an effective tracking framework for text in the video, making use of the temporal correlation of text cues across successive frames, which contributes to enhancing both the precision and the recall of the final text detection result. Experiments on public scene text video datasets demonstrate the state-of-the-art performance of the proposed method.

References

[1]

Katherine L. Bouman, Golnaz Abdollahian, Mireille Boutin, and Edward J. Delp. 2011. A Low Complexity Sign Detection and Text Localization Method for Mobile Applications. IEEE Transactions on Multimedia Vol. 13, 5 (Oct. . 2011), 922--934.

Digital Library

Google Scholar

[2]

Xiangrong Chen and Alan L. Yuille. 2004. Detecting and reading text in natural scenes. In 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Vol. Vol. 2. II-366-II-373 Vol.2. opersonChangsong Liu, and Xiaoqing Ding. 2013. A research on Video text tracking and recognition. Proceedings of SPIE Vol. 8664 (2013), 8664-8664-10.

Digital Library

Google Scholar

[3]

Kai Wang, Boris Babenko, and Serge Belongie. 2011. End-to-End Scene Text Recognition. In 2011 International Conference on Computer Vision. 1457--1464.

Digital Library

Google Scholar

[4]

Christian Wolf, Jean-Michel Jolion, and Francoise Chassaing. 2002. Text Localization, Enhancement and Binarization in Multimedia Documents 16th International Conference on Pattern Recognition, Vol. Vol. 2. 1037--1040 vol.2.

Google Scholar

[5]

Liang Wu, Palaiahnakote Shivakumara, Tong Lu, and Chew Lim Tan. 2015. A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video. IEEE Transactions on Multimedia Vol. 17, 8 (Aug . 2015), 1137--1152.

Digital Library

Google Scholar

[6]

Hailiang Xu and Feng Su. 2015. Robust Seed Localization and Growing with Deep Convolutional Features for Scene Text Detection. In 2015 5th ACM International Conference on Multimedia Retrieval (ICMR 2015). 387--394.

Digital Library

Google Scholar

[7]

Chun Yang, Xu-Cheng Yin, Wei-Yi Pei, Shu Tian, Ze-Yu Zuo, Chao Zhu, and Junchi Yan. 2017. Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming. IEEE Transactions on Image Processing Vol. 26, 7 (July. 2017), 3235--3248.

Digital Library

Google Scholar

[8]

Xu-Cheng Yin, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao. 2014. Robust Text Detection in Natural Scene Images. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 36, 5 (May. 2014), 970--983.

Google Scholar

[9]

Xu-Cheng Yin, Ze-Yu Zuo, Shu Tian, and Cheng-Lin Liu. 2016. Text Detection, Tracking and Recognition in Video: A Comprehensive Survey. IEEE Transactions on Image Processing Vol. 25, 6 (June. 2016), 2752--2773.

Digital Library

Google Scholar

[10]

Zheng Zhang, Wei Shen, Cong Yao, and Xiang Bai. 2015. Symmetry-based text line detection in natural scenes 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2558--2567.

Google Scholar

[11]

Xu Zhao, Kai-Hsiang Lin, Yun Fu, Yuxiao Hu, Yuncai Liu, and Thomas S. Huang. 2011. Text From Corners: A Novel Approach to Detect Text and Caption in Videos. IEEE Transactions on Image Processing Vol. 20, 3 (March. 2011), 790--799.

Digital Library

Google Scholar

[12]

Ze-Yu Zuo, Shu Tian, Wei yi Pei, and Xu-Cheng Yin. 2015. Multi-strategy tracking based text detection in scene videos 2015 13th International Conference on Document Analysis and Recognition (ICDAR). 66--70.

Digital Library

Google Scholar

Cited By

View all

XIAO WLIANG LCHEN JWANG T(2024)VTD-FCENet: A Real-Time HD Video Text Detection with Scale-Aware Fourier Contour EmbeddingIEICE Transactions on Information and Systems10.1587/transinf.2023EDL8030E107.D:4(574-578)Online publication date: 1-Apr-2024
https://doi.org/10.1587/transinf.2023EDL8030
Naosekpam VSahu N(2024)Video text rediscovery: Predicting and tracking text across complex scenesComputational Intelligence10.1111/coin.1268640:3Online publication date: 18-Jun-2024
https://doi.org/10.1111/coin.12686
Zhou XWang CWang XLiu W(2024)Video text tracking with transformer-based local searchNeurocomputing10.1016/j.neucom.2024.128420(128420)Online publication date: Aug-2024
https://doi.org/10.1016/j.neucom.2024.128420
Show More Cited By

Index Terms

Scene Text Detection and Tracking in Video with Background Cues
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
2. Information systems
  1. Information retrieval

Recommendations

A Robust Approach for Scene Text Detection and Tracking in Video
Advances in Multimedia Information Processing – PCM 2018
Abstract
The detection of scene text in videos is of great value in various content-based video applications such as video analysis and retrieval. In this paper, we present a robust scene text detection and tracking method for videos. We first propose an ...
Tracking Multiple Occluding People by Localizing on Multiple Scene Planes

Occlusion and lack of visibility in crowded and cluttered scenes make it difficult to track individual people correctly and consistently, particularly in a single view. We present a multi-view approach to solving this problem. In our approach we neither ...
Automatic Detection and Localization of Natural Scene Text in Video
ICPR '10: Proceedings of the 2010 20th International Conference on Pattern Recognition

Video scene text contains semantic information and thus can contribute significantly to video indexing and summarization. However, most of the previous approaches to detecting scene text from videos experience difficulties in handling texts with various ...

Comments

Information & Contributors

Information

Published In

ICMR '18: Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval

June 2018

550 pages

ISBN:9781450350464

DOI:10.1145/3206025

Conference Chairs:
Kiyoharu Aizawa
The Univ. of Tokyo, Japan
,
Michael Lew
Leiden Univ., Netherlands
,
Shin'ichi Satoh
National Inst. of Informatics, Japan

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 June 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

ICMR '18

Sponsor:

SIGMM

ICMR '18: International Conference on Multimedia Retrieval

June 11 - 14, 2018

Yokohama, Japan

Acceptance Rates

ICMR '18 Paper Acceptance Rate 44 of 136 submissions, 32%;

Overall Acceptance Rate 254 of 830 submissions, 31%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

26
Total Citations
View Citations
320
Total Downloads

Downloads (Last 12 months)15
Downloads (Last 6 weeks)0

Reflects downloads up to 26 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

XIAO WLIANG LCHEN JWANG T(2024)VTD-FCENet: A Real-Time HD Video Text Detection with Scale-Aware Fourier Contour EmbeddingIEICE Transactions on Information and Systems10.1587/transinf.2023EDL8030E107.D:4(574-578)Online publication date: 1-Apr-2024
https://doi.org/10.1587/transinf.2023EDL8030
Naosekpam VSahu N(2024)Video text rediscovery: Predicting and tracking text across complex scenesComputational Intelligence10.1111/coin.1268640:3Online publication date: 18-Jun-2024
https://doi.org/10.1111/coin.12686
Zhou XWang CWang XLiu W(2024)Video text tracking with transformer-based local searchNeurocomputing10.1016/j.neucom.2024.128420(128420)Online publication date: Aug-2024
https://doi.org/10.1016/j.neucom.2024.128420
Goyal SMotwani D(2024)A Study of Text Extraction Algorithms for Natural Scene ImagesSN Computer Science10.1007/s42979-024-03068-w5:6Online publication date: 29-Jul-2024
https://dl.acm.org/doi/10.1007/s42979-024-03068-w
Wu WCai YShen CZhang DFu YZhou HLuo P(2024)End-to-End Video Text Spotting with TransformerInternational Journal of Computer Vision10.1007/s11263-024-02063-1Online publication date: 12-Jul-2024
https://doi.org/10.1007/s11263-024-02063-1
Yu JQian JXin YWang CDong Y(2024)Swin transformer-based traffic video text trackingApplied Intelligence10.1007/s10489-024-05710-9Online publication date: 20-Aug-2024
https://doi.org/10.1007/s10489-024-05710-9
Zu XYu HLi BXue XElkind E(2023)Towards accurate video text spotting with text-wise semantic reasoningProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/206(1858-1866)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/206
Qian JJiang XMa JLi JGao ZQin X(2023)Accompany Children's Learning for You: An Intelligent Companion Learning SystemComputer Graphics Forum10.1111/cgf.1486242:6Online publication date: 3-Jul-2023
https://doi.org/10.1111/cgf.14862
Devi MSeetha MVishwanadha Raju S(2023)Natural Scene Text Detection in Video with Hybrid Text Augmentation and Fusion-Transferred LearningIntelligent Computing and Communication10.1007/978-981-99-1588-0_17(183-197)Online publication date: 20-Sep-2023
https://doi.org/10.1007/978-981-99-1588-0_17
Rajeswari RAradhana B(2023)Character Recognition in Scene Images Using MSER and CNNCognition and Recognition10.1007/978-3-031-22405-8_8(99-107)Online publication date: 1-Jan-2023
https://doi.org/10.1007/978-3-031-22405-8_8
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

A Robust Approach for Scene Text Detection and Tracking in Video

Tracking Multiple Occluding People by Localizing on Multiple Scene Planes

Automatic Detection and Localization of Natural Scene Text in Video

Comments

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Other Metrics

Article Metrics

Other Metrics

Cited By

Login options

Full Access

PDF

eReader

Abstract

References

Cited By

Index Terms

Recommendations

A Robust Approach for Scene Text Detection and Tracking in Video

Tracking Multiple Occluding People by Localizing on Multiple Scene Planes

Automatic Detection and Localization of Natural Scene Text in Video

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations