PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to impr...

Full description

Bibliographic Details
Main Authors:	Du, Yuning, Li, Chenxia, Guo, Ruoyu, Cui, Cheng, Liu, Weiwei, Zhou, Jun, Lu, Bin, Yang, Yehua, Liu, Qiwen, Hu, Xiaoguang, Yu, Dianhai, Ma, Yanjun
Format:	Article in Journal/Newspaper
Language:	unknown
Published:	arXiv 2021
Subjects:	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences DML
Online Access:	https://dx.doi.org/10.48550/arxiv.2109.03144 https://arxiv.org/abs/2109.03144

id	ftdatacite:10.48550/arxiv.2109.03144
record_format	openpolar
spelling	ftdatacite:10.48550/arxiv.2109.03144 2023-05-15T16:01:56+02:00 PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System Du, Yuning Li, Chenxia Guo, Ruoyu Cui, Cheng Liu, Weiwei Zhou, Jun Lu, Bin Yang, Yehua Liu, Qiwen Hu, Xiaoguang Yu, Dianhai Ma, Yanjun 2021 https://dx.doi.org/10.48550/arxiv.2109.03144 https://arxiv.org/abs/2109.03144 unknown arXiv arXiv.org perpetual, non-exclusive license http://arxiv.org/licenses/nonexclusive-distrib/1.0/ Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences Article CreativeWork article Preprint 2021 ftdatacite https://doi.org/10.48550/arxiv.2109.03144 2022-03-10T14:04:53Z Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to improve the accuracy of PP-OCR and keep high efficiency, in this paper, we propose a more robust OCR system, i.e. PP-OCRv2. We introduce bag of tricks to train a better text detector and a better text recognizer, which include Collaborative Mutual Learning (CML), CopyPaste, Lightweight CPUNetwork (LCNet), Unified-Deep Mutual Learning (U-DML) and Enhanced CTCLoss. Experiments on real data show that the precision of PP-OCRv2 is 7% higher than PP-OCR under the same inference cost. It is also comparable to the server models of the PP-OCR which uses ResNet series as backbones. All of the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle. : 8 pages, 9 figures, 5 tables Article in Journal/Newspaper DML DataCite Metadata Store (German National Library of Science and Technology)
institution	Open Polar
collection	DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id	ftdatacite
language	unknown
topic	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences
spellingShingle	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences Du, Yuning Li, Chenxia Guo, Ruoyu Cui, Cheng Liu, Weiwei Zhou, Jun Lu, Bin Yang, Yehua Liu, Qiwen Hu, Xiaoguang Yu, Dianhai Ma, Yanjun PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
topic_facet	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences
description	Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to improve the accuracy of PP-OCR and keep high efficiency, in this paper, we propose a more robust OCR system, i.e. PP-OCRv2. We introduce bag of tricks to train a better text detector and a better text recognizer, which include Collaborative Mutual Learning (CML), CopyPaste, Lightweight CPUNetwork (LCNet), Unified-Deep Mutual Learning (U-DML) and Enhanced CTCLoss. Experiments on real data show that the precision of PP-OCRv2 is 7% higher than PP-OCR under the same inference cost. It is also comparable to the server models of the PP-OCR which uses ResNet series as backbones. All of the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle. : 8 pages, 9 figures, 5 tables
format	Article in Journal/Newspaper
author	Du, Yuning Li, Chenxia Guo, Ruoyu Cui, Cheng Liu, Weiwei Zhou, Jun Lu, Bin Yang, Yehua Liu, Qiwen Hu, Xiaoguang Yu, Dianhai Ma, Yanjun
author_facet	Du, Yuning Li, Chenxia Guo, Ruoyu Cui, Cheng Liu, Weiwei Zhou, Jun Lu, Bin Yang, Yehua Liu, Qiwen Hu, Xiaoguang Yu, Dianhai Ma, Yanjun
author_sort	Du, Yuning
title	PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_short	PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_full	PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_fullStr	PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_full_unstemmed	PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_sort	pp-ocrv2: bag of tricks for ultra lightweight ocr system
publisher	arXiv
publishDate	2021
url	https://dx.doi.org/10.48550/arxiv.2109.03144 https://arxiv.org/abs/2109.03144
genre	DML
genre_facet	DML
op_rights	arXiv.org perpetual, non-exclusive license http://arxiv.org/licenses/nonexclusive-distrib/1.0/
op_doi	https://doi.org/10.48550/arxiv.2109.03144
_version_	1766397608565145600

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

Similar Items