PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to impr...

Full description

Bibliographic Details
Main Authors: Du, Yuning, Li, Chenxia, Guo, Ruoyu, Cui, Cheng, Liu, Weiwei, Zhou, Jun, Lu, Bin, Yang, Yehua, Liu, Qiwen, Hu, Xiaoguang, Yu, Dianhai, Ma, Yanjun
Format: Text
Language:unknown
Published: 2021
Subjects:
DML
Online Access:http://arxiv.org/abs/2109.03144
id ftarxivpreprints:oai:arXiv.org:2109.03144
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2109.03144 2023-09-05T13:19:06+02:00 PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System Du, Yuning Li, Chenxia Guo, Ruoyu Cui, Cheng Liu, Weiwei Zhou, Jun Lu, Bin Yang, Yehua Liu, Qiwen Hu, Xiaoguang Yu, Dianhai Ma, Yanjun 2021-09-07 http://arxiv.org/abs/2109.03144 unknown http://arxiv.org/abs/2109.03144 Computer Science - Computer Vision and Pattern Recognition text 2021 ftarxivpreprints 2023-08-16T16:40:09Z Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to improve the accuracy of PP-OCR and keep high efficiency, in this paper, we propose a more robust OCR system, i.e. PP-OCRv2. We introduce bag of tricks to train a better text detector and a better text recognizer, which include Collaborative Mutual Learning (CML), CopyPaste, Lightweight CPUNetwork (LCNet), Unified-Deep Mutual Learning (U-DML) and Enhanced CTCLoss. Experiments on real data show that the precision of PP-OCRv2 is 7% higher than PP-OCR under the same inference cost. It is also comparable to the server models of the PP-OCR which uses ResNet series as backbones. All of the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle. Comment: 8 pages, 9 figures, 5 tables Text DML ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Computer Vision and Pattern Recognition
spellingShingle Computer Science - Computer Vision and Pattern Recognition
Du, Yuning
Li, Chenxia
Guo, Ruoyu
Cui, Cheng
Liu, Weiwei
Zhou, Jun
Lu, Bin
Yang, Yehua
Liu, Qiwen
Hu, Xiaoguang
Yu, Dianhai
Ma, Yanjun
PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
topic_facet Computer Science - Computer Vision and Pattern Recognition
description Optical Character Recognition (OCR) systems have been widely used in various of application scenarios. Designing an OCR system is still a challenging task. In previous work, we proposed a practical ultra lightweight OCR system (PP-OCR) to balance the accuracy against the efficiency. In order to improve the accuracy of PP-OCR and keep high efficiency, in this paper, we propose a more robust OCR system, i.e. PP-OCRv2. We introduce bag of tricks to train a better text detector and a better text recognizer, which include Collaborative Mutual Learning (CML), CopyPaste, Lightweight CPUNetwork (LCNet), Unified-Deep Mutual Learning (U-DML) and Enhanced CTCLoss. Experiments on real data show that the precision of PP-OCRv2 is 7% higher than PP-OCR under the same inference cost. It is also comparable to the server models of the PP-OCR which uses ResNet series as backbones. All of the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle. Comment: 8 pages, 9 figures, 5 tables
format Text
author Du, Yuning
Li, Chenxia
Guo, Ruoyu
Cui, Cheng
Liu, Weiwei
Zhou, Jun
Lu, Bin
Yang, Yehua
Liu, Qiwen
Hu, Xiaoguang
Yu, Dianhai
Ma, Yanjun
author_facet Du, Yuning
Li, Chenxia
Guo, Ruoyu
Cui, Cheng
Liu, Weiwei
Zhou, Jun
Lu, Bin
Yang, Yehua
Liu, Qiwen
Hu, Xiaoguang
Yu, Dianhai
Ma, Yanjun
author_sort Du, Yuning
title PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_short PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_full PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_fullStr PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_full_unstemmed PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
title_sort pp-ocrv2: bag of tricks for ultra lightweight ocr system
publishDate 2021
url http://arxiv.org/abs/2109.03144
genre DML
genre_facet DML
op_relation http://arxiv.org/abs/2109.03144
_version_ 1776199922397216768