Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) fo...

Full description

Bibliographic Details
Main Authors: Zheng, Qingping, Deng, Jiankang, Zhu, Zheng, Li, Ying, Zafeiriou, Stefanos
Format: Report
Language:unknown
Published: arXiv 2022
Subjects:
DML
Online Access:https://dx.doi.org/10.48550/arxiv.2203.14448
https://arxiv.org/abs/2203.14448
id ftdatacite:10.48550/arxiv.2203.14448
record_format openpolar
spelling ftdatacite:10.48550/arxiv.2203.14448 2023-05-15T16:01:44+02:00 Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing Zheng, Qingping Deng, Jiankang Zhu, Zheng Li, Ying Zafeiriou, Stefanos 2022 https://dx.doi.org/10.48550/arxiv.2203.14448 https://arxiv.org/abs/2203.14448 unknown arXiv arXiv.org perpetual, non-exclusive license http://arxiv.org/licenses/nonexclusive-distrib/1.0/ Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences Preprint Article article CreativeWork 2022 ftdatacite https://doi.org/10.48550/arxiv.2203.14448 2022-04-01T18:28:54Z This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr. Report DML DataCite Metadata Store (German National Library of Science and Technology) Lapa ENVELOPE(68.633,68.633,-73.200,-73.200)
institution Open Polar
collection DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id ftdatacite
language unknown
topic Computer Vision and Pattern Recognition cs.CV
FOS Computer and information sciences
spellingShingle Computer Vision and Pattern Recognition cs.CV
FOS Computer and information sciences
Zheng, Qingping
Deng, Jiankang
Zhu, Zheng
Li, Ying
Zafeiriou, Stefanos
Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
topic_facet Computer Vision and Pattern Recognition cs.CV
FOS Computer and information sciences
description This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr.
format Report
author Zheng, Qingping
Deng, Jiankang
Zhu, Zheng
Li, Ying
Zafeiriou, Stefanos
author_facet Zheng, Qingping
Deng, Jiankang
Zhu, Zheng
Li, Ying
Zafeiriou, Stefanos
author_sort Zheng, Qingping
title Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_short Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_full Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_fullStr Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_full_unstemmed Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_sort decoupled multi-task learning with cyclical self-regulation for face parsing
publisher arXiv
publishDate 2022
url https://dx.doi.org/10.48550/arxiv.2203.14448
https://arxiv.org/abs/2203.14448
long_lat ENVELOPE(68.633,68.633,-73.200,-73.200)
geographic Lapa
geographic_facet Lapa
genre DML
genre_facet DML
op_rights arXiv.org perpetual, non-exclusive license
http://arxiv.org/licenses/nonexclusive-distrib/1.0/
op_doi https://doi.org/10.48550/arxiv.2203.14448
_version_ 1766397477322227712