Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) fo...

Full description

Bibliographic Details
Main Authors:	Zheng, Qingping, Deng, Jiankang, Zhu, Zheng, Li, Ying, Zafeiriou, Stefanos
Format:	Report
Language:	unknown
Published:	arXiv 2022
Subjects:	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences Lapa DML
Online Access:	https://dx.doi.org/10.48550/arxiv.2203.14448 https://arxiv.org/abs/2203.14448

id	ftdatacite:10.48550/arxiv.2203.14448
record_format	openpolar
spelling	ftdatacite:10.48550/arxiv.2203.14448 2023-05-15T16:01:44+02:00 Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing Zheng, Qingping Deng, Jiankang Zhu, Zheng Li, Ying Zafeiriou, Stefanos 2022 https://dx.doi.org/10.48550/arxiv.2203.14448 https://arxiv.org/abs/2203.14448 unknown arXiv arXiv.org perpetual, non-exclusive license http://arxiv.org/licenses/nonexclusive-distrib/1.0/ Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences Preprint Article article CreativeWork 2022 ftdatacite https://doi.org/10.48550/arxiv.2203.14448 2022-04-01T18:28:54Z This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr. Report DML DataCite Metadata Store (German National Library of Science and Technology) Lapa ENVELOPE(68.633,68.633,-73.200,-73.200)
institution	Open Polar
collection	DataCite Metadata Store (German National Library of Science and Technology)
op_collection_id	ftdatacite
language	unknown
topic	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences
spellingShingle	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences Zheng, Qingping Deng, Jiankang Zhu, Zheng Li, Ying Zafeiriou, Stefanos Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
topic_facet	Computer Vision and Pattern Recognition cs.CV FOS Computer and information sciences
description	This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr.
format	Report
author	Zheng, Qingping Deng, Jiankang Zhu, Zheng Li, Ying Zafeiriou, Stefanos
author_facet	Zheng, Qingping Deng, Jiankang Zhu, Zheng Li, Ying Zafeiriou, Stefanos
author_sort	Zheng, Qingping
title	Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_short	Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_full	Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_fullStr	Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_full_unstemmed	Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_sort	decoupled multi-task learning with cyclical self-regulation for face parsing
publisher	arXiv
publishDate	2022
url	https://dx.doi.org/10.48550/arxiv.2203.14448 https://arxiv.org/abs/2203.14448
long_lat	ENVELOPE(68.633,68.633,-73.200,-73.200)
geographic	Lapa
geographic_facet	Lapa
genre	DML
genre_facet	DML
op_rights	arXiv.org perpetual, non-exclusive license http://arxiv.org/licenses/nonexclusive-distrib/1.0/
op_doi	https://doi.org/10.48550/arxiv.2203.14448
_version_	1766397477322227712

Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

Similar Items