Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing

This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) fo...

Full description

Bibliographic Details
Main Authors: Zheng, Qingping, Deng, Jiankang, Zhu, Zheng, Li, Ying, Zafeiriou, Stefanos
Format: Text
Language:unknown
Published: 2022
Subjects:
DML
Online Access:http://arxiv.org/abs/2203.14448
id ftarxivpreprints:oai:arXiv.org:2203.14448
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:2203.14448 2023-09-05T13:19:06+02:00 Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing Zheng, Qingping Deng, Jiankang Zhu, Zheng Li, Ying Zafeiriou, Stefanos 2022-03-27 http://arxiv.org/abs/2203.14448 unknown http://arxiv.org/abs/2203.14448 Computer Science - Computer Vision and Pattern Recognition text 2022 ftarxivpreprints 2023-08-16T17:00:05Z This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr. Text DML ArXiv.org (Cornell University Library) Lapa ENVELOPE(68.633,68.633,-73.200,-73.200)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Computer Vision and Pattern Recognition
spellingShingle Computer Science - Computer Vision and Pattern Recognition
Zheng, Qingping
Deng, Jiankang
Zhu, Zheng
Li, Ying
Zafeiriou, Stefanos
Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
topic_facet Computer Science - Computer Vision and Pattern Recognition
description This paper probes intrinsic factors behind typical failure cases (e.g. spatial inconsistency and boundary confusion) produced by the existing state-of-the-art method in face parsing. To tackle these problems, we propose a novel Decoupled Multi-task Learning with Cyclical Self-Regulation (DML-CSR) for face parsing. Specifically, DML-CSR designs a multi-task model which comprises face parsing, binary edge, and category edge detection. These tasks only share low-level encoder weights without high-level interactions between each other, enabling to decouple auxiliary modules from the whole network at the inference stage. To address spatial inconsistency, we develop a dynamic dual graph convolutional network to capture global contextual information without using any extra pooling operation. To handle boundary confusion in both single and multiple face scenarios, we exploit binary and category edge detection to jointly obtain generic geometric structure and fine-grained semantic clues of human faces. Besides, to prevent noisy labels from degrading model generalization during training, cyclical self-regulation is proposed to self-ensemble several model instances to get a new model and the resulting model then is used to self-distill subsequent models, through alternating iterations. Experiments show that our method achieves the new state-of-the-art performance on the Helen, CelebAMask-HQ, and Lapa datasets. The source code is available at https://github.com/deepinsight/insightface/tree/master/parsing/dml_csr.
format Text
author Zheng, Qingping
Deng, Jiankang
Zhu, Zheng
Li, Ying
Zafeiriou, Stefanos
author_facet Zheng, Qingping
Deng, Jiankang
Zhu, Zheng
Li, Ying
Zafeiriou, Stefanos
author_sort Zheng, Qingping
title Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_short Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_full Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_fullStr Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_full_unstemmed Decoupled Multi-task Learning with Cyclical Self-Regulation for Face Parsing
title_sort decoupled multi-task learning with cyclical self-regulation for face parsing
publishDate 2022
url http://arxiv.org/abs/2203.14448
long_lat ENVELOPE(68.633,68.633,-73.200,-73.200)
geographic Lapa
geographic_facet Lapa
genre DML
genre_facet DML
op_relation http://arxiv.org/abs/2203.14448
_version_ 1776199911865319424