PP-HumanSeg: Connectivity-Aware Portrait Segmentation with a Large-Scale Teleconferencing Video Dataset

التفاصيل البيبلوغرافية
العنوان:	PP-HumanSeg: Connectivity-Aware Portrait Segmentation with a Large-Scale Teleconferencing Video Dataset
المؤلفون:	Chu, Lutao, Liu, Yi, Wu, Zewu, Tang, Shiyu, Chen, Guowei, Hao, Yuying, Peng, Juncai, Yu, Zhiliang, Chen, Zeyu, Lai, Baohua, Xiong, Haoyi
سنة النشر:	2021
المجموعة:	Computer Science
مصطلحات موضوعية:	Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
الوصف:	As the COVID-19 pandemic rampages across the world, the demands of video conferencing surge. To this end, real-time portrait segmentation becomes a popular feature to replace backgrounds of conferencing participants. While feature-rich datasets, models and algorithms have been offered for segmentation that extract body postures from life scenes, portrait segmentation has yet not been well covered in a video conferencing context. To facilitate the progress in this field, we introduce an open-source solution named PP-HumanSeg. This work is the first to construct a large-scale video portrait dataset that contains 291 videos from 23 conference scenes with 14K fine-labeled frames and extensions to multi-camera teleconferencing. Furthermore, we propose a novel Semantic Connectivity-aware Learning (SCL) for semantic segmentation, which introduces a semantic connectivity-aware loss to improve the quality of segmentation results from the perspective of connectivity. And we propose an ultra-lightweight model with SCL for practical portrait segmentation, which achieves the best trade-off between IoU and the speed of inference. Extensive evaluations on our dataset demonstrate the superiority of SCL and our model. The source code is available at https://github.com/PaddlePaddle/PaddleSeg. Comment: Accepted by WACV workshop
نوع الوثيقة:	Working Paper
URL الوصول:	http://arxiv.org/abs/2112.07146
رقم الانضمام:	edsarx.2112.07146
قاعدة البيانات:	arXiv

View record in Arxiv

الوصف
الوصف غير متاح.