FairCrowd: Fair Human Face Dataset Sampling via Batch-Level Crowdsourcing Bias Inference

Ziyi Kou, Yang Zhang, Lanyu Shang, Dong Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Human face image is a large category of visual information utilized by various human facial data services (e.g., face recognition, face generation, face attribute prediction). However, the quality of data services (QoDS) on human face datasets is usually biased towards the majority demographic group due to the data imbalance issue. In this paper, we focus on a fair human face dataset sampling problem where the goal is to sample a sub-dataset from the original dataset to reduce its bias by leveraging crowd intelligence to infer the demographic labels of face images (e.g., male or female, old or young). Our problem is motivated by the limitations of current fair data sampling solutions that require pre-annotated demographic labels to sample a fair dataset. Two important challenges exist in solving our problem: 1) it is extremely time-consuming and expensive to assign crowd workers to annotate demographic labels of all images in a large-scale facial dataset; 2) it is not a trivial task to improve the fairness of the sampled sub-dataset (with fewer data samples) without sacrificing the accuracy performance of data services on such dataset. To address the above challenges, we develop FairCrowd, a fair crowdsourcing-based data sampling framework that leverages an efficient batch-level demographic label inference model and a joint fair-accuracy-aware data shuffling method. We evaluate the performance of FairCrowd through a large-scale real-world face image dataset that consists of celebrity faces from a diversified set of demographic groups. The results show that FairCrowd not only reduces demographic bias but also improves the accuracy of data services trained on the sub-dataset generated by FairCrowd, leading to a more desirable QoDS of the application.

Original languageEnglish (US)
Title of host publication2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-10
Number of pages10
ISBN (Electronic)9781665414944
ISBN (Print)9781665430548
DOIs
StatePublished - Jun 25 2021
Externally publishedYes
Event29th IEEE/ACM International Symposium on Quality of Service, IWQOS 2021 - Virtual, Tokyo, Japan
Duration: Jun 25 2021Jun 28 2021

Publication series

Name2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021

Conference

Conference29th IEEE/ACM International Symposium on Quality of Service, IWQOS 2021
Country/TerritoryJapan
CityVirtual, Tokyo
Period6/25/216/28/21

Keywords

  • Crowdsourcing
  • Fair Dataset Sampling
  • Machine Learning for Quality of Service
  • Quality of Data Service

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'FairCrowd: Fair Human Face Dataset Sampling via Batch-Level Crowdsourcing Bias Inference'. Together they form a unique fingerprint.

Cite this