TY - JOUR
T1 - Unsupervised Domain Adaptation via Contrastive Adversarial Domain Mixup
T2 - A Case Study on COVID-19
AU - Zeng, Huimin
AU - Yue, Zhenrui
AU - Shang, Lanyu
AU - Zhang, Yang
AU - Wang, Dong
N1 - This work was supported in part by the National Science Foundation under Grants IIS-2202481, CHE-2105005, IIS-2130263, CNS-1845639, and CNS-1831669.
PY - 2024
Y1 - 2024
N2 - Training large deep learning (DL) models with high performance for natural language downstream tasks usually requires rich-labeled data. However, in a real-world application of COVID-19 information service (e.g., misinformation detection, question answering), a fundamental challenge is the lack of the labeled COVID data to enable supervised end-to-end training of the models for different downstream tasks, especially at the early stage of the pandemic. To address this challenge, we propose an unsupervised domain adaptation framework using contrastive learning and adversarial domain mixup to transfer the knowledge from an existing source data domain to the target COVID-19 data domain. In particular, to bridge the gap between the source domain and the target domain, our method reduces a radial basis function (RBF) based discrepancy between these two domains. Moreover, we leverage the power of domain adversarial examples to establish an intermediate domain mixup, where the latent representations of the input text from both domains could be mixed during the training process. In this paper, we focus on two prevailing downstream tasks in mining COVID-19 text data: COVID-19 misinformation detection and COVID-19 news question answering. Extensive domain adaptation experiments on multiple real-world datasets suggest that our method can effectively adapt misinformation detection and question answering systems to the unseen COVID-19 target domain with significant improvements compared to the state-of-the-art baselines.
AB - Training large deep learning (DL) models with high performance for natural language downstream tasks usually requires rich-labeled data. However, in a real-world application of COVID-19 information service (e.g., misinformation detection, question answering), a fundamental challenge is the lack of the labeled COVID data to enable supervised end-to-end training of the models for different downstream tasks, especially at the early stage of the pandemic. To address this challenge, we propose an unsupervised domain adaptation framework using contrastive learning and adversarial domain mixup to transfer the knowledge from an existing source data domain to the target COVID-19 data domain. In particular, to bridge the gap between the source domain and the target domain, our method reduces a radial basis function (RBF) based discrepancy between these two domains. Moreover, we leverage the power of domain adversarial examples to establish an intermediate domain mixup, where the latent representations of the input text from both domains could be mixed during the training process. In this paper, we focus on two prevailing downstream tasks in mining COVID-19 text data: COVID-19 misinformation detection and COVID-19 news question answering. Extensive domain adaptation experiments on multiple real-world datasets suggest that our method can effectively adapt misinformation detection and question answering systems to the unseen COVID-19 target domain with significant improvements compared to the state-of-the-art baselines.
KW - Domain adaptation
KW - contrastive domain mixup
KW - misinformation detection
KW - question answering
UR - http://www.scopus.com/inward/record.url?scp=85183943869&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85183943869&partnerID=8YFLogxK
U2 - 10.1109/TETC.2024.3354419
DO - 10.1109/TETC.2024.3354419
M3 - Article
AN - SCOPUS:85183943869
SN - 2168-6750
VL - 12
SP - 1105
EP - 1116
JO - IEEE Transactions on Emerging Topics in Computing
JF - IEEE Transactions on Emerging Topics in Computing
IS - 4
ER -