Evaluating and mitigating bias in AI-based medical text generation

Xiuying Chen*, Tairan Wang, Juexiao Zhou, Zirui Song, Xin Gao*, Xiangliang Zhang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Artificial intelligence (AI) systems, particularly those based on deep learning models, have increasingly achieved expert-level performance in medical applications. However, there is growing concern that such AI systems may reflect and amplify human bias, reducing the quality of their performance in historically underserved populations. The fairness issue has attracted considerable research interest in the medical imaging classification field, yet it remains understudied in the text-generation domain. In this study, we investigate the fairness problem in text generation within the medical field and observe substantial performance discrepancies across different races, sexes and age groups, including intersectional groups, various model scales and different evaluation metrics. To mitigate this fairness issue, we propose an algorithm that selectively optimizes those underserved groups to reduce bias. Our evaluations across multiple backbones, datasets and modalities demonstrate that our proposed algorithm enhances fairness in text generation without compromising overall performance.

Original languageEnglish (US)
Article number14080
Pages (from-to)388-396
Number of pages9
JournalNature Computational Science
Volume5
Issue number5
DOIs
StateAccepted/In press - 2025

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science Applications
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Evaluating and mitigating bias in AI-based medical text generation'. Together they form a unique fingerprint.

Cite this