Focusing on Abnormal: Visual Contrastive Classification and Semantic Enhancement for Medical Report Generation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Automating medical report generation from radiology images is vital for accurate, standardized diagnoses. Abnormal regions in medical images, often small and rare, are challenging to detect, leading models to overlook critical disease features and generate repetitive healthy content. We propose V-C2SE, a novel model that enhances abnormality detection through two key strategies within the visual modality: 1) Visual Contrastive Classification (VC2), which aligns disease-specific features across random samples using contrastive learning, improving the model’s focus on abnormal semantics during encoding; 2) Visual Semantic Enhancement (VSE), which constructs healthy templates to amplify abnormal features in a feature-space augmentation paradigm, ensuring precise report generation. By leveraging contrastive learning and healthy templates, V-C2SE detects subtle abnormalities with high precision and generates clinically relevant reports. Evaluated on IU X-Ray and MIMIC-CXR datasets, V-C2SE achieves competitive results with state-of-the-art methods across natural language generation (NLG) and clinical efficacy (CE) metrics, producing high-quality, semantically accurate reports. Our approach addresses the critical challenge of focusing on rare abnormalities and enhancing diagnostic efficiency.

Original languageEnglish
Title of host publicationNeural Information Processing - 32nd International Conference, ICONIP 2025, Proceedings
EditorsTadahiro Taniguchi, Tadashi Kozuno, Chi Sing Andrew Leung, Junichiro Yoshimoto, Mufti Mahmud, Maryam Doborjeh, Kenji Doya
PublisherSpringer Science and Business Media Deutschland GmbH
Pages132-147
Number of pages16
ISBN (Print)9789819540990
DOIs
Publication statusPublished - 2026
Externally publishedYes
Event32nd International Conference on Neural Information Processing, ICONIP 2025 - Okinawa, Japan
Duration: 20 Nov 202524 Nov 2025

Publication series

NameCommunications in Computer and Information Science
Volume2757
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference32nd International Conference on Neural Information Processing, ICONIP 2025
Country/TerritoryJapan
CityOkinawa
Period20/11/2524/11/25

Keywords

  • Contrastive Classification
  • Focus on Abnormal
  • Medical Report Generation
  • Semantic Enhancement

Fingerprint

Dive into the research topics of 'Focusing on Abnormal: Visual Contrastive Classification and Semantic Enhancement for Medical Report Generation'. Together they form a unique fingerprint.

Cite this