Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers

Boland, Christopher; Tsaftaris, Sotirios A.; Dahdouh, Sonia

doi:https://doi.org/10.59275/j.melba.2025-8888

Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers

Christopher Boland^1,2, Sotirios A. Tsaftaris², Sonia Dahdouh¹

1: Canon Medical Research Europe Ltd., 2: School of Engineering, University of Edinburgh

Publication date: 2025/11/20

https://doi.org/10.59275/j.melba.2025-8888

PDF · arXiv

Abstract

Deep learning models are prone to learning shortcut solutions to problems using spuriously correlated yet irrelevant features of their training data. In high-risk applications such as medical image analysis, this phenomenon may prevent models from using clinically meaningful features when making predictions, potentially leading to poor robustness and harm to patients. We demonstrate that different types of shortcuts—those that are diffuse and spread throughout the image, as well as those that are localized to specific areas—manifest distinctly across network layers and can, therefore, be more effectively targeted through mitigation strategies that target the intermediate layers. We propose a novel knowledge distillation framework that leverages a teacher network fine-tuned on a small subset of task-relevant data to mitigate shortcut learning in a student network trained on a large dataset corrupted with a bias feature. Through extensive experiments on CheXpert, ISIC 2017, and SiMBA datasets using various architectures (ResNet-18, AlexNet, DenseNet-121, and 3D CNNs), we demonstrate consistent improvements over traditional Empirical Risk Minimization, augmentation-based bias-mitigation, and group-based bias-mitigation approaches. In many cases, we achieve comparable performance with a baseline model trained on bias-free data, even on out-of-distribution test data. Our results demonstrate the practical applicability of our approach to real-world medical imaging scenarios where bias annotations are limited and shortcut features are difficult to identify a priori

Keywords

Algorithmic Bias · Shortcut Learning · Knowledge Distillation · Spurious Correlations

Bibtex

@article{melba:2025:020:boland,
    title = "Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers",
    author = "Boland, Christopher and Tsaftaris, Sotirios A. and Dahdouh, Sonia",
    journal = "Machine Learning for Biomedical Imaging",
    volume = "3",
    issue = "Special issue on FAIMI",
    year = "2025",
    pages = "447--476",
    issn = "2766-905X",
    doi = "https://doi.org/10.59275/j.melba.2025-8888",
    url = "https://melba-journal.org/2025:020"
}

RIS

TY - JOUR
AU - Boland, Christopher
AU - Tsaftaris, Sotirios A.
AU - Dahdouh, Sonia
PY - 2025
TI - Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers
T2 - Machine Learning for Biomedical Imaging
VL - 3
IS - Special issue on FAIMI
SP - 447
EP - 476
SN - 2766-905X
DO - https://doi.org/10.59275/j.melba.2025-8888
UR - https://melba-journal.org/2025:020
ER -

Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers

Christopher Boland1,2, Sotirios A. Tsaftaris2, Sonia Dahdouh1

1: Canon Medical Research Europe Ltd., 2: School of Engineering, University of Edinburgh

Publication date: 2025/11/20

https://doi.org/10.59275/j.melba.2025-8888

Abstract

Christopher Boland^1,2, Sotirios A. Tsaftaris², Sonia Dahdouh¹