CheXVision-DenseNet
CheXVision β Deep Learning & Big Data university project. 14-class chest X-ray pathology detection + binary normal/abnormal classification on the NIH Chest X-ray14 dataset (112,120 images).
Architecture
Fine-Tuning Strategy
Training Pipeline
Training Metrics
- Best validation macro AUC-ROC:
0.8459 - Best validation binary AUC-ROC:
0.7867 - Best validation binary F1:
0.6736 - Best checkpoint epoch:
18
Per-Class AUC-ROC at Best Epoch
| Pathology | AUC-ROC | Visual |
|---|---|---|
| Atelectasis | 0.8334 |
ββββββββββ |
| Cardiomegaly | 0.9010 |
ββββββββββ |
| Effusion | 0.8873 |
ββββββββββ |
| Infiltration | 0.7133 |
ββββββββββ |
| Mass | 0.8756 |
ββββββββββ |
| Nodule | 0.8084 |
ββββββββββ |
| Pneumonia | 0.7397 |
ββββββββββ |
| Pneumothorax | 0.8705 |
ββββββββββ |
| Consolidation | 0.8063 |
ββββββββββ |
| Edema | 0.9255 |
ββββββββββ |
| Emphysema | 0.9107 |
ββββββββββ |
| Fibrosis | 0.8085 |
ββββββββββ |
| Pleural_Thickening | 0.8377 |
ββββββββββ |
| Hernia | 0.9242 |
ββββββββββ |
Training Configuration
- Repository:
HlexNC/chexvision-densenet - Dataset: HlexNC/chest-xray-14-320 Β· revision
44443e6ee968b3c6094b63f14a27698c40b50680 - Architecture: DenseNet-121 transfer learning with a shared feature layer and dual classification heads.
- Platform: Kaggle GPU kernel (NVIDIA T4 / P100)
- Batch size:
24Γ grad_accum4= effective batch96 - AMP (fp16):
enabled - Optimizer: AdamW Β· Scheduler: CosineAnnealingLR
- Epochs configured:
60Β· Early stop patience:15
Intended Use
This model is intended for research and educational work on automated chest X-ray pathology detection. It outputs two predictions per image:
- Multi-label scores β independent sigmoid probability for each of 14 NIH pathologies
- Binary score β sigmoid probability of any abnormality (Normal vs. Abnormal)
Limitations
- Not validated for clinical use. Predictions must not substitute professional medical judgment.
- Trained on NIH Chest X-ray14, which contains noisy radiologist annotations (patient-level labels, not lesion-level).
- Performance degrades on images from equipment, patient populations, or preprocessing pipelines that differ from the NIH training distribution.
- Reported AUC metrics are on the validation split, not the held-out test set.
CheXNet Benchmark Context
CheXNet (Rajpurkar et al., 2017) β the seminal paper establishing DenseNet-121 for chest X-ray classification β reported 0.841 macro AUC-ROC on a comparable split of this dataset. CheXVision-DenseNet matches this benchmark. See the CheXVision demo for live inference.
Citation
@misc{chexvision2026,
title={CheXVision: Dual-Task Chest X-ray Classification with Custom CNN and DenseNet-121},
author={BIG D(ATA) Team},
year={2026},
howpublished={\url{https://huggingface.co/HlexNC/chexvision-densenet}}
}