Pan-Cancer Nuclei Segmentation in Hematoxylin And Eosin Whole Slide Images

USCAP 2024 Annual Meeting

Mar 25, 2024
Oncology
Presentation

Mina Khoshdeli, Rohan Joshi, Bolesław L. Osinski, Luca Lonini, Martin C. Stumpe

Background

 

Nuclei segmentation is a critical stage in characterizing the morphology of cells in Hematoxylin and Eosin (H&E) stained whole slide images (WSIs). Extensive research has been conducted on the application of deep learning models for nuclei segmentation. While individual models have shown promising performance in segmenting nuclei for specific cancer types, a gap remains in the availability of a single model to segment nuclei across cancer types. Here, we created a comprehensive training and validation dataset encompassing a broad spectrum of cancer types and histological subtypes to address this need. 

 

Design

 

A cohort of WSIs was generated including 14 cancer types, with each having up to three histological subtypes (N=410, Table I). From these WSIs, we randomly selected two fields of view (FOV) from the tissue area for annotation at 40x magnification. During the training phase, we acquired one annotation per image, while for the validation phase, we gathered four annotations per image. We trained a HoverNet model initializing using weights from a model previously trained on the CoNSeP dataset. To assess the model performance, we conducted a comparative analysis between the trained model, the initial CoNSeP-trained model, and consensus evaluations of pathologists. To determine consensus among pathologists, we computed majority votes for all possible permutations of three pathologists, with one annotator being excluded for each permutation. We compared the consensus annotations with the model’s output and the annotations of the held-out pathologists. As a result, we obtained four data points for the box plots presented in Figure1 and 2. Statistical testing was performed using a t-test for independent samples and Bonferroni correction.

 

Results

 

We used a cell-based dice score to compare the performance of models and human annotators.  We found comparable performance between the model’s output and the performance of human annotators (mean 0.767, 95% CI 0.760-0.773 for the model; mean 0.794, 95% CI 0.777-0.810 for human annotators, p=0.08 with Bonferroni correction). In prostate, non-small cell lung, pancreatic, tumor of unknown origin, melanoma, gastric, bladder, and biliary cancers we found statistically indistinguishable performance in subgroup analyses.

 

Conclusions

 

In this study, we have established an annotation dataset tailored explicitly to pan-cancer nuclei segmentation and have demonstrated that by utilizing the HoverNet architecture, we have achieved near-human-level performance.

 

Table 1

 

Cancer Type Histology subtypesNumber of images in TrainingNumber of images validation
Biliary Cancercholangiocarcinoma1212
Bladder Cancerurothelial carcinoma1212
Breast Cancerbreast carcinoma1011
Colorectal Cancercolorectal adenocarcinoma1212
Endometrial Cancercarcinosarcoma1212
endometrial serous carcinoma1212
endometrioid carcinoma1112
Esophageal Cancergastroesophageal adenocarcinoma1212
gastroesophageal squamous cell carcinoma1212
gastroesophageal adenocarcinoma1212
Melanomamelanoma1212
Non-Small Cell Lung Cancerlung adenocarcinoma1212
lung squamous cell carcinoma1212
Ovarian Cancerovarian serous carcinoma1211
Pancreatic Cancerpancreatic adenocarcinoma1212
pancreatic neuroendocrine tumor1012
Prostate Cancerprostatic adenocarcinoma1212
Sarcomafibrous sarcoma1112
leiomyosarcoma1212
Tumor of Unknown OriginNA3636

 

Figure 1

 

 

Figure 2