Evaluating Cellularity Estimation Methods Comparin Part2

The document describes a study that uses deep learning to estimate tumor cell ratio (TCR) in lung cancer patients. It details the cohort of 41 patients from 4 medical institutions, how 3 pathologists created exhaustive cell-level annotations to establish a gold standard, and how they trained and evaluated their deep learning model using a leave-one-hospital-out validation scheme.

Uploaded by

quang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Evaluating Cellularity Estimation Methods Comparin Part2

Uploaded by

quang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Diagnostics 2024, 14, 1115 3 of 17

model itself also requires a large amount of computational cost to learn and may suffer
from hallucinations that change the semantic content of the image, therefore introducing
label-noise.
An effective method to improve the robustness of AI models from limited data is
data augmentation (DA) [21,22]. This method randomly adds variations in positional
information (rotation, flipping) and color, brightness, etc., to the original input images to
increase the diversity of the dataset without changing the semantics of the data (location,
number and type of cells). By constructing AI models from these augmented datasets, the
robustness of AI to variations in image quality is improved. In [23], DA is reported to
be more effective than normalization of input images for improving the robustness of AI
models to staining variations.
To develop a practical AI-powered TCR calculation software, it is essential to perform
cell detection and classification accurately and efficiently. Chen et al. [24] developed a
method for identifying the cell cycle phase of cells based on fluorescence images. Their
approach utilizes k-means clustering and rule-based heuristics to classify cells into dif-
ferent phases; while this method may be effective for limited datasets, its generalization
performance to datasets from different institutions or acquired using different imaging
devices is likely to be constrained. Ghani et al. [25] proposed a method for accelerating
cell detection using a convolutional neural network embedded on an FPGA device. This
approach offers significant speedups compared to software-based implementations. The
authors demonstrated the effectiveness of their method in detecting different types of
damaged kidney cells with an accuracy of nearly 100%. However, the limited size of their
test dataset raises concerns about the generalizability of their results. Further validation
with a larger and more diverse dataset is recommended.
Addressing these challenges, we leverage a previously-developed deep-learning
model [26] for TCR counting and use it in this study to predict TCR for a cohort (see
Section 2.1) of 41 non-small cell lung cancer patients from four different medical institu-
tions (sites). In other reported experiments on TCR counting [27], the “gold standard”
ground truth is established by first defining a tumor area and then counting all cells within
that area as tumor cells. In contrast, our approach is more fine-grained and classifies each
cell independently as tumor or non-tumor.
In Section 2.2, we first establish a “gold standard” (GS) set of exhaustive cell-level anno-
tations by three pathologists in regions of interests of the cohort. We also ask 13 pathologists
to visually estimate the TCR on these regions (see Section 2.3). In Section 2.4, we detail
the model architecture, training partition, and data augmentation we use to create our AI
model. Finally, to evaluate the real-world robustness of the AI model, we devised a leave-
one-hospital-out cross-validation scheme, where we test it on images from sites unseen
during training. In Section 3, we report our findings, comparing the TCR predictions by
the AI model to the gold standard and to the pathologists’ visual estimate.

2. Materials and Methods

2.1. Cohort
The cohort in this study consists of 41 patient at various stages of non-small cell
lung cancer (NSCLC) and considered for genetic panel testing. The specimens were
collected between 2018 and 2022 from 4 sites: Hokkaido University Hospital (HUH),
Kansai Medical University Hospital (KMU), Kanagawa Cancer Center (KCC), and Saitama
Cancer Center (SCC). A total of 11 specimens were extracted with trans-bronchial biopsy
(TBB), 9 were extracted with trans-bronchial needle aspiration (TBNA), 9 were extracted
with core-needle biopsy (CNB), and 12 were extracted with surgical resections. These
specimens were then prepared into blocks using the formalin-fixed paraffin-embedded
(FFPE) method, sectioned, stained with hematoxylin and eosin (H&E), and scanned in
bright-field at 40× magnification (0.25 microns/pixel) with a whole slide device (Philips
UFS with Philips IntelliSite Pathology Solution, Philips, Amsterdam, The Netherlands) to
generate WSI images.
Diagnostics 2024, 14, 1115 4 of 17

Immunostained images from the same blocks were used during case selection for
histological type differentiation but were not subsequently used for TCR estimation by
pathologists nor by AI. Specimens with a high number of crushed necrotic cells were
excluded. Specimen in the cohort exhibited adenocarcinoma (27) and squamous cell
carcinoma (14). Care was taken that both cancer subtypes and sample extraction methods
were split among sites to avoid site biases.

2.2. Gold Standard Labels

In a first step, we define, for each WSI slide a region of interest (ROI) where the labeling
will take place (Figure 1). To eliminate bias in selecting the ROIs from the WSI slides, we
employed the following procedure:
1. A pathologist manually traces a target area on a WSI slide (typically, the tumor area).
2. A program divides the area into square tiles of 400 microns (1760 × 1760 pixels at
40× magnification) and randomly selects a tile.
3. The pathologist examines the selected tile. If it does not contain tumor cells or has
an image quality issue (e.g., blurriness, artifacts), it is excluded, and another tile is
randomly selected by the program. If the selected tile has none of the aforementioned
issues, it is finalized as the representative ROI for the WSI.

Figure 1. Region of interest (ROI) selection over WSI slide. In (A), a tissue is selected on the WSI slide. In
(B), the tissue is tiled and one tile is selected as ROI. In (C), the cells of the ROI are exhaustively labeled.

In this manner, a representative ROI was selected from each WSI of the 41 cases,
resulting in a total of 41 ROIs.
Three pathologists were then independently instructed to exhaustively identify the
location of every cell in the ROI and to label them as either tumor cells (tCells), non-tumor
cells (nCells), or indistinguishable cells (iCells). iCells are cells within an ROI that cannot
be definitively classified as either tCells, nCells, or cells that should not be labeled. These
cells may also be excluded from tumor cell content calculations by pathologists if DNA
nucleic acids cannot be extracted due to crush, necrosis, degeneration, or keratinization.
In addition, cells that are not used for tumor cell content calculations by pathologists,
such as red blood cells and cells whose nuclei cannot be recognized due to necrosis or
degeneration, are excluded from the labeling process.
For each cell, the labels given by the three pathologists were combined into a single
final label using a majority rule. Cells are initially matched using a distance-based matching
algorithm thus resulting in a 3-cell match (all 3 pathologists annotated that cell), a 2-cell
match, or a 1-cell match. Then, the final label was established following the rules shown in
Table 1.

Folmsbee Et Al. - 2018 - Active Deep Learning Improved Training Efficiency of Convolutional Neural Networks For Tissue Classification in
No ratings yet
Folmsbee Et Al. - 2018 - Active Deep Learning Improved Training Efficiency of Convolutional Neural Networks For Tissue Classification in
4 pages
Security: Standard Operating Procedures: Guests
No ratings yet
Security: Standard Operating Procedures: Guests
5 pages
Company Registration in Nepal
No ratings yet
Company Registration in Nepal
11 pages
Investigating The Impact of Corporate Rebranding On Customer Satisfaction: Empirical Evidence From The Beverage Industry
No ratings yet
Investigating The Impact of Corporate Rebranding On Customer Satisfaction: Empirical Evidence From The Beverage Industry
14 pages
Operations Management: (12th Edition)
No ratings yet
Operations Management: (12th Edition)
2 pages
Evaluating Cellularity Estimation Methods Comparin Part7
No ratings yet
Evaluating Cellularity Estimation Methods Comparin Part7
2 pages
Evaluating Cellularity Estimation Methods Comparin Part5
No ratings yet
Evaluating Cellularity Estimation Methods Comparin Part5
2 pages
Evaluating Cellularity Estimation Methods Comparin Part3
No ratings yet
Evaluating Cellularity Estimation Methods Comparin Part3
2 pages
Evaluating Cellularity Estimation Methods Comparin Part1
No ratings yet
Evaluating Cellularity Estimation Methods Comparin Part1
2 pages
Evaluating Cellularity Estimation Methods Comparin
No ratings yet
Evaluating Cellularity Estimation Methods Comparin
17 pages
Cancer Detection and Segmentation in Pathological Whole Slide Images 1
No ratings yet
Cancer Detection and Segmentation in Pathological Whole Slide Images 1
20 pages
Piumi Isbi 2024
No ratings yet
Piumi Isbi 2024
1 page
Evaluating Cellularity Estimation Methods Comparin Part6
No ratings yet
Evaluating Cellularity Estimation Methods Comparin Part6
2 pages
The Complete Working of Our Cancer Detection Project
No ratings yet
The Complete Working of Our Cancer Detection Project
3 pages
Journal Club
No ratings yet
Journal Club
76 pages
YW Et Al. - 2020 - Integrative Data Augmentati
No ratings yet
YW Et Al. - 2020 - Integrative Data Augmentati
13 pages
Selecting Regions of Interest in Large Multi-Scale Images For Cancer Pathology
No ratings yet
Selecting Regions of Interest in Large Multi-Scale Images For Cancer Pathology
9 pages
Automatic Labels Are As Effective As Manual Labels in Biomedical Images Classification With Deep Learnin
No ratings yet
Automatic Labels Are As Effective As Manual Labels in Biomedical Images Classification With Deep Learnin
16 pages
Histopathologic Cancer Detection Using Convolutional Neural Networks
No ratings yet
Histopathologic Cancer Detection Using Convolutional Neural Networks
4 pages
A Deep Learning Model For Molecular Label Transfer That Enables Cancer Cell Identi Fication From Histopathology Images
No ratings yet
A Deep Learning Model For Molecular Label Transfer That Enables Cancer Cell Identi Fication From Histopathology Images
11 pages
Chrons Disease Documentation
No ratings yet
Chrons Disease Documentation
390 pages
Multi-Class Classification of Blood Cells VC
No ratings yet
Multi-Class Classification of Blood Cells VC
18 pages
Review of DL Pathological Image Classification Task-specifc to Foundation Models
No ratings yet
Review of DL Pathological Image Classification Task-specifc to Foundation Models
17 pages
Deep Learning For Identifying Metastatic Breast Cancer
No ratings yet
Deep Learning For Identifying Metastatic Breast Cancer
6 pages
Histopathology foundation models benchmark
No ratings yet
Histopathology foundation models benchmark
59 pages
Deep Learning For Identifying Metastatic Breast Cancer
No ratings yet
Deep Learning For Identifying Metastatic Breast Cancer
6 pages
Deep Learning For Identifying Metastatic Breast Cancer
No ratings yet
Deep Learning For Identifying Metastatic Breast Cancer
6 pages
4AF73AA8EDD66E76C909D5F5551240E6
No ratings yet
4AF73AA8EDD66E76C909D5F5551240E6
12 pages
Computer Aided Leukemia Detection Using Image Processing Techniques
No ratings yet
Computer Aided Leukemia Detection Using Image Processing Techniques
5 pages
Deep Learning Can Predict Microsatellite Instability
No ratings yet
Deep Learning Can Predict Microsatellite Instability
10 pages
Towards Source-Free Cross Tissues Histopathological Cell Segmentation Via Target-Specific Finetuning
No ratings yet
Towards Source-Free Cross Tissues Histopathological Cell Segmentation Via Target-Specific Finetuning
13 pages
Project Work Papers
No ratings yet
Project Work Papers
19 pages
García Emilio Lymphocyte Detection Gastric Cancer PDF
No ratings yet
García Emilio Lymphocyte Detection Gastric Cancer PDF
7 pages
Accurate Diagnosis of Lymphoma on Whole-slide Histopathology Images Using Deep Learning | Npj Digital Medicine
No ratings yet
Accurate Diagnosis of Lymphoma on Whole-slide Histopathology Images Using Deep Learning | Npj Digital Medicine
18 pages
Identifying Liver Cancer Cells Using Cascaded Convolutional Neural Network and Gray Level Co-Occurrence Matrix Techniques
No ratings yet
Identifying Liver Cancer Cells Using Cascaded Convolutional Neural Network and Gray Level Co-Occurrence Matrix Techniques
9 pages
Leukemia Prediction Using Random Forest Algorithm
No ratings yet
Leukemia Prediction Using Random Forest Algorithm
8 pages
Improving Cancer Metastasis Detection Via Effectiv
No ratings yet
Improving Cancer Metastasis Detection Via Effectiv
13 pages
BC 06
No ratings yet
BC 06
5 pages
AplicationCPathForCancer
No ratings yet
AplicationCPathForCancer
24 pages
IEE Research Paper
No ratings yet
IEE Research Paper
4 pages
AI Harvard nature cancer s41586-024-07894-z
No ratings yet
AI Harvard nature cancer s41586-024-07894-z
25 pages
Signal Processing 1
No ratings yet
Signal Processing 1
2 pages
1-s2.0-S095741742202471X-main
No ratings yet
1-s2.0-S095741742202471X-main
11 pages
Blood Cell Cancer Classification Using ResNet50 and Transfer Learning Enhancing Diagnostic Accuracy in Hematology
No ratings yet
Blood Cell Cancer Classification Using ResNet50 and Transfer Learning Enhancing Diagnostic Accuracy in Hematology
6 pages
CellLevelClassifierHandcraftedFeatures
No ratings yet
CellLevelClassifierHandcraftedFeatures
12 pages
Extended_Leukemia_Detection_Conference_Paper
No ratings yet
Extended_Leukemia_Detection_Conference_Paper
3 pages
Sparse Coding of Pathology Slides Compared to Tran
No ratings yet
Sparse Coding of Pathology Slides Compared to Tran
9 pages
BC 09
No ratings yet
BC 09
6 pages
xhwveaduxn
No ratings yet
xhwveaduxn
25 pages
PatchBased Convolutional Neural
No ratings yet
PatchBased Convolutional Neural
10 pages
Sabol Et Al 2020
No ratings yet
Sabol Et Al 2020
10 pages
Feature Extraction Using Traditional Image Processing and Convolutional Neural Network Methods To Classify White Blood Cells A Study2019
No ratings yet
Feature Extraction Using Traditional Image Processing and Convolutional Neural Network Methods To Classify White Blood Cells A Study2019
12 pages
Deep Learning Integrates Histopathology and Proteo
No ratings yet
Deep Learning Integrates Histopathology and Proteo
21 pages
Liver Tumor Segmentation Thesis
No ratings yet
Liver Tumor Segmentation Thesis
62 pages
Automated Cancer Diagnosis Based On Histopathologi
No ratings yet
Automated Cancer Diagnosis Based On Histopathologi
17 pages
工智能 (AI) 在免疫肿瘤学 (IO) 中重要问题的应用
No ratings yet
工智能 (AI) 在免疫肿瘤学 (IO) 中重要问题的应用
11 pages
research_paper
No ratings yet
research_paper
2 pages
s41598-023-42357-x
No ratings yet
s41598-023-42357-x
14 pages
Combining Deep Learning With Traditional Features For Classification and
No ratings yet
Combining Deep Learning With Traditional Features For Classification and
4 pages
6-Combining Deep Learning With Traditional Features For Classification and Segmentation of Pathological Images of Breast Cancer
No ratings yet
6-Combining Deep Learning With Traditional Features For Classification and Segmentation of Pathological Images of Breast Cancer
4 pages
4lung cancer classification
0% (1)
4lung cancer classification
20 pages
Artificial Immune Systems: Fundamentals and Applications
From Everand
Artificial Immune Systems: Fundamentals and Applications
Fouad Sabry
No ratings yet
Textbook of Urgent Care Management: Chapter 35, Urgent Care Imaging and Interpretation
From Everand
Textbook of Urgent Care Management: Chapter 35, Urgent Care Imaging and Interpretation
Tim Hogan
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Financial Reporting and Analysis II
No ratings yet
Financial Reporting and Analysis II
27 pages
Lesson 6 Task 4
No ratings yet
Lesson 6 Task 4
3 pages
Crs 2013-2017 Bece
No ratings yet
Crs 2013-2017 Bece
24 pages
2N LIFT1 User Guide EN 2.0.2
No ratings yet
2N LIFT1 User Guide EN 2.0.2
143 pages
Soil Testing Geotechnical Investigations
No ratings yet
Soil Testing Geotechnical Investigations
21 pages
Rabelais 'Carnival in Kerala Through The Eyes of Bakhtin
No ratings yet
Rabelais 'Carnival in Kerala Through The Eyes of Bakhtin
5 pages
Cookery Reflection Paper
No ratings yet
Cookery Reflection Paper
9 pages
Yogeshvar Isahasran Amastotram
No ratings yet
Yogeshvar Isahasran Amastotram
20 pages
Internationalization Theory
No ratings yet
Internationalization Theory
48 pages
Module 4
No ratings yet
Module 4
5 pages
Financial Accounting For Managers 3rd Edition Sanjay Dhamija download
100% (2)
Financial Accounting For Managers 3rd Edition Sanjay Dhamija download
63 pages
The Impact of A Level Subject Choice and Students Background Characteristics On Higher Education Participation
No ratings yet
The Impact of A Level Subject Choice and Students Background Characteristics On Higher Education Participation
10 pages
Pedal Powered Generator
No ratings yet
Pedal Powered Generator
96 pages
Summary of Third Level
100% (1)
Summary of Third Level
2 pages
Cambridge International AS & A Level: GEOGRAPHY 9696/21
No ratings yet
Cambridge International AS & A Level: GEOGRAPHY 9696/21
8 pages
NCM - Parul Shukla - 1237
No ratings yet
NCM - Parul Shukla - 1237
1 page
Grade (9) Second Term
No ratings yet
Grade (9) Second Term
46 pages
Module 1
No ratings yet
Module 1
7 pages
Rise Business Card
No ratings yet
Rise Business Card
2 pages
Charu Kapoor, Report Overview
No ratings yet
Charu Kapoor, Report Overview
33 pages
Calculus Core CH 1-7
No ratings yet
Calculus Core CH 1-7
233 pages
3RD A La Carte Menu
No ratings yet
3RD A La Carte Menu
3 pages
09 - Octagon Tower
No ratings yet
09 - Octagon Tower
9 pages
Arduino Based Automatic Room Light Control
No ratings yet
Arduino Based Automatic Room Light Control
8 pages
Doubles in Bridge
No ratings yet
Doubles in Bridge
3 pages
BIO 401 Note... Introduction To Bioinformatics
No ratings yet
BIO 401 Note... Introduction To Bioinformatics
4 pages