Background Recently, deep learning has rapidly become the methodology of choice in digital pathology image analysis. However, due to the current challenges of digital pathology (color stain variability, large images, etc.), specific pre-processing steps are required to train a reliable deep learning model. Method In this work, there are two main goals: i) present a fully automated pre-processing algorithm for a smart patch selection within histopathological images, and ii) evaluate the impact of the proposed strategy within a deep learning framework for the detection of prostate and breast cancer. The proposed algorithm is specifically designed to extract patches only on informative regions (i.e., high density of nuclei), most likely representative of where cancer can be detected. Results Our strategy was developed and tested on 1000 hematoxylin and eosin (H&E) stained images of prostate and breast tissue. By combining a stain normalization step and a segmentation-driven patch extraction, the proposed approach is capable of increasing the performance of a computer-aided diagnosis (CAD) system for the detection of prostate cancer (18.61% accuracy improvement) and breast cancer (17.72% accuracy improvement). Conclusion We strongly believe that the integration of the proposed pre-processing steps within deep learning frameworks will allow the achievement of robust and reliable CAD systems. Being based on nuclei detection, this strategy can be easily extended to other glandular tissues (e.g., colon, thyroid, pancreas, etc.) or staining methods (e.g., PAS).

Impact of stain normalization and patch selection on the performance of convolutional neural networks in histological breast and prostate cancer classification / Salvi, Massimo; Molinari, Filippo; Acharya, U Rajendra; Molinaro, Luca; Meiburger, Kristen M.. - In: COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE UPDATE. - ISSN 2666-9900. - ELETTRONICO. - 1:(2021). [10.1016/j.cmpbup.2021.100004]

Impact of stain normalization and patch selection on the performance of convolutional neural networks in histological breast and prostate cancer classification

Salvi, Massimo;Molinari, Filippo;Meiburger, Kristen M.
2021

Abstract

Background Recently, deep learning has rapidly become the methodology of choice in digital pathology image analysis. However, due to the current challenges of digital pathology (color stain variability, large images, etc.), specific pre-processing steps are required to train a reliable deep learning model. Method In this work, there are two main goals: i) present a fully automated pre-processing algorithm for a smart patch selection within histopathological images, and ii) evaluate the impact of the proposed strategy within a deep learning framework for the detection of prostate and breast cancer. The proposed algorithm is specifically designed to extract patches only on informative regions (i.e., high density of nuclei), most likely representative of where cancer can be detected. Results Our strategy was developed and tested on 1000 hematoxylin and eosin (H&E) stained images of prostate and breast tissue. By combining a stain normalization step and a segmentation-driven patch extraction, the proposed approach is capable of increasing the performance of a computer-aided diagnosis (CAD) system for the detection of prostate cancer (18.61% accuracy improvement) and breast cancer (17.72% accuracy improvement). Conclusion We strongly believe that the integration of the proposed pre-processing steps within deep learning frameworks will allow the achievement of robust and reliable CAD systems. Being based on nuclei detection, this strategy can be easily extended to other glandular tissues (e.g., colon, thyroid, pancreas, etc.) or staining methods (e.g., PAS).
File in questo prodotto:
File Dimensione Formato  
(2021) paper - Impact preprocessing.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 2.48 MB
Formato Adobe PDF
2.48 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2872534