Cancer is a chaotic disease known as the plague of our age and there are many subtypes of the cancer. Cancer is commonly seen disorder and its mortality rate is very high. Therefore, many researchers have worked/studied on the cancer detection and treatment. To contribute cancer studies according to machine learning, we have presented a new generation convolutional neural network (CNN) termed ShortNeXt in this research. The presented ShortNeXt has inspired by ResNet, ConvNeXt and MobileNet architectures to use the advantages these CNNs together. This model, which aims to extract robust feature map using convolution-based residual blocks, is named ShortNeXt because it incorporates more than one shortcut. The ShortNeXt architecture has four main stages and these stages are: (i) an input/stem, (ii) ShortNeXt, (iii) downsampling, and (iv) output. In this CNN architecture, convolution, batch normalization and the Gaussian Error Linear Unit (GELU) activation functions have been utilized. In this aspect, the implementation of the recommended ShortNeXt is simple. The stem stage uses a 4 × 4 sized convolution with stride 4 like ConvNeXt and Swin Transformer and this operation is named patchify operation. Additionally, a 2 × 2 patchify block has been used in the downsampling block. In the ShortNeXt block, an inverted bottleneck has been used, and both 1 × 1 and 3 × 3 convolution blocks are employed in the expansion phase. The output layer has increased the number of filters from 768 to 1280 by using pixel-wise convolution, drawing inspiration from MobileNetV2 and a final feature map with a length of 1280 has been obtained by deploying global average pooling (GAP). In the classification phase, fully connected and softmax operators have been used. To get comparative results about to the recommended ShortNeXt, a publicly available histopathological image dataset has been used and this dataset contains nine classes, and the proposed ShortNeXt has achieved 97.82% and 97.86% validation and test accuracy, respectively. The obtained results and findings openly showcases that ShortNeXt is an effective deep learning method for histopathological image classification for cancer detection/classification.

ShortNeXt: A novel method for accurate classification of colorectal cancer histopathology images / Barua, Prabal Datta; Tasci, Burak; Baygin, Mehmet; Dogan, Sengul; Tuncer, Turker; Molinari, Filippo; Salvi, Massimo; Acharya, U. Rajendra. - In: COMPUTER VISION AND IMAGE UNDERSTANDING. - ISSN 1077-3142. - 265:(2026). [10.1016/j.cviu.2026.104672]

ShortNeXt: A novel method for accurate classification of colorectal cancer histopathology images

Molinari, Filippo;Massimo, Salvi;
2026

Abstract

Cancer is a chaotic disease known as the plague of our age and there are many subtypes of the cancer. Cancer is commonly seen disorder and its mortality rate is very high. Therefore, many researchers have worked/studied on the cancer detection and treatment. To contribute cancer studies according to machine learning, we have presented a new generation convolutional neural network (CNN) termed ShortNeXt in this research. The presented ShortNeXt has inspired by ResNet, ConvNeXt and MobileNet architectures to use the advantages these CNNs together. This model, which aims to extract robust feature map using convolution-based residual blocks, is named ShortNeXt because it incorporates more than one shortcut. The ShortNeXt architecture has four main stages and these stages are: (i) an input/stem, (ii) ShortNeXt, (iii) downsampling, and (iv) output. In this CNN architecture, convolution, batch normalization and the Gaussian Error Linear Unit (GELU) activation functions have been utilized. In this aspect, the implementation of the recommended ShortNeXt is simple. The stem stage uses a 4 × 4 sized convolution with stride 4 like ConvNeXt and Swin Transformer and this operation is named patchify operation. Additionally, a 2 × 2 patchify block has been used in the downsampling block. In the ShortNeXt block, an inverted bottleneck has been used, and both 1 × 1 and 3 × 3 convolution blocks are employed in the expansion phase. The output layer has increased the number of filters from 768 to 1280 by using pixel-wise convolution, drawing inspiration from MobileNetV2 and a final feature map with a length of 1280 has been obtained by deploying global average pooling (GAP). In the classification phase, fully connected and softmax operators have been used. To get comparative results about to the recommended ShortNeXt, a publicly available histopathological image dataset has been used and this dataset contains nine classes, and the proposed ShortNeXt has achieved 97.82% and 97.86% validation and test accuracy, respectively. The obtained results and findings openly showcases that ShortNeXt is an effective deep learning method for histopathological image classification for cancer detection/classification.
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S1077314226000391-main.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 2.08 MB
Formato Adobe PDF
2.08 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3007417