Comparison of Convmixer Method and Resnet Method in Classification and Detection of Gastrointestinal Diseases Using Kvasir Dataset
Abstract
This research discusses the comparison of ConvMixer and ResNet methods in the classification and detection of gastrointestinal diseases using the Kvasir dataset. Gastrointestinal diseases are often difficult to detect early due to the similarity of visual patterns in endoscopy images, requiring an efficient deep learning-based solution. The purpose of this research is to compare and evaluate the models used. The research used a quantitative approach with an experimental method. Endoscopy image data was processed through augmentation, normalization, and division of the dataset into train, validation, and test. ConvMixer and ResNet were trained with customized hyperparameters, and evaluated using accuracy, precision, recall, and F1-score metrics.
The results showed that ResNet excelled with 86% accuracy, slightly higher than ConvMixer which recorded 84% accuracy. ResNet's residual structure overcomes the problem of vanishing gradients, while ConvMixer offers better training speed. Both models showed high performance, although the challenge of similar patterns between classes was still an obstacle. As a result, ResNet provides better results in detecting gastrointestinal diseases, but ConvMixer is also a promising alternative. Further development with more diverse datasets is needed to improve model performance.
Downloads
References
N. I. Firdausi, “Implementasi Deep Learning Dalam Mendeteksi Penyakit Menggunakan Convolutional Neural Network Dan Tensorflow,” Kaos GL Derg., vol. 8, no. 75, pp. 147–154, 2020.
Y. Pantha, “Enhancing Diagnostic Accuracy in Medicine Using Artificial Intelligence: a Deep Learning Approach,” J. Enhanc. Heat Transf., no. November, 2024.
H. Laçi, K. Sevrani, and S. Iqbal, “Deep learning approaches for classification tasks in medical X-ray, MRI, and ultrasound images: a scoping review,” BMC Med. Imaging, vol. 25, no. 1, 2025.
C. Li, L. Li, and J. Shi, “Gastrointestinal endoscopy in early diagnosis and treatment of gastrointestinal tumors,” Pakistan J. Med. Sci., vol. 36, no. 2, pp. 203–207, 2020.
F. F. Youssef, L. L. Branch, M. Kowalczyk, and T. J. Savides, “Endoscopic Approaches for Managing Small Intestinal Disease,” Annu. Rev. Med., vol. 76, pp. 155–165, 2025.
A. Ali, A. Iqbal, S. Khan, N. Ahmad, and S. Shah, “A two-phase transfer learning framework for gastrointestinal diseases classification,” PeerJ Comput. Sci., vol. 10, pp. 1–31, 2024.
K. Pogorelov et al., “Kvasir: A multi-class image dataset for computer aided gastrointestinal disease detection,” Proc. 8th ACM Multimed. Syst. Conf. MMSys 2017, pp. 164–169, 2017.
Z. Muthiah, O. Triananda, and M. Iqbal, “Deep Learning Based Classification of TB and Normal Chest X-rays Using a Custom CNN with Minimal Epoch Training,” vol. 10, no. 2, pp. 272–279, 2025.
S. A. El-Ghany, M. A. Mahmood, and A. A. Abd El-Aziz, “An Accurate Deep Learning-Based Computer-Aided Diagnosis System for Gastrointestinal Disease Detection Using Wireless Capsule Endoscopy Image Analysis,” Appl. Sci., vol. 14, no. 22, 2024.
A. A. Demirbaş, H. Üzen, and H. Fırat, “Spatial-attention ConvMixer architecture for classification and detection of gastrointestinal diseases using the Kvasir dataset,” Heal. Inf. Sci. Syst., vol. 12, no. 1, 2024.
Copyright (c) 2025 Yuliana, Ivan Septa A P, Riska Veny F

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.















