COMPARATIVE ANALYSIS OF THE EFFECTIVENESS OF TRANSFORMER AND CONVOLUTIONAL NEURAL NETWORK ARCHITECTURES FOR AUTOMATIC CLASSIFICATION OF RICE LEAF DISEASES
https://doi.org/10.55452/1998-6688-2025-22-3-149-160
Abstract
This article presents a comparative analysis of modern neural network architectures, convolutional neural networks (CNNs) and transformers, for the automatic diagnosis of rice leaf diseases. In the experiments, DenseNet121, ResNet, Vision Transformer (ViT), and MaxViT models were trained and tested, followed by their evaluation in terms of accuracy and computational efficiency. The study was conducted on a large-scale dataset containing real images of healthy and diseased rice leaves, which makes the results highly relevant for agricultural science and practice. The experiments included hyperparameter optimization, application of data augmentation techniques, and the use of loss functions and regularization methods to improve the generalization ability of the models. The evaluation metrics comprised classification accuracy, F1-score, as well as computational efficiency indicators such as prediction time and resource consumption. The results showed that transformer-based models, particularly MaxViT, achieve accuracy of up to 94.10%. This is attributed to their ability to effectively capture both local and global image features through attention mechanisms and deep contextualization. At the same time, CNN architectures such as DenseNet121 and ResNet demonstrate high processing speed and robustness under limited computational resources.
About the Authors
D. B. JurayevKazakhstan
Bachelor student
Almaty
I. M. Ualiyeva
Kazakhstan
Cand.Phys.-Math.Sc., Associate Professor
Almaty
A. Zh. Akzhalova
Kazakhstan
PhD, professor
Almaty
References
1. Ahad, M.T., Li, Y., Song, B., Bhuiyan, T. Comparison of CNN-based deep learning architectures for rice diseases classification. Artificial Intelligence in Agriculture, 9, 22–35 (2023). https://doi.org/10.1016/j.aiia.2023.07.001.
2. Sethy, P.K., Barpanda, N.K., Rath, A.K., Behera, S.K. Deep feature based rice leaf disease identification using support vector machine. Computers and Electronics in Agriculture, 175, 105527 (2020). https://doi.org/10.1016/j.compag.2020.105527.
3. Simhadri, C.G., Kondaveeti, H.K., Vatsavayi, V.K., Mitra, A., Ananthachari, P. Deep learning for rice leaf disease detection: A systematic literature review on emerging trends, methodologies and techniques. Information Processing in Agriculture (2024, May). https://doi.org/10.1016/j.inpa.2024.04.006.
4. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T. et al. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint (2020). URL: https://arxiv.org/abs/2010.11929.
5. Haque, M.A., Pastor-Escuredo, D., Brigui, I., Kesswani, N., Bordoloi, S., Ray, A. K. Rice disease identification using Vision Transformer (ViT) based network. The Future of Artificial Intelligence and Robotics (Cham: Springer, 2024), pp. 732–741. https://doi.org/10.1007/978-3-031-60935-0_63.
6. Mhaned, A., Salma, M., El Haji, M., Jamal, B. Plant disease detection using vision transformers. International Journal of Electrical and Computer Engineering, 15 (2), 2334–2344 (2025). https://doi.org/10.11591/ijece.v15i2.
7. Zhang, H. Attention-based feature enhancement for rice leaf disease recognition. Proc. 2nd Int. Conf. Artificial Intelligence and Automation in High-Performance Computing (AIAHPC 2022), pp. 41–49. https://doi.org/10.1117/12.2641832.
8. Tharani, P., Baranidharan, B. A hybrid ViT-CNN model premeditated for rice leaf disease identification. International Journal of Computational Methods and Experimental Measurements, 12 (1), 35–43 (2024). https://doi.org/10.18280/ijcmem.120104.
9. Mustofa, S., Munna, M., Emon, Y., Rabbany, G., Ahad, M. A comprehensive review on plant leaf disease detection using deep learning // arXiv preprint (2023). URL: https://arxiv.org/abs/2308.14087.
10. Li, Z., Liu, F., Yang, W., Peng, S., Zhou, J. A survey of convolutional neural networks: Analysis, applications, and prospects. IEEE Transactions on Neural Networks and Learning Systems (2021), pp. 1–21. https://doi.org/10.1109/TNNLS.2021.3084827.
11. Tu, Z., Talebi, H., Zhang, H., Yang, F., Milanfar, P., Bovik, A., Li, Y. MaxViT: Multi-axis vision transformer. arXiv preprint (2022). URL: https://arxiv.org/abs/2204.01697.
12. Ahad, Md & Li, Yan & Song, Bo & Bhuiyan, Touhid. Comparison of CNN-based deep learning architectures for rice diseases classification. Artificial Intelligence in Agriculture, 9, 22–35 (2023). https://doi.org/ 10.1016/j.aiia.2023.07.001.
Review
For citations:
Jurayev D.B., Ualiyeva I.M., Akzhalova A.Zh. COMPARATIVE ANALYSIS OF THE EFFECTIVENESS OF TRANSFORMER AND CONVOLUTIONAL NEURAL NETWORK ARCHITECTURES FOR AUTOMATIC CLASSIFICATION OF RICE LEAF DISEASES. Herald of the Kazakh-British Technical University. 2025;22(3):149-160. (In Russ.) https://doi.org/10.55452/1998-6688-2025-22-3-149-160