Preview

Herald of the Kazakh-British Technical University

Advanced search

CLASSIFICATION OF LUNG CALCIFICATIONS AND CANCER IN LUNGS-RADS SYSTEM BASED ON RADIOLOGICAL FEATURES

https://doi.org/10.55452/1998-6688-2024-21-4-32-44

Abstract

Lung cancer represents a significant health challenge both in Kazakhstan and globally, standing out as one of the most fatal forms of cancer. Diagnosis of lung cancer is challenging as symptoms often remain undetectable in the early stages. Furthermore, lung cancer shares clinical features with various other pulmonary conditions, complicating its accurate identification. Accurate diagnosis typically involves lung puncture for subsequent biopsy, a highly invasive and painful procedure for the patient. Therefore, it is crucial to distinguish false positive cases in the diagnostic stage of computed tomography scans. We conducted a comparative analysis of five machine learning models (Logistic Regression, Decision Tree, Random Forest, SVM, and Naïve Bayes Algorithms) based on radiological features extracted from annotated computed tomography scans. We opted for classical machine learning methods because their decision-making process is easier to control compared to neural networks. We evaluated the models in terms of binary and multi-class classification to determine whether a given nodule is related to calcifications or cancers, as well as its classification according to Lung-RADS, enabling the management of whether further biopsy or only routine monitoring is necessary. We used Precision to evaluate the number of False Positive predictions in the binary classification task. Precision emerged as a pivotal metric in our assessment, offering insights into the number of false positive predictions specifically in the binary classification task. For the multi-class classification aspect, we turned to Quadratic Kappa, a robust measure that accounts for the ordinal nature of the Lung-RADS classes. Our analysis was underpinned by a combination of local Kazakhstani data and the publicly available LIDC-IDRI dataset, underscoring our commitment to leveraging diverse data sources to bolster diagnostic capabilities.

About the Author

D. Nam
Kazakh-British Technical University
Kazakhstan

Master of Tech. Sci., PhD Student

Almaty



References

1. Ferlay J., Ervik M., Lam F., et al. Global Cancer Observatory: Cancer Today. Lyon, France: International Agency for Research on Cancer, 2022. Available from: https://gco.iarc.fr/today.

2. Raza R., Zulfiqar F., Khan M. O., Arif M., Alvi A., Iftikhar M. A., & Alam T. Lung-EffNet: Lung cancer classification using EfficientNet from CT-scan images. Engineering Applications of Artificial Intelligence, 2023, vol. 126, p. 106902.

3. Tan M., & Le Q. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, 2019, May, pp. 6105–6114. PMLR.

4. Cho J., Kim J., Lee K. J., Nam C.M., Yoon S.H., Song H. ... & Lee K.W. Incidence lung cancer after a negative ct screening in the national lung screening trial: Deep learning-based detection of missed lung cancers. Journal of Clinical Medicine, 2020, vol. 9, no. 12, p. 3908.

5. Huang G., Liu Z., Van Der Maaten L., & Weinberger K.Q. Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 4700–4708.

6. Chui K.T., Gupta B.B., Jhaveri R.H., Chi H.R., Arya V., Almomani A., & Nauman A. Multiround transfer learning and modified generative adversarial network for lung cancer detection. International Journal of Intelligent Systems, 2023, pp. 1–14.

7. Tiwari A., Hannan S.A., Pinnamaneni R., Al-Ansari A.R.M., El-Ebiary Y.A.B., Prema S. ... & Vidalón J.L.J. Optimized Ensemble of Hybrid RNN-GAN Models for Accurate and Automated Lung Tumour Detection from CT Images. International Journal of Advanced Computer Science and Applications, 2023, vol. 14, no.7.

8. Götz T.I., Göb S., Sawant S., Erick X.F., Wittenberg T., Schmidkonz C. ... & Ramming A. Number of necessary training examples for Neural Networks with different number of trainable parameters. Journal of Pathology Informatics, 2022, no.13, p. 100114.

9. Goel K., Gu A., Li Y., & Ré C. Model patching: Closing the subgroup performance gap with data augmentation, 2020, arXiv preprint arXiv:2008.06775.

10. Banerjee I., Bhimireddy A.R., Burns J.L., Celi L.A., Chen L.C., Correa R. ... & Gichoya J.W. Reading race: AI recognises patient’s racial identity in medical images, 2021, arXiv preprint arXiv:2107.10356.

11. Gichoya J.W., Banerjee I., Bhimireddy A.R., Burns J.L., Celi L.A., Chen L.C. ... & Zhang H. (2022). AI recognition of patient race in medical imaging: a modelling study. The Lancet Digital Health, vol.4, no. 6, e406-e414.

12. Jayaraj D., & Sathiamoorthy S. Random forest based classification model for lung cancer prediction on computer tomography images. In 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), 2019, November, pp. 100–104. IEEE.

13. Armato III S.G., McLennan G., Bidaut L., et al. Data From LIDC-IDRI [Data set]. The Cancer Imaging Archive, 2015. https://doi.org/10.7937/K9/TCIA.2015.LO9QL9SX.

14. Beucher S., & Meyer F. The morphological approach to segmentation: the watershed transformation. In Mathematical morphology in image processing, 2018, pp. 433–481).

15. Breiman L. Random forests. Machine learning, 2001, no. 45, pp. 5–32.

16. Paing May Phu, and Somsak Choomchuay. Improved random forest (RF) classifier for imbalanced classification of lung nodules. 2018 International Conference on Engineering, Applied Sciences, and Technology (ICEAST). IEEE, 2018.

17. Kareem H.F., AL-Husieny M.S., Mohsen F.Y., Khalil E.A., & Hassan Z.S. Evaluation of SVM performance in the detection of lung cancer in marked CT scan dataset. Indonesian Journal of Electrical Engineering and Computer Science, 2021, vol. 21, no. 3, p. 1731.

18. Otsu N. A threshold selection method from gray-level histograms. Automatica, 1975, vol. 11, no. 285–296, pp. 23–27.

19. Gabor D. Theory of communication. Part 1: The analysis of information. Journal of the Institution of Electrical Engineers-part III: radio and communication engineering, 1946, vol. 93, no. 26, pp. 429–441.

20. Haralick R.M., Shanmugam K., & Dinstein I.H. Textural features for image classification. IEEE Transactions on systems, man, and cybernetics, 1973, no. 6, pp. 610–621.

21. Cripsy J. Viji, and Divya T. Lung Cancer Disease Prediction and Classification based on Feature Selection method using Bayesian Network, Logistic Regression, J48, Random Forest, and Naïve Bayes Algorithms. 2023 3rd International Conference on Smart Data Intelligence (ICSMDI). IEEE, 2023.

22. Pearson K. VII. Note on regression and inheritance in the case of two parents. proceedings of the royal society of London, 1895, vol. 58, no. 347–352, pp. 240–242.

23. Spearman C. The proof and measurement of association between two things, 1961.

24. Nam D., Panina A., Pak A. Lung cancer segmentation dataset with Lung-RADS class, Mendeley Data, V1, doi: 10.17632/5rr22hgzwr.1


Review

For citations:


Nam D. CLASSIFICATION OF LUNG CALCIFICATIONS AND CANCER IN LUNGS-RADS SYSTEM BASED ON RADIOLOGICAL FEATURES. Herald of the Kazakh-British Technical University. 2024;21(4):32-44. https://doi.org/10.55452/1998-6688-2024-21-4-32-44

Views: 187


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1998-6688 (Print)
ISSN 2959-8109 (Online)