APPLICATION OF BLEU AND SARI METRICS IN EVALUATING SIMPLIFIED TEXTS IN KAZAKH: ANALYSIS AND EFFECTIVENESS

S. T. Nursapa; I. M. Ualiyeva

doi:10.55452/1998-6688-2025-22-1-36-43

APPLICATION OF BLEU AND SARI METRICS IN EVALUATING SIMPLIFIED TEXTS IN KAZAKH: ANALYSIS AND EFFECTIVENESS

S. T. Nursapa, I. M. Ualiyeva

https://doi.org/10.55452/1998-6688-2025-22-1-36-43

Full Text:

PDF (Rus)

Generate QR code

Abstract

This article explores the methodology for evaluating the quality of simplified texts in Kazakh using BLEU and SARI metrics. Text simplification is an important aspect for ensuring information accessibility and facilitating the learning process in Kazakh language. The BLEU metric, based on comparing n-grams of translation and reference, is widely used for evaluating the quality of machine translation, but it does not take the context of the input text into account. The SARI metric, specifically designed for evaluating text simplification, considers semantic changes and shows a higher correlation with human judgments. In this study, algorithms for replacing complex words with simple synonyms and for replacing or removing complex phrases were applied. The analysis results showed that the SARI metric is more sensitive to the changes made in simplified texts compared to BLEU. Therefore, the combined use of BLEU and SARI metrics provides a comprehensive and accurate evaluation of the quality of simplified texts in Kazakh.

Keywords

text simplification, Kazakh language, evaluation metrics, BLEU, SARI, text quality, machine translation, automatic text simplification, simplification algorithms

About the Authors

S. T. Nursapa

Al-Farabi Kazakh National University
Kazakhstan

Master’s student

Almaty

I. M. Ualiyeva

Al-Farabi Kazakh National University
Kazakhstan

Cand. Phys.-Math. Sc., Associate Professor

Almaty

References

1. Jiang, Chao, Mounica Maddela, Wuwei Lan, Yang Zhong, Wei Xu. Comput. Res. Repos., 2020. https://doi.org/10.48550/arXiv.2005.02324

2. Lindstrom, Jennifer H. Teaching Exceptional Children, 2019, vol. 51, pp. 189–200. https://doi.org/10.1177/0040059918763712.

3. Xu W., Callison-Burch C., Napoles C. Transactions of the Association for Computational Linguistics, 2015, vol. 3, pp. 283–297. https://doi.org/10.1162/tacl_a_00139.

4. Suha S. Al-Thanyyan, Aqil M. Azmi. ACM Comput. Surv., 2021, 36 p. https://doi.org/10.1145/3442695.

5. Papineni K., Roukos S., Ward T., Zhu W.-J. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics – ACL ’02, 2001, p. 311. https://doi.org/10.3115/1073083.1073135.

6. Xu, Wei, Napoles, Courtney, Pavlick, Ellie, Chen, Quanze, Callison-Burch, Chris. Transactions of the Association for Computational Linguistics, 2016, pp. 401–415. https://doi.org/10.1162/tacl_a_00107.

7. Narayan S., Gardent C. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014, vol. 1, pp. 435–445. https://doi.org/10.3115/v1/P14-1041.

8. Sulem E., Abend O., Rappoport A. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018, pp. 738–744. https://doi.org/10.18653/v1/D18-1081.

9. 1Sulem E., Abend O., Rappoport A. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018, vol. 1, pp. 685–696. https://doi.org/10.48550/arXiv.1810.05022.

10. Qonaqjailyqqa negızdelgen qazaq ūlttyq ashanasy, soyle.kz, 2024. Available: https://www.soyle.kz/article/view?id=879. [Accessed: 26-Nov- 2024] [in Kazakh]

11. 1Janfada B., Minaei-Bidgoli B. 6th International Conference on Web Research (ICWR), 2020, p. 271. https://doi.org/10.1109/ICWR49608.2020.9122325.

Review

For citations:

Nursapa S.T., Ualiyeva I.M. APPLICATION OF BLEU AND SARI METRICS IN EVALUATING SIMPLIFIED TEXTS IN KAZAKH: ANALYSIS AND EFFECTIVENESS. Herald of the Kazakh-British Technical University. 2025;22(1):36-43. (In Russ.) https://doi.org/10.55452/1998-6688-2025-22-1-36-43

JATS XML

This work is licensed under a Creative Commons Attribution 4.0 License.

ISSN 1998-6688 (Print)
ISSN 2959-8109 (Online)

Username
Password
	Remember me
Not a user? Register with this site Forgot your password?

User

Herald of the Kazakh-British Technical University

APPLICATION OF BLEU AND SARI METRICS IN EVALUATING SIMPLIFIED TEXTS IN KAZAKH: ANALYSIS AND EFFECTIVENESS

Full Text:

Abstract

Keywords

About the Authors

References

Review

For citations:

Cookies policy