AUTOMATIC KAZAKH SPEECH RECOGNITION WITH DNN

O. Mamyrbayev; M. Turdalyuly; N. Mekebayev; T. Turdalykyzy; A. Shayakhmetova

AUTOMATIC KAZAKH SPEECH RECOGNITION WITH DNN

O. Mamyrbayev, M. Turdalyuly, N. Mekebayev, T. Turdalykyzy, A. Shayakhmetova

Full Text:

PDF (Rus) |

Generate QR code

Abstract

This paper describes one of the areas in the field of artificial intelligence speech recognition systems. Comparing the speeches o f Kazakh and other languages, they identified the main problems of automatic recognition of this language. One of the main problems is the lack of speech data, for which work was carried out to collect acoustic data of the Kazakh language. In order to continue the research work related to the Kazakh language, the personal data of the announcers were identified. Algorithms for processing speech signals, learning acoustic and language modeling are described and research and practical work is carried out. Test results of speech recognition using deep neural networks were obtained. Comparisons with the results of traditional models and the best DNN (Deep Neural Network) aspects.

Keywords

Kazakh language speech recognition, speech recognition systems, deep neural networks, DNN speech processing

About the Authors

O. Mamyrbayev

Институт информационных и вычислительных технологий КН МОН РК
Kazakhstan

M. Turdalyuly

Институт информационных и вычислительных технологий КН МОН РК
Kazakhstan

N. Mekebayev

Институт информационных и вычислительных технологий КН МОН РК; Казахский Национальный университет им. аль-Фараби
Kazakhstan

T. Turdalykyzy

Институт информационных и вычислительных технологий КН МОН РК
Kazakhstan

A. Shayakhmetova

Институт информационных и вычислительных технологий КН МОН РК
Kazakhstan

References

1. Stouten F., Duchateau J., Martens J.-P., Wambacq P. Coping with disfluencies in spontaneous speech recognition: acoustic detection and linguistic context manipulation // Speech Communication. 2006. Vol. 48. pp. 1590-1606.

2. Tsiaras V., Panagiotakis C., Stylianou Y. Video and audio based detection of filled hesitation pauses in classroom lectures // Proc. o f the 17th European Signal Processing Conference (EUSIPCO 2009). Glasgow, Scotland, August 24-28, 2009. pp. 834-838.

3. Psutka J., Ircing P., Psutka J. V., Hajic J., Byrne W. J., Mirovsky J. Automatic Transcription of Czech, Russian, and Slovak Spontaneous Speech in the M ALACH Project // Proceedings of Eurospeech. Lisboa. Portugal. Sept. 4-8. 2005. pp. 1349-1352.

4. Young S. et al. The HTK Book (for HTK Version 3.4). Cambridge. UK, 2009. 375 p.

5. Karpov A., Kipyatkova I., Ronzhin A. Very Large Vocabulary A SR for Spoken Russian with Syntactic and Morphemic Analysis. In Proc. INTERSPEECH-2011, Florence, Italy, 2011, pp. 3161-3164.

6. Serizel, R., Giuliani, D.: Vocal tract length normalization approaches to DNN-Based children’s and adults’ speech recognition. IEEE W orkshop on Spoken Language Technology, pp. 135-140. 2014.

7. Behbahani, Yasser Mohseni, Babaali, Bagher, Turdalyuly Mussa Persian sentences to phoneme sequences conversion based on recurrent neural networks // Open Computer Science. - 2016. - Issue-6. - P. 219-225.

8. Dong Yu., Li Deng Automatic Speech Recognition // Shpringer. -2014. P. -315.

Review

For citations:

Mamyrbayev O., Turdalyuly M., Mekebayev N., Turdalykyzy T., Shayakhmetova A. AUTOMATIC KAZAKH SPEECH RECOGNITION WITH DNN. Herald of the Kazakh-British Technical University. 2019;16(2):134-142. (In Russ.)

This work is licensed under a Creative Commons Attribution 4.0 License.

ISSN 1998-6688 (Print)
ISSN 2959-8109 (Online)

Username
Password
	Remember me
Not a user? Register with this site Forgot your password?

User

Herald of the Kazakh-British Technical University

AUTOMATIC KAZAKH SPEECH RECOGNITION WITH DNN

Full Text:

Abstract

Keywords

About the Authors

References

Review

For citations:

Cookies policy