SPEAKER RECOGNITION BY ULTRASHORT UTTERANCES
Medetov B. Nurlankyzy A. Namazbayev T. Akhmediyarova A. Zhetpisbayev K. Zhetpisbayeva A. Kargulova A.
2025Technology Center
Eastern-European Journal of Enterprise Technologies
2025#2Issue 9(134)62 - 69 pp.
The object of this study is the accuracy of announcer identification based on short utterances. To solve the task of speaker identification based on ultrashort speech utterances, a phoneme-by-phoneme approach to constructing voice models has been proposed within the framework of the study. The validity of this approach is based on the fact that short utterances usually contain a limited number of phonemes. In this regard, a hypothesis was put forward assuming that in order to increase the accuracy of announcer identification based on short utterances, it is necessary to analyze the sound of specific phonemes by different announcers. The experiments involved speech recordings of monosyllabic words with corresponding phonemes, on the basis of which, using the ECAPA-TDNN neural network architecture, announcer voice models were constructed. The experimental studies showed that voice models constructed based on the sounds of only one model provide higher announcer identification accuracy compared to generalized models constructed based on all speech sounds. It was also found that different phonemes provide different announcer identification accuracy. For example, with a speech signal duration of 2–3 seconds, the accuracy of announcer identification by the generalized model was 75 %. And the accuracy of announcer identification using a model built on the basis of only one phoneme “E”, with the same input data, was 85 %, which is 10 percentage points higher than that of the generalized model Copyright
announcer recognition , ECAPA-TDNN , phoneme-by-phoneme recognition , phonemes of the Kazakh language , ultra-short utterances
Text of the article Перейти на текст статьи
Department of Electronics, Telecommunications and Space Technologies
Department of Space Engineering, Non-Profit Joint Stock Company “Almaty University of Power Engineering and Telecommunications named after Gumarbek Daukeyev”, Baytursynuli str., 126/1, Almaty, 050013, Kazakhstan
Department of Solid State Physics and Nonlinear Physics, Al-Farabi Kazakh National University, Al-Farabi ave., 71, Almaty, 050040, Kazakhstan
Department of Software Engineering
Department of Radio Engineering, Electronics and Telecommunications, Turan University, Satpayev str., 16A, Almaty, 050013, Kazakhstan
Department of Electric Power Supply, S. Seifullin Kazakh Agrotechnical Research University, Zhenis ave., 62, Astana, 010011, Kazakhstan
Department of Radio Engineering, Electronics and Telecommunications, L.N. Gumilyov Eurasian National University, Satbayev str., 2, Astana, 010008, Kazakhstan
Satbayev University, Satbayev str., 22, Almaty, 050013, Kazakhstan
Department of Electronics
Department of Space Engineering
Department of Solid State Physics and Nonlinear Physics
Department of Software Engineering
Department of Radio Engineering
Department of Electric Power Supply
Department of Radio Engineering
Satbayev University
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026