A comprehensive voice dataset for Hindko digit recognition
Ahmed T. Khan M. Khan K. Syed I. Ullah S.S.
February 2025Elsevier Inc.
Data in Brief
2025#58
Hindko is a language primarily spoken in Northwestern areas of Pakistan. Approximately eight million people speak the Hindko language. According to its native speakers, it is 7th largest language of Pakistan and 2nd largest language of Khyber Pakhtunkhwa. The Hazara region is the cultural hub of Hindko language. About 80% of the population in districts like Haripur, Abbotabad and Mansehra speak Hindko. The spoken content of Hindko covers a wide range of subjects, including religion, education, poetry, politics, theater, and more. Despite all this, Hindko lacks a voice recognition system that could enhance accessibility, preserve the language, and promote digital inclusion for its speakers. This paper presents a voice recognition dataset that consists of 17,597 voice samples, and is accessible to the public for academic and research purposes. The dataset consists of 20 Hindko digits ranging from 1 to 20 and all the voice samples are taken from the students and staff and faculty of Pak-Austria Fachhochschule Institute of Applied Science and Technology.
Artificial intelligence , Machine learning , Natural language processing , Signal processing , Voice recognition
Text of the article Перейти на текст статьи
Pak-Austria Fachhochschule: Institute of Applied Sciences and Technology, Haripur, Pakistan
Software Competence Center Hagenberg, Softwarepark 32a, Hagenberg, 4232, Austria
Department of Computer Science, School of Engineering and Digital Sciences, Nazarbayev University, Kazakhstan
Department of Information & Communication Engineering, Hankuk University of Foreign Studies, Yongin, 17035, South Korea
Department of Information & Communication Technology, University of Agder (UiA), Norway
Pak-Austria Fachhochschule: Institute of Applied Sciences and Technology
Software Competence Center Hagenberg
Department of Computer Science
Department of Information & Communication Engineering
Department of Information & Communication Technology
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026