Specifics of Data Collection and Data Processing during Formation of RailVista Dataset for Machine Learning- and Deep Learning-Based Applications


Abisheva G. Goranin N. Razakhova B. Aidynov T. Satybaldina D.
August 2024Multidisciplinary Digital Publishing Institute (MDPI)

Sensors
2024#24Issue 16

This paper presents the methodology and outcomes of creating the Rail Vista dataset, designed for detecting defects on railway tracks using machine and deep learning techniques. The dataset comprises 200,000 high-resolution images categorized into 19 distinct classes covering various railway infrastructure defects. The data collection involved a meticulous process including complex image capture methods, distortion techniques for data enrichment, and secure storage in a data warehouse using efficient binary file formats. This structured dataset facilitates effective training of machine/deep learning models, enhancing automated defect detection systems in railway safety and maintenance applications. The study underscores the critical role of high-quality datasets in advancing machine learning applications within the railway domain, highlighting future prospects for improving safety and reliability through automated recognition technologies.

data collection , dataset , machine learning , railway , railway track defects

Text of the article Перейти на текст статьи

Department of Artificial Intelligence Technology, Faculty of Information Technologies, L.N. Gumilyov, Eurasian National University, Astana, KZ-010000, Kazakhstan
Department of Information Systems, Faculty of Fundamental Sciences, Vilnius Gediminas Technical University, Vilnius, LT-08412, Lithuania
Department of Information Security, Faculty of Information Technologies, L.N. Gumilyov, Eurasian National University, Astana, KZ-010000, Kazakhstan

Department of Artificial Intelligence Technology
Department of Information Systems
Department of Information Security

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026