Hand-crafted versus learned representations for audio event detection
Küçükbay S.E. Yazıcı A. Kalkan S.
September 2022Springer
Multimedia Tools and Applications
2022#81Issue 2130911 - 30930 pp.
Audio Event Detection (AED) pertains to identifying the types of events in audio signals. AED is essential for applications requiring decisions based on audio signals, which can be critical, for example, for health, surveillance and security applications. Despite the proven benefits of deep learning in obtaining the best representation for solving a problem, AED studies still generally employ hand-crafted representations even when deep learning is used for solving the AED task. Intrigued by this, we investigate whether or not hand-crafted representations (i.e. spectogram, mel spectogram, log mel spectogram and mel frequency cepstral coefficients) are better than a representation learned using a Convolutional Autoencoder (CAE). To the best of our knowledge, our study is the first to ask this question and thoroughly compare feature representations for AED. To this end, we first find the best hop size and window size for each hand-crafted representation and compare the optimized hand-crafted representations with CAE-learned representations. Our extensive analyses on a subset of the AudioSet dataset confirm the common practice in that hand-crafted representations do perform better than learned features by a large margin (∼ 30 AP). Moreover, we show that the commonly used window and hop sizes do not provide the optimal performances for the hand-crafted representations.
Audio event classification , Audio event detection , Deep learning , Log mel spectogram , Mel spectrogram , MFCC , Spectrogram
Text of the article Перейти на текст статьи
Department of Computer Engineering, Middle East Technical University, Ankara, Turkey
Department of Computer Science, SEDS, Nazarbayev University, Nur Sultan, Kazakhstan
Department of Computer Engineering
Department of Computer Science
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026