Mobile QR Code
Title Boosting Principal Frequency based Data Augmentation for Sound Event Detection
Authors 남지원(Jiwon Nam) ; 박상욱(Sangwook Park)
DOI https://doi.org/10.5573/ieie.2024.61.7.77
Page pp.77-83
ISSN 2287-5026
Keywords Boosting principal frequency; Dcase; Sound event detection; Data augmentation; Audio&speech
Abstract Sound event detection aims to identify sounds of interest into the type of sounds and each interval. So, it is a crucial technique of audio-based situation awareness, multimedia analysis, and monitoring systems. To improve detection performance, an acoustic model that reflects the statistic of target sounds, which occur in various situations, is needed. However, it is difficult to consider various situations due to high cost of audio collection and preparing training dataset. To address this challenge, this paper analyzes the effectiveness of conventional data augmentation approaches applicable to training acoustic models efficiently, and then proposes a new data augmentation method. In the experiments based on the testbed provided by DCASE2020 task4, the proposed method shows the best performance in an event-based F1 score of 39.39% compared to other comparison groups such as time shiting, time rolling, masking, filtering, adding noise, and mix-up.