Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Damian Koszewski; Bożena Kostek

doi:10.17743/jaes.2019.0050

Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Abstrakt

Developing signal processing methods to extract information automatically has potential in several applications, for example searching for multimedia based on its audio content, making context-aware mobile applications (e.g., tuning apps), or pre-processing for an automatic mixing system. However, the last-mentioned application needs a significant amount of research to reliably recognize real musical instruments in recordings. In this paper we primarily focus on how to obtain data for efficiently training, validating, and testing a deep-learning model by using a data augmentation technique. These data are transformed into 2D feature spaces, i.e., mel-scale spectrograms. The Neural Network used in the experiments consists of a single-block DenseNet architecture and a multi-head softmax classifier for efficient learning with the mixup augmentation. For automatic noisy data labeling, the batch-wise loss masking, which is robust to corrupting outliers in data, was applied. To train the models, various audio sample rates and different audio representations were utilized. The method provides promising recognition scores even with real-world recordings that contain noisy data.

Cytowania

7

CrossRef
0

Web of Science
8

Scopus

Autorzy (2)

Cytuj jako

Pełna treść

pobierz publikację

pobrano 294 razy

Wersja publikacji: Accepted albo Published Version
Licencja: Copyright (2020 Audio Eng. Society)

pełna treść artykułu zobacz w serwisie zewnętrznym otwiera się w nowej karcie

Słowa kluczowe

Informacje szczegółowe

Kategoria:

Publikacja w czasopiśmie

Typ:

artykuły w czasopismach

Opublikowano w:

JOURNAL OF THE AUDIO ENGINEERING SOCIETY nr 68, strony 57 - 65,
ISSN: 1549-4950

Język:

angielski

Rok wydania:

2020

Opis bibliograficzny:

Koszewski D., Kostek B.: Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing// JOURNAL OF THE AUDIO ENGINEERING SOCIETY -Vol. 68,iss. 1/2 (2020), s.57-65

DOI:

10.17743/jaes.2019.0050

Źródła finansowania:

Działalność statutowa/subwencja

Weryfikacja:

Politechnika Gdańska

wyświetlono 189 razy

Publikacje, które mogą cię zainteresować

Vehicle Type Recognition Based on Audio Data

D. Kobiela,
M. Hajdasz,
M. Erezman
+ 4 autorów

2025

Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition

G. Korvel,
P. Treigys,
G. Tamulevicus
+ 2 autorów

2018

Data augmentation for improving deep learning in image classification problem

2018

Categorization of emotions in dog behavior based on the deep neural network

2022

Meta Tagi

Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Abstrakt

Cytowania

Autorzy (2)

Damian Koszewski mgr inż.

Bożena Kostek prof. dr hab. inż.

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

Vehicle Type Recognition Based on Audio Data

Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition

Data augmentation for improving deep learning in image classification problem

Categorization of emotions in dog behavior based on the deep neural network

Wyszukiwarka

Musical Instrument Tagging Using Data Augmentation and Effective Noisy Data Processing

Abstrakt

Cytowania

Autorzy (2)

Damian Koszewski mgr inż.

Bożena Kostek prof. dr hab. inż.

Cytuj jako

Pełna treść

Słowa kluczowe

Informacje szczegółowe

Publikacje, które mogą cię zainteresować

Vehicle Type Recognition Based on Audio Data

Analysis of 2D Feature Spaces for Deep Learning-based Speech Recognition

Data augmentation for improving deep learning in image classification problem

Categorization of emotions in dog behavior based on the deep neural network