Latent and Adversarial Data Augmentation for Sound Event Detection and Classification

David Perera; Slim Essid; Gaël Richard

Communication Dans Un Congrès Année : 2022

Latent and Adversarial Data Augmentation for Sound Event Detection and Classification

(1, 2, 3) , (1, 2, 3) , (1, 2, 3)

1
2
3

David Perera

Fonction : Auteur
PersonId : 1161271
IdHAL : david-perera

Télécom Paris

Département Images, Données, Signal

Signal, Statistique et Apprentissage

Slim Essid

Fonction : Auteur
PersonId : 181234
IdHAL : slimessid
ORCID : 0000-0002-0028-327X
IdRef : 11025130X

Télécom Paris

Département Images, Données, Signal

Signal, Statistique et Apprentissage

Gaël Richard

Fonction : Auteur
PersonId : 14146
IdHAL : gael-richard
IdRef : 094977208

Télécom Paris

Département Images, Données, Signal

Signal, Statistique et Apprentissage

Résumé

Invariance-based learning is a promising approach in deep learning. Among other benefits, it can mitigate the lack of diversity of available datasets and increase the interpretability of trained models. To this end, practitioners often use a consistency cost penalizing the sensitivity of a model to a set of carefully selected data augmentations. However, there is no consensus about how these augmentations should be selected. In this paper, we study the behavior of several augmentation strategies. We consider the task of sound event detection and classification for our experiments. In particular, we show that transformations operating on the internal layers of a deep neural network are beneficial for this task.

Mots clés

sound event detection data augmentation adversarial learning

Domaines

Apprentissage [cs.LG]

Fichier principal

dcase.pdf (178.96 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

David Perera : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03782827

Soumis le : mercredi 21 septembre 2022-15:33:18

Dernière modification le : lundi 9 octobre 2023-12:49:43

Archivage à long terme le : jeudi 22 décembre 2022-19:15:03

Dates et versions

hal-03782827 , version 1 (21-09-2022)

Identifiants

HAL Id : hal-03782827 , version 1

Citer

David Perera, Slim Essid, Gaël Richard. Latent and Adversarial Data Augmentation for Sound Event Detection and Classification. International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE), Nov 2022, Nancy, France. ⟨hal-03782827⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM LTCI IDS S2A IP_PARIS

207 Consultations

156 Téléchargements

Latent and Adversarial Data Augmentation for Sound Event Detection and Classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager