Bioacoustic classification of a small dataset of mammalian vocalisations using deep learning

Manriquez P, R.; Kotz, S. A.; Ravignani, A.; de Boer, B.

doi:10.1080/09524622.2024.2354468

In the last few years, automatic extraction and classification of animal vocalisations has been facilitated by machine learning (ML) and deep learning (DL) methods. Different frameworks allowed researchers to automatically extract features and perform classification tasks, aiding in call identification and species recognition. However, the success of these applications relies on the amount of available data to train these algorithms. The lack of sufficient data can also lead to overfitting and affect generalisation (i.e. poor performance on out-of-sample data). Further, acquiring large data sets is costly and annotating them is time consuming. Thus, how small can a dataset be to still provide useful information by means of ML or DL? Here, we show how convolutional neural network architectures can handle small datasets in a bioacoustic classification task of affective mammalian vocalisations. We explain how these techniques can be used (e.g. pre-training and data augmentation), and emphasise how to implement them in concordance with features of bioacoustic signals. We further discuss whether these networks can generalise the affective quality of vocalisations across different taxa.

Bioacoustic classification of a small dataset of mammalian vocalisations using deep learning / Manriquez P, R.; Kotz, S. A.; Ravignani, A.; de Boer, B.. - In: BIOACOUSTICS. - ISSN 0952-4622. - 33:4(2024), pp. 354-371. [10.1080/09524622.2024.2354468]

Bioacoustic classification of a small dataset of mammalian vocalisations using deep learning

Manriquez P R.;Kotz S. A.;Ravignani A.;de Boer B.

2024

Abstract

In the last few years, automatic extraction and classification of animal vocalisations has been facilitated by machine learning (ML) and deep learning (DL) methods. Different frameworks allowed researchers to automatically extract features and perform classification tasks, aiding in call identification and species recognition. However, the success of these applications relies on the amount of available data to train these algorithms. The lack of sufficient data can also lead to overfitting and affect generalisation (i.e. poor performance on out-of-sample data). Further, acquiring large data sets is costly and annotating them is time consuming. Thus, how small can a dataset be to still provide useful information by means of ML or DL? Here, we show how convolutional neural network architectures can handle small datasets in a bioacoustic classification task of affective mammalian vocalisations. We explain how these techniques can be used (e.g. pre-training and data augmentation), and emphasise how to implement them in concordance with features of bioacoustic signals. We further discuss whether these networks can generalise the affective quality of vocalisations across different taxa.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2024
			
	Parole chiave
	
				Artificial intelligence; machine learning; species discrimination; species recognition
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Bioacoustic classification of a small dataset of mammalian vocalisations using deep learning / Manriquez P, R.; Kotz, S. A.; Ravignani, A.; de Boer, B.. - In: BIOACOUSTICS. - ISSN 0952-4622. - 33:4(2024), pp. 354-371. [10.1080/09524622.2024.2354468]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Manriquez_etal2024_Bioacoustic classification of a small dataset of mammalian vocalisations using deep learning.pdf accesso aperto Note: Manriquez_Bioacoustic classification_2024 Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 3.01 MB Formato Adobe PDF	3.01 MB	Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1727159

Citazioni

ND

1

1

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Catalogo dei prodotti della ricerca