Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)

The bioacoustic analyses of animal sounds result in an enormous amount of digitized acoustic data, and we need effective automatic processing to extract the information content of the recordings. Our research focuses on the song of Collared Flycatcher (Ficedula albicollis) and we are interested in t...

Full description

Bibliographic Details
Main Authors: Zsebők Sándor, Nagy-Egri Máté Ferenc, Barnaföldi Gergely Gábor, Laczi Miklós, Nagy Gergely, Vaskuti Éva, Garamszegi László Zsolt
Format: Article
Language:English
Published: Sciendo 2019-12-01
Series:Ornis Hungarica
Subjects:
Online Access:https://doi.org/10.2478/orhu-2019-0015
id doaj-c6944b12ff2c477ea12bcf4bde822f53
record_format Article
spelling doaj-c6944b12ff2c477ea12bcf4bde822f532021-09-05T18:16:04ZengSciendoOrnis Hungarica2061-95882019-12-01272596610.2478/orhu-2019-0015orhu-2019-0015Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)Zsebők Sándor0Nagy-Egri Máté Ferenc1Barnaföldi Gergely Gábor2Laczi Miklós3Nagy Gergely4Vaskuti Éva5Garamszegi László Zsolt6Behavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryWigner Research Centre for Physics, 1121, Budapest, Konkoly-Thege Miklós út 29-33. HungaryWigner Research Centre for Physics, 1121, Budapest, Konkoly-Thege Miklós út 29-33. HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryThe bioacoustic analyses of animal sounds result in an enormous amount of digitized acoustic data, and we need effective automatic processing to extract the information content of the recordings. Our research focuses on the song of Collared Flycatcher (Ficedula albicollis) and we are interested in the evolution of acoustic signals. During the last 20 years, we obtained hundreds of hours of recordings of bird songs collected in natural environment, and there is a permanent need for the automatic process of recordings. In this study, we chose an open-source, deep-learning image detection system to (1) find the species-specific songs of the Collared Flycatcher on the recordings and (2) to detect the small, discrete elements so-called syllables within the song. For these tasks, we first transformed the acoustic data into spectrogram images, then we trained two deep-learning models separately on our manually segmented database. The resulted models detect the songs with an intersection of union higher than 0.8 and the syllables higher than 0.7. This technique anticipates an order of magnitude less human effort in the acoustic processing than the manual method used before. Thanks to the new technique, we are able to address new biological questions that need large amount of acoustic data.https://doi.org/10.2478/orhu-2019-0015bird songdeep-learningobject detectioncollared flycatcherautomatic segmentation
collection DOAJ
language English
format Article
sources DOAJ
author Zsebők Sándor
Nagy-Egri Máté Ferenc
Barnaföldi Gergely Gábor
Laczi Miklós
Nagy Gergely
Vaskuti Éva
Garamszegi László Zsolt
spellingShingle Zsebők Sándor
Nagy-Egri Máté Ferenc
Barnaföldi Gergely Gábor
Laczi Miklós
Nagy Gergely
Vaskuti Éva
Garamszegi László Zsolt
Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)
Ornis Hungarica
bird song
deep-learning
object detection
collared flycatcher
automatic segmentation
author_facet Zsebők Sándor
Nagy-Egri Máté Ferenc
Barnaföldi Gergely Gábor
Laczi Miklós
Nagy Gergely
Vaskuti Éva
Garamszegi László Zsolt
author_sort Zsebők Sándor
title Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)
title_short Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)
title_full Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)
title_fullStr Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)
title_full_unstemmed Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)
title_sort automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the collared flycatcher (ficedula albicollis)
publisher Sciendo
series Ornis Hungarica
issn 2061-9588
publishDate 2019-12-01
description The bioacoustic analyses of animal sounds result in an enormous amount of digitized acoustic data, and we need effective automatic processing to extract the information content of the recordings. Our research focuses on the song of Collared Flycatcher (Ficedula albicollis) and we are interested in the evolution of acoustic signals. During the last 20 years, we obtained hundreds of hours of recordings of bird songs collected in natural environment, and there is a permanent need for the automatic process of recordings. In this study, we chose an open-source, deep-learning image detection system to (1) find the species-specific songs of the Collared Flycatcher on the recordings and (2) to detect the small, discrete elements so-called syllables within the song. For these tasks, we first transformed the acoustic data into spectrogram images, then we trained two deep-learning models separately on our manually segmented database. The resulted models detect the songs with an intersection of union higher than 0.8 and the syllables higher than 0.7. This technique anticipates an order of magnitude less human effort in the acoustic processing than the manual method used before. Thanks to the new technique, we are able to address new biological questions that need large amount of acoustic data.
topic bird song
deep-learning
object detection
collared flycatcher
automatic segmentation
url https://doi.org/10.2478/orhu-2019-0015
work_keys_str_mv AT zseboksandor automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis
AT nagyegrimateferenc automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis
AT barnafoldigergelygabor automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis
AT laczimiklos automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis
AT nagygergely automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis
AT vaskutieva automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis
AT garamszegilaszlozsolt automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis
_version_ 1717786193424285696