Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)
The bioacoustic analyses of animal sounds result in an enormous amount of digitized acoustic data, and we need effective automatic processing to extract the information content of the recordings. Our research focuses on the song of Collared Flycatcher (Ficedula albicollis) and we are interested in t...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Sciendo
2019-12-01
|
Series: | Ornis Hungarica |
Subjects: | |
Online Access: | https://doi.org/10.2478/orhu-2019-0015 |
id |
doaj-c6944b12ff2c477ea12bcf4bde822f53 |
---|---|
record_format |
Article |
spelling |
doaj-c6944b12ff2c477ea12bcf4bde822f532021-09-05T18:16:04ZengSciendoOrnis Hungarica2061-95882019-12-01272596610.2478/orhu-2019-0015orhu-2019-0015Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis)Zsebők Sándor0Nagy-Egri Máté Ferenc1Barnaföldi Gergely Gábor2Laczi Miklós3Nagy Gergely4Vaskuti Éva5Garamszegi László Zsolt6Behavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryWigner Research Centre for Physics, 1121, Budapest, Konkoly-Thege Miklós út 29-33. HungaryWigner Research Centre for Physics, 1121, Budapest, Konkoly-Thege Miklós út 29-33. HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryBehavioural Ecology Group, Department of Systematic Zoology and Ecology, Eötvös Loránd University, 1117Budapest, Pázmány Péter sétány 1/C, HungaryThe bioacoustic analyses of animal sounds result in an enormous amount of digitized acoustic data, and we need effective automatic processing to extract the information content of the recordings. Our research focuses on the song of Collared Flycatcher (Ficedula albicollis) and we are interested in the evolution of acoustic signals. During the last 20 years, we obtained hundreds of hours of recordings of bird songs collected in natural environment, and there is a permanent need for the automatic process of recordings. In this study, we chose an open-source, deep-learning image detection system to (1) find the species-specific songs of the Collared Flycatcher on the recordings and (2) to detect the small, discrete elements so-called syllables within the song. For these tasks, we first transformed the acoustic data into spectrogram images, then we trained two deep-learning models separately on our manually segmented database. The resulted models detect the songs with an intersection of union higher than 0.8 and the syllables higher than 0.7. This technique anticipates an order of magnitude less human effort in the acoustic processing than the manual method used before. Thanks to the new technique, we are able to address new biological questions that need large amount of acoustic data.https://doi.org/10.2478/orhu-2019-0015bird songdeep-learningobject detectioncollared flycatcherautomatic segmentation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zsebők Sándor Nagy-Egri Máté Ferenc Barnaföldi Gergely Gábor Laczi Miklós Nagy Gergely Vaskuti Éva Garamszegi László Zsolt |
spellingShingle |
Zsebők Sándor Nagy-Egri Máté Ferenc Barnaföldi Gergely Gábor Laczi Miklós Nagy Gergely Vaskuti Éva Garamszegi László Zsolt Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis) Ornis Hungarica bird song deep-learning object detection collared flycatcher automatic segmentation |
author_facet |
Zsebők Sándor Nagy-Egri Máté Ferenc Barnaföldi Gergely Gábor Laczi Miklós Nagy Gergely Vaskuti Éva Garamszegi László Zsolt |
author_sort |
Zsebők Sándor |
title |
Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis) |
title_short |
Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis) |
title_full |
Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis) |
title_fullStr |
Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis) |
title_full_unstemmed |
Automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the Collared Flycatcher (Ficedula albicollis) |
title_sort |
automatic bird song and syllable segmentation with an open-source deep-learning object detection method – a case study in the collared flycatcher (ficedula albicollis) |
publisher |
Sciendo |
series |
Ornis Hungarica |
issn |
2061-9588 |
publishDate |
2019-12-01 |
description |
The bioacoustic analyses of animal sounds result in an enormous amount of digitized acoustic data, and we need effective automatic processing to extract the information content of the recordings. Our research focuses on the song of Collared Flycatcher (Ficedula albicollis) and we are interested in the evolution of acoustic signals. During the last 20 years, we obtained hundreds of hours of recordings of bird songs collected in natural environment, and there is a permanent need for the automatic process of recordings. In this study, we chose an open-source, deep-learning image detection system to (1) find the species-specific songs of the Collared Flycatcher on the recordings and (2) to detect the small, discrete elements so-called syllables within the song. For these tasks, we first transformed the acoustic data into spectrogram images, then we trained two deep-learning models separately on our manually segmented database. The resulted models detect the songs with an intersection of union higher than 0.8 and the syllables higher than 0.7. This technique anticipates an order of magnitude less human effort in the acoustic processing than the manual method used before. Thanks to the new technique, we are able to address new biological questions that need large amount of acoustic data. |
topic |
bird song deep-learning object detection collared flycatcher automatic segmentation |
url |
https://doi.org/10.2478/orhu-2019-0015 |
work_keys_str_mv |
AT zseboksandor automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis AT nagyegrimateferenc automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis AT barnafoldigergelygabor automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis AT laczimiklos automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis AT nagygergely automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis AT vaskutieva automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis AT garamszegilaszlozsolt automaticbirdsongandsyllablesegmentationwithanopensourcedeeplearningobjectdetectionmethodacasestudyinthecollaredflycatcherficedulaalbicollis |
_version_ |
1717786193424285696 |