Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications

Voice communication is the main channel to exchange information between pilots and Air-Traffic Controllers (ATCos). Recently, several projects have explored the employment of speech recognition technology to automatically extract spoken key information such as call signs, commands, and values, which...

Full description

Bibliographic Details
Main Authors: Juan Zuluaga-Gomez, Karel Veselý, Alexander Blatt, Petr Motlicek, Dietrich Klakow, Allan Tart, Igor Szöke, Amrutha Prasad, Saeed Sarfjoo, Pavel Kolčárek, Martin Kocour, Honza Černocký, Claudia Cevenini, Khalid Choukri, Mickael Rigault, Fabian Landis
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Proceedings
Subjects:
Online Access:https://www.mdpi.com/2504-3900/59/1/14
id doaj-8eb322b1dcca45e094efe574283a39af
record_format Article
spelling doaj-8eb322b1dcca45e094efe574283a39af2020-12-04T00:03:14ZengMDPI AGProceedings2504-39002020-12-0159141410.3390/proceedings2020059014Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken CommunicationsJuan Zuluaga-Gomez0Karel Veselý1Alexander Blatt2Petr Motlicek3Dietrich Klakow4Allan Tart5Igor Szöke6Amrutha Prasad7Saeed Sarfjoo8Pavel Kolčárek9Martin Kocour10Honza Černocký11Claudia Cevenini12Khalid Choukri13Mickael Rigault14Fabian Landis15Idiap Research Institute, 1920 Martigny, SwitzerlandSpeech@FIT and IT4I Center of Excellence, Brno University of Technology, 60190 Brno, CzechiaDepartment of Language Science and Technology, Saarland University, 66123 Saarbruecken, GermanyIdiap Research Institute, 1920 Martigny, SwitzerlandDepartment of Language Science and Technology, Saarland University, 66123 Saarbruecken, GermanyOpenSky Network, 3400 Burgdorf, SwitzerlandReplayWell, 61600 Brno, Czech RepublicIdiap Research Institute, 1920 Martigny, SwitzerlandIdiap Research Institute, 1920 Martigny, SwitzerlandHoneywell, 62700 Brno, Czech RepublicSpeech@FIT and IT4I Center of Excellence, Brno University of Technology, 60190 Brno, CzechiaSpeech@FIT and IT4I Center of Excellence, Brno University of Technology, 60190 Brno, CzechiaRomagna Tech, 47121 Forli, ItalyEvaluations and Language Resources Distribution Agency (ELDA), 75013 Paris, FranceEvaluations and Language Resources Distribution Agency (ELDA), 75013 Paris, FranceOpenSky Network, 3400 Burgdorf, SwitzerlandVoice communication is the main channel to exchange information between pilots and Air-Traffic Controllers (ATCos). Recently, several projects have explored the employment of speech recognition technology to automatically extract spoken key information such as call signs, commands, and values, which can be used to reduce ATCos’ workload and increase performance and safety in Air-Traffic Control (ATC)-related activities. Nevertheless, the collection of ATC speech data is very demanding, expensive, and limited to the intrinsic speakers’ characteristics. As a solution, this paper presents ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula>, a project that aims to develop a unique platform to collect, organize, and pre-process ATC data collected from air space. Initially, the data are gathered directly through publicly accessible radio frequency channels with VHF receivers and LiveATC, which can be considered as an “unlimited-source” of low-quality data. The ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula> project explores employing context information such as radar and air surveillance data (collected with ADS-B and Mode S) from the OpenSky Network (OSN) to correlate call signs automatically extracted from voice communication with those available from ADS-B channels, to eventually increase the overall call sign detection rates. More specifically, the timestamp and location of the spoken command (issued by the ATCo by voice) are extracted, and a query is sent to the OSN server to retrieve the call sign tags in ICAO format for the airplanes corresponding to the given area. Then, a word sequence provided by an automatic speech recognition system is fed into a Natural Language Processing (NLP) based module together with the set of call signs available from the ADS-B channels. The NLP module extracts the call sign, command, and command arguments from the spoken utterance.https://www.mdpi.com/2504-3900/59/1/14air traffic controlair surveillance dataautomatic speech recognitioncall sign detectionOpenSky Networknamed entity recognition
collection DOAJ
language English
format Article
sources DOAJ
author Juan Zuluaga-Gomez
Karel Veselý
Alexander Blatt
Petr Motlicek
Dietrich Klakow
Allan Tart
Igor Szöke
Amrutha Prasad
Saeed Sarfjoo
Pavel Kolčárek
Martin Kocour
Honza Černocký
Claudia Cevenini
Khalid Choukri
Mickael Rigault
Fabian Landis
spellingShingle Juan Zuluaga-Gomez
Karel Veselý
Alexander Blatt
Petr Motlicek
Dietrich Klakow
Allan Tart
Igor Szöke
Amrutha Prasad
Saeed Sarfjoo
Pavel Kolčárek
Martin Kocour
Honza Černocký
Claudia Cevenini
Khalid Choukri
Mickael Rigault
Fabian Landis
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
Proceedings
air traffic control
air surveillance data
automatic speech recognition
call sign detection
OpenSky Network
named entity recognition
author_facet Juan Zuluaga-Gomez
Karel Veselý
Alexander Blatt
Petr Motlicek
Dietrich Klakow
Allan Tart
Igor Szöke
Amrutha Prasad
Saeed Sarfjoo
Pavel Kolčárek
Martin Kocour
Honza Černocký
Claudia Cevenini
Khalid Choukri
Mickael Rigault
Fabian Landis
author_sort Juan Zuluaga-Gomez
title Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
title_short Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
title_full Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
title_fullStr Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
title_full_unstemmed Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
title_sort automatic call sign detection: matching air surveillance data with air traffic spoken communications
publisher MDPI AG
series Proceedings
issn 2504-3900
publishDate 2020-12-01
description Voice communication is the main channel to exchange information between pilots and Air-Traffic Controllers (ATCos). Recently, several projects have explored the employment of speech recognition technology to automatically extract spoken key information such as call signs, commands, and values, which can be used to reduce ATCos’ workload and increase performance and safety in Air-Traffic Control (ATC)-related activities. Nevertheless, the collection of ATC speech data is very demanding, expensive, and limited to the intrinsic speakers’ characteristics. As a solution, this paper presents ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula>, a project that aims to develop a unique platform to collect, organize, and pre-process ATC data collected from air space. Initially, the data are gathered directly through publicly accessible radio frequency channels with VHF receivers and LiveATC, which can be considered as an “unlimited-source” of low-quality data. The ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula> project explores employing context information such as radar and air surveillance data (collected with ADS-B and Mode S) from the OpenSky Network (OSN) to correlate call signs automatically extracted from voice communication with those available from ADS-B channels, to eventually increase the overall call sign detection rates. More specifically, the timestamp and location of the spoken command (issued by the ATCo by voice) are extracted, and a query is sent to the OSN server to retrieve the call sign tags in ICAO format for the airplanes corresponding to the given area. Then, a word sequence provided by an automatic speech recognition system is fed into a Natural Language Processing (NLP) based module together with the set of call signs available from the ADS-B channels. The NLP module extracts the call sign, command, and command arguments from the spoken utterance.
topic air traffic control
air surveillance data
automatic speech recognition
call sign detection
OpenSky Network
named entity recognition
url https://www.mdpi.com/2504-3900/59/1/14
work_keys_str_mv AT juanzuluagagomez automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT karelvesely automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT alexanderblatt automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT petrmotlicek automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT dietrichklakow automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT allantart automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT igorszoke automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT amruthaprasad automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT saeedsarfjoo automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT pavelkolcarek automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT martinkocour automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT honzacernocky automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT claudiacevenini automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT khalidchoukri automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT mickaelrigault automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
AT fabianlandis automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications
_version_ 1724400949865218048