Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications
Voice communication is the main channel to exchange information between pilots and Air-Traffic Controllers (ATCos). Recently, several projects have explored the employment of speech recognition technology to automatically extract spoken key information such as call signs, commands, and values, which...
Main Authors: | , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-12-01
|
Series: | Proceedings |
Subjects: | |
Online Access: | https://www.mdpi.com/2504-3900/59/1/14 |
id |
doaj-8eb322b1dcca45e094efe574283a39af |
---|---|
record_format |
Article |
spelling |
doaj-8eb322b1dcca45e094efe574283a39af2020-12-04T00:03:14ZengMDPI AGProceedings2504-39002020-12-0159141410.3390/proceedings2020059014Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken CommunicationsJuan Zuluaga-Gomez0Karel Veselý1Alexander Blatt2Petr Motlicek3Dietrich Klakow4Allan Tart5Igor Szöke6Amrutha Prasad7Saeed Sarfjoo8Pavel Kolčárek9Martin Kocour10Honza Černocký11Claudia Cevenini12Khalid Choukri13Mickael Rigault14Fabian Landis15Idiap Research Institute, 1920 Martigny, SwitzerlandSpeech@FIT and IT4I Center of Excellence, Brno University of Technology, 60190 Brno, CzechiaDepartment of Language Science and Technology, Saarland University, 66123 Saarbruecken, GermanyIdiap Research Institute, 1920 Martigny, SwitzerlandDepartment of Language Science and Technology, Saarland University, 66123 Saarbruecken, GermanyOpenSky Network, 3400 Burgdorf, SwitzerlandReplayWell, 61600 Brno, Czech RepublicIdiap Research Institute, 1920 Martigny, SwitzerlandIdiap Research Institute, 1920 Martigny, SwitzerlandHoneywell, 62700 Brno, Czech RepublicSpeech@FIT and IT4I Center of Excellence, Brno University of Technology, 60190 Brno, CzechiaSpeech@FIT and IT4I Center of Excellence, Brno University of Technology, 60190 Brno, CzechiaRomagna Tech, 47121 Forli, ItalyEvaluations and Language Resources Distribution Agency (ELDA), 75013 Paris, FranceEvaluations and Language Resources Distribution Agency (ELDA), 75013 Paris, FranceOpenSky Network, 3400 Burgdorf, SwitzerlandVoice communication is the main channel to exchange information between pilots and Air-Traffic Controllers (ATCos). Recently, several projects have explored the employment of speech recognition technology to automatically extract spoken key information such as call signs, commands, and values, which can be used to reduce ATCos’ workload and increase performance and safety in Air-Traffic Control (ATC)-related activities. Nevertheless, the collection of ATC speech data is very demanding, expensive, and limited to the intrinsic speakers’ characteristics. As a solution, this paper presents ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula>, a project that aims to develop a unique platform to collect, organize, and pre-process ATC data collected from air space. Initially, the data are gathered directly through publicly accessible radio frequency channels with VHF receivers and LiveATC, which can be considered as an “unlimited-source” of low-quality data. The ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula> project explores employing context information such as radar and air surveillance data (collected with ADS-B and Mode S) from the OpenSky Network (OSN) to correlate call signs automatically extracted from voice communication with those available from ADS-B channels, to eventually increase the overall call sign detection rates. More specifically, the timestamp and location of the spoken command (issued by the ATCo by voice) are extracted, and a query is sent to the OSN server to retrieve the call sign tags in ICAO format for the airplanes corresponding to the given area. Then, a word sequence provided by an automatic speech recognition system is fed into a Natural Language Processing (NLP) based module together with the set of call signs available from the ADS-B channels. The NLP module extracts the call sign, command, and command arguments from the spoken utterance.https://www.mdpi.com/2504-3900/59/1/14air traffic controlair surveillance dataautomatic speech recognitioncall sign detectionOpenSky Networknamed entity recognition |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Juan Zuluaga-Gomez Karel Veselý Alexander Blatt Petr Motlicek Dietrich Klakow Allan Tart Igor Szöke Amrutha Prasad Saeed Sarfjoo Pavel Kolčárek Martin Kocour Honza Černocký Claudia Cevenini Khalid Choukri Mickael Rigault Fabian Landis |
spellingShingle |
Juan Zuluaga-Gomez Karel Veselý Alexander Blatt Petr Motlicek Dietrich Klakow Allan Tart Igor Szöke Amrutha Prasad Saeed Sarfjoo Pavel Kolčárek Martin Kocour Honza Černocký Claudia Cevenini Khalid Choukri Mickael Rigault Fabian Landis Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications Proceedings air traffic control air surveillance data automatic speech recognition call sign detection OpenSky Network named entity recognition |
author_facet |
Juan Zuluaga-Gomez Karel Veselý Alexander Blatt Petr Motlicek Dietrich Klakow Allan Tart Igor Szöke Amrutha Prasad Saeed Sarfjoo Pavel Kolčárek Martin Kocour Honza Černocký Claudia Cevenini Khalid Choukri Mickael Rigault Fabian Landis |
author_sort |
Juan Zuluaga-Gomez |
title |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications |
title_short |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications |
title_full |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications |
title_fullStr |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications |
title_full_unstemmed |
Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications |
title_sort |
automatic call sign detection: matching air surveillance data with air traffic spoken communications |
publisher |
MDPI AG |
series |
Proceedings |
issn |
2504-3900 |
publishDate |
2020-12-01 |
description |
Voice communication is the main channel to exchange information between pilots and Air-Traffic Controllers (ATCos). Recently, several projects have explored the employment of speech recognition technology to automatically extract spoken key information such as call signs, commands, and values, which can be used to reduce ATCos’ workload and increase performance and safety in Air-Traffic Control (ATC)-related activities. Nevertheless, the collection of ATC speech data is very demanding, expensive, and limited to the intrinsic speakers’ characteristics. As a solution, this paper presents ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula>, a project that aims to develop a unique platform to collect, organize, and pre-process ATC data collected from air space. Initially, the data are gathered directly through publicly accessible radio frequency channels with VHF receivers and LiveATC, which can be considered as an “unlimited-source” of low-quality data. The ATCO<inline-formula><math display="inline"><semantics><msup><mrow></mrow><mn>2</mn></msup></semantics></math></inline-formula> project explores employing context information such as radar and air surveillance data (collected with ADS-B and Mode S) from the OpenSky Network (OSN) to correlate call signs automatically extracted from voice communication with those available from ADS-B channels, to eventually increase the overall call sign detection rates. More specifically, the timestamp and location of the spoken command (issued by the ATCo by voice) are extracted, and a query is sent to the OSN server to retrieve the call sign tags in ICAO format for the airplanes corresponding to the given area. Then, a word sequence provided by an automatic speech recognition system is fed into a Natural Language Processing (NLP) based module together with the set of call signs available from the ADS-B channels. The NLP module extracts the call sign, command, and command arguments from the spoken utterance. |
topic |
air traffic control air surveillance data automatic speech recognition call sign detection OpenSky Network named entity recognition |
url |
https://www.mdpi.com/2504-3900/59/1/14 |
work_keys_str_mv |
AT juanzuluagagomez automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT karelvesely automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT alexanderblatt automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT petrmotlicek automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT dietrichklakow automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT allantart automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT igorszoke automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT amruthaprasad automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT saeedsarfjoo automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT pavelkolcarek automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT martinkocour automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT honzacernocky automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT claudiacevenini automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT khalidchoukri automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT mickaelrigault automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications AT fabianlandis automaticcallsigndetectionmatchingairsurveillancedatawithairtrafficspokencommunications |
_version_ |
1724400949865218048 |