Synthetic Data for Training and Evaluation of Critical Traffic Scenarios
Modern camera-based vehicle safety systems heavily rely on machine learning and consequently require large amounts of training data to perform reliably. However, collecting and annotating the needed data is an extremely expensive and time-consuming process. In addition, it is exceptionally difficult...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Linköpings universitet, Medie- och Informationsteknik
2021
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-177779 |
id |
ndltd-UPSALLA1-oai-DiVA.org-liu-177779 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-liu-1777792021-07-06T05:23:32ZSynthetic Data for Training and Evaluation of Critical Traffic ScenariosengCollin, SofieLinköpings universitet, Medie- och InformationsteknikLinköpings universitet, Tekniska fakulteten2021Machine LearningAutonomous DrivingSynthetic DataCritical Traffic ScenariosPedestrian DetectionImage ProcessingRoboticsRobotteknik och automationModern camera-based vehicle safety systems heavily rely on machine learning and consequently require large amounts of training data to perform reliably. However, collecting and annotating the needed data is an extremely expensive and time-consuming process. In addition, it is exceptionally difficult to collect data that covers critical scenarios. This thesis investigates to what extent synthetic data can replace real-world data for these scenarios. Since only a limited amount of data consisting of such real-world scenarios is available, this thesis instead makes use of proxy scenarios, e.g. situations when pedestrians are located closely in front of the vehicle (for example at a crosswalk). The presented approach involves training a detector on real-world data where all samples of these proxy scenarios have been removed and compare it to other detectors trained on data where the removed samples have been replaced with various degrees of synthetic data. A method for generating and automatically and accurately annotating synthetic data, using features in the CARLA simulator, is presented. Also, the domain gap between the synthetic and real-world data is analyzed and methods in domain adaptation and data augmentation are reviewed. The presented experiments show that aligning statistical properties between the synthetic and real-world datasets distinctly mitigates the domain gap. There are also clear indications that synthetic data can help detect pedestrians in critical traffic situations <p>Examensarbetet är utfört vid Institutionen för teknik och naturvetenskap (ITN) vid Tekniska fakulteten, Linköpings universitet</p>Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-177779application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Machine Learning Autonomous Driving Synthetic Data Critical Traffic Scenarios Pedestrian Detection Image Processing Robotics Robotteknik och automation |
spellingShingle |
Machine Learning Autonomous Driving Synthetic Data Critical Traffic Scenarios Pedestrian Detection Image Processing Robotics Robotteknik och automation Collin, Sofie Synthetic Data for Training and Evaluation of Critical Traffic Scenarios |
description |
Modern camera-based vehicle safety systems heavily rely on machine learning and consequently require large amounts of training data to perform reliably. However, collecting and annotating the needed data is an extremely expensive and time-consuming process. In addition, it is exceptionally difficult to collect data that covers critical scenarios. This thesis investigates to what extent synthetic data can replace real-world data for these scenarios. Since only a limited amount of data consisting of such real-world scenarios is available, this thesis instead makes use of proxy scenarios, e.g. situations when pedestrians are located closely in front of the vehicle (for example at a crosswalk). The presented approach involves training a detector on real-world data where all samples of these proxy scenarios have been removed and compare it to other detectors trained on data where the removed samples have been replaced with various degrees of synthetic data. A method for generating and automatically and accurately annotating synthetic data, using features in the CARLA simulator, is presented. Also, the domain gap between the synthetic and real-world data is analyzed and methods in domain adaptation and data augmentation are reviewed. The presented experiments show that aligning statistical properties between the synthetic and real-world datasets distinctly mitigates the domain gap. There are also clear indications that synthetic data can help detect pedestrians in critical traffic situations === <p>Examensarbetet är utfört vid Institutionen för teknik och naturvetenskap (ITN) vid Tekniska fakulteten, Linköpings universitet</p> |
author |
Collin, Sofie |
author_facet |
Collin, Sofie |
author_sort |
Collin, Sofie |
title |
Synthetic Data for Training and Evaluation of Critical Traffic Scenarios |
title_short |
Synthetic Data for Training and Evaluation of Critical Traffic Scenarios |
title_full |
Synthetic Data for Training and Evaluation of Critical Traffic Scenarios |
title_fullStr |
Synthetic Data for Training and Evaluation of Critical Traffic Scenarios |
title_full_unstemmed |
Synthetic Data for Training and Evaluation of Critical Traffic Scenarios |
title_sort |
synthetic data for training and evaluation of critical traffic scenarios |
publisher |
Linköpings universitet, Medie- och Informationsteknik |
publishDate |
2021 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-177779 |
work_keys_str_mv |
AT collinsofie syntheticdatafortrainingandevaluationofcriticaltrafficscenarios |
_version_ |
1719415815759986688 |