The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering

As industry continues to embrace Industry 4.0, many sectors now seek to automate fraud detection to ensure reduced financial exposure. However, the data-driven models which are commonly used in the development of such ‘digital solutions’ rely on ‘supervised’ learning techniques which require high re...

Full description

Bibliographic Details
Main Authors: Yanfeng Liang, Behzad Nobakht, Gordon Lindsay
Format: Article
Language:English
Published: Elsevier 2021-12-01
Series:Measurement: Sensors
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2665917421001884
id doaj-0ec94ed033d34802aabd83b54b62e5e1
record_format Article
spelling doaj-0ec94ed033d34802aabd83b54b62e5e12021-09-27T04:29:05ZengElsevierMeasurement: Sensors2665-91742021-12-0118100225The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkeringYanfeng Liang0Behzad Nobakht1Gordon Lindsay2Corresponding author.; TÜV SÜD National Engineering Laboratory, East Kilbride, Scotland, United KingdomTÜV SÜD National Engineering Laboratory, East Kilbride, Scotland, United KingdomTÜV SÜD National Engineering Laboratory, East Kilbride, Scotland, United KingdomAs industry continues to embrace Industry 4.0, many sectors now seek to automate fraud detection to ensure reduced financial exposure. However, the data-driven models which are commonly used in the development of such ‘digital solutions’ rely on ‘supervised’ learning techniques which require high resolution datasets containing labelled instances of the specific fraudulent activity. In reality, applications such as engineering and manufacturing only have limited datasets which contain such information and recreating the physical conditions surrounding the fraudulent activity is often not practical or is illegal. This paper details a collaborative R&D project undertaken for the fuel bunkering industry; whereby data-driven models were designed to detect fraudulent activity during fuel transfer operations. Synthetic data generation was used to build up high resolution datasets based on field data which contained instances of fraud. The results demonstrate successful synthetic data generation and modelling techniques with high predictive accuracies.http://www.sciencedirect.com/science/article/pii/S2665917421001884Synthetic dataData-driven modellingMachine learningCoriolisBunkering
collection DOAJ
language English
format Article
sources DOAJ
author Yanfeng Liang
Behzad Nobakht
Gordon Lindsay
spellingShingle Yanfeng Liang
Behzad Nobakht
Gordon Lindsay
The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering
Measurement: Sensors
Synthetic data
Data-driven modelling
Machine learning
Coriolis
Bunkering
author_facet Yanfeng Liang
Behzad Nobakht
Gordon Lindsay
author_sort Yanfeng Liang
title The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering
title_short The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering
title_full The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering
title_fullStr The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering
title_full_unstemmed The application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering
title_sort application of synthetic data generation and data-driven modelling in the development of a fraud detection system for fuel bunkering
publisher Elsevier
series Measurement: Sensors
issn 2665-9174
publishDate 2021-12-01
description As industry continues to embrace Industry 4.0, many sectors now seek to automate fraud detection to ensure reduced financial exposure. However, the data-driven models which are commonly used in the development of such ‘digital solutions’ rely on ‘supervised’ learning techniques which require high resolution datasets containing labelled instances of the specific fraudulent activity. In reality, applications such as engineering and manufacturing only have limited datasets which contain such information and recreating the physical conditions surrounding the fraudulent activity is often not practical or is illegal. This paper details a collaborative R&D project undertaken for the fuel bunkering industry; whereby data-driven models were designed to detect fraudulent activity during fuel transfer operations. Synthetic data generation was used to build up high resolution datasets based on field data which contained instances of fraud. The results demonstrate successful synthetic data generation and modelling techniques with high predictive accuracies.
topic Synthetic data
Data-driven modelling
Machine learning
Coriolis
Bunkering
url http://www.sciencedirect.com/science/article/pii/S2665917421001884
work_keys_str_mv AT yanfengliang theapplicationofsyntheticdatagenerationanddatadrivenmodellinginthedevelopmentofafrauddetectionsystemforfuelbunkering
AT behzadnobakht theapplicationofsyntheticdatagenerationanddatadrivenmodellinginthedevelopmentofafrauddetectionsystemforfuelbunkering
AT gordonlindsay theapplicationofsyntheticdatagenerationanddatadrivenmodellinginthedevelopmentofafrauddetectionsystemforfuelbunkering
AT yanfengliang applicationofsyntheticdatagenerationanddatadrivenmodellinginthedevelopmentofafrauddetectionsystemforfuelbunkering
AT behzadnobakht applicationofsyntheticdatagenerationanddatadrivenmodellinginthedevelopmentofafrauddetectionsystemforfuelbunkering
AT gordonlindsay applicationofsyntheticdatagenerationanddatadrivenmodellinginthedevelopmentofafrauddetectionsystemforfuelbunkering
_version_ 1716867141741838336