ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy

When entering the phase of big data processing and statistical inferences in experimental physics, the efficient use of machine learning methods may require optimal data preprocessing methods and, in particular, optimal balance between details and noise. In experimental studies of strong-field quant...

Full description

Bibliographic Details
Main Authors: Yury Rodimkov, Evgeny Efimenko, Valentin Volokitin, Elena Panova, Alexey Polovinkin, Iosif Meyerov, Arkady Gonoskov
Format: Article
Language:English
Published: MDPI AG 2021-12-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/23/1/21
id doaj-cb881d2f44504ad6bb73907b08ddf97b
record_format Article
spelling doaj-cb881d2f44504ad6bb73907b08ddf97b2020-12-26T00:03:41ZengMDPI AGEntropy1099-43002021-12-0123212110.3390/e23010021ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning StrategyYury Rodimkov0Evgeny Efimenko1Valentin Volokitin2Elena Panova3Alexey Polovinkin4Iosif Meyerov5Arkady Gonoskov6Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaInstitute of Applied Physics of the Russian Academy of Sciences, 603950 Nizhni Novgorod, RussiaDepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaDepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaAdv Learning Systems, TDAA, Intel, Chandler, AZ 85226, USADepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaDepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaWhen entering the phase of big data processing and statistical inferences in experimental physics, the efficient use of machine learning methods may require optimal data preprocessing methods and, in particular, optimal balance between details and noise. In experimental studies of strong-field quantum electrodynamics with intense lasers, this balance concerns data binning for the observed distributions of particles and photons. Here we analyze the aspect of binning with respect to different machine learning methods (Support Vector Machine (SVM), Gradient Boosting Trees (GBT), Fully-Connected Neural Network (FCNN), Convolutional Neural Network (CNN)) using numerical simulations that mimic expected properties of upcoming experiments. We see that binning can crucially affect the performance of SVM and GBT, and, to a less extent, FCNN and CNN. This can be interpreted as the latter methods being able to effectively learn the optimal binning, discarding unnecessary information. Nevertheless, given limited training sets, the results indicate that the efficiency can be increased by optimizing the binning scale along with other hyperparameters. We present specific measurements of accuracy that can be useful for planning of experiments in the specified research area.https://www.mdpi.com/1099-4300/23/1/21laser physicsartificial neural networksfully-connected neural networks
collection DOAJ
language English
format Article
sources DOAJ
author Yury Rodimkov
Evgeny Efimenko
Valentin Volokitin
Elena Panova
Alexey Polovinkin
Iosif Meyerov
Arkady Gonoskov
spellingShingle Yury Rodimkov
Evgeny Efimenko
Valentin Volokitin
Elena Panova
Alexey Polovinkin
Iosif Meyerov
Arkady Gonoskov
ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy
Entropy
laser physics
artificial neural networks
fully-connected neural networks
author_facet Yury Rodimkov
Evgeny Efimenko
Valentin Volokitin
Elena Panova
Alexey Polovinkin
Iosif Meyerov
Arkady Gonoskov
author_sort Yury Rodimkov
title ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy
title_short ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy
title_full ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy
title_fullStr ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy
title_full_unstemmed ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy
title_sort ml-based analysis of particle distributions in high-intensity laser experiments: role of binning strategy
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2021-12-01
description When entering the phase of big data processing and statistical inferences in experimental physics, the efficient use of machine learning methods may require optimal data preprocessing methods and, in particular, optimal balance between details and noise. In experimental studies of strong-field quantum electrodynamics with intense lasers, this balance concerns data binning for the observed distributions of particles and photons. Here we analyze the aspect of binning with respect to different machine learning methods (Support Vector Machine (SVM), Gradient Boosting Trees (GBT), Fully-Connected Neural Network (FCNN), Convolutional Neural Network (CNN)) using numerical simulations that mimic expected properties of upcoming experiments. We see that binning can crucially affect the performance of SVM and GBT, and, to a less extent, FCNN and CNN. This can be interpreted as the latter methods being able to effectively learn the optimal binning, discarding unnecessary information. Nevertheless, given limited training sets, the results indicate that the efficiency can be increased by optimizing the binning scale along with other hyperparameters. We present specific measurements of accuracy that can be useful for planning of experiments in the specified research area.
topic laser physics
artificial neural networks
fully-connected neural networks
url https://www.mdpi.com/1099-4300/23/1/21
work_keys_str_mv AT yuryrodimkov mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy
AT evgenyefimenko mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy
AT valentinvolokitin mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy
AT elenapanova mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy
AT alexeypolovinkin mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy
AT iosifmeyerov mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy
AT arkadygonoskov mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy
_version_ 1724370672797351936