ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy
When entering the phase of big data processing and statistical inferences in experimental physics, the efficient use of machine learning methods may require optimal data preprocessing methods and, in particular, optimal balance between details and noise. In experimental studies of strong-field quant...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-12-01
|
Series: | Entropy |
Subjects: | |
Online Access: | https://www.mdpi.com/1099-4300/23/1/21 |
id |
doaj-cb881d2f44504ad6bb73907b08ddf97b |
---|---|
record_format |
Article |
spelling |
doaj-cb881d2f44504ad6bb73907b08ddf97b2020-12-26T00:03:41ZengMDPI AGEntropy1099-43002021-12-0123212110.3390/e23010021ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning StrategyYury Rodimkov0Evgeny Efimenko1Valentin Volokitin2Elena Panova3Alexey Polovinkin4Iosif Meyerov5Arkady Gonoskov6Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaInstitute of Applied Physics of the Russian Academy of Sciences, 603950 Nizhni Novgorod, RussiaDepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaDepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaAdv Learning Systems, TDAA, Intel, Chandler, AZ 85226, USADepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaDepartment of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, RussiaWhen entering the phase of big data processing and statistical inferences in experimental physics, the efficient use of machine learning methods may require optimal data preprocessing methods and, in particular, optimal balance between details and noise. In experimental studies of strong-field quantum electrodynamics with intense lasers, this balance concerns data binning for the observed distributions of particles and photons. Here we analyze the aspect of binning with respect to different machine learning methods (Support Vector Machine (SVM), Gradient Boosting Trees (GBT), Fully-Connected Neural Network (FCNN), Convolutional Neural Network (CNN)) using numerical simulations that mimic expected properties of upcoming experiments. We see that binning can crucially affect the performance of SVM and GBT, and, to a less extent, FCNN and CNN. This can be interpreted as the latter methods being able to effectively learn the optimal binning, discarding unnecessary information. Nevertheless, given limited training sets, the results indicate that the efficiency can be increased by optimizing the binning scale along with other hyperparameters. We present specific measurements of accuracy that can be useful for planning of experiments in the specified research area.https://www.mdpi.com/1099-4300/23/1/21laser physicsartificial neural networksfully-connected neural networks |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yury Rodimkov Evgeny Efimenko Valentin Volokitin Elena Panova Alexey Polovinkin Iosif Meyerov Arkady Gonoskov |
spellingShingle |
Yury Rodimkov Evgeny Efimenko Valentin Volokitin Elena Panova Alexey Polovinkin Iosif Meyerov Arkady Gonoskov ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy Entropy laser physics artificial neural networks fully-connected neural networks |
author_facet |
Yury Rodimkov Evgeny Efimenko Valentin Volokitin Elena Panova Alexey Polovinkin Iosif Meyerov Arkady Gonoskov |
author_sort |
Yury Rodimkov |
title |
ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy |
title_short |
ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy |
title_full |
ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy |
title_fullStr |
ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy |
title_full_unstemmed |
ML-Based Analysis of Particle Distributions in High-Intensity Laser Experiments: Role of Binning Strategy |
title_sort |
ml-based analysis of particle distributions in high-intensity laser experiments: role of binning strategy |
publisher |
MDPI AG |
series |
Entropy |
issn |
1099-4300 |
publishDate |
2021-12-01 |
description |
When entering the phase of big data processing and statistical inferences in experimental physics, the efficient use of machine learning methods may require optimal data preprocessing methods and, in particular, optimal balance between details and noise. In experimental studies of strong-field quantum electrodynamics with intense lasers, this balance concerns data binning for the observed distributions of particles and photons. Here we analyze the aspect of binning with respect to different machine learning methods (Support Vector Machine (SVM), Gradient Boosting Trees (GBT), Fully-Connected Neural Network (FCNN), Convolutional Neural Network (CNN)) using numerical simulations that mimic expected properties of upcoming experiments. We see that binning can crucially affect the performance of SVM and GBT, and, to a less extent, FCNN and CNN. This can be interpreted as the latter methods being able to effectively learn the optimal binning, discarding unnecessary information. Nevertheless, given limited training sets, the results indicate that the efficiency can be increased by optimizing the binning scale along with other hyperparameters. We present specific measurements of accuracy that can be useful for planning of experiments in the specified research area. |
topic |
laser physics artificial neural networks fully-connected neural networks |
url |
https://www.mdpi.com/1099-4300/23/1/21 |
work_keys_str_mv |
AT yuryrodimkov mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy AT evgenyefimenko mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy AT valentinvolokitin mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy AT elenapanova mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy AT alexeypolovinkin mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy AT iosifmeyerov mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy AT arkadygonoskov mlbasedanalysisofparticledistributionsinhighintensitylaserexperimentsroleofbinningstrategy |
_version_ |
1724370672797351936 |