Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach
Advances in machine learning (ML) coupled with increased computational power have enabled identification of patterns in data extracted from complex systems. ML algorithms are actively being sought in recovering physical models or mathematical equations from data. This is a highly valuable technique...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-06-01
|
Series: | Fluids |
Subjects: | |
Online Access: | https://www.mdpi.com/2311-5521/4/2/111 |
id |
doaj-91b0c06cdcc449cd96ef16b2fba93202 |
---|---|
record_format |
Article |
spelling |
doaj-91b0c06cdcc449cd96ef16b2fba932022020-11-25T01:52:00ZengMDPI AGFluids2311-55212019-06-014211110.3390/fluids4020111fluids4020111Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression ApproachHarsha Vaddireddy0Omer San1School of Mechanical and Aerospace Engineering, Oklahoma State University, Stillwater, OK 74078, USASchool of Mechanical and Aerospace Engineering, Oklahoma State University, Stillwater, OK 74078, USAAdvances in machine learning (ML) coupled with increased computational power have enabled identification of patterns in data extracted from complex systems. ML algorithms are actively being sought in recovering physical models or mathematical equations from data. This is a highly valuable technique where models cannot be built using physical reasoning alone. In this paper, we investigate the application of fast function extraction (FFX), a fast, scalable, deterministic symbolic regression algorithm to recover partial differential equations (PDEs). FFX identifies active bases among a huge set of candidate basis functions and their corresponding coefficients from recorded snapshot data. This approach uses a sparsity-promoting technique from compressive sensing and sparse optimization called pathwise regularized learning to perform feature selection and parameter estimation. Furthermore, it recovers several models of varying complexity (number of basis terms). FFX finally filters out many identified models using non-dominated sorting and forms a Pareto front consisting of optimal models with respect to minimizing complexity and test accuracy. Numerical experiments are carried out to recover several ubiquitous PDEs such as wave and heat equations among linear PDEs and Burgers, Korteweg−de Vries (KdV), and Kawahara equations among higher-order nonlinear PDEs. Additional simulations are conducted on the same PDEs under noisy conditions to test the robustness of the proposed approach.https://www.mdpi.com/2311-5521/4/2/111deterministic symbolic regressionfast function extractioncompressive sensingpathwise regularized learningnon-dominated sorting |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Harsha Vaddireddy Omer San |
spellingShingle |
Harsha Vaddireddy Omer San Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach Fluids deterministic symbolic regression fast function extraction compressive sensing pathwise regularized learning non-dominated sorting |
author_facet |
Harsha Vaddireddy Omer San |
author_sort |
Harsha Vaddireddy |
title |
Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach |
title_short |
Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach |
title_full |
Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach |
title_fullStr |
Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach |
title_full_unstemmed |
Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach |
title_sort |
equation discovery using fast function extraction: a deterministic symbolic regression approach |
publisher |
MDPI AG |
series |
Fluids |
issn |
2311-5521 |
publishDate |
2019-06-01 |
description |
Advances in machine learning (ML) coupled with increased computational power have enabled identification of patterns in data extracted from complex systems. ML algorithms are actively being sought in recovering physical models or mathematical equations from data. This is a highly valuable technique where models cannot be built using physical reasoning alone. In this paper, we investigate the application of fast function extraction (FFX), a fast, scalable, deterministic symbolic regression algorithm to recover partial differential equations (PDEs). FFX identifies active bases among a huge set of candidate basis functions and their corresponding coefficients from recorded snapshot data. This approach uses a sparsity-promoting technique from compressive sensing and sparse optimization called pathwise regularized learning to perform feature selection and parameter estimation. Furthermore, it recovers several models of varying complexity (number of basis terms). FFX finally filters out many identified models using non-dominated sorting and forms a Pareto front consisting of optimal models with respect to minimizing complexity and test accuracy. Numerical experiments are carried out to recover several ubiquitous PDEs such as wave and heat equations among linear PDEs and Burgers, Korteweg−de Vries (KdV), and Kawahara equations among higher-order nonlinear PDEs. Additional simulations are conducted on the same PDEs under noisy conditions to test the robustness of the proposed approach. |
topic |
deterministic symbolic regression fast function extraction compressive sensing pathwise regularized learning non-dominated sorting |
url |
https://www.mdpi.com/2311-5521/4/2/111 |
work_keys_str_mv |
AT harshavaddireddy equationdiscoveryusingfastfunctionextractionadeterministicsymbolicregressionapproach AT omersan equationdiscoveryusingfastfunctionextractionadeterministicsymbolicregressionapproach |
_version_ |
1724995474079875072 |