Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach

Advances in machine learning (ML) coupled with increased computational power have enabled identification of patterns in data extracted from complex systems. ML algorithms are actively being sought in recovering physical models or mathematical equations from data. This is a highly valuable technique...

Full description

Bibliographic Details
Main Authors: Harsha Vaddireddy, Omer San
Format: Article
Language:English
Published: MDPI AG 2019-06-01
Series:Fluids
Subjects:
Online Access:https://www.mdpi.com/2311-5521/4/2/111
id doaj-91b0c06cdcc449cd96ef16b2fba93202
record_format Article
spelling doaj-91b0c06cdcc449cd96ef16b2fba932022020-11-25T01:52:00ZengMDPI AGFluids2311-55212019-06-014211110.3390/fluids4020111fluids4020111Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression ApproachHarsha Vaddireddy0Omer San1School of Mechanical and Aerospace Engineering, Oklahoma State University, Stillwater, OK 74078, USASchool of Mechanical and Aerospace Engineering, Oklahoma State University, Stillwater, OK 74078, USAAdvances in machine learning (ML) coupled with increased computational power have enabled identification of patterns in data extracted from complex systems. ML algorithms are actively being sought in recovering physical models or mathematical equations from data. This is a highly valuable technique where models cannot be built using physical reasoning alone. In this paper, we investigate the application of fast function extraction (FFX), a fast, scalable, deterministic symbolic regression algorithm to recover partial differential equations (PDEs). FFX identifies active bases among a huge set of candidate basis functions and their corresponding coefficients from recorded snapshot data. This approach uses a sparsity-promoting technique from compressive sensing and sparse optimization called pathwise regularized learning to perform feature selection and parameter estimation. Furthermore, it recovers several models of varying complexity (number of basis terms). FFX finally filters out many identified models using non-dominated sorting and forms a Pareto front consisting of optimal models with respect to minimizing complexity and test accuracy. Numerical experiments are carried out to recover several ubiquitous PDEs such as wave and heat equations among linear PDEs and Burgers, Korteweg−de Vries (KdV), and Kawahara equations among higher-order nonlinear PDEs. Additional simulations are conducted on the same PDEs under noisy conditions to test the robustness of the proposed approach.https://www.mdpi.com/2311-5521/4/2/111deterministic symbolic regressionfast function extractioncompressive sensingpathwise regularized learningnon-dominated sorting
collection DOAJ
language English
format Article
sources DOAJ
author Harsha Vaddireddy
Omer San
spellingShingle Harsha Vaddireddy
Omer San
Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach
Fluids
deterministic symbolic regression
fast function extraction
compressive sensing
pathwise regularized learning
non-dominated sorting
author_facet Harsha Vaddireddy
Omer San
author_sort Harsha Vaddireddy
title Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach
title_short Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach
title_full Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach
title_fullStr Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach
title_full_unstemmed Equation Discovery Using Fast Function Extraction: a Deterministic Symbolic Regression Approach
title_sort equation discovery using fast function extraction: a deterministic symbolic regression approach
publisher MDPI AG
series Fluids
issn 2311-5521
publishDate 2019-06-01
description Advances in machine learning (ML) coupled with increased computational power have enabled identification of patterns in data extracted from complex systems. ML algorithms are actively being sought in recovering physical models or mathematical equations from data. This is a highly valuable technique where models cannot be built using physical reasoning alone. In this paper, we investigate the application of fast function extraction (FFX), a fast, scalable, deterministic symbolic regression algorithm to recover partial differential equations (PDEs). FFX identifies active bases among a huge set of candidate basis functions and their corresponding coefficients from recorded snapshot data. This approach uses a sparsity-promoting technique from compressive sensing and sparse optimization called pathwise regularized learning to perform feature selection and parameter estimation. Furthermore, it recovers several models of varying complexity (number of basis terms). FFX finally filters out many identified models using non-dominated sorting and forms a Pareto front consisting of optimal models with respect to minimizing complexity and test accuracy. Numerical experiments are carried out to recover several ubiquitous PDEs such as wave and heat equations among linear PDEs and Burgers, Korteweg−de Vries (KdV), and Kawahara equations among higher-order nonlinear PDEs. Additional simulations are conducted on the same PDEs under noisy conditions to test the robustness of the proposed approach.
topic deterministic symbolic regression
fast function extraction
compressive sensing
pathwise regularized learning
non-dominated sorting
url https://www.mdpi.com/2311-5521/4/2/111
work_keys_str_mv AT harshavaddireddy equationdiscoveryusingfastfunctionextractionadeterministicsymbolicregressionapproach
AT omersan equationdiscoveryusingfastfunctionextractionadeterministicsymbolicregressionapproach
_version_ 1724995474079875072