Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance
<p>One of the main objectives of the scientific enterprise is the development of well-performing yet parsimonious models for all natural phenomena and systems. In the 21st century, scientists usually represent their models, hypotheses, and experimental observations using digital computers. Mea...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Copernicus Publications
2021-03-01
|
Series: | Hydrology and Earth System Sciences |
Online Access: | https://hess.copernicus.org/articles/25/1103/2021/hess-25-1103-2021.pdf |
id |
doaj-70dfa8163a3948f8966ffb7fdd2c34bb |
---|---|
record_format |
Article |
spelling |
doaj-70dfa8163a3948f8966ffb7fdd2c34bb2021-03-03T13:13:45ZengCopernicus PublicationsHydrology and Earth System Sciences1027-56061607-79382021-03-01251103111510.5194/hess-25-1103-2021Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performanceE. Azmi0U. Ehret1S. V. Weijs2B. L. Ruddell3R. A. P. Perdigão4R. A. P. Perdigão5R. A. P. Perdigão6Steinbuch Centre for Computing, Karlsruhe Institute of Technology (KIT), Karlsruhe, GermanyInstitute of Water Resources and River Basin Management, Karlsruhe Institute of Technology (KIT), Karlsruhe, GermanyDepartment of Civil Engineering, University of British Columbia, Vancouver, CanadaSchool of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, USAMeteoceanics Interdisciplinary Centre for Complex System Science, Vienna, AustriaCCIAM, Centre for Ecology, Evolution and Environmental Changes, Universidade de Lisboa, Lisbon, PortugalPhysics of Information and Quantum Technologies Group, Instituto de Telecomunicações, Lisbon, Portugal<p>One of the main objectives of the scientific enterprise is the development of well-performing yet parsimonious models for all natural phenomena and systems. In the 21st century, scientists usually represent their models, hypotheses, and experimental observations using digital computers. Measuring performance and parsimony of computer models is therefore a key theoretical and practical challenge for 21st century science. “Performance” here refers to a model's ability to reduce predictive uncertainty about an object of interest. “Parsimony” (or complexity) comprises two aspects: descriptive complexity – the size of the model itself which can be measured by the disk space it occupies – and computational complexity – the model's effort to provide output. Descriptive complexity is related to inference quality and generality; computational complexity is often a practical and economic concern for limited computing resources.</p> <p>In this context, this paper has two distinct but related goals. The first is to propose a practical method of measuring computational complexity by utility software “Strace”, which counts the total number of memory visits while running a model on a computer. The second goal is to propose the “bit by bit” method, which combines measuring computational complexity by “Strace” and measuring model performance by information loss relative to observations, both in bit. For demonstration, we apply the “bit by bit” method to watershed models representing a wide diversity of modelling strategies (artificial neural network, auto-regressive, process-based, and others). We demonstrate that computational complexity as measured by “Strace” is sensitive to all aspects of a model, such as the size of the model itself, the input data it reads, its numerical scheme, and time stepping. We further demonstrate that for each model, the bit counts for computational complexity exceed those for performance by several orders of magnitude and that the differences among the models for both computational complexity and performance can be explained by their setup and are in accordance with expectations.</p> <p>We conclude that measuring computational complexity by “Strace” is practical, and it is also general in the sense that it can be applied to any model that can be run on a digital computer. We further conclude that the “bit by bit” approach is general in the sense that it measures two key aspects of a model in the single unit of bit. We suggest that it can be enhanced by additionally measuring a model's descriptive complexity – also in bit.</p>https://hess.copernicus.org/articles/25/1103/2021/hess-25-1103-2021.pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
E. Azmi U. Ehret S. V. Weijs B. L. Ruddell R. A. P. Perdigão R. A. P. Perdigão R. A. P. Perdigão |
spellingShingle |
E. Azmi U. Ehret S. V. Weijs B. L. Ruddell R. A. P. Perdigão R. A. P. Perdigão R. A. P. Perdigão Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance Hydrology and Earth System Sciences |
author_facet |
E. Azmi U. Ehret S. V. Weijs B. L. Ruddell R. A. P. Perdigão R. A. P. Perdigão R. A. P. Perdigão |
author_sort |
E. Azmi |
title |
Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance |
title_short |
Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance |
title_full |
Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance |
title_fullStr |
Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance |
title_full_unstemmed |
Technical note: “Bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance |
title_sort |
technical note: “bit by bit”: a practical and general approach for evaluating model computational complexity vs. model performance |
publisher |
Copernicus Publications |
series |
Hydrology and Earth System Sciences |
issn |
1027-5606 1607-7938 |
publishDate |
2021-03-01 |
description |
<p>One of the main objectives of the scientific enterprise is the development of well-performing yet parsimonious models for all natural phenomena and systems. In the 21st century, scientists usually represent their models, hypotheses, and experimental observations using digital computers. Measuring performance and parsimony of computer models is therefore a key theoretical and practical challenge for 21st century science. “Performance” here refers to a model's ability to reduce predictive uncertainty about an object of interest. “Parsimony” (or complexity) comprises two aspects: descriptive complexity – the size of the model itself which can be measured by the disk space it occupies – and computational complexity – the model's effort to provide output. Descriptive complexity is related to inference quality and generality; computational complexity is often a practical and economic concern for limited computing resources.</p>
<p>In this context, this paper has two distinct but related goals. The first is to propose a practical method of measuring computational complexity by utility software “Strace”, which counts the total number of memory visits
while running a model on a computer. The second goal is to propose the “bit by bit” method, which combines measuring computational complexity by “Strace” and measuring model performance by information loss relative to
observations, both in bit. For demonstration, we apply the “bit by bit” method to watershed models representing a wide diversity of modelling strategies (artificial neural network, auto-regressive, process-based, and
others). We demonstrate that computational complexity as measured by “Strace” is sensitive to all aspects of a model, such as the size of the model itself, the input data it reads, its numerical scheme, and time stepping. We further demonstrate that for each model, the bit counts for computational complexity exceed those for performance by several orders
of magnitude and that the differences among the models for both computational complexity and performance can be explained by their setup and are in accordance with expectations.</p>
<p>We conclude that measuring computational complexity by “Strace” is practical, and it is also general in the sense that it can be applied to any model that can be run on a digital computer. We further conclude that the “bit by bit” approach is general in the sense that it measures two key aspects of a model in the single unit of bit. We suggest that it can be enhanced by additionally measuring a model's descriptive complexity – also
in bit.</p> |
url |
https://hess.copernicus.org/articles/25/1103/2021/hess-25-1103-2021.pdf |
work_keys_str_mv |
AT eazmi technicalnotebitbybitapracticalandgeneralapproachforevaluatingmodelcomputationalcomplexityvsmodelperformance AT uehret technicalnotebitbybitapracticalandgeneralapproachforevaluatingmodelcomputationalcomplexityvsmodelperformance AT svweijs technicalnotebitbybitapracticalandgeneralapproachforevaluatingmodelcomputationalcomplexityvsmodelperformance AT blruddell technicalnotebitbybitapracticalandgeneralapproachforevaluatingmodelcomputationalcomplexityvsmodelperformance AT rapperdigao technicalnotebitbybitapracticalandgeneralapproachforevaluatingmodelcomputationalcomplexityvsmodelperformance AT rapperdigao technicalnotebitbybitapracticalandgeneralapproachforevaluatingmodelcomputationalcomplexityvsmodelperformance AT rapperdigao technicalnotebitbybitapracticalandgeneralapproachforevaluatingmodelcomputationalcomplexityvsmodelperformance |
_version_ |
1724232895239815168 |