Entropy, Free Energy, and Work of Restricted Boltzmann Machines

A restricted Boltzmann machine is a generative probabilistic graphic network. A probability of finding the network in a certain configuration is given by the Boltzmann distribution. Given training data, its learning is done by optimizing the parameters of the energy function of the network. In this...

Full description

Bibliographic Details
Main Authors: Sangchul Oh, Abdelkader Baggag, Hyunchul Nha
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/22/5/538
id doaj-938aed650c8a449fa8b8b673033a91a0
record_format Article
spelling doaj-938aed650c8a449fa8b8b673033a91a02020-11-25T02:04:34ZengMDPI AGEntropy1099-43002020-05-012253853810.3390/e22050538Entropy, Free Energy, and Work of Restricted Boltzmann MachinesSangchul Oh0Abdelkader Baggag1Hyunchul Nha2Qatar Environment and Energy Research Institute, Hamad Bin Khalifa University, Qatar Foundation, Doha 5825, QatarQatar Computing Research Institute, Hamad Bin Khalifa University, Qatar Foundation, Doha 5825, QatarDepartment of Physics, Texas A&M University at Qatar, Education City, Doha 23874, QatarA restricted Boltzmann machine is a generative probabilistic graphic network. A probability of finding the network in a certain configuration is given by the Boltzmann distribution. Given training data, its learning is done by optimizing the parameters of the energy function of the network. In this paper, we analyze the training process of the restricted Boltzmann machine in the context of statistical physics. As an illustration, for small size bar-and-stripe patterns, we calculate thermodynamic quantities such as entropy, free energy, and internal energy as a function of the training epoch. We demonstrate the growth of the correlation between the visible and hidden layers via the subadditivity of entropies as the training proceeds. Using the Monte-Carlo simulation of trajectories of the visible and hidden vectors in the configuration space, we also calculate the distribution of the work done on the restricted Boltzmann machine by switching the parameters of the energy function. We discuss the Jarzynski equality which connects the path average of the exponential function of the work and the difference in free energies before and after training.https://www.mdpi.com/1099-4300/22/5/538restricted Boltzmann machinesentropysubadditivity of entropyJarzynski equalitymachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Sangchul Oh
Abdelkader Baggag
Hyunchul Nha
spellingShingle Sangchul Oh
Abdelkader Baggag
Hyunchul Nha
Entropy, Free Energy, and Work of Restricted Boltzmann Machines
Entropy
restricted Boltzmann machines
entropy
subadditivity of entropy
Jarzynski equality
machine learning
author_facet Sangchul Oh
Abdelkader Baggag
Hyunchul Nha
author_sort Sangchul Oh
title Entropy, Free Energy, and Work of Restricted Boltzmann Machines
title_short Entropy, Free Energy, and Work of Restricted Boltzmann Machines
title_full Entropy, Free Energy, and Work of Restricted Boltzmann Machines
title_fullStr Entropy, Free Energy, and Work of Restricted Boltzmann Machines
title_full_unstemmed Entropy, Free Energy, and Work of Restricted Boltzmann Machines
title_sort entropy, free energy, and work of restricted boltzmann machines
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2020-05-01
description A restricted Boltzmann machine is a generative probabilistic graphic network. A probability of finding the network in a certain configuration is given by the Boltzmann distribution. Given training data, its learning is done by optimizing the parameters of the energy function of the network. In this paper, we analyze the training process of the restricted Boltzmann machine in the context of statistical physics. As an illustration, for small size bar-and-stripe patterns, we calculate thermodynamic quantities such as entropy, free energy, and internal energy as a function of the training epoch. We demonstrate the growth of the correlation between the visible and hidden layers via the subadditivity of entropies as the training proceeds. Using the Monte-Carlo simulation of trajectories of the visible and hidden vectors in the configuration space, we also calculate the distribution of the work done on the restricted Boltzmann machine by switching the parameters of the energy function. We discuss the Jarzynski equality which connects the path average of the exponential function of the work and the difference in free energies before and after training.
topic restricted Boltzmann machines
entropy
subadditivity of entropy
Jarzynski equality
machine learning
url https://www.mdpi.com/1099-4300/22/5/538
work_keys_str_mv AT sangchuloh entropyfreeenergyandworkofrestrictedboltzmannmachines
AT abdelkaderbaggag entropyfreeenergyandworkofrestrictedboltzmannmachines
AT hyunchulnha entropyfreeenergyandworkofrestrictedboltzmannmachines
_version_ 1724942444420661248