Entropy, Free Energy, and Work of Restricted Boltzmann Machines
A restricted Boltzmann machine is a generative probabilistic graphic network. A probability of finding the network in a certain configuration is given by the Boltzmann distribution. Given training data, its learning is done by optimizing the parameters of the energy function of the network. In this...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-05-01
|
Series: | Entropy |
Subjects: | |
Online Access: | https://www.mdpi.com/1099-4300/22/5/538 |
id |
doaj-938aed650c8a449fa8b8b673033a91a0 |
---|---|
record_format |
Article |
spelling |
doaj-938aed650c8a449fa8b8b673033a91a02020-11-25T02:04:34ZengMDPI AGEntropy1099-43002020-05-012253853810.3390/e22050538Entropy, Free Energy, and Work of Restricted Boltzmann MachinesSangchul Oh0Abdelkader Baggag1Hyunchul Nha2Qatar Environment and Energy Research Institute, Hamad Bin Khalifa University, Qatar Foundation, Doha 5825, QatarQatar Computing Research Institute, Hamad Bin Khalifa University, Qatar Foundation, Doha 5825, QatarDepartment of Physics, Texas A&M University at Qatar, Education City, Doha 23874, QatarA restricted Boltzmann machine is a generative probabilistic graphic network. A probability of finding the network in a certain configuration is given by the Boltzmann distribution. Given training data, its learning is done by optimizing the parameters of the energy function of the network. In this paper, we analyze the training process of the restricted Boltzmann machine in the context of statistical physics. As an illustration, for small size bar-and-stripe patterns, we calculate thermodynamic quantities such as entropy, free energy, and internal energy as a function of the training epoch. We demonstrate the growth of the correlation between the visible and hidden layers via the subadditivity of entropies as the training proceeds. Using the Monte-Carlo simulation of trajectories of the visible and hidden vectors in the configuration space, we also calculate the distribution of the work done on the restricted Boltzmann machine by switching the parameters of the energy function. We discuss the Jarzynski equality which connects the path average of the exponential function of the work and the difference in free energies before and after training.https://www.mdpi.com/1099-4300/22/5/538restricted Boltzmann machinesentropysubadditivity of entropyJarzynski equalitymachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sangchul Oh Abdelkader Baggag Hyunchul Nha |
spellingShingle |
Sangchul Oh Abdelkader Baggag Hyunchul Nha Entropy, Free Energy, and Work of Restricted Boltzmann Machines Entropy restricted Boltzmann machines entropy subadditivity of entropy Jarzynski equality machine learning |
author_facet |
Sangchul Oh Abdelkader Baggag Hyunchul Nha |
author_sort |
Sangchul Oh |
title |
Entropy, Free Energy, and Work of Restricted Boltzmann Machines |
title_short |
Entropy, Free Energy, and Work of Restricted Boltzmann Machines |
title_full |
Entropy, Free Energy, and Work of Restricted Boltzmann Machines |
title_fullStr |
Entropy, Free Energy, and Work of Restricted Boltzmann Machines |
title_full_unstemmed |
Entropy, Free Energy, and Work of Restricted Boltzmann Machines |
title_sort |
entropy, free energy, and work of restricted boltzmann machines |
publisher |
MDPI AG |
series |
Entropy |
issn |
1099-4300 |
publishDate |
2020-05-01 |
description |
A restricted Boltzmann machine is a generative probabilistic graphic network. A probability of finding the network in a certain configuration is given by the Boltzmann distribution. Given training data, its learning is done by optimizing the parameters of the energy function of the network. In this paper, we analyze the training process of the restricted Boltzmann machine in the context of statistical physics. As an illustration, for small size bar-and-stripe patterns, we calculate thermodynamic quantities such as entropy, free energy, and internal energy as a function of the training epoch. We demonstrate the growth of the correlation between the visible and hidden layers via the subadditivity of entropies as the training proceeds. Using the Monte-Carlo simulation of trajectories of the visible and hidden vectors in the configuration space, we also calculate the distribution of the work done on the restricted Boltzmann machine by switching the parameters of the energy function. We discuss the Jarzynski equality which connects the path average of the exponential function of the work and the difference in free energies before and after training. |
topic |
restricted Boltzmann machines entropy subadditivity of entropy Jarzynski equality machine learning |
url |
https://www.mdpi.com/1099-4300/22/5/538 |
work_keys_str_mv |
AT sangchuloh entropyfreeenergyandworkofrestrictedboltzmannmachines AT abdelkaderbaggag entropyfreeenergyandworkofrestrictedboltzmannmachines AT hyunchulnha entropyfreeenergyandworkofrestrictedboltzmannmachines |
_version_ |
1724942444420661248 |