Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension

The identification of universal properties from minimally processed data sets is one goal of machine learning techniques applied to statistical physics. Here, we study how the minimum number of variables needed to accurately describe the important features of a data set—the intrinsic dimension (I_{d...

Full description

Bibliographic Details
Main Authors: T. Mendes-Santos, X. Turkeshi, M. Dalmonte, Alex Rodriguez
Format: Article
Language:English
Published: American Physical Society 2021-02-01
Series:Physical Review X
Online Access:http://doi.org/10.1103/PhysRevX.11.011040
id doaj-022d2f65518840b0bedc492738de178e
record_format Article
spelling doaj-022d2f65518840b0bedc492738de178e2021-02-26T15:17:08ZengAmerican Physical SocietyPhysical Review X2160-33082021-02-0111101104010.1103/PhysRevX.11.011040Unsupervised Learning Universal Critical Behavior via the Intrinsic DimensionT. Mendes-SantosX. TurkeshiM. DalmonteAlex RodriguezThe identification of universal properties from minimally processed data sets is one goal of machine learning techniques applied to statistical physics. Here, we study how the minimum number of variables needed to accurately describe the important features of a data set—the intrinsic dimension (I_{d})—behaves in the vicinity of phase transitions. We employ state-of-the-art nearest-neighbors-based I_{d} estimators to compute the I_{d} of raw Monte Carlo thermal configurations across different phase transitions: first-order, second-order, and Berezinskii-Kosterlitz-Thouless. For all the considered cases, we find that the I_{d} uniquely characterizes the transition regime. The finite-size analysis of the I_{d} allows us to not only identify critical points with an accuracy comparable to methods that rely on a priori identification of order parameters but also to determine the corresponding (critical) exponent ν in the case of continuous transitions. For the case of topological transitions, this analysis overcomes the reported limitations affecting other unsupervised learning methods. Our work reveals how raw data sets display unique signatures of universal behavior in the absence of any dimensional reduction scheme and suggest direct parallelism between conventional order parameters in real space and the intrinsic dimension in the data space.http://doi.org/10.1103/PhysRevX.11.011040
collection DOAJ
language English
format Article
sources DOAJ
author T. Mendes-Santos
X. Turkeshi
M. Dalmonte
Alex Rodriguez
spellingShingle T. Mendes-Santos
X. Turkeshi
M. Dalmonte
Alex Rodriguez
Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension
Physical Review X
author_facet T. Mendes-Santos
X. Turkeshi
M. Dalmonte
Alex Rodriguez
author_sort T. Mendes-Santos
title Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension
title_short Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension
title_full Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension
title_fullStr Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension
title_full_unstemmed Unsupervised Learning Universal Critical Behavior via the Intrinsic Dimension
title_sort unsupervised learning universal critical behavior via the intrinsic dimension
publisher American Physical Society
series Physical Review X
issn 2160-3308
publishDate 2021-02-01
description The identification of universal properties from minimally processed data sets is one goal of machine learning techniques applied to statistical physics. Here, we study how the minimum number of variables needed to accurately describe the important features of a data set—the intrinsic dimension (I_{d})—behaves in the vicinity of phase transitions. We employ state-of-the-art nearest-neighbors-based I_{d} estimators to compute the I_{d} of raw Monte Carlo thermal configurations across different phase transitions: first-order, second-order, and Berezinskii-Kosterlitz-Thouless. For all the considered cases, we find that the I_{d} uniquely characterizes the transition regime. The finite-size analysis of the I_{d} allows us to not only identify critical points with an accuracy comparable to methods that rely on a priori identification of order parameters but also to determine the corresponding (critical) exponent ν in the case of continuous transitions. For the case of topological transitions, this analysis overcomes the reported limitations affecting other unsupervised learning methods. Our work reveals how raw data sets display unique signatures of universal behavior in the absence of any dimensional reduction scheme and suggest direct parallelism between conventional order parameters in real space and the intrinsic dimension in the data space.
url http://doi.org/10.1103/PhysRevX.11.011040
work_keys_str_mv AT tmendessantos unsupervisedlearninguniversalcriticalbehaviorviatheintrinsicdimension
AT xturkeshi unsupervisedlearninguniversalcriticalbehaviorviatheintrinsicdimension
AT mdalmonte unsupervisedlearninguniversalcriticalbehaviorviatheintrinsicdimension
AT alexrodriguez unsupervisedlearninguniversalcriticalbehaviorviatheintrinsicdimension
_version_ 1724249373100998656