The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]

In a language corpus, the probability that a word occurs n times is often proportional to 1/n2. Assigning rank, s, to words according to their abundance, log s vs log n typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of...

Full description

Bibliographic Details
Main Author: Steven A. Frank
Format: Article
Language:English
Published: F1000 Research Ltd 2019-03-01
Series:F1000Research
Online Access:https://f1000research.com/articles/8-334/v1
id doaj-a73dc1741eb94fc8bec6f71d5904e99d
record_format Article
spelling doaj-a73dc1741eb94fc8bec6f71d5904e99d2020-11-25T03:14:51ZengF1000 Research LtdF1000Research2046-14022019-03-01810.12688/f1000research.18681.120456The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]Steven A. Frank0Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, 92697-2525, USAIn a language corpus, the probability that a word occurs n times is often proportional to 1/n2. Assigning rank, s, to words according to their abundance, log s vs log n typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different biological species, the probability of a population of size n is typically proportional to 1/n, declining exponentially for larger n, the log series pattern. This article shows that the differing patterns of Zipf's law and the log series arise as the opposing endpoints of a more general theory. The general theory follows from the generic form of all probability patterns as a consequence of conserved average values and the associated invariances of scale. To understand the common patterns of abundance, the generic form of probability distributions plus the conserved average abundance is sufficient. The general theory includes cases that are between the Zipf and log series endpoints, providing a broad framework for analyzing widely observed abundance patterns.https://f1000research.com/articles/8-334/v1
collection DOAJ
language English
format Article
sources DOAJ
author Steven A. Frank
spellingShingle Steven A. Frank
The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]
F1000Research
author_facet Steven A. Frank
author_sort Steven A. Frank
title The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]
title_short The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]
title_full The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]
title_fullStr The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]
title_full_unstemmed The common patterns of abundance: the log series and Zipf's law [version 1; peer review: 3 approved]
title_sort common patterns of abundance: the log series and zipf's law [version 1; peer review: 3 approved]
publisher F1000 Research Ltd
series F1000Research
issn 2046-1402
publishDate 2019-03-01
description In a language corpus, the probability that a word occurs n times is often proportional to 1/n2. Assigning rank, s, to words according to their abundance, log s vs log n typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different biological species, the probability of a population of size n is typically proportional to 1/n, declining exponentially for larger n, the log series pattern. This article shows that the differing patterns of Zipf's law and the log series arise as the opposing endpoints of a more general theory. The general theory follows from the generic form of all probability patterns as a consequence of conserved average values and the associated invariances of scale. To understand the common patterns of abundance, the generic form of probability distributions plus the conserved average abundance is sufficient. The general theory includes cases that are between the Zipf and log series endpoints, providing a broad framework for analyzing widely observed abundance patterns.
url https://f1000research.com/articles/8-334/v1
work_keys_str_mv AT stevenafrank thecommonpatternsofabundancethelogseriesandzipfslawversion1peerreview3approved
AT stevenafrank commonpatternsofabundancethelogseriesandzipfslawversion1peerreview3approved
_version_ 1724642072400494592