<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">APM</journal-id><journal-title-group><journal-title>Advances in Pure Mathematics</journal-title></journal-title-group><issn pub-type="epub">2160-0368</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/apm.2023.133010</article-id><article-id pub-id-type="publisher-id">APM-123798</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Zipf’s Law, Benford’s Law, and Pareto Rule
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Oded</surname><given-names>Kafri</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>Kafri Nihul Ltd., Tel Aviv, Israel</addr-line></aff><pub-date pub-type="epub"><day>15</day><month>03</month><year>2023</year></pub-date><volume>13</volume><issue>03</issue><fpage>174</fpage><lpage>180</lpage><history><date date-type="received"><day>23,</day>	<month>February</month>	<year>2023</year></date><date date-type="rev-recd"><day>19,</day>	<month>March</month>	<year>2023</year>	</date><date date-type="accepted"><day>22,</day>	<month>March</month>	<year>2023</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  From a basic probabilistic argumentation, the Zipfian distribution and Benford’s law are derived. It is argued that Zipf’s law fits to calculate the rank probabilities of identical indistinguishable objects and that Benford’s distribution fits to calculate the rank probabilities of distinguishable objects. 
  <em>i.e.</em> in the distribution of words in long texts all the words in a given rank are identical, therefore, the rank distribution is Zipfian. In logarithmic tables, the objects with identical 1st digits are distinguishable as there are many different digits in the 2nd, 3rd… places, etc., and therefore the distribution is according to Benford’s law. Pareto 20 - 80 rule is shown to be an outcome of Benford’s distribution as when the number of ranks is about 10 the probability of 20% of the high probability ranks is equal to the probability of the rest of 80% low probability ranks. It is argued that all these distributions, including the central limit theorem, are outcomes of Planck’s law and are the result of the quantization of energy. This argumentation may be considered a physical origin of probability.
 
</p></abstract><kwd-group><kwd>Zipf’s Law</kwd><kwd> Benford’s Law</kwd><kwd> Pareto 20 - 80 Rule</kwd><kwd> Planck’s Law</kwd><kwd> Max Entropy</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Zipf’s law and Benford’s law are long-tail rank distributions appearing in many copious statistical ensembles [<xref ref-type="bibr" rid="scirp.123798-ref1">1</xref>]. Both laws are considered empirical laws. In 1881, Newcomb [<xref ref-type="bibr" rid="scirp.123798-ref2">2</xref>] found that the probability distribution p ( n ) of the decimal</p><p>digits in the 1<sup>st</sup> digits of the logarithmic table obeys p ( n ) = log ( 1 + 1 n ) , where</p><p>1 ≤ ⌊ n ⌋ ≤ 9 . Benford [<xref ref-type="bibr" rid="scirp.123798-ref3">3</xref>] found in 1938 that Newcomb’s distribution applies to many more ensembles and not only to the logarithmic table. Later [<xref ref-type="bibr" rid="scirp.123798-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.123798-ref5">5</xref>] the law generalized for N ranks to be,</p><p>p B ( n , N ) = ln ( 1 + 1 n ) ln ( N + 1 ) . (1)</p><p>Eleven years later, in 1949, Zipf [<xref ref-type="bibr" rid="scirp.123798-ref6">6</xref>] discovered that in long texts, in several languages, the most frequent word appears twice as much as the second most frequent word, the second most frequent word appears twice as much as the fourth frequent word, and so on. The Zipfian distribution, similarly to Benford’s law, appears in many ensembles, like populations of cities, bestsellers lists, etc. Zipf’s law can be written [<xref ref-type="bibr" rid="scirp.123798-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.123798-ref8">8</xref>] as,</p><p>p z ( n , N ) = 1 n H N , (2)</p><p>where H N = ∑ n = 1 N 1 n is the N<sup>th</sup> harmonic number.</p><p>Both Zipf’s law and Benford’s law are obtained from the maximum entropy distribution of indistinguishable balls in N distinguishable boxes, where the boxes are the ranks and the number of the balls is much larger than the number of boxes [<xref ref-type="bibr" rid="scirp.123798-ref8">8</xref>]. In <xref ref-type="fig" rid="fig1">Figure 1</xref>, the Benford distribution and Zipf distribution for 10 ranks are plotted. It is seen that Benford’s law and Zipf’s law are similar but not identical.</p><p>Hereafter, we derive both laws using basic probabilistic tools and explain the differences between them. In addition, we derive the Pareto 20 - 80 rule of thumb for Benford’s law and discuss their origin and limitations.</p></sec><sec id="s2"><title>2. Zipf’s Law</title><p>Suppose that there are N identical biscuits and a mouse in a closed space. The mouse eats every day one biscuit. What is the probability of a biscuit being eaten on the d day?</p><p>The maximum survival days n that a biscuit has at the day d is,</p><p>n = N + 1 − d ,</p><p>where 1 ≤ ⌊ d ⌋ ≤ N .</p><p>On the first day, d = 1 , the biscuit has maximum n = N days to survive. Where d = N , the biscuit has only n = 1 day. The probability p of the biscuit to be eaten is inversely proportional to n, namely, p ∝ 1 / n , therefore, the normalized probability distribution is,</p><p>p Z ( n , N ) = 1 n ∑ n = 1 N 1 n = 1 n H N ,</p><p>which is Zipf’s law.</p><p>We see that the probability of a biscuit being eaten on the day n obeys Zipf’s law. This model, which is similar to the coupon collector problem, is identical to the word distribution of long texts. Suppose that one wants to write a text of N</p><p>words. The first word has a probability of 1 N , the second word 1 N − 1 , etc. In the discussion, we explain why the Zipf distribution is so general.</p></sec><sec id="s3"><title>3. Benford’s Law</title><p>Benford’s law is obtained by applying the Riemann sum to Zipf’s law [<xref ref-type="bibr" rid="scirp.123798-ref8">8</xref>] [<xref ref-type="bibr" rid="scirp.123798-ref9">9</xref>]. If we assume that is continuous, then,</p><p>1 n ≈ ∫ n ′ = n n + 1 d n ′ n ′ = ln ( 1 + 1 n ) and H N ≈ ∫ n ′ = 1 N + 1 d n ′ n ′ = ln ( N + 1 )</p><p>Substitute these integrals in Zipf’s law (Equation (2)) and we obtain Benford’s law (Equation (1)).</p><p>Benford’s law seems to approximate the more accurate Zipf’s law. However, under certain conditions, Benford’s law is more accurate than Zipf’s law. For example, suppose that a pig that eats M biscuits per day replaces the mouse in the example above. In this case, a day becomes a rank that contains M biscuits. Since in a day there are M biscuits, the probability of a biscuit m to be eaten in the n day is,</p><p>p Z ( n , m , N M ) = 1 H N M ⋅ 1 n M + m . (3)</p><p>The probability to be eaten in the whole n<sup>th</sup> day is</p><p>p Z ( n + 1 , N M ) = 1 H N M ⋅ ∑ m = 1 M + 1 1 n M + m .</p><p>Since ∑ m = 1 M + 1 1 n M + m = H ( n + 1 ) M − H n M , therefore,</p><p>p Z ( n + 1 , N M ) = H ( n + 1 ) M − H n M H N M .</p><p>for M ≫ 1 , we can use the approximation</p><p>lim M → ∞ H M = ln ( M ) + γ ,</p><p>where γ ≈ 0.577 is the Euler-Mascheroni constant. Therefore,</p><p>p Z ( n + 1 , N M ) ≈ ln [ ( n + 1 ) M ] − ln [ n M ] + γ − γ H N M = 1 H N M ln ( 1 + 1 n ) . (4)</p><p>Equation (4), when renormalized, yields Benford’s law. It is seen that Benford’s law is obtained when there are sub-distributions inside Zipf’s ranks.</p></sec><sec id="s4"><title>4. Pareto 20 - 80 Rule of Thumb</title><p>In 1906, Italian economist Vilfredo Pareto [<xref ref-type="bibr" rid="scirp.123798-ref10">10</xref>] observed that 20% of the people in his country owned 80% of the nation’s wealth. That rule was found to apply with uncanny accuracy to many situations and be useful in many disciplines, including the study of business productivity. Hereafter we show that the Pareto principle can be easily calculated from Benford’s law. To do so we have to find the rank n &#175; which is the sum of the probabilities up to n &#175; is equal to the sum above it. In Benford’s law, the rank n &#175; obeys,</p><p>∑ n = 1 n &#175; ln ( 1 + 1 n ) = ∑ n = n &#175; N + 1 ln ( 1 + 1 n ) ,</p><p>which yields; 2 ln ( n &#175; + 1 ) = ln ( N + 1 ) , or</p><p>n &#175; = N + 1 − 1 . (5)</p><p>The Pareto ratio is simply,</p><p>n &#175; N + 1 : N + 1 − n &#175; N + 1 (6)</p><p>Therefore n &#175; / ( N + 1 ) is the fraction of the ranks that have equal probability to the rest of the ranks and according to the Pareto rule is 0.2.</p><p>Zipf’s law does not fit for Pareto ratio calculation as the distribution within the ranks does not exist and therefore none-integer n &#175; has no meaning. Benford’s law is used for fraud detection of financial reports [<xref ref-type="bibr" rid="scirp.123798-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.123798-ref12">12</xref>]. However, Benford’s distributions appear in many other statistics, of which a notable one is wealth distribution [<xref ref-type="bibr" rid="scirp.123798-ref13">13</xref>]. Pareto 20 - 80 distribution and Gini inequality index in free economies are in agreement with Benford’s law [<xref ref-type="bibr" rid="scirp.123798-ref14">14</xref>]. However, as was shown Zipf’s law, Benford’s law and Pareto’s rule are sensitive to the number of ranks N. Namely, the same distribution of probabilities yields different ratios between the ranks probabilities when N is changed. In <xref ref-type="fig" rid="fig2">Figure 2</xref>, we see that</p><p>around N ≈ 10 , the ratio 20 - 80 is a pretty good approximation of Benford’s law distribution which fits better for the economy in which the incomes within the ranks are varying.</p></sec><sec id="s5"><title>5. Discussion</title><p>The unequal probability distribution of the power laws is counterintuitive. If all the ranks have an equal probability to have an object, why they don’t have an equal amount of objects? The explanation comes from statistical mechanics, An ensemble of ranks and their probabilities to have indistinguishable objects is analogous to a microcanonical ensemble of N boxes and 〈 n 〉 N balls, where 〈 n 〉 is the average number of balls in a rank. The thermodynamic microcanonical ensemble conserves material, volume, and energy. In the boxes and balls ensemble, the material is the boxes and their number N represents the conservation of volume. The number of balls represents the conservation of energy. According to the second law, in equilibrium, both the probabilities of the boxes to have a ball is equal and, all the microstates’ probabilities are equal. A microstate (a state of the ensemble) is a distinguishable configuration of all the balls in all the boxes [<xref ref-type="bibr" rid="scirp.123798-ref7">7</xref>]. These requirements are an outcome of the second law, which one of its definitions states that in equilibrium the entropy is maximum. Planck calculated the distribution of the balls in the boxes in 1901 [<xref ref-type="bibr" rid="scirp.123798-ref15">15</xref>] [<xref ref-type="bibr" rid="scirp.123798-ref16">16</xref>]. He maximized the entropy of a set of distinguishable oscillators having an average energy k<sub>B</sub>T, and each ball (photon) had an energy hv. Where k<sub>B</sub> is the Boltzmann constant, T is the temperature, h is the Planck constant, and v is the photon’s frequency. The famous Planck result is,</p><p>n = 1 exp ( h v k B T ) − 1 .</p><p>In the Planck equation, n is the occupation number of an oscillator in an ensemble in which the average energy is k<sub>B</sub>T, and each photon has energy hv,</p><p>therefore k B T h v is the average number of photons in an oscillator. If we designate k B T h v = 〈 n 〉 we can write the Planck equation as,</p><p>[ n ] = 1 exp ( 1 〈 n 〉 ) − 1 . (7)</p><p>In equilibrium for a given temperature and frequency all the oscillators should have the same number of photons 〈 n 〉 . Since v and T can have any value, 〈 n 〉 is not necessarily an integer, however, quantum mechanics enables, according to Equation (7), only an integer number of photons [ n ] to exist. Therefore we can calculate the average number of balls 〈 n 〉 = f ( [ n ] ) as a function of the integer number of balls. In the case that 〈 n 〉 ≫ 1 , exp ( 1 / 〈 n 〉 ) ≈ 1 + 1 / 〈 n 〉 , we obtain that n ≈ 〈 n 〉 . This is the classical result in which the occupation number and the number of balls are equal. Thus the probability is given by,</p><p>p ( n ) = 1 〈 n 〉 ≈ 1 n ,</p><p>that when normalized to N boxes, yields Zipf’s law as in Equation (2). In the general case Equation (7) yields</p><p>p ( n ) = 1 〈 n 〉 = ln ( 1 + 1 n ) . (8)</p><p>When Equation (8) is normalized to N boxes it becomes Benford’s law of Equation (1).</p><p>In the case when 〈 n 〉 ≪ 1 , 1 〈 n 〉 ≫ 1 , or exp ( − 1 n ) − 1 ≈ exp ( − 1 n ) , the probability to find n balls, namely</p><p>p ( n ) = 1 〈 n 〉 = 1 exp ( 1 n ) − 1 ≈ exp ( − 1 n ) . (9)</p><p>When normalized Equation (9) yields the canonical distribution namely.</p><p>p ( n , N ) = exp ( − 1 n ) ∑ n = 1 N + 1 exp ( − 1 n ) .</p><p>The normalization factor</p><p>Z = ∑ n = 1 N + 1 exp ( − 1 n ) ≈ ∑ n = 1 ∞ exp ( − 1 n )</p><p>is the canonical partition function, which yields the central limit theorem in the limit of very small 〈 n 〉 [<xref ref-type="bibr" rid="scirp.123798-ref9">9</xref>].</p></sec><sec id="s6"><title>6. Summary</title><p>Zipf’s law, Benford’s law, and Pareto’s 20 - 80 rule are considered empirical laws. We argue that Zipf’s law is the rank distribution of indistinguishable objects, while Benford’s law is the rank distribution in which the objects within the rank are distinguishable. Pareto’s 20 - 80 ratio, was found to be in good agreement with Benford’s law in the vicinity of 10 ranks. It has also been argued that all these distributions, including the central limit theorem, can be derived from Planck’s law and are the result of the quantization of energy. This argumentation may be considered a physical origin of probability.</p></sec><sec id="s7"><title>Acknowledgements</title><p>I thank H. Kafri and E. Fishof for reading the manuscript and for their useful comments.</p></sec><sec id="s8"><title>Conflicts of Interest</title><p>The author declares no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s9"><title>Cite this paper</title><p>Kafri, O. (2023) Zipf’s Law, Benford’s Law, and Pareto Rule. Advances in Pure Mathematics, 13, 174-180. https://doi.org/10.4236/apm.2023.133010</p></sec></body><back><ref-list><title>References</title><ref id="scirp.123798-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Tao, T. (2009) Benford’s Law, Zipf’s Law, and the Pareto Distribution. https://terrytao.wordpress.com/2009/07/03</mixed-citation></ref><ref id="scirp.123798-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Newcomb, S. (1881) Note on the Frequency of Use of the Different Digits in Natural Numbers. American Journal of Mathematics, 4, 39-40. https://doi.org/10.2307/2369148</mixed-citation></ref><ref id="scirp.123798-ref3"><label>3</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Benford</surname><given-names> F. </given-names></name>,<etal>et al</etal>. (<year>1938</year>)<article-title>The Law of Anomalous Numbers</article-title><source> Proceedings of the American Mathematical Society</source><volume> 78</volume>,<fpage> 551</fpage>-<lpage>572</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.123798-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Kafri, O. (2009) Entropy Principle in Direct Derivation of Benford’s Law.</mixed-citation></ref><ref id="scirp.123798-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Kafri, O. and Kafri, H. (2013) Entropy-God’s Dice Game. CreateSpace, 208-209.http://www.entropy-book.com/</mixed-citation></ref><ref id="scirp.123798-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Zipf, G.K. (1949) Human Behavior and the Principle of Least-Effort. Addison-Wesley.</mixed-citation></ref><ref id="scirp.123798-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Powers, D.M.W. (1998) Applications and Explanations of Zipf’s Law. NeMLaP3/ CoNLL98: ACL, 151-160. https://doi.org/10.3115/1603899.1603924</mixed-citation></ref><ref id="scirp.123798-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Kafri, O. (2020) Microcanonical Partition Function.</mixed-citation></ref><ref id="scirp.123798-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Kafri, O. (2016) A Novel Approach to Probability. Advances in Pure Mathematics, 6, 201-211. https://doi.org/10.4236/apm.2016.64017</mixed-citation></ref><ref id="scirp.123798-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Pareto, V. (1964) “Cours d’économie Politique”: Nouvelle édition par G.-H. Bousquet et G. Busino. Librairie Droz, Geneva. https://doi.org/10.3917/droz.paret.1964.01</mixed-citation></ref><ref id="scirp.123798-ref11"><label>11</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Nigrini</surname><given-names> M. </given-names></name>,<etal>et al</etal>. (<year>1996</year>)<article-title>A Taxpayer Compliance Application of Benford’s Law</article-title><source> The Journal of the American Taxation Association</source><volume> 18</volume>,<fpage> 72</fpage>-<lpage>91</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.123798-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Kossovsky, A.E. (2014) Benford’s Law: Theory, The General Law of Relative Quantities, and Forensic Fraud Detection Applications., World Scientific Pub. Co. https://doi.org/10.1142/9089</mixed-citation></ref><ref id="scirp.123798-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Kafri, O. (2018) Money Physics and Distributive Justice. CreateSpace, 94-95.</mixed-citation></ref><ref id="scirp.123798-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Kafri, O. and Fishof, E. (2016) Economic Inequality as a Statistical Outcome. Journal of Economics Bibliography, 3, 570-576.</mixed-citation></ref><ref id="scirp.123798-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Planck, M. (1901) On the Law of Distribution of Energy in the Normal Spectrum. Annalen der Physik, 4, 553. https://doi.org/10.1002/andp.19013090310</mixed-citation></ref><ref id="scirp.123798-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Kafri, O. and Kafri, H. (2013) Entropy-God’s Dice Game. CreateSpace, 198-201. http://www.entropy-book.com</mixed-citation></ref></ref-list></back></article>