<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">AM</journal-id><journal-title-group><journal-title>Applied Mathematics</journal-title></journal-title-group><issn pub-type="epub">2152-7385</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/am.2021.127041</article-id><article-id pub-id-type="publisher-id">AM-110462</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Origin, Alternative Expressions of Newcomb-Benford Law and Deviations of Digit Frequencies
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Gene</surname><given-names>Whyman</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>Physics Department, Faculty of Natural Sciences, Ariel University, Ariel, Israel</addr-line></aff><pub-date pub-type="epub"><day>05</day><month>07</month><year>2021</year></pub-date><volume>12</volume><issue>07</issue><fpage>576</fpage><lpage>586</lpage><history><date date-type="received"><day>14,</day>	<month>May</month>	<year>2021</year></date><date date-type="rev-recd"><day>9,</day>	<month>July</month>	<year>2021</year>	</date><date date-type="accepted"><day>12,</day>	<month>July</month>	<year>2021</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  The Newcomb-Benford law, which describes the uneven distribution of the frequencies of digits in data sets, is by its nature probabilistic. Therefore, the main goal of this work was to derive formulas for the permissible deviations of the above frequencies (confidence intervals). For this, a previously developed method was used, which represents an alternative to the traditional approach. The alternative formula expressing the Newcomb-Benford law is re-derived. As shown in general form, it is numerically equivalent to the original Benford formula. The obtained formulas for confidence intervals for Benford’s law are shown to be useful for checking arrays of numerical data. Consequences for numeral systems with different bases are analyzed. The alternative expression for the frequencies of digits at the second decimal place is deduced together with the corresponding deviation intervals. In general, in this approach, all the presented results are a consequence of the positionality property of digital systems such as decimal, binary, etc.
 
</p></abstract><kwd-group><kwd>Benford’s Law</kwd><kwd> Confidence Intervals in Benford’s Law</kwd><kwd> Alternative Expression of Benford’s Law</kwd><kwd> Benford’s Law for Different Numeral Systems</kwd><kwd> Frequencies of Digits at the Second Decimal Place</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>The surprising fact of the uneven distribution of decimal digits over decimal places was noticed by Newcomb [<xref ref-type="bibr" rid="scirp.110462-ref1">1</xref>] in 1881 and then rediscovered in 1938 by Benford [<xref ref-type="bibr" rid="scirp.110462-ref2">2</xref>], who gave the corresponding mathematical expression</p><p>F ( n ) = log ( 1 + 1 n ) , (1)</p><p>where F(n) is the frequency of numbers having the first digit n (the base of the logarithm is 10). Evidently, the sum of all 9 frequencies equals 1.</p><p>Since then, the law has been repeatedly tested and applied in a wide variety of areas [<xref ref-type="bibr" rid="scirp.110462-ref3">3</xref>] - [<xref ref-type="bibr" rid="scirp.110462-ref10">10</xref>] and continues to attract the attention of researchers [<xref ref-type="bibr" rid="scirp.110462-ref11">11</xref>] - [<xref ref-type="bibr" rid="scirp.110462-ref22">22</xref>]. Meanwhile, efforts have been made to justify or derive the above equation [<xref ref-type="bibr" rid="scirp.110462-ref23">23</xref>] [<xref ref-type="bibr" rid="scirp.110462-ref24">24</xref>] [<xref ref-type="bibr" rid="scirp.110462-ref25">25</xref>] [<xref ref-type="bibr" rid="scirp.110462-ref26">26</xref>]. In Ref [<xref ref-type="bibr" rid="scirp.110462-ref26">26</xref>] it was connected with the scaling invariance of physical laws. It was shown that the Benford law is valid for numbers distributed exponentially [<xref ref-type="bibr" rid="scirp.110462-ref27">27</xref>]. Also, there exists the geometrical explanation of the Benford law [<xref ref-type="bibr" rid="scirp.110462-ref28">28</xref>]. Such a variety of explanations for the same law is somewhat unusual in physics and mathematics where, as a rule, there is a single main reason explaining its origin.</p><p>The simplest explanation of the Newcomb-Benford law has been given by the author of the present communication in cooperation with Ed. Bormashenko and E. Shulzinger [<xref ref-type="bibr" rid="scirp.110462-ref29">29</xref>]. We have shown that Benford’s law follows as a consequence of the “positionality” of numeral systems like the decimal one.</p><p>People unacquainted with the literature on the subject refuse to believe that in arrays of unfalsified data almost a third of decimal numbers begin with the digit 1. At the same time, no one is surprised by the fact that in the binary system all the numbers begin with the digit 1.</p><p>The main assumptions of the present work are as follows. 1) Any natural numerical array, which is being analyzed, is bounded from above either essentially or by the number of presented digits (the position of the decimal point is irrelevant). 2) Within this array, the probability of encountering any number is the same for all numbers, but not for digits in each position.</p><p>First, I will give a brief overview of the cited work [<xref ref-type="bibr" rid="scirp.110462-ref29">29</xref>] presenting its results in a more convenient form. An alternative to Equation (1) expression will be deduced in a new way; its relation to Equation (1) will be clarified. The importance of inequalities for frequencies of digits will be emphasized and illustrated by the distribution of the population in Israeli cities and by the results by state of the 2020 presidential elections in the United States. Finally, the same method of extremal frequencies will be applied to the determination of digits frequencies at the second decimal place.</p></sec><sec id="s2"><title>2. Frequencies of Digits as a Consequence of the Structure of the Positional Numeral System</title><p>Benford’s law is often used to check the reliability of various arrays of numerical data. In any case, these arrays are bounded above and below, and the role of these restrictions can be played simply by the given number of digits. For example, the number of votes cast for a particular candidate in some place is limited by the number of those who have the right to vote; the distribution of the population by city is limited by the population of the country, etc. Аs will be clear from what follows, only the upper limit is significant, while the lower one is unimportant.</p><p>Now let us consider the frequency F(n) of numbers with a certain digit n = 1 , 2 , ⋯ , 9 at first place in the set of natural numbers { 1 , 2 , ⋯ , m } ≡ { m } . How this frequency changes with increasing m? In Ref. [<xref ref-type="bibr" rid="scirp.110462-ref29">29</xref>] it has been shown that the frequency F(n) to pass a sequence of alternating local minimums, F min , k ( n ) , and local maximums F max , k ( n ) . For example, for n = 1, the minimums of F(1) are achieved for values of m equal to 9, 99, 999, … due to the maximum of the denominator in the frequency definition while the numerator remains constant. Starting from these values, the frequency begins to rise since the growth of the numerator in percentage is greater than the growth of the denominator. Maximums are attained at values of m equal to 19, 199, 1999, and so on. Another example is n = 7 where local minimums and maximums of the frequency are at m values of 69, 699, 6999, …, and 79, 799, 7999, …, respectively. In general form, these statements are expressed by the formulas for k = 1 , 2 , 3 , ⋯</p><p>m min , k = n ⋅ 10 k − 1 , (2)</p><p>m max , k = ( n + 1 ) ⋅ 10 k − 1. (3)</p><p>Amount of numbers starting with the digit n up to m min , k equals to 10 k − 1 + 10 k − 2 + ⋯ + 1 = ( 10 k − 1 ) / 9 ; analogously, amount of numbers starting with the digit n up to m max , k equals 10 k + 10 k − 1 + ⋯ + 1 = ( 10 k + 1 − 1 ) / 9 . Once more taking into account Equations ((2), (3)), one has for the frequencies at local minimums and maximums:</p><p>F min , k ( n ) = 10 k − 1 9 ( n ⋅ 10 k − 1 ) , (4)</p><p>F max , k ( n ) = 10 k + 1 − 1 9 [ ( n + 1 ) ⋅ 10 k − 1 ] . (5)</p><p>The dependences of these quantities on m and the digit value n are shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>. With the increase of k, both quantities in Equations ((4), (5)) converge</p><p>very rapidly to</p><p>F min ( n ) = lim k → ∞ F min , k ( n ) = 1 9 n , (6)</p><p>F max ( n ) = lim k → ∞ F max , k ( n ) = 10 9 ( n + 1 ) . (7)</p><p>It is seen that the frequencies vary inversely with a digit value that is the qualitative formulation of the Newcomb-Benford law. The limiting values of frequencies (6) and (7) also presented in <xref ref-type="table" rid="table1">Table 1</xref> are very useful in analyzing real numerical arrays because in reality the exact upper bond, m, is unknown. The lowest bond is also unknown, but this is unessential since contribution of the first terms of the set { 1 , 2 , 3 , ⋯ , m } to both numerator and denominator of the frequency definition is negligible compared to contribution of the last terms.</p><p>From the minimal and maximal asymptotical frequencies, some mean values may be constructed like arithmetic, geometric, logarithmic or harmonic one. The geometric mean turned out to be closest to the original Benford distribution in Equation (1). One has from Equations ((6), (7))</p><p>F ( n ) = 0.43077 n ( n + 1 ) (8)</p><p>where a numerical multiplier is the normalizing factor A = [ ∑ 1 9 1 / i ( i + 1 ) ] − 1 . The quantity F(n) is interpreted as the probability for the digit n to occupy the first place in a number of the decimal numeral system. As is seen from <xref ref-type="table" rid="table1">Table 1</xref>, the differences between the results of Benford’s law (1) and Equation (8) are negligible, at least from a practical point of view.</p></sec><sec id="s3"><title>3. Equivalence of Two Formulations of Benford’s Law</title><p>It is possible to prove the equivalence of Equation (8) to the Benford law (1). After the change of variable n</p><p>n = ( e x − 1 ) − 1 ,       ln ( 10 / 9 ) ≤ x ≤ ln 2 (9)</p><p>Equation (8) turns into</p><p>F ( n ) = A 1 n ( n + 1 ) = 2 A sinh ( x 2 ) = 2 A [ x 2 + 1 6 ( x 2 ) 3 + ⋯ ] . (10)</p><p>Passing to approximate formulas we have</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Frequencies of digits in numerical arrays</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Digit, n</th><th align="center" valign="middle" >1</th><th align="center" valign="middle" >2</th><th align="center" valign="middle" >3</th><th align="center" valign="middle" >4</th><th align="center" valign="middle" >5</th><th align="center" valign="middle" >6</th><th align="center" valign="middle" >7</th><th align="center" valign="middle" >8</th><th align="center" valign="middle" >9</th></tr></thead><tr><td align="center" valign="middle" >Benford’s Equation (1)</td><td align="center" valign="middle" >0.3010</td><td align="center" valign="middle" >0.1761</td><td align="center" valign="middle" >0.1249</td><td align="center" valign="middle" >0.09691</td><td align="center" valign="middle" >0.07918</td><td align="center" valign="middle" >0.06695</td><td align="center" valign="middle" >0.05799</td><td align="center" valign="middle" >0.05115</td><td align="center" valign="middle" >0.04576</td></tr><tr><td align="center" valign="middle" >F(n), Equation (8)</td><td align="center" valign="middle" >0.3046</td><td align="center" valign="middle" >0.1759</td><td align="center" valign="middle" >0.1244</td><td align="center" valign="middle" >0.09632</td><td align="center" valign="middle" >0.07865</td><td align="center" valign="middle" >0.06647</td><td align="center" valign="middle" >0.05756</td><td align="center" valign="middle" >0.05077</td><td align="center" valign="middle" >0.04541</td></tr><tr><td align="center" valign="middle" >F<sub>min</sub>(n), Equation (6)</td><td align="center" valign="middle" >0.111</td><td align="center" valign="middle" >0.0556</td><td align="center" valign="middle" >0.037</td><td align="center" valign="middle" >0.028</td><td align="center" valign="middle" >0.022</td><td align="center" valign="middle" >0.018</td><td align="center" valign="middle" >0.016</td><td align="center" valign="middle" >0.014</td><td align="center" valign="middle" >0.012</td></tr><tr><td align="center" valign="middle" >F<sub>max</sub>(n), Equation (7)</td><td align="center" valign="middle" >0.556</td><td align="center" valign="middle" >0.370</td><td align="center" valign="middle" >0.278</td><td align="center" valign="middle" >0.222</td><td align="center" valign="middle" >0.185</td><td align="center" valign="middle" >0.159</td><td align="center" valign="middle" >0.139</td><td align="center" valign="middle" >0.123</td><td align="center" valign="middle" >0.111</td></tr></tbody></table></table-wrap><p>F ( n ) = A [ x + O ( x 3 24 ) ] ≈ A x . (11)</p><p>The inverse to (9) transformation is</p><p>x = ln n + 1 n (12)</p><p>that gives after substitution of A =0.43077 from Equation (8) and transition to logarithms with a base of 10</p><p>F ( n ) ≈ 0.992 log n + 1 n . (13)</p><p>The error A x 3 / 24 associated with neglecting higher-order terms reaches a maximum of +0.006 for n = 1, but for n = 2 it is already equal to +0.001. The validity of the proof of the equivalence of Benford’s law and Equation (8) is confirmed by numerical results in the table. In conclusion, the question arises whether to consider the formula (8) as an excellent approximation to Benford’s law (1), or, conversely, to consider Benford’s formula as an excellent approximation to the law (8) deduced from the properties of the decimal numeral system?</p></sec><sec id="s4"><title>4. Population of Israeli Cities and by State Results of the 2020 Presidential Elections in the United States</title><p>From Equations ((6) and (7)) the inequalities for F(n) follow</p><p>1 9 n ≤ F ( n ) ≤ 10 9 ( n + 1 ) , (14)</p><p>which is more useful than Equations ((1), (8)) in applications because the exact upper bound of an array considered is unknown. <xref ref-type="fig" rid="fig2">Figure 2</xref> shows an analysis</p><p>using (8) and (14) of the population distribution over 72 cities of Israel in accordance with the results of the 2008 census [<xref ref-type="bibr" rid="scirp.110462-ref30">30</xref>]. Lacking the right software and programming skills, I choose data with a small number of items and performed the calculation manually. For the same reason, as another example, a sampling of 51 items was considered, which represents the results by state of the 2020 US presidential election [<xref ref-type="bibr" rid="scirp.110462-ref31">31</xref>] (<xref ref-type="fig" rid="fig3">Figure 3</xref>).</p><p>In both Figures, there are only a few cases of going beyond the deviation intervals (14), and these goings are small. It should not be forgotten that all consideration is probabilistic. In addition, for the above reason, small samplings were chosen. The Benford law is often used to detect violations and fraud in datasets. From this point of view, the adequacy of the census and the vote count in the presidential 2020 elections successfully pass this test.</p></sec><sec id="s5"><title>5. Other Positional Numeral Systems</title><p>Generalization to other than decimal numeral systems is straightforward. Instead of Equations ((8) and (14)) one has Equations ((15) and (16)) [<xref ref-type="bibr" rid="scirp.110462-ref29">29</xref>]</p><p>F N ( n ) = A N n ( n + 1 ) (15)</p><p>where N is the base of a numeral system, 1 ≤ n ≤ N − 1 , and A N = [ ∑ 1 N − 1 1 / i ( i + 1 ) ] − 1 is the normalizing factor and</p><p>1 ( N − 1 ) n ≤ F N ( n ) ≤ N ( N − 1 ) ( n + 1 ) . (16)</p><p>In particular, in the binary system (N = 2), all the last equations turn into 1 for n = 1 (all the numbers in the binary system begin with 1).</p><p>Normalizing factors for the most popular numeral systems are as follows: A 2 = 1 , A 8 = 0.467469 , A 10 = 0.430773 , A 16 = 0.353036 . For systems with a low base, the probability of finding the digit 1 at the first place of number is high; for example, F 4 ( 1 ) = 0.503626 . This makes them undesirable for digital encryption since the chances are high that the encoded number starts with one. In this regard, coding using high base numeral systems is preferred; for example, in hexadecimal numeral system this probability is two times lower, F 16 ( 1 ) = 0.249634 .</p></sec><sec id="s6"><title>6. Frequencies of Digits at the Second Decimal Place</title><p>The alternation of local minimums and maxima of digit frequencies when expanding a limited set of natural numbers {m} can also be used for the second decimal position. Let us show this by the example of the digits 0 and 2. For n = 0, maxima are attained for the following values of m: 10, 20, …, 90, 109, 209, …, 909, 1099, 2099, … 9099, 10,999, 20,999, …, 90,999, …. Wherein, amount of numbers contained in {m} with 0 at the second place changes, respectively, as follows: 1, 2, …, 9, 19, 29, …, 99, 199, 299, …, 999, 1999, 2999, … 9999, …. In general form, this is expressed with a help of two indices: i ⋅ 10 k + 10 k − 1 − 1 and ( i + 1 ) ⋅ 10 k − 1 − 1 where i runs from 1 to 9. Thus, the corresponding frequency is</p><p>G i , k max ( 0 ) = ( i + 1 ) ⋅ 10 k − 1 − 1 i ⋅ 10 k + 10 k − 1 − 1 . (17)</p><p>Analogously, for the minimal frequencies the values of m and amount of the required numbers in {m} are, respectively, 9, 19, 29, …, 99, 199, 299, …, 999, 1999, 2999, …, 9999, 19,999, 29,999, …, 99,999, … and 0, 1, 2, …, 9, 19, 29, …, 99, 199, 299, …999, 1999, 2999, …, 9999. The minimal frequencies are</p><p>G i , k min ( 0 ) = i ⋅ 10 k − 1 − 1 i ⋅ 10 k − 1 . (18)</p><p>For n = 2, the maximal frequencies are attained for m values: 12, 22, …, 92, 129, 229, …929, 1299, 2299, …, 9299, 12,999, 22,999, …. 92,999, …. Amount of numbers with 2 at the second place in {m} is as follows: 1, 2, …, 9, 19, 29, …, 99, 199, 299, …, 999, 1999, 2999, …, 9999, …. The formula for the frequency:</p><p>G i , k max ( 2 ) = ( i + 1 ) ⋅ 10 k − 1 − 1 i ⋅ 10 k + 3 ⋅ 10 k − 1 − 1 . (19)</p><p>The minimal frequencies are for m: 11, 21, …, 91, 119, 219, …, 919, 1199, 2199, …, 9199, 11,999, 21,999, …, 91,999, … with the same amount of numbers in numerator as in Equation (18):</p><p>G i , k min ( 2 ) = i ⋅ 10 k − 1 − 1 i ⋅ 10 k + 2 ⋅ 10 k − 1 − 1 . (20)</p><p>For 0 ≤ n ≤ 9 :</p><p>G i , k max ( n ) = ( i + 1 ) ⋅ 10 k − 1 − 1 i ⋅ 10 k + ( n + 1 ) ⋅ 10 k − 1 − 1 (21)</p><p>G i , k min ( n ) = i ⋅ 10 k − 1 − 1 i ⋅ 10 k + n ⋅ 10 k − 1 − 1 . (22)</p><p>Both the minimum of Equation (22) (minimum minimorum) and the maximum of Equation (21) are attained at i = 1 . Further, the limits for k going to infinity:</p><p>G max ( n ) = 2 11 + n , (23)</p><p>G min ( n ) = 1 10 + n . (24)</p><p>One of possible estimations of the probability to find the digit n at the second place of a number may be the normalized geometrical mean of the maximal and minimal frequencies (23) and (24):</p><p>G ( n ) = 1.44237 ( 10 + n ) ( 10 + n + 1 ) (25)</p><p>(compare to Equation (15)).Corresponding numerical data are presented in <xref ref-type="table" rid="table2">Table 2</xref>.</p><p>It is seen from <xref ref-type="table" rid="table2">Table 2</xref> that the confidence intervals strongly overlap that may prevent the second digit statistics from analyzing arrays of numbers. Also, the distribution of probabilities is too smooth.</p><p>For the numeral systems with the arbitrary base N, one has from Equation (25):</p><p>G N ( n ) = B N ( N + n ) ( N + n + 1 ) (26)</p><p>where B N = [ ∑ 0 N − 1 1 / ( N + i ) ( N + i + 1 ) ] − 1 (compare to Equation (25)). For low bases, the probabilities may differ more sharply. Thus, for the binary system (N = 2): G 2 ( 0 ) = 2 − 2 ≈ 0.59 while G 2 ( 1 ) = 2 − 1 ≈ 0.41 .</p></sec><sec id="s7"><title>7. Conclusions</title><p>With the expansion of a bounded set of natural numbers, the density of numbers starting with a certain digit experiences quasiperiodic oscillation (see <xref ref-type="fig" rid="fig1">Figure 1</xref>). The maxima and minima of this oscillation quickly stabilize and determine possible deviations from Benford’s law. Formulas for these deviations are useful when analyzing numeric arrays for fraud.</p><p>The geometric mean of the above minimum and maximum decreases with the increase in values of the initial digits of numbers, giving an alternative quantitative</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Probabilities and deviation intervals for digits at the second decimal place</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Digit, n</th><th align="center" valign="middle" >0</th><th align="center" valign="middle" >1</th><th align="center" valign="middle" >2</th><th align="center" valign="middle" >3</th><th align="center" valign="middle" >4</th><th align="center" valign="middle" >5</th><th align="center" valign="middle" >6</th><th align="center" valign="middle" >7</th><th align="center" valign="middle" >8</th><th align="center" valign="middle" >9</th></tr></thead><tr><td align="center" valign="middle" >G(n), Equation (25)</td><td align="center" valign="middle" >0.1375</td><td align="center" valign="middle" >0.1255</td><td align="center" valign="middle" >0.1155</td><td align="center" valign="middle" >0.1069</td><td align="center" valign="middle" >0.0995</td><td align="center" valign="middle" >0.0931</td><td align="center" valign="middle" >0.0875</td><td align="center" valign="middle" >0.0825</td><td align="center" valign="middle" >0.0780</td><td align="center" valign="middle" >0.0740</td></tr><tr><td align="center" valign="middle" >G min ( n ) , Equation (24)</td><td align="center" valign="middle" >0.1000</td><td align="center" valign="middle" >0.0909</td><td align="center" valign="middle" >0.0833</td><td align="center" valign="middle" >0.0769</td><td align="center" valign="middle" >0.0714</td><td align="center" valign="middle" >0.0667</td><td align="center" valign="middle" >0.0625</td><td align="center" valign="middle" >0.0588</td><td align="center" valign="middle" >0.0556</td><td align="center" valign="middle" >0.0526</td></tr><tr><td align="center" valign="middle" >G max ( n ) , Equation (23)</td><td align="center" valign="middle" >0.1818</td><td align="center" valign="middle" >0.1667</td><td align="center" valign="middle" >0.1538</td><td align="center" valign="middle" >0.1429</td><td align="center" valign="middle" >0.1333</td><td align="center" valign="middle" >0.1250</td><td align="center" valign="middle" >0.1176</td><td align="center" valign="middle" >0.1111</td><td align="center" valign="middle" >0.1053</td><td align="center" valign="middle" >0.1000</td></tr></tbody></table></table-wrap><p>expression (8) for the Newcomb-Benford law. This expression approximately coincides with Benford’s formula (1) up to the second order in some small parameter, which is less than 1.</p><p>The results are generalized to the arbitrary base of positional numeral systems. The systems with the higher base are preferable in digital encryption because for them digit frequencies are close to each other and maximal possible deviations overlap.</p><p>The elaborated method of extremal digital frequencies applies to the second decimal place. In this case, a smooth dependence on the digit value may prevent the method from applying to the check of numerical arrays for fraud.</p><p>Perhaps, taking into account information about the boundaries of the numerical arrays under consideration will narrow the confidence intervals and improve the correspondence of the calculated and measured frequencies in the case of truthful data. Work in this direction is expected to be carried out in the near future.</p></sec><sec id="s8"><title>Acknowledgements</title><p>The author is grateful to Professor Edward Bormashenko who brought his attention to this field.</p></sec><sec id="s9"><title>Conflicts of Interest</title><p>The author declares no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s10"><title>Cite this paper</title><p>Whyman, G. (2021) Origin, Alternative Expressions of Newcomb-Benford Law and Deviations of Digit Frequencies. Applied Mathematics, 12, 576-586. https://doi.org/10.4236/am.2021.127041</p></sec></body><back><ref-list><title>References</title><ref id="scirp.110462-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Newcomb, S. (1881) Note on the Frequency of Use of Different Digits in Natural Numbers. American Journal of Mathematics, 4, 39-40. https://doi.org/10.2307/2369148</mixed-citation></ref><ref id="scirp.110462-ref2"><label>2</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Benford</surname><given-names> F. </given-names></name>,<etal>et al</etal>. (<year>1938</year>)<article-title>The Law of Anomalous Numbers</article-title><source> Proceedings of the American Philosophical Society</source><volume> 78</volume>,<fpage> 551</fpage>-<lpage>572</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.110462-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Pain, J.-C. (2008) Benford’s Law and Complex Atomic Spectra. Physical Review E, 77, Article ID: 012102. https://doi.org/10.1103/PhysRevE.77.012102</mixed-citation></ref><ref id="scirp.110462-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Mir, T.A. (2012) The Law of the Leading Digits and the World Religions. Physica A, 391, 792-798. https://doi.org/10.1016/j.physa.2011.09.001</mixed-citation></ref><ref id="scirp.110462-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Sambridge, M. (2010) Benford’s Law in the Natural Sciences. Geophysical Research Letters, 37, L22301. https://doi.org/10.1029/2010GL044830</mixed-citation></ref><ref id="scirp.110462-ref6"><label>6</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Hernandez Caceres</surname><given-names> J.L. </given-names></name>,<etal>et al</etal>. (<year>2008</year>)<article-title>First Digit Distribution in Some Biological Data Sets. Possible Explanations for Departures from Benford’s Law</article-title><source> The Electronic Journal of Biomedicine</source><volume> 1</volume>,<fpage> 27</fpage>-<lpage>35</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.110462-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Friar, J.L, Goldman, T. and Pérez-Mercader, J. (2012) Genome Sizes and the Benford Distribution. PLoS ONE, 7, e36624. https://doi.org/10.1371/journal.pone.0036624</mixed-citation></ref><ref id="scirp.110462-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Shao, L. and Ma, B.Q. (2010) Empirical Mantissa Distributions of Pulsars. Astroparticle Physics, 33, 255-262. https://doi.org/10.1016/j.astropartphys.2010.02.003</mixed-citation></ref><ref id="scirp.110462-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Mir, T.A., Ausloos, M. and Cerqueti, R. (2014) Benford’s Law Predicted Digit Distribution of Aggregated Income Taxes: The Surprising Conformity of Italian Cities and Regions. The European Physical Journal B, 87, 261. https://doi.org/10.1140/epjb/e2014-50525-2</mixed-citation></ref><ref id="scirp.110462-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Kaiser, M. (2019) Benford’s Law as an Indicator of Survey Reliability—Can We Trust Our Data? Journal of Economic Surveys, 33, 1602-1618. https://doi.org/10.1111/joes.12338</mixed-citation></ref><ref id="scirp.110462-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Alipour, A. and Alipour, S. (2019) Application of Benford’s Law in Analyzing Geotechnical Data. Civil Engineering Infrastructures Journal, 52, 323-334.</mixed-citation></ref><ref id="scirp.110462-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">da Silva Azevedo, C., Gon&amp;#231;alves, R.F., Gava, V.L. and de MesquitoSpinola, M. (2021) A Benford’s Law Based Methodology for Fraud Detection in Social Welfare Programs: Bolsa Familia Analysis. Physica A: Statistical Mechanics and Its Applications, 567, Article ID: 125626. https://doi.org/10.1016/j.physa.2020.125626</mixed-citation></ref><ref id="scirp.110462-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">da Silva, A.J., Floquet, S., Santos, D.O.C. and Lima, R.F. (2020) On the Validation of the Newcomb-Benford Law and the Weibull Distribution in Neuromuscular Transmission. Physica A: Statistical Mechanics and Its Applications, 553, Article ID: 124606. https://doi.org/10.1016/j.physa.2020.124606</mixed-citation></ref><ref id="scirp.110462-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Tunalioglu, N. and Erdogan, B. (2019) Usability of the Benford’s Law for the Results of Least Square Estimation. Acta Geodaetica et Geophysica, 54, 315-331. https://doi.org/10.1007/s40328-019-00259-3</mixed-citation></ref><ref id="scirp.110462-ref15"><label>15</label><mixed-citation publication-type="book" xlink:type="simple">Whyman, G., Ohtori, N., Shulzinger, E. and Bormashenko, Ed. (2016) Revisiting the Benford Law: When the Benford-Like Distribution of Leading Digits in Sets of Numerical Data Is Expectable? Physica A: Statistical Mechanics and Its Applications, 461, 595-601. https://doi.org/10.1016/j.physa.2016.06.054</mixed-citation></ref><ref id="scirp.110462-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Istrate, C. (2019) Detecting Earnings Management Using Benford’s Law: The Case of Romanian Listed Companies. Journal of Accounting and Management Information Systems, 18, 198-223. https://doi.org/10.24818/jamis.2019.02003</mixed-citation></ref><ref id="scirp.110462-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Morag, S. and Salmon-Divon, M. (2019) Characterizing Human Cell Types and Tissue Origin Using the Benford Law. Cells, 8, 1004. https://doi.org/10.3390/cells8091004</mixed-citation></ref><ref id="scirp.110462-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Yan, X., Yang, S.-G., Kim, B.J. and Minnhagen, P. (2017) Benford’s Law and First Letter of Word. Physica A: Statistical Mechanics and Its Applications, 512, 305-315. https://doi.org/10.1016/j.physa.2018.08.133</mixed-citation></ref><ref id="scirp.110462-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Shulzinger, E. and Bormashenko, E. (2017) On the Universal Quantitative Pattern of the Distribution of Initial Characters in General Dictionaries: The Exponential Distribution Is Valid for Various Languages. Journal of Quantitative Linguistics, 24, 273-288. https://doi.org/10.1080/09296174.2017.1304620</mixed-citation></ref><ref id="scirp.110462-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Dantuluri, A. and Desai, S. (2018) Do τ Lepton Branching Fractions Obey Benford’s Law? Physica A: Statistical Mechanics and Its Applications, 506, 919-928. https://doi.org/10.1016/j.physa.2018.05.013</mixed-citation></ref><ref id="scirp.110462-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Da Silva, S.B. (2020) Limits of Benford’s Law in Experimental Field. International Journal of Applied Mathematics, 33, 685-695. https://doi.org/10.12732/ijam.v33i4.12</mixed-citation></ref><ref id="scirp.110462-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Branets, S. (2019) Detecting Money Laundering with Benford’s Law and Machine Learning. Master’s Thesis, University of Tartu, Faculty of Social Sciences, School of Economics and Business Administration, Tartu.</mixed-citation></ref><ref id="scirp.110462-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Hill, T.P. (1995) Base-Invariance Implies Benford’s Law. Proceedings of the American Mathematical Society, 123, 887-895. https://doi.org/10.2307/2160815</mixed-citation></ref><ref id="scirp.110462-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">Hill, T.P. (1995) A Statistical Derivation of the Significant-Digit Law. Statistical Science, 10, 354-363. https://doi.org/10.1214/ss/1177009869</mixed-citation></ref><ref id="scirp.110462-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Berger, A. and Hill, T.P. (2011) Benford’s Law Strikes Back: No Simple Explanation in Sight for Mathematical Gem. The Mathematical Intelligencer, 33, 85-91. https://doi.org/10.1007/s00283-010-9182-3</mixed-citation></ref><ref id="scirp.110462-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">Pietronero, L., Tosatti, E., Tosatti, V. and Vespignani, A. (2001) Explaining the Uneven Distribution of Numbers in Nature: The Laws of Benford and Zipf. Physica A, 29, 297-304. https://doi.org/10.1016/S0378-4371(00)00633-6</mixed-citation></ref><ref id="scirp.110462-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">Engel, H.A. and Leuenberger, Ch. (2003) Benford’s Law for Exponential Random Variables. Statistics &amp; Probability Letters, 63, 361-365. https://doi.org/10.1016/S0167-7152(03)00101-9</mixed-citation></ref><ref id="scirp.110462-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">Fewster, R.M. (2009) A Simple Explanation of Benford’s Law. The American Statistician, 63, 26-32. https://doi.org/10.1198/tast.2009.0005</mixed-citation></ref><ref id="scirp.110462-ref29"><label>29</label><mixed-citation publication-type="book" xlink:type="simple">Whyman, G., Shulzinger, E. and Bormashenko, Ed. (2016) Intuitive Considerations Clarifying the Origin and Applicability of the Benford Law. Results in Physics, 6, 3-6. https://doi.org/10.1016/j.rinp.2015.11.010</mixed-citation></ref><ref id="scirp.110462-ref30"><label>30</label><mixed-citation publication-type="other" xlink:type="simple">Israel Central Bureau of Statistics (2008) Profiles by Locality.https://en.wikipedia.org/wiki/List_of_cities_in_Israel#cite_note-26</mixed-citation></ref><ref id="scirp.110462-ref31"><label>31</label><mixed-citation publication-type="other" xlink:type="simple">https://www.politico.com/2020-election/results</mixed-citation></ref></ref-list></back></article>