<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">
    ojs
   </journal-id>
   <journal-title-group>
    <journal-title>
     Open Journal of Statistics
    </journal-title>
   </journal-title-group>
   <issn pub-type="epub">
    2161-718X
   </issn>
   <issn publication-format="print">
    2161-7198
   </issn>
   <publisher>
    <publisher-name>
     Scientific Research Publishing
    </publisher-name>
   </publisher>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="doi">
    10.4236/ojs.2025.154020
   </article-id>
   <article-id pub-id-type="publisher-id">
    ojs-144605
   </article-id>
   <article-categories>
    <subj-group subj-group-type="heading">
     <subject>
      Articles
     </subject>
    </subj-group>
    <subj-group subj-group-type="Discipline-v2">
     <subject>
      Physics 
     </subject>
     <subject>
       Mathematics
     </subject>
    </subj-group>
   </article-categories>
   <title-group>
    Maximum Entropy Distribution of Correlated Variables: Application to Human Height and Weight
   </title-group>
   <contrib-group>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Mark P.
      </surname>
      <given-names>
       Silverman
      </given-names>
     </name> 
     <xref ref-type="aff" rid="aff1"> 
      <sup>1</sup>
     </xref> 
     <xref ref-type="aff" rid="aff2"> 
      <sup>2</sup>
     </xref>
    </contrib>
   </contrib-group> 
   <aff id="aff1">
    <addr-line>
     aG. A. Jarvis Prof. of Physics, Emer., Trinity College, Hartford, USA
    </addr-line> 
   </aff> 
   <aff id="aff2">
    <addr-line>
     aTall Pines Research, Simsbury, USA
    </addr-line> 
   </aff> 
   <pub-date pub-type="epub">
    <day>
     04
    </day> 
    <month>
     08
    </month>
    <year>
     2025
    </year>
   </pub-date> 
   <volume>
    15
   </volume> 
   <issue>
    04
   </issue>
   <fpage>
    371
   </fpage>
   <lpage>
    389
   </lpage>
   <history>
    <date date-type="received">
     <day>
      8,
     </day>
     <month>
      July
     </month>
     <year>
      2025
     </year>
    </date>
    <date date-type="published">
     <day>
      3,
     </day>
     <month>
      July
     </month>
     <year>
      2025
     </year> 
    </date> 
    <date date-type="accepted">
     <day>
      3,
     </day>
     <month>
      August
     </month>
     <year>
      2025
     </year> 
    </date>
   </history>
   <permissions>
    <copyright-statement>
     © Copyright 2014 by authors and Scientific Research Publishing Inc. 
    </copyright-statement>
    <copyright-year>
     2014
    </copyright-year>
    <license>
     <license-p>
      This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
     </license-p>
    </license>
   </permissions>
   <abstract>
    Recent investigations have shown that a bivariant lognormal probability density function predicts the statistical moments and correlations of adult human height and weight so extensively and closely as to pose an enigma regarding the underlying reason for such exactness. No genetic or environmental cause currently accounts for this distribution. In this article, it is shown that the Principle of Maximum Entropy (PME)—which is an inferential method drawn exclusively from probability theory—leads to the joint lognormal distribution of height and weight independent of any physical mechanism of biological development. The operation of the PME entails carrying out a variational procedure on a functional comprising the Shannon information entropy subject to constraints posed by known prior information in the form of expectation values. In the case of height and weight the prior information consists of the means, variances, and linear correlation of the logarithms of the two variables. When applied to a large anthropometric survey, the maximum entropy distribution resulting from this variational procedure is shown to be astronomically more probable than any other distribution consistent with the prior information. Although the PME provides an explanation of the enigma, the possibility is examined that an underlying stochastic mechanism may also lead to the same distribution.
   </abstract>
   <kwd-group> 
    <kwd>
     Information Entropy
    </kwd> 
    <kwd>
      Principle of Maximum Entropy
    </kwd> 
    <kwd>
      Body Mass Index
    </kwd> 
    <kwd>
      Variability of Height
    </kwd> 
    <kwd>
      Variability of Weight
    </kwd> 
    <kwd>
      Correlation of Height and Weight
    </kwd> 
    <kwd>
      Lognormal Distribution
    </kwd> 
    <kwd>
      Kullback-Leibler Divergence
    </kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <sec id="s1">
   <title>
    <xref ref-type="bibr" rid="scirp.144605-"></xref>1. Introduction—A Statistical Enigma</title>
   <p>In recent publications, the author derived and discussed the exact probability density functions (PDF) of the body mass index (BMI) <xref ref-type="bibr" rid="scirp.144605-1">
     [1]
    </xref> <xref ref-type="bibr" rid="scirp.144605-2">
     [2]
    </xref> and the joint distribution of human height and weight <xref ref-type="bibr" rid="scirp.144605-3">
     [3]
    </xref>. The BMI is the most widely used medical risk factor for morbidity and mortality related to weight. Categories of risk are set by the World Health Organization (WHO) <xref ref-type="bibr" rid="scirp.144605-4">
     [4]
    </xref> or by national health organizations like the U.S. National Institutes of Health (NIH) <xref ref-type="bibr" rid="scirp.144605-5">
     [5]
    </xref>. As a mathematically defined quantity</p>
   <p>
    <xref ref-type="bibr" rid="scirp.144605-"></xref> 
    <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
      <mtext>
        BMI 
      </mtext> 
      <mo>
        ≡ 
      </mo> 
      <mrow> 
       <mi>
         W 
       </mi> 
       <mo>
         / 
       </mo> 
       <mrow> 
        <msup> 
         <mi>
           H 
         </mi> 
         <mn>
           2 
         </mn> 
        </msup> 
       </mrow> 
      </mrow> 
     </mrow> 
    </math>, (1)</p>
   <p>in which W is weight in kg and H is height in meters of an individual, the functional form of the BMI PDF can be derived exactly for any and all demographics using analytical methods to transform and combine functions of random variables <xref ref-type="bibr" rid="scirp.144605-6">
     [6]
    </xref>.</p>
   <p>However, to specify the BMI distribution of a particular population, one must know the statistical distribution of height and weight for that particular group. Since the height and weight of individuals result from a complex interplay of genetic and environmental determinants, no theoretical expression corresponding to Equation (1) is known from which to deduce an exact, or even an approximate, PDF. In such cases, there are basically two different ways to proceed:</p>
   <p>1) Hypothesize the form that the sought-for theoretical distribution should take based on prior information, such as acquired from an anthropometric survey of a sampled population.</p>
   <p>2) Apply some fundamental physical or epistemological principle that makes use of prior information and leads by analysis to a unique statistical distribution.</p>
   <p>In arriving at the joint probability density function for human height and weight, the author employed both methods. The first method was recently published <xref ref-type="bibr" rid="scirp.144605-3">
     [3]
    </xref> and led to a bivariate lognormal distribution that, when tested against a large anthropometric survey <xref ref-type="bibr" rid="scirp.144605-7">
     [7]
    </xref>, matched the statistics so closely and so extensively as to pose an enigma regarding the underlying reason. The second method, which is based on the Principle of Maximum Entropy (PME), provides a possible explanation of the apparent exactness of the empirical distribution. This is the subject of the present paper.</p>
   <sec id="s1_1">
    <title>1.1. Inference Based on Empirical Evidence</title>
    <p>In the first approach, full details of which can be found in Ref. <xref ref-type="bibr" rid="scirp.144605-3">
      [3]
     </xref>, various empirical features of the anthropometric data suggested that height and weight together are distributed as a bivariant lognormal random variable with probability density</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mtable columnalign="left"> 
       <mtr> 
        <mtd> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mi>
               H 
             </mi> 
             <mo>
               , 
             </mo> 
             <mi>
               W 
             </mi> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
         </msub> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mi>
             h 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             w 
           </mi> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mtd> 
       </mtr> 
       <mtr> 
        <mtd> 
         <mo>
           = 
         </mo> 
         <mfrac> 
          <mrow> 
           <mi>
             exp 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mo>
               − 
             </mo> 
             <mfrac> 
              <mn>
                1 
              </mn> 
              <mrow> 
               <mn>
                 2 
               </mn> 
               <mrow> 
                <mo>
                  ( 
                </mo> 
                <mrow> 
                 <mn>
                   1 
                 </mn> 
                 <mo>
                   − 
                 </mo> 
                 <msup> 
                  <mi>
                    r 
                  </mi> 
                  <mn>
                    2 
                  </mn> 
                 </msup> 
                </mrow> 
                <mo>
                  ) 
                </mo> 
               </mrow> 
              </mrow> 
             </mfrac> 
             <mrow> 
              <mo>
                [ 
              </mo> 
              <mrow> 
               <msup> 
                <mrow> 
                 <mrow> 
                  <mo>
                    ( 
                  </mo> 
                  <mrow> 
                   <mfrac> 
                    <mrow> 
                     <mi>
                       ln 
                     </mi> 
                     <mrow> 
                      <mo>
                        ( 
                      </mo> 
                      <mi>
                        h 
                      </mi> 
                      <mo>
                        ) 
                      </mo> 
                     </mrow> 
                     <mo>
                       − 
                     </mo> 
                     <msub> 
                      <mi>
                        m 
                      </mi> 
                      <mn>
                        1 
                      </mn> 
                     </msub> 
                    </mrow> 
                    <mrow> 
                     <msub> 
                      <mi>
                        s 
                      </mi> 
                      <mn>
                        1 
                      </mn> 
                     </msub> 
                    </mrow> 
                   </mfrac> 
                  </mrow> 
                  <mo>
                    ) 
                  </mo> 
                 </mrow> 
                </mrow> 
                <mn>
                  2 
                </mn> 
               </msup> 
               <mo>
                 − 
               </mo> 
               <mn>
                 2 
               </mn> 
               <mi>
                 r 
               </mi> 
               <mrow> 
                <mo>
                  ( 
                </mo> 
                <mrow> 
                 <mfrac> 
                  <mrow> 
                   <mi>
                     ln 
                   </mi> 
                   <mrow> 
                    <mo>
                      ( 
                    </mo> 
                    <mi>
                      h 
                    </mi> 
                    <mo>
                      ) 
                    </mo> 
                   </mrow> 
                   <mo>
                     − 
                   </mo> 
                   <msub> 
                    <mi>
                      m 
                    </mi> 
                    <mn>
                      1 
                    </mn> 
                   </msub> 
                  </mrow> 
                  <mrow> 
                   <msub> 
                    <mi>
                      s 
                    </mi> 
                    <mn>
                      1 
                    </mn> 
                   </msub> 
                  </mrow> 
                 </mfrac> 
                </mrow> 
                <mo>
                  ) 
                </mo> 
               </mrow> 
               <mrow> 
                <mo>
                  ( 
                </mo> 
                <mrow> 
                 <mfrac> 
                  <mrow> 
                   <mi>
                     ln 
                   </mi> 
                   <mrow> 
                    <mo>
                      ( 
                    </mo> 
                    <mi>
                      w 
                    </mi> 
                    <mo>
                      ) 
                    </mo> 
                   </mrow> 
                   <mo>
                     − 
                   </mo> 
                   <msub> 
                    <mi>
                      m 
                    </mi> 
                    <mn>
                      2 
                    </mn> 
                   </msub> 
                  </mrow> 
                  <mrow> 
                   <msub> 
                    <mi>
                      s 
                    </mi> 
                    <mn>
                      2 
                    </mn> 
                   </msub> 
                  </mrow> 
                 </mfrac> 
                </mrow> 
                <mo>
                  ) 
                </mo> 
               </mrow> 
               <mo>
                 + 
               </mo> 
               <msup> 
                <mrow> 
                 <mrow> 
                  <mo>
                    ( 
                  </mo> 
                  <mrow> 
                   <mfrac> 
                    <mrow> 
                     <mi>
                       ln 
                     </mi> 
                     <mrow> 
                      <mo>
                        ( 
                      </mo> 
                      <mi>
                        w 
                      </mi> 
                      <mo>
                        ) 
                      </mo> 
                     </mrow> 
                     <mo>
                       − 
                     </mo> 
                     <msub> 
                      <mi>
                        m 
                      </mi> 
                      <mn>
                        2 
                      </mn> 
                     </msub> 
                    </mrow> 
                    <mrow> 
                     <msub> 
                      <mi>
                        s 
                      </mi> 
                      <mn>
                        2 
                      </mn> 
                     </msub> 
                    </mrow> 
                   </mfrac> 
                  </mrow> 
                  <mo>
                    ) 
                  </mo> 
                 </mrow> 
                </mrow> 
                <mn>
                  2 
                </mn> 
               </msup> 
              </mrow> 
              <mo>
                ] 
              </mo> 
             </mrow> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mrow> 
           <mn>
             2 
           </mn> 
           <mi>
             π 
           </mi> 
           <msub> 
            <mi>
              s 
            </mi> 
            <mn>
              1 
            </mn> 
           </msub> 
           <msub> 
            <mi>
              s 
            </mi> 
            <mn>
              2 
            </mn> 
           </msub> 
           <mi>
             h 
           </mi> 
           <mi>
             w 
           </mi> 
           <msqrt> 
            <mrow> 
             <mn>
               1 
             </mn> 
             <mo>
               − 
             </mo> 
             <msup> 
              <mi>
                r 
              </mi> 
              <mn>
                2 
              </mn> 
             </msup> 
            </mrow> 
           </msqrt> 
          </mrow> 
         </mfrac> 
        </mtd> 
       </mtr> 
      </mtable> 
     </math> (2)</p>
    <p>in which upper case letters (H, W) represent random variables, lower case letters (h, w) represent realizations of the variables (e.g. outcomes of measurement or inputs to calculation), and the associated parameters are defined as follows:</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          m 
        </mi> 
        <mn>
          1 
        </mn> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            H 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          〉 
        </mo> 
       </mrow> 
       <mtext>
           
       </mtext> 
      </mrow> 
     </math> (3)</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msubsup> 
        <mi>
          s 
        </mi> 
        <mn>
          1 
        </mn> 
        <mn>
          2 
        </mn> 
       </msubsup> 
       <mo>
         = 
       </mo> 
       <mi>
         var 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            H 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mrow> 
         <msup> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mi>
               ln 
             </mi> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mi>
                H 
              </mi> 
              <mo>
                ) 
              </mo> 
             </mrow> 
             <mo>
               − 
             </mo> 
             <msubsup> 
              <mi>
                m 
              </mi> 
              <mn>
                1 
              </mn> 
              <mrow></mrow> 
             </msubsup> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mn>
            2 
          </mn> 
         </msup> 
        </mrow> 
        <mo>
          〉 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (4)</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          m 
        </mi> 
        <mn>
          2 
        </mn> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            W 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          〉 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (5)</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msubsup> 
        <mi>
          s 
        </mi> 
        <mn>
          2 
        </mn> 
        <mn>
          2 
        </mn> 
       </msubsup> 
       <mo>
         = 
       </mo> 
       <mi>
         var 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            W 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mrow> 
         <msup> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mi>
               ln 
             </mi> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mi>
                W 
              </mi> 
              <mo>
                ) 
              </mo> 
             </mrow> 
             <mo>
               − 
             </mo> 
             <msubsup> 
              <mi>
                m 
              </mi> 
              <mn>
                2 
              </mn> 
              <mrow></mrow> 
             </msubsup> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mn>
            2 
          </mn> 
         </msup> 
        </mrow> 
        <mo>
          〉 
        </mo> 
       </mrow> 
      </mrow> 
     </math>(6)</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         r 
       </mi> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mrow> 
         <mi>
           cov 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mi>
             ln 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mi>
              H 
            </mi> 
            <mo>
              ) 
            </mo> 
           </mrow> 
           <mo>
             , 
           </mo> 
           <mi>
             ln 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mi>
              W 
            </mi> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          / 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            s 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <msub> 
          <mi>
            s 
          </mi> 
          <mn>
            2 
          </mn> 
         </msub> 
        </mrow> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mfrac> 
            <mrow> 
             <mi>
               ln 
             </mi> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mi>
                H 
              </mi> 
              <mo>
                ) 
              </mo> 
             </mrow> 
             <mo>
               − 
             </mo> 
             <msub> 
              <mi>
                m 
              </mi> 
              <mn>
                1 
              </mn> 
             </msub> 
            </mrow> 
            <mrow> 
             <msub> 
              <mi>
                s 
              </mi> 
              <mn>
                1 
              </mn> 
             </msub> 
            </mrow> 
           </mfrac> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mfrac> 
            <mrow> 
             <mi>
               ln 
             </mi> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mi>
                W 
              </mi> 
              <mo>
                ) 
              </mo> 
             </mrow> 
             <mo>
               − 
             </mo> 
             <msub> 
              <mi>
                m 
              </mi> 
              <mn>
                2 
              </mn> 
             </msub> 
            </mrow> 
            <mrow> 
             <msub> 
              <mi>
                s 
              </mi> 
              <mn>
                2 
              </mn> 
             </msub> 
            </mrow> 
           </mfrac> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          〉 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. (7)</p>
    <p>Angular brackets 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mtext>
            
        </mtext> 
        <mo>
          〉 
        </mo> 
       </mrow> 
      </mrow> 
     </math> symbolize the expectation value of any discrete function 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          f 
        </mi> 
        <mrow> 
         <mi>
           h 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           w 
         </mi> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> or continuous function 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         f 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           h 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           w 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math></p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mi>
          f 
        </mi> 
        <mo>
          〉 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mtable columnalign="left"> 
         <mtr> 
          <mtd> 
           <mstyle displaystyle="true"> 
            <munder> 
             <mo>
               ∑ 
             </mo> 
             <mrow> 
              <mi>
                h 
              </mi> 
              <mo>
                , 
              </mo> 
              <mi>
                w 
              </mi> 
             </mrow> 
            </munder> 
            <mrow> 
             <msub> 
              <mi>
                f 
              </mi> 
              <mrow> 
               <mi>
                 h 
               </mi> 
               <mo>
                 , 
               </mo> 
               <mi>
                 w 
               </mi> 
              </mrow> 
             </msub> 
             <msub> 
              <mi>
                p 
              </mi> 
              <mrow> 
               <mi>
                 h 
               </mi> 
               <mo>
                 , 
               </mo> 
               <mi>
                 w 
               </mi> 
              </mrow> 
             </msub> 
            </mrow> 
           </mstyle> 
          </mtd> 
         </mtr> 
         <mtr> 
          <mtd> 
           <mstyle displaystyle="true"> 
            <mrow> 
             <mo>
               ∬ 
             </mo> 
             <mrow> 
              <mi>
                f 
              </mi> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mi>
                  h 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  w 
                </mi> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mrow> 
                <mi>
                  H 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  W 
                </mi> 
               </mrow> 
              </msub> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mi>
                  h 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  w 
                </mi> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <mtext>
                d 
              </mtext> 
              <mi>
                h 
              </mi> 
              <mtext>
                  
              </mtext> 
              <mtext>
                d 
              </mtext> 
              <mi>
                w 
              </mi> 
             </mrow> 
            </mrow> 
           </mstyle> 
          </mtd> 
         </mtr> 
        </mtable> 
       </mrow> 
      </mrow> 
     </math> (8)</p>
    <p>with summation or integration over a range of non-negative real numbers. The parameter r in Equation (7) is referred to as the linear correlation coefficient; it is a measure of the first-order covariance of two random variables <xref ref-type="bibr" rid="scirp.144605-8">
      [8]
     </xref>.</p>
    <p>The features of the sample distributions of height and weight that suggested a bivariate lognormal distribution included (1) marked asymmetry about the mean as discerned graphically or measured quantitatively by the statistic skewness (related to the 3rd statistical moment), (2) a nonzero correlation coefficient of height and weight, and especially (3) the fact that histograms of the natural logarithms of the sampled height and weight passed statistical tests for normal (i.e. Gaussian) distributions, consistent with the definition of a lognormal distribution <xref ref-type="bibr" rid="scirp.144605-9">
      [9]
     </xref>.</p>
    <p>The statistical content of Equation (2) and its marginal distributions were tested comprehensively against an extensive data base of the Anthropometric Survey of U.S. Army Personnel (ANSUR) <xref ref-type="bibr" rid="scirp.144605-7">
      [7]
     </xref>. This survey, which included both genders, 5 categories of age ranging from younger than 20 to older than 41, 7 broad categories of race further classified into approximately 30 ethnic subpopulations, 51 US birthplaces (50 States and Washington DC), as well as approximately 30 international birthplaces, is presumed representative of a large, diverse group of basically healthy adults, since members of the military have to pass fitness requirements for acceptance. Statistical tests were carried out separately for male and female cohorts comprising respectively 4082 and 1986 subjects. <xref ref-type="fig" rid="fig1">
      Figure 1
     </xref> shows a 3-dimensional plot of the distribution function (2) for the set of parameters that apply to the ANSUR male cohort: 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          m 
        </mi> 
        <mn>
          1 
        </mn> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mn>
         4.4351 
       </mn> 
      </mrow> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          m 
        </mi> 
        <mn>
          2 
        </mn> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mn>
         0.5624 
       </mn> 
      </mrow> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          s 
        </mi> 
        <mn>
          1 
        </mn> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mn>
         0.1654 
       </mn> 
      </mrow> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          s 
        </mi> 
        <mn>
          2 
        </mn> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mn>
         0.0390 
       </mn> 
      </mrow> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         r 
       </mi> 
       <mo>
         = 
       </mo> 
       <mn>
         0.4716 
       </mn> 
      </mrow> 
     </math>. The second frame of the figure shows an orientation rotated 180˚ relative to the first frame.</p>
    <p>For a conjectured distribution with the foregoing characteristics, one might expect the probability density in Equation (2) to approximate a hypothetical “true” probability density reasonably well. In fact, the outcome of the tests revealed agreement between theoretical predictions and data to an astonishing degree.</p>
    <fig-group id="fig1" position="float">
     <fig id="fig1" position="float">
      <label>Figure 1</label>
      <caption>
       <title>(a)--(b)--Figure 1. Plot of the probability density function of the bivariant lognormal distribution of human height and weight for the male ANSUR cohort. Contour lines show lines of variable height for fixed weight and lines of variable weight for fixed height. These two sets of contours are each mutually orthogonal and lognormal in form. The views shown in (a) and (b) differ in orientation by 180˚ as indicated by the mirror reflection of the colors in the base plane.</title>
      </caption>
      <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/1241960-rId45.jpeg?20250806111932" />
     </fig>
     <fig id="fig1" position="float">
      <label>Figure 1</label>
      <caption>
       <title>(a)--(b)--Figure 1. Plot of the probability density function of the bivariant lognormal distribution of human height and weight for the male ANSUR cohort. Contour lines show lines of variable height for fixed weight and lines of variable weight for fixed height. These two sets of contours are each mutually orthogonal and lognormal in form. The views shown in (a) and (b) differ in orientation by 180˚ as indicated by the mirror reflection of the colors in the base plane.</title>
      </caption>
      <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/1241960-rId46.jpeg?20250806111932" />
     </fig>
    </fig-group>
    <p>Upon input of the numerical values of the five parameters of Equations (3)-(7) obtained for each gender from the ANSUR population, the statistical predictions of Equation (2) matched corresponding sample statistics over an extensive hierarchy of higher moments and correlations limited only by statistical uncertainties due to finite sample size. Especially interesting was the fact that the single linear correlation coefficient r, together with the other four parameters, sufficed for correctly predicting all accessible higher-order nonlinear correlations of the data.</p>
    <p>The full extent of concordances of lower moments, hyperstatistics, and correlation functions of all four variables 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        H 
      </mi> 
     </math>, 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        W 
      </mi> 
     </math>, 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         ln 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          H 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>, 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         ln 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          W 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>, together with exhaustive tests for hidden nonlinear correlations beyond those intrinsic to Equation (2), are discussed in Ref. <xref ref-type="bibr" rid="scirp.144605-3">
      [3]
     </xref>. It seems improbable that such extensive agreement between data and theory is purely coincidental. Consequently, an explanation of the enigmatic perfection of the bivariate lognormal distribution of human height and weight might be attributable to some fundamental principle. Hence the second approach.</p>
   </sec>
   <sec id="s1_2">
    <title>1.2. Inference From Probability Theory</title>
    <p>The Principle of Maximum Entropy (PME), introduced by Jaynes in the 1950s as a basis for deriving and justifying the fundamental relations of equilibrium statistical mechanics (ESM) <xref ref-type="bibr" rid="scirp.144605-10">
      [10]
     </xref> <xref ref-type="bibr" rid="scirp.144605-11">
      [11]
     </xref>, provides a means of finding the most probable statistical distribution compatible with known prior information. Although the initial motivation was to solve the physics problem of finding a probabilistic, rather than mechanical, explanation of the success of ESM, the PME has since been developed as a general method of inference applicable to problems as diverse as image analysis <xref ref-type="bibr" rid="scirp.144605-12">
      [12]
     </xref>, detection of cheating <xref ref-type="bibr" rid="scirp.144605-13">
      [13]
     </xref>, extraction of information from crowdsourcing <xref ref-type="bibr" rid="scirp.144605-14">
      [14]
     </xref>, and many other applications in science, engineering, and business <xref ref-type="bibr" rid="scirp.144605-15">
      [15]
     </xref>.</p>
    <p>Mathematically, the PME generates a variational procedure by which to maximize the Shannon information entropy function <xref ref-type="bibr" rid="scirp.144605-16">
      [16]
     </xref> augmented by constraints usually posed in the form of expectation values. The virtue of the PME is that it leads to the most objective (i.e. least biased) probability distribution consistent with the given constraints <xref ref-type="bibr" rid="scirp.144605-17">
      [17]
     </xref>. The information entropy, which is equivalent to the entropy function of ESM up to a universal scalar factor (the Boltzmann constant), is defined by an expression of the form</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         S 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           p 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         ≡ 
       </mo> 
       <mi>
         S 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           ⋯ 
         </mo> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mi>
            n 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mo>
         − 
       </mo> 
       <mstyle displaystyle="true"> 
        <munderover> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </munderover> 
        <mrow> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              p 
            </mi> 
            <mi>
              i 
            </mi> 
           </msub> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (9)</p>
    <p>in which 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          p 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           i 
         </mi> 
         <mo>
           = 
         </mo> 
         <mn>
           1 
         </mn> 
         <mo>
           , 
         </mo> 
         <mo>
           ⋯ 
         </mo> 
         <mo>
           , 
         </mo> 
         <mi>
           n 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is the probability of an outcome 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          x 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math>. Equation (9) can be generalized to apply to the entropy of a continuous variable, an extension that will be discussed later in the paper where the distinction becomes relevant. From Equations (8) and (9) it follows that the information entropy may be thought of as the expectation</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> <img height="20px" src="https://html.scirp.org/file/1241960-rId61.jpeg?20250806111932"> 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          S 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           p 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          ≡ 
        </mo> 
        <mo>
          − 
        </mo> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mrow> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             p 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mo>
           〉 
         </mo> 
        </mrow> 
       </mrow> 
      </math>. (10)</img></p>
    <p>In Section 2, the maximum entropy distribution of human height and weight is derived. The statistical implications of maximum entropy are taken up in Section 3. An important matter regarding the information entropy of continuous variables is discussed in Section 4. Section 5 outlines a hypothetical stochastic mechanism of biological development that leads to the bivariant lognormal distribution. Some fine points regarding the maximum entropy explanation of the distribution of human height and weight and future steps for further testing are discussed in Section 6.</p>
    <sec id="s1">
     <title>2. Maximum Entropy Distribution of Height and Weight</title>
     <p>It is common practice in statistics to make a logarithmic transformation of data <xref ref-type="bibr" rid="scirp.144605-18">
       [18]
      </xref>, especially in analyses of physical, biomedical, or economic information comprising positive values that display a marked skewness such as human height and weight. The desired outcome of such a transformation is to “normalize” the data—i.e. render a more symmetric distribution closer in form to Gaussian.</p>
     <p>Consider, therefore, the bivariate random variable</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            X 
          </mi> 
          <mo>
            , 
          </mo> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mi>
            Y 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          ≡ 
        </mo> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             H 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mo>
            , 
          </mo> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             W 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> (11)</p>
     <p>with realizations represented by x, y, respectively. Then the entropy of a data set comprising the logarithms of n individual measurements of H and W takes the form</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          S 
        </mi> 
        <mo>
          = 
        </mo> 
        <mo>
          − 
        </mo> 
        <mstyle displaystyle="true"> 
         <munder> 
          <mo>
            ∑ 
          </mo> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             y 
           </mi> 
          </mrow> 
         </munder> 
         <mrow> 
          <msub> 
           <mi>
             p 
           </mi> 
           <mrow> 
            <mi>
              x 
            </mi> 
            <mo>
              , 
            </mo> 
            <mi>
              y 
            </mi> 
           </mrow> 
          </msub> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <msub> 
             <mi>
               p 
             </mi> 
             <mrow> 
              <mi>
                x 
              </mi> 
              <mo>
                , 
              </mo> 
              <mi>
                y 
              </mi> 
             </mrow> 
            </msub> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
        </mstyle> 
       </mrow> 
      </math>. (12)</p>
     <p>From the transformed data set one can extract the five statistics summarized in the five Equations (3)-(7) and incorporate this information as constraints in an entropy functional</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mtable columnalign="left"> 
        <mtr> 
         <mtd> 
          <mi>
            H 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mstyle mathvariant="bold" mathsize="normal"> 
            <mi>
              p 
            </mi> 
           </mstyle> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mo>
            = 
          </mo> 
          <mo>
            − 
          </mo> 
          <mstyle displaystyle="true"> 
           <munder> 
            <mo>
              ∑ 
            </mo> 
            <mrow> 
             <mi>
               x 
             </mi> 
             <mo>
               , 
             </mo> 
             <mi>
               y 
             </mi> 
            </mrow> 
           </munder> 
           <mrow> 
            <msub> 
             <mi>
               p 
             </mi> 
             <mrow> 
              <mi>
                x 
              </mi> 
              <mo>
                , 
              </mo> 
              <mi>
                y 
              </mi> 
             </mrow> 
            </msub> 
            <mi>
              ln 
            </mi> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mrow> 
                <mi>
                  x 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  y 
                </mi> 
               </mrow> 
              </msub> 
             </mrow> 
             <mo>
               ) 
             </mo> 
            </mrow> 
           </mrow> 
          </mstyle> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             0 
           </mn> 
          </msub> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mstyle displaystyle="true"> 
             <munder> 
              <mo>
                ∑ 
              </mo> 
              <mrow> 
               <mi>
                 x 
               </mi> 
               <mo>
                 , 
               </mo> 
               <mi>
                 y 
               </mi> 
              </mrow> 
             </munder> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mrow> 
                <mi>
                  x 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  y 
                </mi> 
               </mrow> 
              </msub> 
             </mrow> 
            </mstyle> 
            <mo>
              − 
            </mo> 
            <mn>
              1 
            </mn> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mstyle displaystyle="true"> 
             <munder> 
              <mo>
                ∑ 
              </mo> 
              <mrow> 
               <mi>
                 x 
               </mi> 
               <mo>
                 , 
               </mo> 
               <mi>
                 y 
               </mi> 
              </mrow> 
             </munder> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mrow> 
                <mi>
                  x 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  y 
                </mi> 
               </mrow> 
              </msub> 
              <msup> 
               <mrow> 
                <mrow> 
                 <mo>
                   ( 
                 </mo> 
                 <mrow> 
                  <mfrac> 
                   <mrow> 
                    <mi>
                      x 
                    </mi> 
                    <mo>
                      − 
                    </mo> 
                    <msub> 
                     <mi>
                       m 
                     </mi> 
                     <mn>
                       1 
                     </mn> 
                    </msub> 
                   </mrow> 
                   <mrow> 
                    <msub> 
                     <mi>
                       s 
                     </mi> 
                     <mn>
                       1 
                     </mn> 
                    </msub> 
                   </mrow> 
                  </mfrac> 
                 </mrow> 
                 <mo>
                   ) 
                 </mo> 
                </mrow> 
               </mrow> 
               <mn>
                 2 
               </mn> 
              </msup> 
             </mrow> 
            </mstyle> 
            <mo>
              − 
            </mo> 
            <mn>
              1 
            </mn> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mtd> 
        </mtr> 
        <mtr> 
         <mtd> 
          <mtext>
            ​ 
          </mtext> 
          <mtext>
            ​ 
          </mtext> 
          <mtext>
            ​ 
          </mtext> 
          <mtext>
            ​ 
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mstyle displaystyle="true"> 
             <munder> 
              <mo>
                ∑ 
              </mo> 
              <mrow> 
               <mi>
                 x 
               </mi> 
               <mo>
                 , 
               </mo> 
               <mi>
                 y 
               </mi> 
              </mrow> 
             </munder> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mrow> 
                <mi>
                  x 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  y 
                </mi> 
               </mrow> 
              </msub> 
              <msup> 
               <mrow> 
                <mrow> 
                 <mo>
                   ( 
                 </mo> 
                 <mrow> 
                  <mfrac> 
                   <mrow> 
                    <mi>
                      y 
                    </mi> 
                    <mo>
                      − 
                    </mo> 
                    <msub> 
                     <mi>
                       m 
                     </mi> 
                     <mn>
                       2 
                     </mn> 
                    </msub> 
                   </mrow> 
                   <mrow> 
                    <msub> 
                     <mi>
                       s 
                     </mi> 
                     <mn>
                       2 
                     </mn> 
                    </msub> 
                   </mrow> 
                  </mfrac> 
                 </mrow> 
                 <mo>
                   ) 
                 </mo> 
                </mrow> 
               </mrow> 
               <mn>
                 2 
               </mn> 
              </msup> 
             </mrow> 
            </mstyle> 
            <mo>
              − 
            </mo> 
            <mn>
              1 
            </mn> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             3 
           </mn> 
          </msub> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mstyle displaystyle="true"> 
             <munder> 
              <mo>
                ∑ 
              </mo> 
              <mrow> 
               <mi>
                 x 
               </mi> 
               <mo>
                 , 
               </mo> 
               <mi>
                 y 
               </mi> 
              </mrow> 
             </munder> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mrow> 
                <mi>
                  x 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  y 
                </mi> 
               </mrow> 
              </msub> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mfrac> 
                 <mrow> 
                  <mi>
                    x 
                  </mi> 
                  <mo>
                    − 
                  </mo> 
                  <msub> 
                   <mi>
                     m 
                   </mi> 
                   <mn>
                     1 
                   </mn> 
                  </msub> 
                 </mrow> 
                 <mrow> 
                  <msub> 
                   <mi>
                     s 
                   </mi> 
                   <mn>
                     1 
                   </mn> 
                  </msub> 
                 </mrow> 
                </mfrac> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mfrac> 
                 <mrow> 
                  <mi>
                    y 
                  </mi> 
                  <mo>
                    − 
                  </mo> 
                  <msub> 
                   <mi>
                     m 
                   </mi> 
                   <mn>
                     2 
                   </mn> 
                  </msub> 
                 </mrow> 
                 <mrow> 
                  <msub> 
                   <mi>
                     s 
                   </mi> 
                   <mn>
                     2 
                   </mn> 
                  </msub> 
                 </mrow> 
                </mfrac> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
             </mrow> 
            </mstyle> 
            <mo>
              − 
            </mo> 
            <mi>
              r 
            </mi> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mtd> 
        </mtr> 
       </mtable> 
      </math> (13)</p>
     <p>by means of the Lagrange multipliers 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mi>
           i 
         </mi> 
        </msub> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            0 
          </mn> 
          <mo>
            , 
          </mo> 
          <mn>
            1 
          </mn> 
          <mo>
            , 
          </mo> 
          <mn>
            2 
          </mn> 
          <mo>
            , 
          </mo> 
          <mn>
            3 
          </mn> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math>. In this paper the symbol “S”, as used in physics, will stand for entropy; the symbol “H”, as used in communication theory, will stand for the entropy functional in the variational procedure. (The symbol “H” actually represents an upper-case Greek Eta for Entropy.)</p>
     <p>The choice of what prior information to include in the variational analysis is somewhat arbitrary, as it depends on what prior information is available. The choice is important, for it determines the form of the statistical distribution that emerges from the analysis. This consideration will be discussed in greater detail later in the paper. For the present, suffice it to say that the author’s choice was motivated by the prior knowledge, as reported in Ref. <xref ref-type="bibr" rid="scirp.144605-3">
       [3]
      </xref>, that a bivariant lognormal probability density matched the ANSUR data very closely. Therefore, the initial constraints were chosen to be the five lowest statistical moments that uniquely defined a bivariant lognormal distribution. A different set of prior constraints, however, would have resulted in a different distribution, even though those constraints would have come from the same ANSUR data set.</p>
     <p>It is to be noted that Equation (13) does not contain explicit terms for constraints on the two means, Equations (3) and (5). This is not an oversight. Rather, the information provided by those two constraints is already included through the constraints on the two variances. To add to Equation (13) additional terms for constraining the two means would be redundant, and the corresponding Lagrange multipliers would turn out to be zero. It is, in fact, a general characteristic of the maximum entropy method that it recognizes redundant information and eliminates the associated Lagrange multipliers from the PME solution <xref ref-type="bibr" rid="scirp.144605-19">
       [19]
      </xref>.</p>
     <p>A question that may arise is how uncertainties in the prior information affect the initial constraints and therefore the resulting solution. The answer is that in applying the PME, one simply treats the expectation values that serve as prior information as given known data. If, for example, the prior information is a set of mean values of the variables, and there is a need to account for the uncertainties in those variables, then that information is also to be included in the PME functional as constraints on the associated variances, such as implemented in Equation (13). Does one then need to consider the uncertainties of the variances, and so on up an unending ladder of higher-order uncertainties? The answer is “No”, as explained later in the paper. The PME procedure itself can indicate to the analyst when more (or different) prior information is required.</p>
     <p>Given Equation (13), the variation with respect to 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
         p 
       </mi> 
      </math></p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          δ 
        </mi> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           p 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mn>
          0 
        </mn> 
       </mrow> 
      </math> (14)</p>
     <p>to maximize the entropy subject to the imposed constraints then leads to the equation</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          ln 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <msub> 
           <mi>
             p 
           </mi> 
           <mrow> 
            <mi>
              x 
            </mi> 
            <mo>
              , 
            </mo> 
            <mi>
              y 
            </mi> 
           </mrow> 
          </msub> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mo>
          − 
        </mo> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mn>
            1 
          </mn> 
          <mo>
            + 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             0 
           </mn> 
          </msub> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          − 
        </mo> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           1 
         </mn> 
        </msub> 
        <mi>
          u 
        </mi> 
        <msup> 
         <mrow> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             x 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mn>
           2 
         </mn> 
        </msup> 
        <mo>
          − 
        </mo> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           2 
         </mn> 
        </msub> 
        <mi>
          v 
        </mi> 
        <msup> 
         <mrow> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             y 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mn>
           2 
         </mn> 
        </msup> 
        <mo>
          − 
        </mo> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           3 
         </mn> 
        </msub> 
        <mi>
          u 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           x 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mi>
          v 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           y 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> (15)</p>
     <p>in which</p>
     <p>
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          u 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           x 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          ≡ 
        </mo> 
        <mfrac> 
         <mrow> 
          <mi>
            x 
          </mi> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             m 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
         </mrow> 
         <mrow> 
          <msub> 
           <mi>
             s 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
         </mrow> 
        </mfrac> 
        <mo>
          , 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mi>
          v 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           y 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          ≡ 
        </mo> 
        <mfrac> 
         <mrow> 
          <mi>
            y 
          </mi> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             m 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
         </mrow> 
         <mrow> 
          <msub> 
           <mi>
             s 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
         </mrow> 
        </mfrac> 
       </mrow> 
      </math>(16)</p>
     <p>are recognized as standard normal variables with properties</p>
     <p>
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mi>
           u 
         </mi> 
         <mo>
           〉 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mi>
           v 
         </mi> 
         <mo>
           〉 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mn>
          0 
        </mn> 
       </mrow> 
      </math>(17)</p>
     <p>and</p>
     <p>
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mrow> 
          <msup> 
           <mi>
             u 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
         </mrow> 
         <mo>
           〉 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mrow> 
          <msup> 
           <mi>
             v 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
         </mrow> 
         <mo>
           〉 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mn>
          1 
        </mn> 
       </mrow> 
      </math>.(18)</p>
     <p>The solution to Equation (15), expressed in the variables u, v, takes the form</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            u 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            v 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mfrac> 
         <mrow> 
          <mi>
            exp 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mo>
              − 
            </mo> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               1 
             </mn> 
            </msub> 
            <msup> 
             <mi>
               u 
             </mi> 
             <mn>
               2 
             </mn> 
            </msup> 
            <mo>
              − 
            </mo> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               2 
             </mn> 
            </msub> 
            <msup> 
             <mi>
               v 
             </mi> 
             <mn>
               2 
             </mn> 
            </msup> 
            <mo>
              − 
            </mo> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               3 
             </mn> 
            </msub> 
            <mi>
              u 
            </mi> 
            <mi>
              v 
            </mi> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mrow> 
          <mi>
            Z 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               1 
             </mn> 
            </msub> 
            <mo>
              , 
            </mo> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               2 
             </mn> 
            </msub> 
            <mo>
              , 
            </mo> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               3 
             </mn> 
            </msub> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
        </mfrac> 
       </mrow> 
      </math> (19)</p>
     <p>in which Lagrange multiplier 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           0 
         </mn> 
        </msub> 
       </mrow> 
      </math> was incorporated into the partition function</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mtable columnalign="left"> 
        <mtr> 
         <mtd> 
          <mi>
            Z 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             λ 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mo>
            ≡ 
          </mo> 
          <mi>
            Z 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               1 
             </mn> 
            </msub> 
            <mo>
              , 
            </mo> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               2 
             </mn> 
            </msub> 
            <mo>
              , 
            </mo> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               3 
             </mn> 
            </msub> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mo>
            = 
          </mo> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mstyle displaystyle="true"> 
           <mrow> 
            <munderover> 
             <mo>
               ∫ 
             </mo> 
             <mrow> 
              <mo>
                − 
              </mo> 
              <mi>
                ∞ 
              </mi> 
             </mrow> 
             <mi>
               ∞ 
             </mi> 
            </munderover> 
            <mrow> 
             <mstyle displaystyle="true"> 
              <mrow> 
               <munderover> 
                <mo>
                  ∫ 
                </mo> 
                <mrow> 
                 <mo>
                   − 
                 </mo> 
                 <mi>
                   ∞ 
                 </mi> 
                </mrow> 
                <mi>
                  ∞ 
                </mi> 
               </munderover> 
               <mrow> 
                <mi>
                  exp 
                </mi> 
                <mrow> 
                 <mo>
                   ( 
                 </mo> 
                 <mrow> 
                  <mo>
                    − 
                  </mo> 
                  <msub> 
                   <mi>
                     λ 
                   </mi> 
                   <mn>
                     1 
                   </mn> 
                  </msub> 
                  <msup> 
                   <mi>
                     u 
                   </mi> 
                   <mn>
                     2 
                   </mn> 
                  </msup> 
                  <mo>
                    − 
                  </mo> 
                  <msub> 
                   <mi>
                     λ 
                   </mi> 
                   <mn>
                     2 
                   </mn> 
                  </msub> 
                  <msup> 
                   <mi>
                     v 
                   </mi> 
                   <mn>
                     2 
                   </mn> 
                  </msup> 
                  <mo>
                    − 
                  </mo> 
                  <msub> 
                   <mi>
                     λ 
                   </mi> 
                   <mn>
                     3 
                   </mn> 
                  </msub> 
                  <mi>
                    u 
                  </mi> 
                  <mi>
                    v 
                  </mi> 
                 </mrow> 
                 <mo>
                   ) 
                 </mo> 
                </mrow> 
                <mtext>
                  d 
                </mtext> 
                <mi>
                  u 
                </mi> 
                <mtext>
                  d 
                </mtext> 
                <mi>
                  v 
                </mi> 
               </mrow> 
              </mrow> 
             </mstyle> 
            </mrow> 
           </mrow> 
          </mstyle> 
         </mtd> 
        </mtr> 
        <mtr> 
         <mtd> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mtext>
              
          </mtext> 
          <mo>
            = 
          </mo> 
          <mfrac> 
           <mrow> 
            <mn>
              2 
            </mn> 
            <mi>
              π 
            </mi> 
           </mrow> 
           <mrow> 
            <msqrt> 
             <mrow> 
              <mn>
                4 
              </mn> 
              <msub> 
               <mi>
                 λ 
               </mi> 
               <mn>
                 1 
               </mn> 
              </msub> 
              <msub> 
               <mi>
                 λ 
               </mi> 
               <mn>
                 2 
               </mn> 
              </msub> 
              <mo>
                − 
              </mo> 
              <msubsup> 
               <mi>
                 λ 
               </mi> 
               <mn>
                 3 
               </mn> 
               <mn>
                 2 
               </mn> 
              </msubsup> 
             </mrow> 
            </msqrt> 
           </mrow> 
          </mfrac> 
         </mtd> 
        </mtr> 
       </mtable> 
      </math> (20)</p>
     <p>whose explicit evaluation in Equation (20) takes account of the continuous nature of the variables. Substitution of 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          Z 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           λ 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> into Equation (19) leads to the probability function</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            u 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            v 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mfrac> 
         <mrow> 
          <msqrt> 
           <mrow> 
            <mn>
              4 
            </mn> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               1 
             </mn> 
            </msub> 
            <msub> 
             <mi>
               λ 
             </mi> 
             <mn>
               2 
             </mn> 
            </msub> 
            <mo>
              − 
            </mo> 
            <msubsup> 
             <mi>
               λ 
             </mi> 
             <mn>
               3 
             </mn> 
             <mn>
               2 
             </mn> 
            </msubsup> 
           </mrow> 
          </msqrt> 
         </mrow> 
         <mrow> 
          <mn>
            2 
          </mn> 
          <mi>
            π 
          </mi> 
         </mrow> 
        </mfrac> 
        <mi>
          exp 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <msup> 
           <mi>
             u 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
          <msup> 
           <mi>
             v 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             3 
           </mn> 
          </msub> 
          <mi>
            u 
          </mi> 
          <mi>
            v 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> . (21)</p>
     <p>The partition function, defined in the first line of Equation (20), is seen to serve as the normalization factor ensuring that 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            u 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            v 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> obeys the completeness requirement of a probability density</p>
     <p>
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mstyle displaystyle="true"> 
         <mrow> 
          <munderover> 
           <mo>
             ∫ 
           </mo> 
           <mrow> 
            <mo>
              − 
            </mo> 
            <mi>
              ∞ 
            </mi> 
           </mrow> 
           <mi>
             ∞ 
           </mi> 
          </munderover> 
          <mrow> 
           <mstyle displaystyle="true"> 
            <mrow> 
             <munderover> 
              <mo>
                ∫ 
              </mo> 
              <mrow> 
               <mo>
                 − 
               </mo> 
               <mi>
                 ∞ 
               </mi> 
              </mrow> 
              <mi>
                ∞ 
              </mi> 
             </munderover> 
             <mrow> 
              <mi>
                p 
              </mi> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mi>
                  u 
                </mi> 
                <mo>
                  , 
                </mo> 
                <mi>
                  v 
                </mi> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <mtext>
                d 
              </mtext> 
              <mi>
                u 
              </mi> 
              <mtext>
                d 
              </mtext> 
              <mi>
                v 
              </mi> 
             </mrow> 
            </mrow> 
           </mstyle> 
          </mrow> 
         </mrow> 
        </mstyle> 
        <mo>
          = 
        </mo> 
        <mn>
          1 
        </mn> 
       </mrow> 
      </math>(22)</p>
     <p>irrespective of the values of the Lagrange multipliers. However, one sees from Equations (8) and (19) that 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          Z 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mi>
           λ 
         </mi> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> is also the function that relates the values of the Lagrange multipliers to the values of the constraints through the relations</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mo>
          − 
        </mo> 
        <mfrac> 
         <mrow> 
          <mo>
            ∂ 
          </mo> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mi>
              Z 
            </mi> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mi>
               λ 
             </mi> 
             <mo>
               ) 
             </mo> 
            </mrow> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mrow> 
          <mo>
            ∂ 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
         </mrow> 
        </mfrac> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mfrac> 
         <mrow> 
          <mn>
            2 
          </mn> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
         </mrow> 
         <mrow> 
          <mn>
            4 
          </mn> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
          <mo>
            − 
          </mo> 
          <msubsup> 
           <mi>
             λ 
           </mi> 
           <mn>
             3 
           </mn> 
           <mn>
             2 
           </mn> 
          </msubsup> 
         </mrow> 
        </mfrac> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mrow> 
          <msup> 
           <mi>
             u 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
         </mrow> 
         <mo>
           〉 
         </mo> 
        </mrow> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mn>
          1 
        </mn> 
       </mrow> 
      </math> (23)</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mo>
          − 
        </mo> 
        <mfrac> 
         <mrow> 
          <mo>
            ∂ 
          </mo> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mi>
              Z 
            </mi> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mi>
               λ 
             </mi> 
             <mo>
               ) 
             </mo> 
            </mrow> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mrow> 
          <mo>
            ∂ 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
         </mrow> 
        </mfrac> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mfrac> 
         <mrow> 
          <mn>
            2 
          </mn> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
         </mrow> 
         <mrow> 
          <mn>
            4 
          </mn> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
          <mo>
            − 
          </mo> 
          <msubsup> 
           <mi>
             λ 
           </mi> 
           <mn>
             3 
           </mn> 
           <mn>
             2 
           </mn> 
          </msubsup> 
         </mrow> 
        </mfrac> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mrow> 
          <msup> 
           <mi>
             v 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
         </mrow> 
         <mo>
           〉 
         </mo> 
        </mrow> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mn>
          1 
        </mn> 
       </mrow> 
      </math> (24)</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mo>
          − 
        </mo> 
        <mfrac> 
         <mrow> 
          <mo>
            ∂ 
          </mo> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mi>
              Z 
            </mi> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mi>
               λ 
             </mi> 
             <mo>
               ) 
             </mo> 
            </mrow> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mrow> 
          <mo>
            ∂ 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             3 
           </mn> 
          </msub> 
         </mrow> 
        </mfrac> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mfrac> 
         <mrow> 
          <mo>
            − 
          </mo> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             3 
           </mn> 
          </msub> 
         </mrow> 
         <mrow> 
          <mn>
            4 
          </mn> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <msub> 
           <mi>
             λ 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
          <mo>
            − 
          </mo> 
          <msubsup> 
           <mi>
             λ 
           </mi> 
           <mn>
             3 
           </mn> 
           <mn>
             2 
           </mn> 
          </msubsup> 
         </mrow> 
        </mfrac> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mrow> 
         <mo>
           〈 
         </mo> 
         <mrow> 
          <mi>
            u 
          </mi> 
          <mtext>
              
          </mtext> 
          <mi>
            v 
          </mi> 
         </mrow> 
         <mo>
           〉 
         </mo> 
        </mrow> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mi>
          r 
        </mi> 
       </mrow> 
      </math>. (25)</p>
     <p>From Equations (23) and (24), it follows that 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           1 
         </mn> 
        </msub> 
        <mo>
          = 
        </mo> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           2 
         </mn> 
        </msub> 
       </mrow> 
      </math>. And from Equations (23) and (25), it follows that 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           3 
         </mn> 
        </msub> 
        <mo>
          = 
        </mo> 
        <mo>
          − 
        </mo> 
        <mn>
          2 
        </mn> 
        <mi>
          r 
        </mi> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           1 
         </mn> 
        </msub> 
       </mrow> 
      </math>. Substitution of the preceding expressions for 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           2 
         </mn> 
        </msub> 
       </mrow> 
      </math> and 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           3 
         </mn> 
        </msub> 
       </mrow> 
      </math> into Equation (24) determines 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           1 
         </mn> 
        </msub> 
       </mrow> 
      </math>, leading to the solution of all three multipliers</p>
     <p>
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           1 
         </mn> 
        </msub> 
        <mo>
          = 
        </mo> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           2 
         </mn> 
        </msub> 
        <mo>
          = 
        </mo> 
        <mfrac> 
         <mn>
           1 
         </mn> 
         <mrow> 
          <mn>
            2 
          </mn> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mn>
              1 
            </mn> 
            <mo>
              − 
            </mo> 
            <msup> 
             <mi>
               r 
             </mi> 
             <mn>
               2 
             </mn> 
            </msup> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
        </mfrac> 
       </mrow> 
      </math>; 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           λ 
         </mi> 
         <mn>
           3 
         </mn> 
        </msub> 
        <mo>
          = 
        </mo> 
        <mfrac> 
         <mrow> 
          <mo>
            − 
          </mo> 
          <mi>
            r 
          </mi> 
         </mrow> 
         <mrow> 
          <mn>
            1 
          </mn> 
          <mo>
            − 
          </mo> 
          <msup> 
           <mi>
             r 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
         </mrow> 
        </mfrac> 
       </mrow> 
      </math> (26)</p>
     <p>and the partition function, Equation (20)</p>
     <p>
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          Z 
        </mi> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mn>
          2 
        </mn> 
        <mi>
          π 
        </mi> 
        <msqrt> 
         <mrow> 
          <mn>
            1 
          </mn> 
          <mo>
            − 
          </mo> 
          <msup> 
           <mi>
             r 
           </mi> 
           <mn>
             2 
           </mn> 
          </msup> 
         </mrow> 
        </msqrt> 
       </mrow> 
      </math>.(27)</p>
     <p>The exact probability function (21) is then determined to be</p>
     <p>
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            u 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            v 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mo>
          = 
        </mo> 
        <mfrac> 
         <mn>
           1 
         </mn> 
         <mrow> 
          <mn>
            2 
          </mn> 
          <mi>
            π 
          </mi> 
          <msqrt> 
           <mrow> 
            <mn>
              1 
            </mn> 
            <mo>
              − 
            </mo> 
            <msup> 
             <mi>
               r 
             </mi> 
             <mn>
               2 
             </mn> 
            </msup> 
           </mrow> 
          </msqrt> 
         </mrow> 
        </mfrac> 
        <mi>
          exp 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mo>
            − 
          </mo> 
          <mfrac> 
           <mn>
             1 
           </mn> 
           <mrow> 
            <mn>
              2 
            </mn> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mrow> 
              <mn>
                1 
              </mn> 
              <mo>
                − 
              </mo> 
              <msup> 
               <mi>
                 r 
               </mi> 
               <mn>
                 2 
               </mn> 
              </msup> 
             </mrow> 
             <mo>
               ) 
             </mo> 
            </mrow> 
           </mrow> 
          </mfrac> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <msup> 
             <mi>
               u 
             </mi> 
             <mn>
               2 
             </mn> 
            </msup> 
            <mo>
              + 
            </mo> 
            <msup> 
             <mi>
               v 
             </mi> 
             <mn>
               2 
             </mn> 
            </msup> 
            <mo>
              − 
            </mo> 
            <mn>
              2 
            </mn> 
            <mi>
              r 
            </mi> 
            <mi>
              u 
            </mi> 
            <mi>
              v 
            </mi> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math>,(28)</p>
     <p>or</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            x 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            y 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mo>
          = 
        </mo> 
        <mtext>
            
        </mtext> 
        <mtext>
            
        </mtext> 
        <mfrac> 
         <mrow> 
          <mi>
            exp 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mo>
              − 
            </mo> 
            <mfrac> 
             <mn>
               1 
             </mn> 
             <mrow> 
              <mn>
                2 
              </mn> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mn>
                  1 
                </mn> 
                <mo>
                  − 
                </mo> 
                <msup> 
                 <mi>
                   r 
                 </mi> 
                 <mn>
                   2 
                 </mn> 
                </msup> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
             </mrow> 
            </mfrac> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mrow> 
              <msup> 
               <mrow> 
                <mrow> 
                 <mo>
                   ( 
                 </mo> 
                 <mrow> 
                  <mfrac> 
                   <mrow> 
                    <mi>
                      x 
                    </mi> 
                    <mo>
                      − 
                    </mo> 
                    <msub> 
                     <mi>
                       m 
                     </mi> 
                     <mn>
                       1 
                     </mn> 
                    </msub> 
                   </mrow> 
                   <mrow> 
                    <msub> 
                     <mi>
                       s 
                     </mi> 
                     <mn>
                       1 
                     </mn> 
                    </msub> 
                   </mrow> 
                  </mfrac> 
                 </mrow> 
                 <mo>
                   ) 
                 </mo> 
                </mrow> 
               </mrow> 
               <mn>
                 2 
               </mn> 
              </msup> 
              <mo>
                + 
              </mo> 
              <msup> 
               <mrow> 
                <mrow> 
                 <mo>
                   ( 
                 </mo> 
                 <mrow> 
                  <mfrac> 
                   <mrow> 
                    <mi>
                      y 
                    </mi> 
                    <mo>
                      − 
                    </mo> 
                    <msub> 
                     <mi>
                       m 
                     </mi> 
                     <mn>
                       2 
                     </mn> 
                    </msub> 
                   </mrow> 
                   <mrow> 
                    <msub> 
                     <mi>
                       s 
                     </mi> 
                     <mn>
                       2 
                     </mn> 
                    </msub> 
                   </mrow> 
                  </mfrac> 
                 </mrow> 
                 <mo>
                   ) 
                 </mo> 
                </mrow> 
               </mrow> 
               <mn>
                 2 
               </mn> 
              </msup> 
              <mo>
                − 
              </mo> 
              <mn>
                2 
              </mn> 
              <mi>
                r 
              </mi> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mfrac> 
                 <mrow> 
                  <mi>
                    x 
                  </mi> 
                  <mo>
                    − 
                  </mo> 
                  <msub> 
                   <mi>
                     m 
                   </mi> 
                   <mn>
                     1 
                   </mn> 
                  </msub> 
                 </mrow> 
                 <mrow> 
                  <msub> 
                   <mi>
                     s 
                   </mi> 
                   <mn>
                     1 
                   </mn> 
                  </msub> 
                 </mrow> 
                </mfrac> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mfrac> 
                 <mrow> 
                  <mi>
                    y 
                  </mi> 
                  <mo>
                    − 
                  </mo> 
                  <msub> 
                   <mi>
                     m 
                   </mi> 
                   <mn>
                     2 
                   </mn> 
                  </msub> 
                 </mrow> 
                 <mrow> 
                  <msub> 
                   <mi>
                     s 
                   </mi> 
                   <mn>
                     2 
                   </mn> 
                  </msub> 
                 </mrow> 
                </mfrac> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
             </mrow> 
             <mo>
               ) 
             </mo> 
            </mrow> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mrow> 
          <mn>
            2 
          </mn> 
          <mi>
            π 
          </mi> 
          <msub> 
           <mi>
             s 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <msub> 
           <mi>
             s 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
          <msqrt> 
           <mrow> 
            <mn>
              1 
            </mn> 
            <mo>
              − 
            </mo> 
            <msup> 
             <mi>
               r 
             </mi> 
             <mn>
               2 
             </mn> 
            </msup> 
           </mrow> 
          </msqrt> 
         </mrow> 
        </mfrac> 
       </mrow> 
      </math>(29)</p>
     <p>in terms of the original coordinates 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          x 
        </mi> 
        <mo>
          , 
        </mo> 
        <mi>
          y 
        </mi> 
       </mrow> 
      </math>. That the solution 
      <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mstyle mathvariant="bold" mathsize="normal"> 
          <mi>
            x 
          </mi> 
         </mstyle> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> of the variational Equation (14) actually leads to the absolute maximum entropy when substituted into expression (9) has been established in general arguments by Jaynes <xref ref-type="bibr" rid="scirp.144605-20">
       [20]
      </xref>.</p>
     <p>Equation (29), which is the maximum entropy solution to the problem of two correlated variables 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            X 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            Y 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> with constraints on their means, variances, and correlation, is recognized to be a bivariant normal probability function. This means that this is the most probable density function of 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            X 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            Y 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> that can be constructed subject to the same set of constraints. Upon carrying out the inverse logarithmic transformation to return to the original variables 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            H 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            W 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math> of height and weight, one obtains Equation (2), the bivariant lognormal probability density. The appearance of the factor 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <msub> 
         <mi>
           s 
         </mi> 
         <mn>
           1 
         </mn> 
        </msub> 
        <msub> 
         <mi>
           s 
         </mi> 
         <mn>
           2 
         </mn> 
        </msub> 
       </mrow> 
      </math> in the denominator of Equation (29) and 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          h 
        </mi> 
        <mtext>
            
        </mtext> 
        <mi>
          w 
        </mi> 
        <mtext>
            
        </mtext> 
        <msub> 
         <mi>
           s 
         </mi> 
         <mn>
           1 
         </mn> 
        </msub> 
        <msub> 
         <mi>
           s 
         </mi> 
         <mn>
           2 
         </mn> 
        </msub> 
       </mrow> 
      </math> in the denominator of Equation (2) results from the Jacobians of the transformations derived from</p>
     <p>
      <xref ref-type="bibr" rid="scirp.144605-"></xref> 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            h 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            w 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mtext>
          d 
        </mtext> 
        <mi>
          h 
        </mi> 
        <mtext>
            
        </mtext> 
        <mtext>
          d 
        </mtext> 
        <mi>
          w 
        </mi> 
        <mo>
          = 
        </mo> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            x 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            y 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mtext>
          d 
        </mtext> 
        <mi>
          x 
        </mi> 
        <mtext>
            
        </mtext> 
        <mtext>
          d 
        </mtext> 
        <mi>
          y 
        </mi> 
        <mo>
          = 
        </mo> 
        <mi>
          p 
        </mi> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            u 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            v 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
        <mtext>
          d 
        </mtext> 
        <mi>
          u 
        </mi> 
        <mtext>
            
        </mtext> 
        <mtext>
          d 
        </mtext> 
        <mi>
          v 
        </mi> 
       </mrow> 
      </math>. (30)</p>
     <p>Since the logarithmic transformation of coordinates and its inverse transformation do not introduce new empirical information, it follows that Equation (2) is the maximum entropy density in coordinates 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            h 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            w 
          </mi> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math>, given that Equation (29) is the maximum entropy density in coordinates 
      <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
        <mrow> 
         <mo>
           ( 
         </mo> 
         <mrow> 
          <mi>
            x 
          </mi> 
          <mo>
            = 
          </mo> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             h 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mo>
            , 
          </mo> 
          <mi>
            y 
          </mi> 
          <mo>
            = 
          </mo> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mi>
             w 
           </mi> 
           <mo>
             ) 
           </mo> 
          </mrow> 
         </mrow> 
         <mo>
           ) 
         </mo> 
        </mrow> 
       </mrow> 
      </math>. Although the argument is sound, there is a subtle matter regarding the entropy of continuous distributions that will be taken up in Section 4.</p>
    </sec>
   </sec>
   <sec id="s3">
    <title>3. Implications of Maximum Entropy</title>
    <p>The foregoing section employing the PME has established that the bivariant lognormal distribution of human height and weight, previously deduced on the basis of empirical sampling, is derivable by a theoretical procedure. The only empirical input in this procedure is the prior information serving as constraints. Apart from this information, the resulting probability density (2) incorporates no additional assumptions, explicit or implicit, introduced through any hypothetical model. In this regard, the PME is said to generate the most objective solution.</p>
    <p>To address the question of whether the PME can actually account for the extraordinary match between the theoretical and sampling distributions of human height and weight, one must understand the probabilistic implications of the PME. For illustrative purposes, consider the male cohort (sample size N &gt; 4000) of the Anthropometric Survey of U.S. Army Personnel (ANSUR) <xref ref-type="bibr" rid="scirp.144605-7">
      [7]
     </xref>, which was the sampling distribution used in Reference <xref ref-type="bibr" rid="scirp.144605-3">
      [3]
     </xref> to test the statistical predictions of the hypothesized lognormal distribution of height and weight. For simplicity, divide the x-y plane (with x and y the variables in Equation (29)) into a rectilinear grid of 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          n 
        </mi> 
        <mn>
          2 
        </mn> 
       </msup> 
      </mrow> 
     </math> cells of equal size labeled by indices ( 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         i 
       </mi> 
       <mo>
         = 
       </mo> 
       <mn>
         1 
       </mn> 
       <mo>
         , 
       </mo> 
       <mo>
         ⋯ 
       </mo> 
       <mo>
         , 
       </mo> 
       <mi>
         n 
       </mi> 
      </mrow> 
     </math>; 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         j 
       </mi> 
       <mo>
         = 
       </mo> 
       <mn>
         1 
       </mn> 
       <mo>
         , 
       </mo> 
       <mo>
         ⋯ 
       </mo> 
       <mo>
         , 
       </mo> 
       <mi>
         n 
       </mi> 
      </mrow> 
     </math>). Each cell represents a specified range of heights and weights into which an observed number 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          n 
        </mi> 
        <mrow> 
         <mi>
           i 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           j 
         </mi> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> of sampled individuals fall. Conservation of participants requires that</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mstyle displaystyle="true"> 
        <munderover> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            j 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </munderover> 
        <mrow> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mrow> 
           <mi>
             i 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             j 
           </mi> 
          </mrow> 
         </msub> 
        </mrow> 
       </mstyle> 
       <mo>
         = 
       </mo> 
       <mi>
         N 
       </mi> 
      </mrow> 
     </math>. (31)</p>
    <p>In what follows it will be more convenient to relabel cells sequentially by a single index, rather than by a Cartesian double index. This can be accomplished by labeling a Cartesian cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            y 
          </mi> 
          <mi>
            j 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> by the index 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         k 
       </mi> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           j 
         </mi> 
         <mo>
           − 
         </mo> 
         <mn>
           1 
         </mn> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mi>
         n 
       </mi> 
       <mo>
         + 
       </mo> 
       <mi>
         i 
       </mi> 
      </mrow> 
     </math> in which 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         k 
       </mi> 
       <mo>
         = 
       </mo> 
       <mn>
         1 
       </mn> 
       <mo>
         , 
       </mo> 
       <mn>
         2 
       </mn> 
       <mo>
         , 
       </mo> 
       <mo>
         ⋯ 
       </mo> 
       <mo>
         , 
       </mo> 
       <msup> 
        <mi>
          n 
        </mi> 
        <mn>
          2 
        </mn> 
       </msup> 
      </mrow> 
     </math>. For examples, cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            y 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> becomes cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         k 
       </mi> 
       <mo>
         = 
       </mo> 
       <mn>
         1 
       </mn> 
      </mrow> 
     </math>, cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            y 
          </mi> 
          <mi>
            n 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> becomes cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         k 
       </mi> 
       <mo>
         = 
       </mo> 
       <msup> 
        <mi>
          n 
        </mi> 
        <mn>
          2 
        </mn> 
       </msup> 
       <mo>
         − 
       </mo> 
       <mi>
         n 
       </mi> 
       <mo>
         + 
       </mo> 
       <mn>
         1 
       </mn> 
      </mrow> 
     </math>, cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            n 
          </mi> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            y 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> becomes cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         k 
       </mi> 
       <mo>
         = 
       </mo> 
       <mi>
         n 
       </mi> 
      </mrow> 
     </math>, and cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            n 
          </mi> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            y 
          </mi> 
          <mi>
            n 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> becomes cell 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         k 
       </mi> 
       <mo>
         = 
       </mo> 
       <msup> 
        <mi>
          n 
        </mi> 
        <mn>
          2 
        </mn> 
       </msup> 
      </mrow> 
     </math>. Then 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          n 
        </mi> 
        <mi>
          k 
        </mi> 
       </msub> 
      </mrow> 
     </math> is the number of individuals falling into the 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          k 
        </mi> 
        <mrow> 
         <mi>
           t 
         </mi> 
         <mi>
           h 
         </mi> 
        </mrow> 
       </msup> 
      </mrow> 
     </math> cell, and Equation (31) becomes</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mstyle displaystyle="true"> 
        <munderover> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            k 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           K 
         </mi> 
        </munderover> 
        <mrow> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
        </mrow> 
       </mstyle> 
       <mo>
         = 
       </mo> 
       <mi>
         N 
       </mi> 
      </mrow> 
     </math> (32)</p>
    <p>with 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         K 
       </mi> 
       <mo>
         = 
       </mo> 
       <msup> 
        <mi>
          n 
        </mi> 
        <mn>
          2 
        </mn> 
       </msup> 
      </mrow> 
     </math> the maximum number of cells. The number of ways 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        Ω 
      </mi> 
     </math> of distributing the possible values 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math> over the K cells is described by a multinomial distribution</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         Ω 
       </mi> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mrow> 
         <mi>
           N 
         </mi> 
         <mo>
           ! 
         </mo> 
        </mrow> 
        <mrow> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           ! 
         </mo> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mn>
            2 
          </mn> 
         </msub> 
         <mo>
           ! 
         </mo> 
         <mo>
           ⋯ 
         </mo> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mi>
            K 
          </mi> 
         </msub> 
         <mo>
           ! 
         </mo> 
        </mrow> 
       </mfrac> 
      </mrow> 
     </math> . (33)</p>
    <p>In physics, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        Ω 
      </mi> 
     </math> is referred to as the volume of phase space. Of the vast number of possible ways to partition 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         N 
       </mi> 
       <mo>
         ≫ 
       </mo> 
       <mn>
         1 
       </mn> 
      </mrow> 
     </math> values over K cells, the partition most likely to occur is the one that maximizes 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        Ω 
      </mi> 
     </math>. Applying to Equation (33) the simplest form of Stirling’s approximation <xref ref-type="bibr" rid="scirp.144605-21">
      [21]
     </xref> of 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         z 
       </mi> 
       <mo>
         ! 
       </mo> 
      </mrow> 
     </math> for some integer z</p>
    <p>
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         ln 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           z 
         </mi> 
         <mo>
           ! 
         </mo> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         ≈ 
       </mo> 
       <mi>
         z 
       </mi> 
       <mi>
         ln 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          z 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         − 
       </mo> 
       <mi>
         z 
       </mi> 
      </mrow> 
     </math>,(34)</p>
    <p>one can show that</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          N 
        </mi> 
       </mfrac> 
       <mi>
         ln 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          Ω 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         ≈ 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         − 
       </mo> 
       <mstyle displaystyle="true"> 
        <munderover> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            k 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           K 
         </mi> 
        </munderover> 
        <mrow> 
         <mfrac> 
          <mrow> 
           <msub> 
            <mi>
              n 
            </mi> 
            <mi>
              k 
            </mi> 
           </msub> 
          </mrow> 
          <mi>
            N 
          </mi> 
         </mfrac> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mfrac> 
            <mrow> 
             <msub> 
              <mi>
                n 
              </mi> 
              <mi>
                k 
              </mi> 
             </msub> 
            </mrow> 
            <mi>
              N 
            </mi> 
           </mfrac> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math>.(35)</p>
    <p>In the limit of large (technically infinite) N, the ratio ( 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mrow> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          / 
        </mo> 
        <mi>
          N 
        </mi> 
       </mrow> 
      </mrow> 
     </math>) approaches the probability 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          p 
        </mi> 
        <mi>
          k 
        </mi> 
       </msub> 
      </mrow> 
     </math>, whereupon one has to an excellent approximation the relation</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          N 
        </mi> 
       </mfrac> 
       <mi>
         ln 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          Ω 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mover> 
        <mo>
          → 
        </mo> 
        <mrow> 
         <mi>
           N 
         </mi> 
         <mo>
           ≫ 
         </mo> 
         <mn>
           1 
         </mn> 
        </mrow> 
       </mover> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mi>
         S 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          p 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (36)</p>
    <p>between sampling frequency and entropy. Therefore, finding the distribution of frequencies 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math> that maximizes 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        Ω 
      </mi> 
     </math>, Equation (33), is equivalent to finding the distribution of probabilities 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             y 
           </mi> 
          </mrow> 
         </msub> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math> that maximizes the information entropy S, Equation (12). One can therefore rearrange Equation (36) to write</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         Ω 
       </mi> 
       <mo>
         = 
       </mo> 
       <msup> 
        <mtext>
          e 
        </mtext> 
        <mrow> 
         <mi>
           N 
         </mi> 
         <mtext>
             
         </mtext> 
         <mi>
           S 
         </mi> 
        </mrow> 
       </msup> 
      </mrow> 
     </math>. (37)</p>
    <p>Suppose next that 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          Ω 
        </mi> 
        <mrow> 
         <mtext>
           ME 
         </mtext> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> is the phase space volume of frequencies 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            n 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math> with maximum entropy 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mrow> 
         <mi>
           max 
         </mi> 
        </mrow> 
       </msub> 
      </mrow> 
     </math>, and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        Ω 
      </mi> 
     </math> is the phase space volume of some other distribution having the same constraint Equation (32) with lower entropy S. Then from Equation (37) the ratio 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mrow> 
         <msub> 
          <mi>
            Ω 
          </mi> 
          <mrow> 
           <mtext>
             ME 
           </mtext> 
          </mrow> 
         </msub> 
        </mrow> 
        <mo>
          / 
        </mo> 
        <mi>
          Ω 
        </mi> 
       </mrow> 
      </mrow> 
     </math> in terms of the entropy difference 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         Δ 
       </mi> 
       <mi>
         S 
       </mi> 
      </mrow> 
     </math> is given by</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mfrac> 
        <mrow> 
         <msub> 
          <mi>
            Ω 
          </mi> 
          <mrow> 
           <mtext>
             ME 
           </mtext> 
          </mrow> 
         </msub> 
        </mrow> 
        <mi>
          Ω 
        </mi> 
       </mfrac> 
       <mo>
         = 
       </mo> 
       <mi>
         exp 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           N 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              S 
            </mi> 
            <mrow> 
             <mtext>
               max 
             </mtext> 
            </mrow> 
           </msub> 
           <mo>
             − 
           </mo> 
           <mi>
             S 
           </mi> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mi>
         exp 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           N 
         </mi> 
         <mi>
           Δ 
         </mi> 
         <mi>
           S 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. (38)</p>
    <p>The implications of Equation (38) can be astounding. Consider a sample size comparable to that of the ANSUR male cohort approximated to be 4000, and two distributions that differ in entropy by only 0.01. Then evaluation of (38) leads to 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mrow> 
          <mrow> 
           <msub> 
            <mi>
              Ω 
            </mi> 
            <mrow> 
             <mtext>
               ME 
             </mtext> 
            </mrow> 
           </msub> 
          </mrow> 
          <mo>
            / 
          </mo> 
          <mi>
            Ω 
          </mi> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         &gt; 
       </mo> 
       <msup> 
        <mrow> 
         <mn>
           10 
         </mn> 
        </mrow> 
        <mrow> 
         <mn>
           17 
         </mn> 
        </mrow> 
       </msup> 
      </mrow> 
     </math>. In words, the maximum entropy distribution 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          Ω 
        </mi> 
        <mrow> 
         <mtext>
           ME 
         </mtext> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> is one hundred thousand million million times more likely to occur by chance than the distribution 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        Ω 
      </mi> 
     </math>.</p>
    <p>The foregoing estimate is a prediction. If the height and weight of individuals in some group of people are measured, and the only prior information that an analyst has are the means, variances, and correlation coefficient of the logarithms of the two variables, then the most likely distribution to account for the sample is the bivariate lognormal distribution, such as was initially found empirically <xref ref-type="bibr" rid="scirp.144605-3">
      [3]
     </xref>. The larger the sample size, the more probable is the maximum entropy distribution to be observed in any sampling. As just illustrated, a maximum entropy distribution can be overwhelmingly more probable than any other distribution subject to the same constraints. In such cases, how a probability density with just a few parameters can suffice to predict correctly an extensive array of testable moments and correlations is no longer a mystery.</p>
   </sec>
   <sec id="s4">
    <title>4. Maximum Entropy of Continuous Random Variables</title>
    <p>In Section 2 it was shown that a bivariate lognormal PDF, Equation (2), is the maximum entropy solution to the variational equation for the distribution of human height and weight. The procedure entailed first showing that a logarithmic transformation of the variables led to a bivariate normal PDF, Equation (29). It was then argued that a deterministic (in contrast to stochastic) coordinate transformation and its inverse do not change the uncertainty in knowledge about the state of a system. From this line of reasoning, it followed that the inverse transformation of the PDF back to the original variables likewise described a maximum entropy distribution.</p>
    <p>The preceding reasoning is not wrong, but there is a subtle underlying issue that should not be glossed over. Since the two distributions, bivariate normal (BN) and bivariate lognormal (BLN), are both maximum entropy distributions of the same system, although expressed in different coordinates, one might have expected that they would have the same value of maximum entropy. There is no fundamental requirement that this be the case, and, in fact, it is not the case. Straightforward calculations using PDFs (29) and (2) lead to the entropy expressions:</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mtable columnalign="left"> 
       <mtr> 
        <mtd> 
         <mtext>
           Bivariant Normal 
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <msub> 
          <mi>
            S 
          </mi> 
          <mrow> 
           <mtext>
             BN 
           </mtext> 
          </mrow> 
         </msub> 
         <mo>
           = 
         </mo> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mn>
             2 
           </mn> 
           <mi>
             π 
           </mi> 
           <mi>
             e 
           </mi> 
           <msub> 
            <mi>
              s 
            </mi> 
            <mn>
              1 
            </mn> 
           </msub> 
           <msub> 
            <mi>
              s 
            </mi> 
            <mn>
              2 
            </mn> 
           </msub> 
           <msqrt> 
            <mrow> 
             <mn>
               1 
             </mn> 
             <mo>
               − 
             </mo> 
             <msup> 
              <mi>
                r 
              </mi> 
              <mn>
                2 
              </mn> 
             </msup> 
            </mrow> 
           </msqrt> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mtd> 
       </mtr> 
       <mtr> 
        <mtd> 
         <mtext>
           Bivariant Lognormal 
         </mtext> 
         <msub> 
          <mi>
            S 
          </mi> 
          <mrow> 
           <mtext>
             BLN 
           </mtext> 
          </mrow> 
         </msub> 
         <mo>
           = 
         </mo> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mn>
             2 
           </mn> 
           <mi>
             π 
           </mi> 
           <mi>
             e 
           </mi> 
           <msub> 
            <mi>
              s 
            </mi> 
            <mn>
              1 
            </mn> 
           </msub> 
           <msub> 
            <mi>
              s 
            </mi> 
            <mn>
              2 
            </mn> 
           </msub> 
           <msqrt> 
            <mrow> 
             <mn>
               1 
             </mn> 
             <mo>
               − 
             </mo> 
             <msup> 
              <mi>
                r 
              </mi> 
              <mn>
                2 
              </mn> 
             </msup> 
            </mrow> 
           </msqrt> 
           <mtext>
               
           </mtext> 
           <msup> 
            <mtext>
              e 
            </mtext> 
            <mrow> 
             <msub> 
              <mi>
                m 
              </mi> 
              <mn>
                1 
              </mn> 
             </msub> 
             <mo>
               + 
             </mo> 
             <msub> 
              <mi>
                m 
              </mi> 
              <mn>
                2 
              </mn> 
             </msub> 
            </mrow> 
           </msup> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mtd> 
       </mtr> 
      </mtable> 
     </math> (39)</p>
    <p>where it is seen that the BLN entropy depends on all five parameters in contrast to the BN entropy which is independent of the two means.</p>
    <p>One can understand the reason for this curious discrepancy—although the expressions in Equation (39) are correct—by examining the general relation between two entropy expressions connected by a coordinate transformation. For illustrative purposes consider a coordinate transformation (which can be multidimensional) 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mstyle mathvariant="bold" mathsize="normal"> 
        <mi>
          x 
        </mi> 
       </mstyle> 
       <mo>
         → 
       </mo> 
       <mstyle mathvariant="bold" mathsize="normal"> 
        <mi>
          y 
        </mi> 
       </mstyle> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> leading to the two entropy expressions 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
      </mrow> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
      </mrow> 
     </math> defined by</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mtable columnalign="left"> 
       <mtr> 
        <mtd> 
         <msub> 
          <mi>
            S 
          </mi> 
          <mstyle mathvariant="bold" mathsize="normal"> 
           <mi>
             x 
           </mi> 
          </mstyle> 
         </msub> 
         <mo>
           = 
         </mo> 
         <mo>
           − 
         </mo> 
         <mstyle displaystyle="true"> 
          <mrow> 
           <mo>
             ∫ 
           </mo> 
           <mrow> 
            <msub> 
             <mi>
               p 
             </mi> 
             <mstyle mathvariant="bold" mathsize="normal"> 
              <mi>
                x 
              </mi> 
             </mstyle> 
            </msub> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mstyle mathvariant="bold" mathsize="normal"> 
              <mi>
                x 
              </mi> 
             </mstyle> 
             <mo>
               ) 
             </mo> 
            </mrow> 
            <mi>
              ln 
            </mi> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mstyle mathvariant="bold" mathsize="normal"> 
                <mi>
                  x 
                </mi> 
               </mstyle> 
              </msub> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mstyle mathvariant="bold" mathsize="normal"> 
                <mi>
                  x 
                </mi> 
               </mstyle> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
             </mrow> 
             <mo>
               ) 
             </mo> 
            </mrow> 
            <mtext>
              d 
            </mtext> 
            <mstyle mathvariant="bold" mathsize="normal"> 
             <mi>
               x 
             </mi> 
            </mstyle> 
           </mrow> 
          </mrow> 
         </mstyle> 
        </mtd> 
       </mtr> 
       <mtr> 
        <mtd> 
         <msub> 
          <mi>
            S 
          </mi> 
          <mstyle mathvariant="bold" mathsize="normal"> 
           <mi>
             y 
           </mi> 
          </mstyle> 
         </msub> 
         <mo>
           = 
         </mo> 
         <mo>
           − 
         </mo> 
         <mstyle displaystyle="true"> 
          <mrow> 
           <mo>
             ∫ 
           </mo> 
           <mrow> 
            <msub> 
             <mi>
               p 
             </mi> 
             <mstyle mathvariant="bold" mathsize="normal"> 
              <mi>
                y 
              </mi> 
             </mstyle> 
            </msub> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mstyle mathvariant="bold" mathsize="normal"> 
              <mi>
                y 
              </mi> 
             </mstyle> 
             <mo>
               ) 
             </mo> 
            </mrow> 
            <mi>
              ln 
            </mi> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mstyle mathvariant="bold" mathsize="normal"> 
                <mi>
                  y 
                </mi> 
               </mstyle> 
              </msub> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mstyle mathvariant="bold" mathsize="normal"> 
                <mi>
                  y 
                </mi> 
               </mstyle> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
             </mrow> 
             <mo>
               ) 
             </mo> 
            </mrow> 
            <mtext>
              d 
            </mtext> 
            <mstyle mathvariant="bold" mathsize="normal"> 
             <mi>
               y 
             </mi> 
            </mstyle> 
           </mrow> 
          </mrow> 
         </mstyle> 
        </mtd> 
       </mtr> 
      </mtable> 
     </math> (40)</p>
    <p>where the probability densities satisfy</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          p 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mtext>
         d 
       </mtext> 
       <mstyle mathvariant="bold" mathsize="normal"> 
        <mi>
          x 
        </mi> 
       </mstyle> 
       <mo>
         = 
       </mo> 
       <msub> 
        <mi>
          p 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mtext>
         d 
       </mtext> 
       <mstyle mathvariant="bold" mathsize="normal"> 
        <mi>
          y 
        </mi> 
       </mstyle> 
      </mrow> 
     </math>(41)</p>
    <p>and are therefore connected by the transformation</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          p 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mstyle mathvariant="bold" mathsize="normal"> 
          <mi>
            y 
          </mi> 
         </mstyle> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mstyle mathvariant="bold" mathsize="normal"> 
           <mi>
             x 
           </mi> 
          </mstyle> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         = 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <msub> 
        <mi>
          p 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mrow> 
        <mo>
          | 
        </mo> 
        <mrow> 
         <mfrac> 
          <mrow> 
           <mo>
             ∂ 
           </mo> 
           <mstyle mathvariant="bold" mathsize="normal"> 
            <mi>
              x 
            </mi> 
           </mstyle> 
          </mrow> 
          <mrow> 
           <mo>
             ∂ 
           </mo> 
           <mstyle mathvariant="bold" mathsize="normal"> 
            <mi>
              y 
            </mi> 
           </mstyle> 
          </mrow> 
         </mfrac> 
        </mrow> 
        <mo>
          | 
        </mo> 
       </mrow> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         = 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mi>
         J 
       </mi> 
       <mtext>
           
       </mtext> 
       <msub> 
        <mi>
          p 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (42)</p>
    <p>with Jacobian 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        J 
      </mi> 
     </math>. Substitution of Equation (42) into Equation (40) leads to the relation</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
       <mo>
         = 
       </mo> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
       <mo>
         − 
       </mo> 
       <mstyle displaystyle="true"> 
        <mrow> 
         <mo>
           ∫ 
         </mo> 
         <mrow> 
          <msub> 
           <mi>
             p 
           </mi> 
           <mstyle mathvariant="bold" mathsize="normal"> 
            <mi>
              x 
            </mi> 
           </mstyle> 
          </msub> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mstyle mathvariant="bold" mathsize="normal"> 
            <mi>
              x 
            </mi> 
           </mstyle> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mi>
              J 
            </mi> 
            <mrow> 
             <mo>
               ( 
             </mo> 
             <mstyle mathvariant="bold" mathsize="normal"> 
              <mi>
                x 
              </mi> 
             </mstyle> 
             <mo>
               ) 
             </mo> 
            </mrow> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mtext>
            d 
          </mtext> 
          <mstyle mathvariant="bold" mathsize="normal"> 
           <mi>
             x 
           </mi> 
          </mstyle> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (43)</p>
    <p>or more succinctly,</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
       <mo>
         = 
       </mo> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
       <mo>
         − 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            J 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          〉 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. (44)</p>
    <p>Applied to a bivariate coordinate transformation 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mn>
            2 
          </mn> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         → 
       </mo> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            y 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            y 
          </mi> 
          <mn>
            2 
          </mn> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> as a concrete example, Equations (40), (42), and (43) reduce to the explicit form</p>
    <p>
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mi>
          y 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         − 
       </mo> 
       <mstyle displaystyle="true"> 
        <mrow> 
         <mo>
           ∫ 
         </mo> 
         <mrow> 
          <msub> 
           <mi>
             p 
           </mi> 
           <mstyle mathvariant="bold" mathsize="normal"> 
            <mi>
              x 
            </mi> 
           </mstyle> 
          </msub> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <msub> 
             <mi>
               x 
             </mi> 
             <mn>
               1 
             </mn> 
            </msub> 
            <mo>
              , 
            </mo> 
            <msub> 
             <mi>
               x 
             </mi> 
             <mn>
               2 
             </mn> 
            </msub> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mfrac> 
             <mrow> 
              <msub> 
               <mi>
                 p 
               </mi> 
               <mstyle mathvariant="bold" mathsize="normal"> 
                <mi>
                  x 
                </mi> 
               </mstyle> 
              </msub> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <msub> 
                 <mi>
                   x 
                 </mi> 
                 <mn>
                   1 
                 </mn> 
                </msub> 
                <mo>
                  , 
                </mo> 
                <msub> 
                 <mi>
                   x 
                 </mi> 
                 <mn>
                   2 
                 </mn> 
                </msub> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 x 
               </mi> 
               <mn>
                 1 
               </mn> 
              </msub> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 x 
               </mi> 
               <mn>
                 2 
               </mn> 
              </msub> 
             </mrow> 
             <mrow> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 y 
               </mi> 
               <mn>
                 1 
               </mn> 
              </msub> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 y 
               </mi> 
               <mn>
                 2 
               </mn> 
              </msub> 
             </mrow> 
            </mfrac> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mtext>
            d 
          </mtext> 
          <msub> 
           <mi>
             x 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <mtext>
            d 
          </mtext> 
          <msub> 
           <mi>
             x 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (45)</p>
    <p>or</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
       <mo>
         − 
       </mo> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         = 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         − 
       </mo> 
       <mstyle displaystyle="true"> 
        <mrow> 
         <mo>
           ∫ 
         </mo> 
         <mrow> 
          <msub> 
           <mi>
             p 
           </mi> 
           <mstyle mathvariant="bold" mathsize="normal"> 
            <mi>
              x 
            </mi> 
           </mstyle> 
          </msub> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <msub> 
             <mi>
               x 
             </mi> 
             <mn>
               1 
             </mn> 
            </msub> 
            <mo>
              , 
            </mo> 
            <msub> 
             <mi>
               x 
             </mi> 
             <mn>
               2 
             </mn> 
            </msub> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mi>
            ln 
          </mi> 
          <mrow> 
           <mo>
             ( 
           </mo> 
           <mrow> 
            <mfrac> 
             <mrow> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 x 
               </mi> 
               <mn>
                 1 
               </mn> 
              </msub> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 x 
               </mi> 
               <mn>
                 2 
               </mn> 
              </msub> 
             </mrow> 
             <mrow> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 y 
               </mi> 
               <mn>
                 1 
               </mn> 
              </msub> 
              <mtext>
                d 
              </mtext> 
              <msub> 
               <mi>
                 y 
               </mi> 
               <mn>
                 2 
               </mn> 
              </msub> 
             </mrow> 
            </mfrac> 
           </mrow> 
           <mo>
             ) 
           </mo> 
          </mrow> 
          <mtext>
            d 
          </mtext> 
          <msub> 
           <mi>
             x 
           </mi> 
           <mn>
             1 
           </mn> 
          </msub> 
          <mtext>
            d 
          </mtext> 
          <msub> 
           <mi>
             x 
           </mi> 
           <mn>
             2 
           </mn> 
          </msub> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math>. (46)</p>
    <p>Apart from a minus sign, the right side of Equation (46) takes the form of a Kullback-Leibler (K-L) divergence <xref ref-type="bibr" rid="scirp.144605-22">
      [22]
     </xref> <xref ref-type="bibr" rid="scirp.144605-23">
      [23]
     </xref>, which quantifies the difference between two probability distributions. It is thus seen that the difference in entropies in Equation (46) is attributable to different prior distributions of the two sets of volume elements. 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mi>
          x 
        </mi> 
       </msub> 
      </mrow> 
     </math> is calculated on the presumption that each volume element 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mtext>
         d 
       </mtext> 
       <mstyle mathvariant="bold" mathsize="normal"> 
        <mi>
          x 
        </mi> 
       </mstyle> 
       <mo>
         ≡ 
       </mo> 
       <mtext>
         d 
       </mtext> 
       <msub> 
        <mi>
          x 
        </mi> 
        <mn>
          1 
        </mn> 
       </msub> 
       <mtext>
         d 
       </mtext> 
       <msub> 
        <mi>
          x 
        </mi> 
        <mn>
          2 
        </mn> 
       </msub> 
       <mo>
         ⋯ 
       </mo> 
       <mtext>
         d 
       </mtext> 
       <msub> 
        <mi>
          x 
        </mi> 
        <mi>
          n 
        </mi> 
       </msub> 
      </mrow> 
     </math> (of an n-dimensional system) is equally likely, whereas 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
      </mrow> 
     </math> is calculated presuming each volume element 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mtext>
         d 
       </mtext> 
       <mi>
         y 
       </mi> 
       <mo>
         ≡ 
       </mo> 
       <mtext>
         d 
       </mtext> 
       <msub> 
        <mi>
          y 
        </mi> 
        <mn>
          1 
        </mn> 
       </msub> 
       <mtext>
         d 
       </mtext> 
       <msub> 
        <mi>
          y 
        </mi> 
        <mn>
          2 
        </mn> 
       </msub> 
       <mo>
         ⋯ 
       </mo> 
       <mtext>
         d 
       </mtext> 
       <msub> 
        <mi>
          y 
        </mi> 
        <mi>
          n 
        </mi> 
       </msub> 
      </mrow> 
     </math> is equally likely. However, since 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mstyle mathvariant="bold" mathsize="normal"> 
       <mi>
         y 
       </mi> 
      </mstyle> 
     </math> is a function of 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mstyle mathvariant="bold" mathsize="normal"> 
       <mi>
         x 
       </mi> 
      </mstyle> 
     </math>, the volume elements of the two systems of coordinates cannot in general both be uniformly distributed. Hence the coordinate transformation 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mstyle mathvariant="bold" mathsize="normal"> 
        <mi>
          x 
        </mi> 
       </mstyle> 
       <mo>
         → 
       </mo> 
       <mstyle mathvariant="bold" mathsize="normal"> 
        <mi>
          y 
        </mi> 
       </mstyle> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> leads to a difference in entropies.</p>
    <p>The fact that entropy 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           y 
         </mi> 
        </mstyle> 
       </msub> 
      </mrow> 
     </math> in Equation (43) or (44) differs in value from entropy 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
       </msub> 
      </mrow> 
     </math> does not mean that variations of the corresponding entropy functionals 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mi>
          x 
        </mi> 
       </msub> 
      </mrow> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mi>
          y 
        </mi> 
       </msub> 
      </mrow> 
     </math>—from which the maximum entropy PDFs are obtained—are necessarily different. Consider the logarithmic transformation 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           = 
         </mo> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            H 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
         <mo>
           = 
         </mo> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            W 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> of the variables 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>and the inverse transformation 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           = 
         </mo> 
         <msup> 
          <mtext>
            e 
          </mtext> 
          <mi>
            X 
          </mi> 
         </msup> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
         <mo>
           = 
         </mo> 
         <msup> 
          <mtext>
            e 
          </mtext> 
          <mi>
            Y 
          </mi> 
         </msup> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. The Jacobian of the transformation is</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         J 
       </mi> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          | 
        </mo> 
        <mrow> 
         <mfrac> 
          <mrow> 
           <mo>
             ∂ 
           </mo> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mi>
               X 
             </mi> 
             <mo>
               , 
             </mo> 
             <mi>
               Y 
             </mi> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mrow> 
           <mo>
             ∂ 
           </mo> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mi>
               H 
             </mi> 
             <mo>
               , 
             </mo> 
             <mi>
               W 
             </mi> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
         </mfrac> 
        </mrow> 
        <mo>
          | 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mi>
           W 
         </mi> 
        </mrow> 
       </mfrac> 
       <mo>
         = 
       </mo> 
       <mi>
         exp 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mo>
           − 
         </mo> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mi>
             X 
           </mi> 
           <mo>
             + 
           </mo> 
           <mi>
             Y 
           </mi> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. (47)</p>
    <p>Substitution of Equation (47) into Equation (44) leads to the relation</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
        </mrow> 
       </msub> 
       <mo>
         − 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            J 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          〉 
        </mo> 
       </mrow> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         = 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
        </mrow> 
       </msub> 
       <mo>
         − 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mi>
          X 
        </mi> 
        <mo>
          〉 
        </mo> 
       </mrow> 
       <mo>
         − 
       </mo> 
       <mrow> 
        <mo>
          〈 
        </mo> 
        <mi>
          Y 
        </mi> 
        <mo>
          〉 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. (48)</p>
    <p>Recall that the two expectation values on the right side of Equation (48) are respectively 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          m 
        </mi> 
        <mn>
          1 
        </mn> 
       </msub> 
      </mrow> 
     </math> and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          m 
        </mi> 
        <mn>
          2 
        </mn> 
       </msub> 
      </mrow> 
     </math>, which are part of the prior information, Equations (3) and (5). Given the entropy functional 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> of Equation (13), the functional 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> associated with entropy 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> of Equation (48) would then include two additional constraints and take the form</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
       </msub> 
       <mo>
         = 
       </mo> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
        </mrow> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             y 
           </mi> 
          </mrow> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         − 
       </mo> 
       <msub> 
        <mi>
          λ 
        </mi> 
        <mn>
          4 
        </mn> 
       </msub> 
       <mstyle displaystyle="true"> 
        <munder> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            x 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            y 
          </mi> 
         </mrow> 
        </munder> 
        <mrow> 
         <mi>
           x 
         </mi> 
         <mtext>
             
         </mtext> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             y 
           </mi> 
          </mrow> 
         </msub> 
        </mrow> 
       </mstyle> 
       <mo>
         − 
       </mo> 
       <msub> 
        <mi>
          λ 
        </mi> 
        <mn>
          5 
        </mn> 
       </msub> 
       <mstyle displaystyle="true"> 
        <munder> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            x 
          </mi> 
          <mo>
            , 
          </mo> 
          <mi>
            y 
          </mi> 
         </mrow> 
        </munder> 
        <mrow> 
         <mi>
           y 
         </mi> 
         <mtext>
             
         </mtext> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             y 
           </mi> 
          </mrow> 
         </msub> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (49)</p>
    <p>where 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
        </mrow> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             y 
           </mi> 
          </mrow> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> includes the terms with Lagrange multipliers 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          λ 
        </mi> 
        <mi>
          j 
        </mi> 
       </msub> 
      </mrow> 
     </math> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           j 
         </mi> 
         <mo>
           = 
         </mo> 
         <mn>
           0 
         </mn> 
         <mo>
           , 
         </mo> 
         <mn>
           1 
         </mn> 
         <mo>
           , 
         </mo> 
         <mn>
           2 
         </mn> 
         <mo>
           , 
         </mo> 
         <mn>
           3 
         </mn> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. However, information on the mean values of X and Y is already included in the constraints on the variances of X and Y in 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
        </mrow> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            p 
          </mi> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             y 
           </mi> 
          </mrow> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. Thus the two additional terms in Equation (49) constitute redundant information, whereupon the Lagrange multipliers 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          λ 
        </mi> 
        <mn>
          4 
        </mn> 
       </msub> 
      </mrow> 
     </math> and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          λ 
        </mi> 
        <mn>
          5 
        </mn> 
       </msub> 
      </mrow> 
     </math> would vanish. It is then apparent that the variation 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         δ 
       </mi> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mn>
         0 
       </mn> 
      </mrow> 
     </math> since 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         δ 
       </mi> 
       <msub> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mi>
           X 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           Y 
         </mi> 
        </mrow> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mn>
         0 
       </mn> 
      </mrow> 
     </math>, and, as expected, the bivariant lognormal PDF is the maximum entropy solution in coordinates 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           h 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           w 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> given that the bivariant normal PDF is the maximum entropy solution in coordinates 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           x 
         </mi> 
         <mo>
           = 
         </mo> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            h 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
         <mo>
           , 
         </mo> 
         <mi>
           y 
         </mi> 
         <mo>
           = 
         </mo> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            w 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>.</p>
    <p>As a general procedure consistent with Bayesian principles, Jaynes <xref ref-type="bibr" rid="scirp.144605-17">
      [17]
     </xref> proposed a relative entropy of the form</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          S 
        </mi> 
        <mrow> 
         <mtext>
           rel 
         </mtext> 
        </mrow> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mtable columnalign="left"> 
         <mtr> 
          <mtd> 
           <mo>
             − 
           </mo> 
           <mstyle displaystyle="true"> 
            <mrow> 
             <mo>
               ∫ 
             </mo> 
             <mrow> 
              <mi>
                p 
              </mi> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mstyle mathvariant="bold" mathsize="normal"> 
                <mi>
                  x 
                </mi> 
               </mstyle> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <mi>
                ln 
              </mi> 
              <mrow> 
               <mo>
                 ( 
               </mo> 
               <mrow> 
                <mrow> 
                 <mrow> 
                  <mi>
                    p 
                  </mi> 
                  <mrow> 
                   <mo>
                     ( 
                   </mo> 
                   <mstyle mathvariant="bold" mathsize="normal"> 
                    <mi>
                      x 
                    </mi> 
                   </mstyle> 
                   <mo>
                     ) 
                   </mo> 
                  </mrow> 
                 </mrow> 
                 <mo>
                   / 
                 </mo> 
                 <mrow> 
                  <mi>
                    m 
                  </mi> 
                  <mrow> 
                   <mo>
                     ( 
                   </mo> 
                   <mstyle mathvariant="bold" mathsize="normal"> 
                    <mi>
                      x 
                    </mi> 
                   </mstyle> 
                   <mo>
                     ) 
                   </mo> 
                  </mrow> 
                 </mrow> 
                </mrow> 
               </mrow> 
               <mo>
                 ) 
               </mo> 
              </mrow> 
              <mtext>
                d 
              </mtext> 
              <mstyle mathvariant="bold" mathsize="normal"> 
               <mi>
                 x 
               </mi> 
              </mstyle> 
             </mrow> 
            </mrow> 
           </mstyle> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mtext>
               continuous 
             </mtext> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mtd> 
         </mtr> 
         <mtr> 
          <mtd> 
           <mo>
             − 
           </mo> 
           <mstyle displaystyle="true"> 
            <munder> 
             <mo>
               ∑ 
             </mo> 
             <mstyle mathvariant="bold" mathsize="normal"> 
              <mi>
                x 
              </mi> 
             </mstyle> 
            </munder> 
            <mrow> 
             <msub> 
              <mi>
                p 
              </mi> 
              <mstyle mathvariant="bold" mathsize="normal"> 
               <mi>
                 x 
               </mi> 
              </mstyle> 
             </msub> 
             <mi>
               ln 
             </mi> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mrow> 
               <mrow> 
                <mrow> 
                 <msub> 
                  <mi>
                    p 
                  </mi> 
                  <mstyle mathvariant="bold" mathsize="normal"> 
                   <mi>
                     x 
                   </mi> 
                  </mstyle> 
                 </msub> 
                </mrow> 
                <mo>
                  / 
                </mo> 
                <mrow> 
                 <msub> 
                  <mi>
                    m 
                  </mi> 
                  <mstyle mathvariant="bold" mathsize="normal"> 
                   <mi>
                     x 
                   </mi> 
                  </mstyle> 
                 </msub> 
                </mrow> 
               </mrow> 
              </mrow> 
              <mo>
                ) 
              </mo> 
             </mrow> 
            </mrow> 
           </mstyle> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mtext>
               
           </mtext> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <mtext>
               discrete 
             </mtext> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mtd> 
         </mtr> 
        </mtable> 
       </mrow> 
      </mrow> 
     </math> (50)</p>
    <p>for either continuous or discrete random variables. This again is the form of a K-L divergence. The background distribution 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         m 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is a measure that transforms under a change of coordinates in the same way as the probability function 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         p 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>, whereupon expression (50) is invariant under a coordinate change.</p>
    <p>As employed in applications of the PME, the function 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         m 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> can be interpreted as a Bayesian prior, i.e. the distribution representing a state of complete ignorance before any empirical information has been acquired <xref ref-type="bibr" rid="scirp.144605-24">
      [24]
     </xref>. How one determines the appropriate measure for an arbitrary statistical problem is, in general, still a work in progress. In some cases, especially for a finite system of discrete random variables (see <xref ref-type="bibr" rid="scirp.144605-13">
      [13]
     </xref> for a worked example), the correct measure may be obvious. In more complex cases involving continuous random variables, Jaynes proposed the use of group theoretical methods to deduce 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         m 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> from the appropriate transformation group that describes the invariances of a statistical problem <xref ref-type="bibr" rid="scirp.144605-24">
      [24]
     </xref>.</p>
    <p>It might be thought that the obvious choice of 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         m 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> would be a uniform distribution over the full range of possible outcomes x, since that certainly depicts a state of total ignorance. However, a uniform distribution is not always suitable or even possible, especially for a system of continuous random variables. First, a uniform distribution is not normalizable if the range of the variables is infinite. And second, being independent of coordinates, a uniform distribution does not change under a coordinate transformation, so that the expression (50) would again not be invariant.</p>
    <p>It is rarely the case, however, that an analyst is in a state of total ignorance prior to data collection. For example, in the problem of the distribution of human height and weight, one can be reasonably assured that no adult human ever had a height exceeding 10 m or a weight exceeding 1000 kg. Thus, a suitable approximation to a uniform distribution could be a Gaussian measure 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         m 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mstyle mathvariant="bold" mathsize="normal"> 
         <mi>
           x 
         </mi> 
        </mstyle> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> with uncertainty sufficiently broad that the center is irrelevant and the function varies very slowly over the practical range of the variables.</p>
   </sec>
   <sec id="s5">
    <title>5. A Stochastic Model of Growth</title>
    <p>Demonstration that an analysis based on the Principle of Maximum Entropy leads naturally to a bivariant lognormal PDF of height and weight does not mean there can be no stochastic physical mechanism that can also generate this statistical distribution. If future anthropometric surveys of larger cohorts are implemented and continue to sustain the bivariant lognormal as the effectively exact distribution of height and weight, there may well be an underlying physical reason. Here is one possible mechanism by which such a distribution can develop.</p>
    <p>Suppose, for example, a study of the genetic basis of human height were to reveal that the variable H could be represented by the geometric mean</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         H 
       </mi> 
       <mo>
         = 
       </mo> 
       <msup> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mstyle displaystyle="true"> 
            <munderover> 
             <mo>
               ∏ 
             </mo> 
             <mrow> 
              <mi>
                i 
              </mi> 
              <mo>
                = 
              </mo> 
              <mn>
                1 
              </mn> 
             </mrow> 
             <mi>
               n 
             </mi> 
            </munderover> 
            <mrow> 
             <msub> 
              <mi>
                X 
              </mi> 
              <mi>
                i 
              </mi> 
             </msub> 
            </mrow> 
           </mstyle> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mrow> 
         <mfrac> 
          <mn>
            1 
          </mn> 
          <mi>
            n 
          </mi> 
         </mfrac> 
        </mrow> 
       </msup> 
      </mrow> 
     </math> (51)</p>
    <p>of a product of independent random variables 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            X 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           i 
         </mi> 
         <mo>
           = 
         </mo> 
         <mn>
           1 
         </mn> 
         <mo>
           , 
         </mo> 
         <mo>
           ⋯ 
         </mo> 
         <mo>
           , 
         </mo> 
         <mi>
           n 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> of finite non-vanishing means and variances. Then the logarithm of H would take the form of a sum of independent variables 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              X 
            </mi> 
            <mi>
              i 
            </mi> 
           </msub> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math></p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         ln 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          H 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          n 
        </mi> 
       </mfrac> 
       <mstyle displaystyle="true"> 
        <munderover> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </munderover> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              X 
            </mi> 
            <mi>
              i 
            </mi> 
           </msub> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mo>
         → 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mi>
         N 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            M 
          </mi> 
          <mi>
            H 
          </mi> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msubsup> 
          <mi>
            Σ 
          </mi> 
          <mi>
            H 
          </mi> 
          <mn>
            2 
          </mn> 
         </msubsup> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>, (52)</p>
    <p>which, under conditions for which the Central Limit Theorem <xref ref-type="bibr" rid="scirp.144605-25">
      [25]
     </xref> is applicable, can be approximated by a normal distribution of resulting mean and variance</p>
    <p>
     <xref ref-type="bibr" rid="scirp.144605-"></xref> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          M 
        </mi> 
        <mi>
          H 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          n 
        </mi> 
       </mfrac> 
       <mstyle displaystyle="true"> 
        <munderover> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </munderover> 
        <mrow> 
         <mrow> 
          <mo>
            〈 
          </mo> 
          <mrow> 
           <mi>
             ln 
           </mi> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <msub> 
              <mi>
                X 
              </mi> 
              <mi>
                i 
              </mi> 
             </msub> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mo>
            〉 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
       <mtext>
           
       </mtext> 
       <mo>
         , 
       </mo> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <mtext>
           
       </mtext> 
       <msubsup> 
        <mi>
          Σ 
        </mi> 
        <mi>
          H 
        </mi> 
        <mn>
          2 
        </mn> 
       </msubsup> 
       <mo>
         = 
       </mo> 
       <mi>
         var 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            M 
          </mi> 
          <mi>
            H 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (53)</p>
    <p>as indicated by the arrow in Equation (52). From Equations (52) and (53) it would then follow that the marginal distribution of H is lognormal in form, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         H 
       </mi> 
       <mo>
         → 
       </mo> 
       <mi>
         Λ 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            M 
          </mi> 
          <mi>
            H 
          </mi> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msubsup> 
          <mi>
            Σ 
          </mi> 
          <mi>
            H 
          </mi> 
          <mn>
            2 
          </mn> 
         </msubsup> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. (Upper-case lambda is symbolic of a lognormal distribution.)</p>
    <p>Suppose further that the variable W is likewise found to be representable as the geometric mean of independent variables 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            Y 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math> 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           i 
         </mi> 
         <mo>
           = 
         </mo> 
         <mn>
           1 
         </mn> 
         <mo>
           , 
         </mo> 
         <mo>
           ⋯ 
         </mo> 
         <mo>
           , 
         </mo> 
         <mi>
           n 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> with each factor 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          Y 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math> correlated with the corresponding variable 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          X 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math>. (For example, suppose that the genes associated with each pair of variables 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            X 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            Y 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> are located on the same chromosomes.) Then, by the previous argument, the marginal distribution of W would also be lognormal in form, and the full statistics of the two correlated variables 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> would be described by a bivariant lognormal distribution, as is currently found to be the case.</p>
    <p>The foregoing mechanism, entailing both a physical cause (biological growth as a product of independent random factors) and a probabilistic reduction (the Central Limit Theorem) is at present entirely hypothetical. As of this writing, the author is not aware of any published studies examining the influence of genetics and/or environment that could account for the statistical distribution of adult human height and weight.</p>
   </sec>
   <sec id="s6">
    <title>6. Conclusions: Current Status and Future Steps</title>
    <p>The objective of this paper has been to account by means of the Principle of Maximum Entropy (PME) for the extraordinary predictive capacity of the bivariant lognormal probability density function (PDF) of human height and weight <xref ref-type="bibr" rid="scirp.144605-3">
      [3]
     </xref>. Currently, there is no known physical reason rooted in the biology of human development for the exactness of this distribution. However, the maximum entropy distribution, derived in Section 2 and elaborated in Section 3, is the most probable distribution consistent with constraints imposed by prior information. In the case of human height and weight, the prior information comprised the means, variances, and linear correlation of the logarithms of height and weight.</p>
    <p>The PME constitutes a general inferential procedure derived from probability theory and not associated with any specific physical agency. Loosely summarized in words, the PME says that the distribution that maximizes the entropy, subject only to the known information provided at the outset, is the distribution that is most likely to occur in any sampling of the chosen variables. As illustrated in Section 3, the probability of this occurrence can be astronomically greater than the occurrence of any other distribution likewise subject to the same prior information. Thus, the maximum entropy distribution can appear to be an effectively exact distribution for no discernible physical reason.</p>
    <p>A question might arise as to whether one can improve the predictive capacity of a maximum entropy distribution by supplying a more inclusive set of prior information. Actually, the PME itself informs a user whether more or different information is needed or not. If the resulting PME solution satisfactorily accounts for a given set of data, then providing more prior information is not likely to yield a statistically significant improvement. However, more or different information is needed if the PME solution does not adequately account for the data. Moreover, if the PME variational procedure leads to no solution, then the prior information is either mutually inconsistent, or else does not reflect the conditions of the experiment or observations that generated the data. Further scrutiny of the physical nature of the problem in question could then suggest to the user what revisions to make.</p>
    <p>As illustration of the preceding comments, consider first the case of equilibrium statistical mechanics (ESM) initially derived by J. Willard Gibbs in the 19th Century and given a modern PME justification by E. T. Jaynes <xref ref-type="bibr" rid="scirp.144605-10">
      [10]
     </xref>. Application of the PME with prior information comprising just the mean values of the internal energy and the number of particles in the system leads to a probability density function of exponential form. From this PDF one can predict correctly a wide range of macroscopic equilibrium properties of both classical and quantum systems, including the fluctuations (i.e. predictive uncertainties) of the variables. Since ESM is ordinarily applied to systems with an enormous number N of degrees of freedom (e.g. 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         N 
       </mi> 
       <mo>
         ~ 
       </mo> 
       <msup> 
        <mrow> 
         <mn>
           10 
         </mn> 
        </mrow> 
        <mrow> 
         <mn>
           24 
         </mn> 
        </mrow> 
       </msup> 
      </mrow> 
     </math> for 1 mol of a monatomic gas), no further prior information beyond the expectation of energy and particle number is required, since the variances of the means decrease as 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mn>
          1 
        </mn> 
        <mo>
          / 
        </mo> 
        <mi>
          N 
        </mi> 
       </mrow> 
      </mrow> 
     </math>.</p>
    <p>However, for the problem of the distribution of human height (H) and weight (W), prior information consisting of just the mean values 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mrow> 
          <mo>
            〈 
          </mo> 
          <mi>
            H 
          </mi> 
          <mo>
            〉 
          </mo> 
         </mrow> 
         <mo>
           , 
         </mo> 
         <mrow> 
          <mo>
            〈 
          </mo> 
          <mi>
            W 
          </mi> 
          <mo>
            〉 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> will not suffice. This would lead to an exponential PDF for 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>, which does not match the data. Including in the prior information the variances 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           var 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            H 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
         <mo>
           , 
         </mo> 
         <mi>
           var 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            W 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> will also not suffice. This would lead to a normal PDF for 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math>, whose sample distribution is not symmetric about the means. Only when the prior information comprises the means and variances and linear correlation of 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            H 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
         <mo>
           , 
         </mo> 
         <mtext>
             
         </mtext> 
         <mtext>
             
         </mtext> 
         <mi>
           ln 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            W 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> does the resulting bivariant lognormal probability density permit accurate prediction of all testable statistics extractable from the ANSUR data.</p>
    <p>It may at first thought seem strange that so small a set of prior information makes it possible to predict correctly the asymmetries of the 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           H 
         </mi> 
         <mo>
           , 
         </mo> 
         <mi>
           W 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> sample distribution without explicit inclusion in the entropy functional of additional terms proportional to the third moments (skewness) of any of the variables. To have included such terms would have greatly complicated the analysis, since the resulting distributions would no longer be lognormal in form or even be reducible to a closed form expression. However, the extra work would have achieved little if anything, since contributions of skewness to the entropy functional decrease as 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mn>
          1 
        </mn> 
        <mo>
          / 
        </mo> 
        <mrow> 
         <msup> 
          <mi>
            N 
          </mi> 
          <mn>
            2 
          </mn> 
         </msup> 
        </mrow> 
       </mrow> 
      </mrow> 
     </math> for a sample size N. Recall that 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         N 
       </mi> 
       <mo>
         &gt; 
       </mo> 
       <mn>
         4000 
       </mn> 
      </mrow> 
     </math> for the ANSUR male cohort. Thus, for large sample size N the PME analysis of height and weight no more needed prior information on skewness than did the PME analysis of equilibrium statistical mechanics need prior information on variance.</p>
    <p>Consequently, at this present stage in the statistical analysis of human height and weight, there would be no compelling reason to modify the prior information without first having more extensive data. Because the sampling variance of a moment depends on the population moment of twice the order <xref ref-type="bibr" rid="scirp.144605-26">
      [26]
     </xref>, the uncertainties of higher-order statistical moments (hyperstatistics) increase rapidly with order, and tests of the maximum entropy PDF to predict such statistics will require correspondingly larger sample sizes than the large ANSUR data base used in References <xref ref-type="bibr" rid="scirp.144605-2">
      [2]
     </xref> and <xref ref-type="bibr" rid="scirp.144605-3">
      [3]
     </xref>.</p>
    <p>From the foregoing comments on prior information, sample size, and PME solutions, one can draw an important epistemological lesson. Expressed loosely, the probability density function does not describe attributes of the real world; rather, it describes one’s state of knowledge about the real world. Thus, the idea that there exists a unique “true” probability density function of human height and weight which may emerge from surveys of ever larger populations is illusory. What can emerge, of course, is a more refined PDF that attains greater predictive capability. Nevertheless, it is conceivable that more than one PDF, each a PME solution for a different set of prior information, can account for the same set of data with comparable success. This does not appear to be the case for human height and weight, although the author has encountered such a situation in his analysis of the probability of infectivity of SARS-Cov 2 (COVID) <xref ref-type="bibr" rid="scirp.144605-27">
      [27]
     </xref>.</p>
   </sec>
   <sec id="s7">
    <title>Author</title>
    <p>Dr. M. P. Silverman is the G. A. Jarvis Professor of Physics Emeritus at Trinity College and senior scientist at Tall Pines Research. His areas of research are in nuclear and medical physics. This article was conceived and written by him alone with no reliance on artificial intelligence.</p>
   </sec>
   <sec id="s8">
    <title>Disclaimer</title>
    <p>The author is not affiliated with any commercial companies or organizations and has received no compensation for the composition of this article.</p>
   </sec>
   <sec id="s9">
    <title>Acknowledgements</title>
    <p>The author expresses his gratitude to Dr. S. B. Brachwitz for numerous discussions on matters relating to biological development. He would also like to thank the reviewers for their helpful comments.</p>
   </sec>
  </sec>
 </body><back>
  <ref-list>
   <title>References</title>
   <ref id="scirp.144605-ref1">
    <label>1</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Silverman, M.P. and Lipscombe, T.C. (2022) Exact Statistical Distribution of the Body Mass Index (BMI): Analysis and Experimental Confirmation. Open Journal of Statistics, 12, 324-356. &gt;https://doi.org/10.4236/ojs.2022.123022
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref2">
    <label>2</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Silverman, M.P. (2025) Perspective on the Body Mass Index (BMI) and Variability of Human Weight and Height. Journal of Biosciences and Medicines, 13, 309-320. &gt;https://doi.org/10.4236/jbm.2025.136026
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref3">
    <label>3</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Silverman, M.P. (2022) Exact Statistical Distribution and Correlation of Human Height and Weight: Analysis and Experimental Confirmation. Open Journal of Statistics, 12, 743-787. &gt;https://doi.org/10.4236/ojs.2022.125044
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref4">
    <label>4</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     World Health Organization, Body Mass Index (BMI). &gt;https://www.who.int/data/gho/data/themes/topics/topic-details/GHO/body-mass-index 
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref5">
    <label>5</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Weir, C. and Jan, A. (2023) BMI Classification and Cut-Off Points.&gt;https://www.ncbi.nlm.nih.gov/books/NBK541070/ 
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref6">
    <label>6</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Silverman, M.P. (2014) A Certain Uncertainty: Nature’s Random Ways. Cambridge University Press. &gt;https://doi.org/10.1017/cbo9781139507370
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref7">
    <label>7</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Gordon, C.C., et al. (2012) Technical Report Natick/TR-15/007, Anthropometric Survey of U.S. Army Personnel: Methods and Summary Statistics. U.S. Army Natick Soldier Research, Development and Engineering Center.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref8">
    <label>8</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Mood, A.M., Graybill, F.A. and Boes, D.C. (1974) Introduction to the Theory of Statistics. 3rd Editon, McGraw-Hill, 155-156.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref9">
    <label>9</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2010) Statistical Distributions. Wiley. &gt;https://doi.org/10.1002/9780470627242
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref10">
    <label>10</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jaynes, E.T. (1957) Information Theory and Statistical Mechanics. Physical Review, 106, 620-630. &gt;https://doi.org/10.1103/physrev.106.620
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref11">
    <label>11</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jaynes, E.T. (1957) Information Theory and Statistical Mechanics. II. Physical Review, 108, 171-190. &gt;https://doi.org/10.1103/physrev.108.171
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref12">
    <label>12</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Skilling, J. and Bryan, R.K. (1984) Maximum Entropy Image Reconstruction: General Algorithm. Monthly Notices of the Royal Astronomical Society, 211, 111-124. &gt;https://doi.org/10.1093/mnras/211.1.111
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref13">
    <label>13</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Silverman, M.P. (2015) Cheating or Coincidence? Statistical Method Employing the Principle of Maximum Entropy for Judging Whether a Student Has Committed Plagiarism. Open Journal of Statistics, 05, 143-157. &gt;https://doi.org/10.4236/ojs.2015.52018
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref14">
    <label>14</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Silverman, M.P. (2019) Extraction of Information from Crowdsourcing: Experimental Test Employing Bayesian, Maximum Likelihood, and Maximum Entropy Methods. Open Journal of Statistics, 9, 571-600. &gt;https://doi.org/10.4236/ojs.2019.95038
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref15">
    <label>15</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Levine, R.D. and Tribus, M. (1978) The Maximum Entropy Formalism. MIT Press.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref16">
    <label>16</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Shannon, C.E. and Weaver, W. (1964) The Mathematical Theory of Communication. The University of Illinois Press.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref17">
    <label>17</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jaynes, E.T. (1989) Where Do We Stand on Maximum Entropy? (1978). In: Rosenkrantz, R.D., Ed., E. T. Jaynes: Papers on Probability, Statistics and Statistical Physics, Kluwer Academic Publishers, 210-314. &gt;https://doi.org/10.1007/978-94-009-6581-2_10
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref18">
    <label>18</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Altman, D.G. (1999) Practical Statistics for Medical Research. Chapman&amp;Hall/CRC, 36-37, 143-146.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref19">
    <label>19</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jaynes, E.T. (1989) Prior Probabilities (1968). In: Rosenkrantz, R.D., Ed., E. T. Jaynes: Papers on Probability, Statistics and Statistical Physics, Kluwer Academic Publishers, 114-130. &gt;https://doi.org/10.1007/978-94-009-6581-2_7
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref20">
    <label>20</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jaynes, E.T. (1963) Information Theory and Statistical Mechanics, in the Brandeis University Summer Institute Lectures in Theoretical Physics. W. A. Benjamin, 188-189.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref21">
    <label>21</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Arfken, G.B. and Weber, H.J. (2005) Mathematical Methods for Physicists. 6th Edition, Elsevier, 489-497.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref22">
    <label>22</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Kullback, S. (1968) Information Theory and Statistics. Dover Publications, 1-31.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref23">
    <label>23</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Lang, N. (2024) What Is the Kullback-Leibler Divergence? &gt;https://databasecamp.de/en/statistics/kullback-leibler-divergence 
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref24">
    <label>24</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jaynes, E.T. (2003) Probability Theory: The Logic of Science. Cambridge University Press, 374-386.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref25">
    <label>25</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Martin, B.R. (1971) Statistics for Physicists. Academic Press, 42-49.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref26">
    <label>26</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Kendall, M.G. and Stuart, A. (1963) The Advanced Theory of Statistics Vol 1: Distribution Theory. 2nd Edition, Hafner, 234.
    </mixed-citation>
   </ref>
   <ref id="scirp.144605-ref27">
    <label>27</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Silverman, M.P. (2023) Probability Distribution of SARS-CoV-2 (COVID) Infectivity Following Onset of Symptoms: Analysis from First Principles. Open Journal of Statistics, 13, 233-263. &gt;https://doi.org/10.4236/ojs.2023.132013
    </mixed-citation>
   </ref>
  </ref-list>
 </back>
</article>