<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">AM</journal-id><journal-title-group><journal-title>Applied Mathematics</journal-title></journal-title-group><issn pub-type="epub">2152-7385</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/am.2024.155022</article-id><article-id pub-id-type="publisher-id">AM-133525</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Comparison of Block Design Nonparametric Subset Selection Rules Based on Alternative Scoring Rules
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Gary</surname><given-names>C. McDonald</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Sajidah</surname><given-names>Alsaeed</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Department of Mathematics and Statistics, Oakland University, Rochester, MI, USA</addr-line></aff><pub-date pub-type="epub"><day>07</day><month>05</month><year>2024</year></pub-date><volume>15</volume><issue>05</issue><fpage>355</fpage><lpage>389</lpage><history><date date-type="received"><day>24,</day>	<month>April</month>	<year>2024</year></date><date date-type="rev-recd"><day>27,</day>	<month>May</month>	<year>2024</year>	</date><date date-type="accepted"><day>30,</day>	<month>May</month>	<year>2024</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  This article compares the size of selected subsets using nonparametric subset selection rules with two different scoring rules for the observations. The scoring rules are based on the expected values of order statistics of the uniform distribution (yielding rank values) and of the normal distribution (yielding normal score values). The comparison is made using state motor vehicle traffic fatality rates, published in a 2016 article, with fifty-one states (including DC as a state) and over a nineteen-year period (1994 through 2012). The earlier study considered four block design selection rules&amp;#8212;two for choosing a subset to contain the &amp;#8220;best&amp;#8221; population (&lt;i&gt;i.e.&lt;/i&gt;, state with lowest mean fatality rate) and two for the &amp;#8220;worst&amp;#8221; population (&lt;i&gt;i.e.&lt;/i&gt;, highest mean rate) with a probability of correct selection chosen to be 0.90. Two selection rules based on normal scores resulted in selected subset sizes substantially smaller than corresponding rules based on ranks (7 vs. 16 and 3 vs. 12). For two other selection rules, the subsets chosen were very close in size (within one). A comparison is also made using state homicide rates, published in a 2022 article, with fifty states and covering eight years. The results are qualitatively the same as those obtained with the motor vehicle traffic fatality rates.
 
</p></abstract><kwd-group><kwd>Order Statistics</kwd><kwd> Rank Scoring Methods</kwd><kwd> Probability of a Correct Selection</kwd><kwd> Subset Size</kwd><kwd> Motor Vehicle Traffic Fatality Rates</kwd><kwd> Homicide Rates</kwd><kwd> Asymptotic Distributions</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Nonparametric statistical methods are useful for analyzing data that might not satisfy the distributional assumptions of parametric methods (e.g., see Conover [<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>] . In cases where the research hypothesis entails comparing subjects under different conditions or time points, or comparing two subject samples on an outcome variable, nonparametric rank score tests can be invoked (e.g., LaVange and Koch [<xref ref-type="bibr" rid="scirp.133525-ref2">2</xref>] . McDonald [<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>] developed a class of nonparametric (distribution-free) subset selection rules for block (two-way) design experimental data. These selection procedures are based on scores, i.e., functions of the rank values of the data. Subsequently, there have been many applications of these procedures based on the raw ranks of the data: McDonald [<xref ref-type="bibr" rid="scirp.133525-ref5">5</xref>] ; Lorenzen and McDonald [<xref ref-type="bibr" rid="scirp.133525-ref6">6</xref>] ; Green, et al. [<xref ref-type="bibr" rid="scirp.133525-ref7">7</xref>] ; Green and McDonald [<xref ref-type="bibr" rid="scirp.133525-ref8">8</xref>] ; McDonald [<xref ref-type="bibr" rid="scirp.133525-ref9">9</xref>] ; Wang and McDonald [<xref ref-type="bibr" rid="scirp.133525-ref10">10</xref>] . Gupta and Panchapakesan [<xref ref-type="bibr" rid="scirp.133525-ref11">11</xref>] provide a thorough review of the class of parametric and nonparametric ranking and selection procedures.</p><p>The purpose of this article is to explore the effect of applying a scoring function of ranks, rather than the raw ranks, to the data and subsequently applying the selection procedure. Specifically, how is the selected subset of populations affected in terms of the size and the content? This will be done with two specific data sets used in earlier publications. The foundations of the subset selection rules, taken from McDonald [<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>] , are described next.</p><p>Let π 1 , ⋯ , π k be k (≥2) independent populations. Let X i j , j = 1 , ⋯ , n ;   i = 1 , ⋯ , k be independent samples of size n from the k populations. Assume the random variables X<sub>ij</sub> have a continuous cumulative distribution function (CDF) F<sub>j</sub>(x;θ<sub>i</sub>), where θ<sub>i</sub>’s belong to some interval Θ on the real line. Suppose F<sub>j</sub>(x;θ) is a stochastically increasing family of distributions in θ; i.e., if θ<sub>1</sub> &lt; θ<sub>2</sub>, then F<sub>j</sub>(x;θ<sub>1</sub>) and F<sub>j</sub>(x;θ<sub>2</sub>) are distinct and F<sub>j</sub>(x;θ<sub>2</sub>) ≤ F<sub>j</sub>(x;θ<sub>1</sub>) for all x. Examples of such families of distributions are: (1) any location parameter family, i.e., F<sub>j</sub>(x;θ) = F<sub>j</sub>(x-θ); (2) any scale parameter family, i.e., F<sub>j</sub>(x;θ) = F<sub>j</sub>(x/θ), θ &gt; 0, x &gt; 0; any family of distribution functions whose densities possess the monotone likelihood ratio property. <xref ref-type="fig" rid="fig1">Figure 1</xref> illustrates that the normal distribution as a location family with respect</p><p>to the mean parameter is stochastically ordered. Note that the CDFs are stacked from top to bottom in inverse order of the mean values.</p><p>Let R<sub>ij</sub> denote the rank of the observation x<sub>ij</sub> among x 1 j , x 2 j , ⋯ , x k j ; i.e., if there are exactly r of the observations x m j , m = 1 , ⋯ , k less than x<sub>ij</sub> then R<sub>ij</sub> = r + 1. These ranks are well-defined with probability one, since the random variables are assumed to have a continuous distribution, and take integer values from 1 to k inclusive. Now let Z ( 1 ) ≤ Z ( 2 ) ≤ ⋯ ≤ Z ( k ) denote an ordered sample of size k from any continuous distribution G, such that − ∞ &lt; a ( r ) ≡ E [ Z ( r ) | G ] &lt; ∞ , r = 1 , ⋯ , k . With each of the random variables X<sub>ij</sub> associate the number a(R<sub>ij</sub>) and define</p><p>H i = ∑ j = 1 n a ( R i j ) ,   i = 1 , ⋯ , k . (1.1)</p><p>The quantity a(R<sub>ij</sub>) is called the score of X<sub>ij</sub>, and the quantities H<sub>i</sub> will define the procedures for selecting a subset of the k populations. Letting θ<sub>[i]</sub> denote the i<sup>th</sup> smallest unknown parameter, it follows that</p><p>F j ( x ; θ [ 1 ] ) ≥ F j ( x ; θ [ 2 ] ) ≥ ⋯ ≥ F j ( x ; θ [ k ] ) ,   ∀ x . (1.2)</p><p>The population whose associated random variables have the distribution F<sub>j</sub>(x;θ<sub>[k]</sub>) is called the “best” population. In case several populations possess the largest parameter value θ<sub>[k]</sub>, one of these is tagged at random and called the best. A “Correct Selection” (CS) is said to occur if and only if the best population is included in the selected subset. In the subset selection formulation, one wishes to select a subset such that the probability is at least equal to a preassigned constant P<sup>*</sup> ( k − 1 &lt; P ∗ &lt; 1 ) that the selected subset includes the best population. Formally, for a given selection rule R,</p><p>inf Ω P ( C S | R ) ≥ P ∗ , (1.3)</p><p>where</p><p>Ω = { θ = ( θ 1 , ⋯ , θ k ) : θ i ∈ Θ , i = 1 , ⋯ , k } . (1.4)</p><p>The choice of P<sup>*</sup> is specified by the analyst and represents the confidence level that the resultant selected subset will contain the best population. The number of populations in the selected subset is a nondecreasing function of P<sup>*</sup>.</p><p>In a similar fashion, the “worst” population can be defined as that population characterized by the probability distribution F<sub>j</sub>(x;θ[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]). Selection procedures can analogously be defined with P<sup>*</sup> requirements on the selected subset to contain the worst population as noted in the following Section 2. The assignment of “best” and “worst” is problem specific as will be noted in the applications to follow.</p></sec><sec id="s2"><title>2. Nonparametric (Distribution-Free) Subset Selection Procedures</title><p>Four subset selection rules are considered for the analysis of state motor vehicle traffic fatality rates (MVTFRs) as given in McDonald [<xref ref-type="bibr" rid="scirp.133525-ref9">9</xref>] . In this application to be described in Section 3, the populations are states and the blocks are years. Since low (high) fatality rates are good (bad), the “best” (“worst”) state is the one with the smallest (largest) mean fatality rate.</p><p>The two selection rules for choosing a subset containing the worst population are given by:</p><p>R<sub>1</sub>: Select π<sub>i</sub> iff H i ≥ max ( H j , j = 1 , ⋯ , k ) − b 1 .</p><p>R<sub>2</sub>: Select π<sub>i</sub> iff H i &gt; b 2 .</p><p>Similarly, the two selection rules for choosing a subset containing the best population are given by:</p><p>R<sub>3</sub>: Select π<sub>i</sub> iff H i ≤ min ( H j , j = 1 , ⋯ , k ) + b 3 .</p><p>R<sub>4</sub>: Select π<sub>i</sub> iff H i &lt; b 4 .</p><p>The non-negative constants b<sub>1</sub>, b<sub>3</sub>, and b<sub>4</sub> are chosen as small as possible and b<sub>2</sub> is chosen as large as possible preserving the probability P<sup>*</sup> goal. In cases considered here, these constants are calculated assuming the population parameters are equal and, thus, the distribution of the H statistics are distribution free. As derived in McDonald [<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>] rules R<sub>1</sub> and R<sub>3</sub> are justified over a slippage space, Ω', where all parameters θ<sub>i</sub> are equal with the possible exception of θ<sub>[k]</sub> (θ[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]) in case of rule R<sub>1</sub> (R<sub>3</sub>); and rules R<sub>2</sub> and R<sub>4</sub> are applicable over the entire parameter space, Ω. That is, the probability of a correct selection will be no less than P<sup>*</sup>. If k = 2, the two selection rules R<sub>1</sub> and R<sub>2</sub> are equivalent, as are R<sub>3</sub> and R<sub>4</sub>, since H<sub>1</sub> + H<sub>2</sub> is a constant.</p><sec id="s2_1"><title>2.1. Calculation of Selection Rules Constants, G = Uniform Distribution (0, 1)</title><p>With the choice of G to be the uniform distribution on the interval (0, 1), the expected value of the order statistics a ( r ) ≡ E [ Z ( r ) ] = r / ( n + 1 ) . The selection procedures can then be stated in terms of ranks and rank sums. Thus,</p><p>R<sub>1</sub>: Select π<sub>i</sub> iff T i ≥ max ( T j , j = 1 , ⋯ , k ) − b 1 (2.1)</p><p>R<sub>2</sub>: Select π<sub>i</sub> iff T i &gt; b 2 . (2.2)</p><p>Similarly, the two selection rules for choosing a subset containing the best population are given by:</p><p>R<sub>3</sub>: Select π<sub>i</sub> iff T i ≤ min ( T j , j = 1 , ⋯ , k ) + b 3 (2.3)</p><p>R<sub>4</sub>: Select π<sub>i</sub> iff T i &lt; b 4 , (2.4)</p><p>where T i = ∑ j = 1 n R i j , i = 1 , ⋯ , k .</p><p>The calculation of these constants is treated in McDonald [<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.133525-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.133525-ref12">12</xref>] for both small and large samples. For the purposes of this article, the asymptotic values are used. The value of b<sub>1</sub> to meet the P<sup>*</sup> requirement is the solution to</p><p>∫ − ∞ ∞ [ ϕ ( x + c b 1 ) ] k − 1 φ ( x ) d x = P ∗ , (2.5)</p><p>where ϕ ( x ) and φ ( x ) are the cdf and density, respectively, of a standard normal random variable, and</p><p>c = c ( n , k ) = [ 12 / n k ( k + 1 ) ] 1 / 2 . (2.6)</p><p>The value of b<sub>3</sub> = b<sub>1</sub>. The values of b<sub>2</sub> and b<sub>4</sub> are given by</p><p>b 2 = [ n ( k 2 − 1 ) / 12 ] 1 / 2 ϕ − 1 ( 1 − P ∗ ) + n ( k + 1 ) / 2 , (2.7)</p><p>where ϕ − 1 ( ⋅ ) is the inverse standard normal CDF, and</p><p>b 4 = n ( k + 1 ) − b 2 . (2.8)</p><p>The selection rules defined in (2.1) through (2.4) are based on rank sums. These arise from the expected values of the order statistics from a standard uniform distribution. This article addresses the question of how the choice of the distribution of the order statistics affects the performance of the subset selection procedures by choosing an alternate distribution for G as in the next section. Conover [<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>] compares the nonparametric Kruskal-Wallis test based on rank scores to that based on normal scores. He concludes that the asymptotic relative efficiency may be greater or less than one depending on the particular situation. This further motivates such assessments of performance characteristics for nonparametric subset selection procedures.</p></sec><sec id="s2_2"><title>2.2. Calculation of Selection Rules Constants, G = Standard Normal Distribution</title><p>The term normal score is used with two different meanings in statistics. One of them relates to creating a single value which can be treated as if it had arisen from a standard normal distribution (zero mean, unit variance). The second one relates to assigning alternative values to data points within a dataset, with the broad intention of creating data values than can be interpreted as being approximations for values that might have been observed had the data arisen from a standard normal distribution. It is associated with data values derived from the ranks of the observations within the dataset. A given data point is assigned a value that is either exactly, or an approximation to, the expectation of the order statistic of the same rank in a sample of standard normal random variables of the same size as the observed data set.</p><p>With the choice of G to be the normal distribution with mean = 0 and standard deviation = 1, the score of X<sub>ij</sub>, call it a(R<sub>ij</sub>), is the expected value of the i<sup>th </sup>order statistic drawn from a sample of size k from the standard normal distribution. Extensive tabulations (to 5 dp) of expected values of normal order statistics are given by Harter [<xref ref-type="bibr" rid="scirp.133525-ref13">13</xref>] for sample sizes k = 2(1)100(25)250(50)400. Birnbaum and Dudman [<xref ref-type="bibr" rid="scirp.133525-ref14">14</xref>] also provide tabulations of these expected values along with corresponding calculations from the logistic distribution. The selection procedures can then be stated in terms of scores and score sums. Thus,</p><p>Q<sub>1</sub>: Select π<sub>i</sub> iff S i ≥ max ( S j , j = 1 , ⋯ , k ) − d 1 (2.9)</p><p>Q<sub>2</sub>: Select π<sub>i</sub> iff S i &gt; d 2 . (2.10)</p><p>Similarly, the two selection rules for choosing a subset containing the best population are given by:</p><p>Q<sub>3</sub>: Select π<sub>i</sub> iff S i ≤ min ( S j , j = 1 , ⋯ , k ) + d 3 (2.11)</p><p>Q<sub>4</sub>: Select π<sub>i</sub> iff S i &lt; d 4 , (2.12)</p><p>where S i = ∑ j = 1 n a ( R i j ) , i = 1 , ⋯ , k .</p><p>The calculation of the constants d<sub>1</sub>, …, d<sub>4</sub> follow the same lines of derivation as for the respective constants used with the uniform distribution order statistics in Section 2.1. The asymptotic value of the value of d<sub>1</sub> to meet the P<sup>*</sup> requirement is the solution to</p><p>∫ − ∞ ∞ [ ϕ ( x + h d 1 ) ] k − 1 φ ( x ) d x = P ∗ , (2.13)</p><p>where<inline-formula><inline-graphic xlink:href="/html.scirp.org/file/4-7405264x40.png" xlink:type="simple"/></inline-formula></p><p>h = h ( n , k ) = [ ( k − 1 ) / ( n ⋅ s s q ) ] 1 / 2 , (2.14)</p><p>and s s q = ∑ i = 1 k [ a ( R i j ) ] 2 . The value of d<sub>3</sub> = d<sub>1</sub>. The value of d<sub>2</sub> and d<sub>4</sub> are given by</p><p>d 2 = [ n ⋅ s s q / k ] 1 / 2 ϕ − 1 ( 1 − P ∗ ) , (2.15)</p><p>and</p><p>d 4 = – d 2 . (2.16)</p></sec></sec><sec id="s3"><title>3. Description of State Motor Vehicle Traffic Fatality Rates (MVTFRs)</title><p>The state MVTFRs per year analyzed in McDonald [<xref ref-type="bibr" rid="scirp.133525-ref9">9</xref>] are used here to illustrate the impact that the two rank scoring rules described in Section 2 have on the selected subsets using selection procedures R<sub>1</sub>, …, R<sub>4</sub> and Q<sub>1</sub>, …, Q<sub>4</sub>. The data are given in Appendix A (to 2 dp) of the cited reference and contained in the R-code of Appendix A of this article. The fatality rates are given for 51 states (taking the District of Columbia as a state) for the years 1994, …, 2012. The two letter abbreviation for states is given as the variable “State” and the fatality rates for the respective years are given in the variables “y1994”, …, “y2012” in the order of the states specified in “State”. Thus k = 51 populations (states) and n = 19 blocks (years) comprise the data set. The National Highway Traffic Safety Administration (NHTSA) publishes the MVTFRs for all U.S. states each year in the Fatality Analysis Reporting System (FARS). The data can be accessed through the government website: www-fars.nhtsa.dot.gov. The fatality rate per year for each state is expressed as the number of fatalities per 100 million vehicle miles of travel (VMT).</p><p>The cited [<xref ref-type="bibr" rid="scirp.133525-ref9">9</xref>] reference notes the possibility of interaction between the populations and blocks based on the Tukey [<xref ref-type="bibr" rid="scirp.133525-ref15">15</xref>] one degree-of-freedom test. However, raising the fatality rates to the power 0.3 indicates no significant evidence of interaction, and use of a two-way additive model for the transformed rates is plausible. That is,</p><p>X i j 0.3 = μ + θ i + β j + ϵ i j , (3.1)</p><p>where θ<sub>i</sub> indicates the particular state effect, β<sub>j</sub> indicates the year effect, and ϵ i j the random error. The distribution of the transformed MVTFRs will be stochastically ordered in θ as it is a location parameter. Since the power transformation is a monotone transformation, the ranks of the transformed data are identical to the ranks of the original fatality rates to be used here. The cited reference provides a more detailed discussion of the data and the form of the assumed additive model.</p></sec><sec id="s4"><title>4. Applications to the MVTFRs Data</title><p>To apply the selection rules to the MVTFRs data set, the constants b<sub>1</sub>, …, b<sub>4</sub> and d<sub>1</sub>, …. , d<sub>4</sub> need to be obtained. The values for b<sub>1</sub> and d<sub>1</sub> are based on determining the values of c&#183;b<sub>1</sub> and h&#183;d<sub>1</sub>, based on (2.5), (2.6) and (2.13), (2.14) for given values of k, n, and P<sup>*</sup>. These two products are equal and the constants b<sub>1</sub> and d<sub>1</sub> are obtained by dividing the product by c and h respectively. The common value of c&#183;b<sub>1</sub> and h&#183;d<sub>1</sub>, call it w, is easily obtained by noting that the integral expressing in (2.5) is an increasing function of w and using a R-code such as</p><p>w&lt;-3.5</p><p>fucn&lt;-function(x){(pnorm(x+w))^50*dnorm(x)}</p><p>integrate(fucn,lower = -Inf,upper = Inf)</p><p>and successive interval halving to converge on w = 3.666 for k = 51, n = 19, and P<sup>*</sup> = 0.90. The resultant constants for implementing the eight subset selection procedures are given in <xref ref-type="table" rid="table1">Table 1</xref> (to 2 dp).</p><p>Execution of the R-code in Appendix A yields the state rank sums and the state normal score sums given in <xref ref-type="table" rid="table2">Table 2</xref>. The code uses the R function “rank” to order the state MVTFRs for each of the nineteen years. This function provides six methods for ranking. The one used here is the “random” option. If two states have the same fatality rate and are thus tied for, say, ranks r1 and r2, the allocation of those two ranks to the tied states would be done randomly, i.e., each state would have the same probability of assignment of r1 and r2. Consequently, for each of the years the 51 ranks are the whole numbers 1, 2, …, 51. The “average” option would assign to each of the tied states the average of r1 and r2. With averaging, not all of the states would have whole numbers assigned. The “averaging” option was used in McDonald [<xref ref-type="bibr" rid="scirp.133525-ref9">9</xref>] and so there are slight differences between results given in the Appendix B of that reference and <xref ref-type="table" rid="table2">Table 2</xref> given here.</p><p>With <xref ref-type="table" rid="table1">Table 1</xref> and <xref ref-type="table" rid="table2">Table 2</xref>, the selection rules given in Sections 2.1 and 2.2 can be applied to the state MVTFRs specified in Appendix A. Using P<sup>*</sup> = 0.90, selection rule R<sub>1</sub> can be now stated as</p><p>R<sub>1</sub>: Select π<sub>i</sub> iff T i ≥ max ( T j , j = 1 , ⋯ , k ) − b 1 = 930 − 237.53 = 692.47 , (4.1)</p><p>and 16 states are thus included in the chosen subset. Using R<sub>2</sub>, all states with</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Selection rules constants for the MVTFRs (k = 51, n = 19, and P<sup>*</sup> = 0.90)</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >R<sub>1</sub></th><th align="center" valign="middle" >R<sub>2 </sub></th><th align="center" valign="middle" >R<sub>3</sub></th><th align="center" valign="middle" >R<sub>4</sub></th><th align="center" valign="middle" >Q<sub>1</sub></th><th align="center" valign="middle" >Q<sub>2</sub></th><th align="center" valign="middle" >Q<sub>3</sub></th><th align="center" valign="middle" >Q<sub>4</sub></th></tr></thead><tr><td align="center" valign="middle" >b<sub>1</sub> = 237.53</td><td align="center" valign="middle" >b<sub>2</sub> = 411.77</td><td align="center" valign="middle" >b<sub>3</sub> = 237.53</td><td align="center" valign="middle" >b<sub>4</sub> = 576.23</td><td align="center" valign="middle" >d<sub>1</sub> = 15.72</td><td align="center" valign="middle" >d<sub>2</sub> = −5.44</td><td align="center" valign="middle" >d<sub>3</sub> = 15.72</td><td align="center" valign="middle" >d<sub>4</sub> = 5.44</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Rank Sums and Normal Score Sums for MVTFR data, k = 51, n = 19</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >State</th><th align="center" valign="middle" >Rank Sum</th><th align="center" valign="middle" >State</th><th align="center" valign="middle" >Rank Sum</th><th align="center" valign="middle" >State</th><th align="center" valign="middle" >NS Sum</th><th align="center" valign="middle" >State</th><th align="center" valign="middle" >NS Sum</th></tr></thead><tr><td align="center" valign="middle" >MA</td><td align="center" valign="middle" >23</td><td align="center" valign="middle" >PA</td><td align="center" valign="middle" >488</td><td align="center" valign="middle" >MA</td><td align="center" valign="middle" >−41.30654</td><td align="center" valign="middle" >PA</td><td align="center" valign="middle" >−0.28389</td></tr><tr><td align="center" valign="middle" >CT</td><td align="center" valign="middle" >88</td><td align="center" valign="middle" >IA</td><td align="center" valign="middle" >492</td><td align="center" valign="middle" >RI</td><td align="center" valign="middle" >−28.60640</td><td align="center" valign="middle" >IA</td><td align="center" valign="middle" >−0.13781</td></tr><tr><td align="center" valign="middle" >RI</td><td align="center" valign="middle" >93</td><td align="center" valign="middle" >GA</td><td align="center" valign="middle" >499</td><td align="center" valign="middle" >CT</td><td align="center" valign="middle" >−28.16906</td><td align="center" valign="middle" >GA</td><td align="center" valign="middle" >0.25425</td></tr><tr><td align="center" valign="middle" >NJ</td><td align="center" valign="middle" >104</td><td align="center" valign="middle" >KS</td><td align="center" valign="middle" >605</td><td align="center" valign="middle" >NJ</td><td align="center" valign="middle" >−24.72987</td><td align="center" valign="middle" >KS</td><td align="center" valign="middle" >5.63425</td></tr><tr><td align="center" valign="middle" >MN</td><td align="center" valign="middle" >124</td><td align="center" valign="middle" >MO</td><td align="center" valign="middle" >606</td><td align="center" valign="middle" >MN</td><td align="center" valign="middle" >−24.61870</td><td align="center" valign="middle" >MO</td><td align="center" valign="middle" >5.64446</td></tr><tr><td align="center" valign="middle" >NH</td><td align="center" valign="middle" >135</td><td align="center" valign="middle" >TX</td><td align="center" valign="middle" >612</td><td align="center" valign="middle" >NH</td><td align="center" valign="middle" >−23.45682</td><td align="center" valign="middle" >TX</td><td align="center" valign="middle" >6.00957</td></tr><tr><td align="center" valign="middle" >WA</td><td align="center" valign="middle" >169</td><td align="center" valign="middle" >NC</td><td align="center" valign="middle" >614</td><td align="center" valign="middle" >WA</td><td align="center" valign="middle" >−18.93026</td><td align="center" valign="middle" >NC</td><td align="center" valign="middle" >6.02964</td></tr><tr><td align="center" valign="middle" >NY</td><td align="center" valign="middle" >181</td><td align="center" valign="middle" >OK</td><td align="center" valign="middle" >649</td><td align="center" valign="middle" >NY</td><td align="center" valign="middle" >−17.93579</td><td align="center" valign="middle" >OK</td><td align="center" valign="middle" >8.26094</td></tr><tr><td align="center" valign="middle" >MD</td><td align="center" valign="middle" >221</td><td align="center" valign="middle" >AK</td><td align="center" valign="middle" >652</td><td align="center" valign="middle" >VT</td><td align="center" valign="middle" >−17.14453</td><td align="center" valign="middle" >AK</td><td align="center" valign="middle" >8.68151</td></tr><tr><td align="center" valign="middle" >VT</td><td align="center" valign="middle" >226</td><td align="center" valign="middle" >FL</td><td align="center" valign="middle" >699</td><td align="center" valign="middle" >MD</td><td align="center" valign="middle" >−15.01171</td><td align="center" valign="middle" >FL</td><td align="center" valign="middle" >10.89390</td></tr><tr><td align="center" valign="middle" >VA</td><td align="center" valign="middle" >230</td><td align="center" valign="middle" >ID</td><td align="center" valign="middle" >714</td><td align="center" valign="middle" >VA</td><td align="center" valign="middle" >−14.56338</td><td align="center" valign="middle" >ID</td><td align="center" valign="middle" >11.77507</td></tr><tr><td align="center" valign="middle" >CA</td><td align="center" valign="middle" >258</td><td align="center" valign="middle" >NV</td><td align="center" valign="middle" >721</td><td align="center" valign="middle" >CA</td><td align="center" valign="middle" >−12.79833</td><td align="center" valign="middle" >NV</td><td align="center" valign="middle" >12.90143</td></tr><tr><td align="center" valign="middle" >OH</td><td align="center" valign="middle" >261</td><td align="center" valign="middle" >TN</td><td align="center" valign="middle" >744</td><td align="center" valign="middle" >OH</td><td align="center" valign="middle" >−12.38324</td><td align="center" valign="middle" >TN</td><td align="center" valign="middle" >13.44741</td></tr><tr><td align="center" valign="middle" >MI</td><td align="center" valign="middle" >304</td><td align="center" valign="middle" >AL</td><td align="center" valign="middle" >760</td><td align="center" valign="middle" >DC</td><td align="center" valign="middle" >−11.42801</td><td align="center" valign="middle" >AL</td><td align="center" valign="middle" >14.72405</td></tr><tr><td align="center" valign="middle" >IL</td><td align="center" valign="middle" >305</td><td align="center" valign="middle" >KY</td><td align="center" valign="middle" >769</td><td align="center" valign="middle" >MI</td><td align="center" valign="middle" >−10.20329</td><td align="center" valign="middle" >KY</td><td align="center" valign="middle" >15.43561</td></tr><tr><td align="center" valign="middle" >IN</td><td align="center" valign="middle" >313</td><td align="center" valign="middle" >WY</td><td align="center" valign="middle" >772</td><td align="center" valign="middle" >IL</td><td align="center" valign="middle" >−10.02509</td><td align="center" valign="middle" >NM</td><td align="center" valign="middle" >15.80883</td></tr><tr><td align="center" valign="middle" >WI</td><td align="center" valign="middle" >324</td><td align="center" valign="middle" >NM</td><td align="center" valign="middle" >773</td><td align="center" valign="middle" >IN</td><td align="center" valign="middle" >−9.65541</td><td align="center" valign="middle" >WY</td><td align="center" valign="middle" >16.41764</td></tr><tr><td align="center" valign="middle" >DC</td><td align="center" valign="middle" >325</td><td align="center" valign="middle" >SD</td><td align="center" valign="middle" >786</td><td align="center" valign="middle" >WI</td><td align="center" valign="middle" >−8.87739</td><td align="center" valign="middle" >SD</td><td align="center" valign="middle" >17.81267</td></tr><tr><td align="center" valign="middle" >ME</td><td align="center" valign="middle" >330</td><td align="center" valign="middle" >AZ</td><td align="center" valign="middle" >823</td><td align="center" valign="middle" >ME</td><td align="center" valign="middle" >−8.52842</td><td align="center" valign="middle" >AZ</td><td align="center" valign="middle" >20.20069</td></tr><tr><td align="center" valign="middle" >UT</td><td align="center" valign="middle" >350</td><td align="center" valign="middle" >WV</td><td align="center" valign="middle" >824</td><td align="center" valign="middle" >UT</td><td align="center" valign="middle" >−7.78518</td><td align="center" valign="middle" >WV</td><td align="center" valign="middle" >20.55222</td></tr><tr><td align="center" valign="middle" >OR</td><td align="center" valign="middle" >397</td><td align="center" valign="middle" >AR</td><td align="center" valign="middle" >887</td><td align="center" valign="middle" >OR</td><td align="center" valign="middle" >−5.02838</td><td align="center" valign="middle" >AR</td><td align="center" valign="middle" >25.49252</td></tr><tr><td align="center" valign="middle" >HI</td><td align="center" valign="middle" >445</td><td align="center" valign="middle" >LA</td><td align="center" valign="middle" >899</td><td align="center" valign="middle" >HI</td><td align="center" valign="middle" >−2.53294</td><td align="center" valign="middle" >LA</td><td align="center" valign="middle" >27.43415</td></tr><tr><td align="center" valign="middle" >ND</td><td align="center" valign="middle" >449</td><td align="center" valign="middle" >SC</td><td align="center" valign="middle" >910</td><td align="center" valign="middle" >ND</td><td align="center" valign="middle" >−2.33842</td><td align="center" valign="middle" >SC</td><td align="center" valign="middle" >28.77247</td></tr><tr><td align="center" valign="middle" >DE</td><td align="center" valign="middle" >453</td><td align="center" valign="middle" >MT</td><td align="center" valign="middle" >927</td><td align="center" valign="middle" >DE</td><td align="center" valign="middle" >−2.30543</td><td align="center" valign="middle" >MS</td><td align="center" valign="middle" >34.07669</td></tr><tr><td align="center" valign="middle" >NE</td><td align="center" valign="middle" >462</td><td align="center" valign="middle" >MS</td><td align="center" valign="middle" >930</td><td align="center" valign="middle" >NE</td><td align="center" valign="middle" >−1.60001</td><td align="center" valign="middle" >MT</td><td align="center" valign="middle" >34.48870</td></tr><tr><td align="center" valign="middle" >CO</td><td align="center" valign="middle" >469</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >CO</td><td align="center" valign="middle" >−0.36437</td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr></tbody></table></table-wrap><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> Number of states chosen by selection rules with P<sup>*</sup> = 0.90, k = 51, and n = 19</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >R<sub>1 </sub></th><th align="center" valign="middle" >R<sub>2</sub></th><th align="center" valign="middle" >R<sub>3</sub></th><th align="center" valign="middle" >R<sub>4</sub></th><th align="center" valign="middle" >Q<sub>1 </sub></th><th align="center" valign="middle" >Q<sub>2</sub></th><th align="center" valign="middle" >Q<sub>3</sub></th><th align="center" valign="middle" >Q<sub>4</sub></th></tr></thead><tr><td align="center" valign="middle" >16</td><td align="center" valign="middle" >30</td><td align="center" valign="middle" >12</td><td align="center" valign="middle" >29</td><td align="center" valign="middle" >7</td><td align="center" valign="middle" >31</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >29</td></tr></tbody></table></table-wrap><p>rank sums exceeding 411.77 are selected yielding a subset containing 30 states. Following the two examples just given, <xref ref-type="table" rid="table3">Table 3</xref> provides the number of selected states in the subsets chosen by the four rules using rank sums and the four rules using normal scores.</p><p>Clearly the number of populations chosen using the rank sum vs. the normal score sum makes a substantial difference. The subset size using Q<sub>1</sub> is slightly less than half of that using R<sub>1</sub> (7 vs. 16). The subset size using Q<sub>3</sub> is a quarter of that using R<sub>3</sub> (3 vs. 12). However, the subset sizes Q<sub>2</sub> and R2 (Q<sub>4</sub> and R<sub>4</sub>) are within one of each other (are equal).</p><p>The correlation between the rank values and the normal score values is 0.983. The R-code of Appendix B produces <xref ref-type="fig" rid="fig2">Figure 2</xref>. The left displays the normal scores vs. the rank scores along with the least squares regression line (Reg Line). The linear fit looks quite good with the exception of the two or three end points on both sides. A notable difference in the rank values and the normal score values is the spacing between successive values. The spacing between any two successive values of the uniform order statistics is 1/(k + 1), and so the difference in rank values is one, a constant. For the normal scores the spacing for extreme values is much larger than the other spacings. <xref ref-type="fig" rid="fig2">Figure 2</xref> (right) displays the differences between the successive expected values of the order statistics from the standard normal distribution for k = 51, i.e., diff [ x ] = a ( x + 1 ) − a ( x ) , x = 1 , ⋯ , 5 0 (see Appendix B for the R-code). For example, for x = 1, diff [ 1 ] = a ( 2 ) − a ( 1 ) = 0.39307 , the maximum spacing value shared with diff[<xref ref-type="bibr" rid="scirp.133525-ref50">50</xref>]. The minimum spacing value is diff[<xref ref-type="bibr" rid="scirp.133525-ref25">25</xref>] = diff[<xref ref-type="bibr" rid="scirp.133525-ref26">26</xref>] = 0.04896. Thus the more extreme values carry a substantially larger differential weight than the more moderate values, and the spacings are symmetric about the point 25.5 as indicated by the vertical line in the right panel.</p></sec><sec id="s5"><title>5. Applications to the State Homicide Rates (SHRs) Data</title><p>The data set, as analyzed by Wang and McDonald [<xref ref-type="bibr" rid="scirp.133525-ref10">10</xref>] is given in the R-code of</p><p>Appendix C. It consists of state homicide rates (i.e., homicides per 100,000 residents) for the years 2005, 2014-2020. An indicated rate of 0.00 is not actually zero since only 2 decimal points were retained for the data. The SHRs are obtained from the Center for Disease Control and Prevention (CDC) at https://www.cdc.gov/nchs/pressroom/sosmap/homicide_mortality/homicide.htm.The CDC website does not contain data for the years 2006 through 2013.</p><p>The transformation, x<sup>0.4</sup>, applied to the SHRs results in a two-way additive model which, plausibly, lacks interaction between the categorical variables ‘state’ and ‘year’ based on the Tukey one degree-of-freedom test. Since this transformation is monotone, analysis can be done directly with the rates without the power transformation as is done in Section 4. The rank sums and the normal score sums, calculated with the R-code of Appendix C, are given in <xref ref-type="table" rid="table4">Table 4</xref>.</p><table-wrap id="table4" ><label><xref ref-type="table" rid="table4">Table 4</xref></label><caption><title> Rank sums and normal score sums for SHR data, k = 50, n = 8</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >State</th><th align="center" valign="middle" >Rank Sum</th><th align="center" valign="middle" >State</th><th align="center" valign="middle" >Rank Sum</th><th align="center" valign="middle" >State</th><th align="center" valign="middle" >NS Sum</th><th align="center" valign="middle" >State</th><th align="center" valign="middle" >NS Sum</th></tr></thead><tr><td align="center" valign="middle" >VT</td><td align="center" valign="middle" >24</td><td align="center" valign="middle" >WV</td><td align="center" valign="middle" >217</td><td align="center" valign="middle" >NH</td><td align="center" valign="middle" >−14.25764</td><td align="center" valign="middle" >WV</td><td align="center" valign="middle" >0.66234</td></tr><tr><td align="center" valign="middle" >NH</td><td align="center" valign="middle" >25</td><td align="center" valign="middle" >TX</td><td align="center" valign="middle" >230</td><td align="center" valign="middle" >VT</td><td align="center" valign="middle" >−13.84980</td><td align="center" valign="middle" >TX</td><td align="center" valign="middle" >1.31531</td></tr><tr><td align="center" valign="middle" >ME</td><td align="center" valign="middle" >27</td><td align="center" valign="middle" >PA</td><td align="center" valign="middle" >239</td><td align="center" valign="middle" >ME</td><td align="center" valign="middle" >−13.11673</td><td align="center" valign="middle" >PA</td><td align="center" valign="middle" >1.77072</td></tr><tr><td align="center" valign="middle" >UT</td><td align="center" valign="middle" >58</td><td align="center" valign="middle" >IN</td><td align="center" valign="middle" >244</td><td align="center" valign="middle" >ND</td><td align="center" valign="middle" >−10.29260</td><td align="center" valign="middle" >IN</td><td align="center" valign="middle" >1.83492</td></tr><tr><td align="center" valign="middle" >ID</td><td align="center" valign="middle" >61</td><td align="center" valign="middle" >KY</td><td align="center" valign="middle" >245</td><td align="center" valign="middle" >ID</td><td align="center" valign="middle" >−9.15845</td><td align="center" valign="middle" >KY</td><td align="center" valign="middle" >2.12983</td></tr><tr><td align="center" valign="middle" >MA</td><td align="center" valign="middle" >61</td><td align="center" valign="middle" >AZ</td><td align="center" valign="middle" >249</td><td align="center" valign="middle" >RI</td><td align="center" valign="middle" >−9.09363</td><td align="center" valign="middle" >AZ</td><td align="center" valign="middle" >2.64468</td></tr><tr><td align="center" valign="middle" >ND</td><td align="center" valign="middle" >61</td><td align="center" valign="middle" >FL</td><td align="center" valign="middle" >256</td><td align="center" valign="middle" >UT</td><td align="center" valign="middle" >−8.85947</td><td align="center" valign="middle" >FL</td><td align="center" valign="middle" >2.66931</td></tr><tr><td align="center" valign="middle" >RI</td><td align="center" valign="middle" >62</td><td align="center" valign="middle" >OH</td><td align="center" valign="middle" >262</td><td align="center" valign="middle" >MA</td><td align="center" valign="middle" >−8.66497</td><td align="center" valign="middle" >OH</td><td align="center" valign="middle" >3.00196</td></tr><tr><td align="center" valign="middle" >MN</td><td align="center" valign="middle" >69</td><td align="center" valign="middle" >MI</td><td align="center" valign="middle" >266</td><td align="center" valign="middle" >MN</td><td align="center" valign="middle" >−7.98909</td><td align="center" valign="middle" >MI</td><td align="center" valign="middle" >3.22598</td></tr><tr><td align="center" valign="middle" >HI</td><td align="center" valign="middle" >72</td><td align="center" valign="middle" >NC</td><td align="center" valign="middle" >274</td><td align="center" valign="middle" >HI</td><td align="center" valign="middle" >−7.72306</td><td align="center" valign="middle" >NC</td><td align="center" valign="middle" >3.66404</td></tr><tr><td align="center" valign="middle" >IA</td><td align="center" valign="middle" >84</td><td align="center" valign="middle" >NV</td><td align="center" valign="middle" >282</td><td align="center" valign="middle" >WY</td><td align="center" valign="middle" >−7.07594</td><td align="center" valign="middle" >NV</td><td align="center" valign="middle" >4.18924</td></tr><tr><td align="center" valign="middle" >OR</td><td align="center" valign="middle" >98</td><td align="center" valign="middle" >AK</td><td align="center" valign="middle" >285</td><td align="center" valign="middle" >IA</td><td align="center" valign="middle" >−6.88568</td><td align="center" valign="middle" >DE</td><td align="center" valign="middle" >4.33158</td></tr><tr><td align="center" valign="middle" >WY</td><td align="center" valign="middle" >99</td><td align="center" valign="middle" >DE</td><td align="center" valign="middle" >285</td><td align="center" valign="middle" >OR</td><td align="center" valign="middle" >−5.78677</td><td align="center" valign="middle" >AK</td><td align="center" valign="middle" >4.78824</td></tr><tr><td align="center" valign="middle" >NE</td><td align="center" valign="middle" >103</td><td align="center" valign="middle" >OK</td><td align="center" valign="middle" >307</td><td align="center" valign="middle" >NE</td><td align="center" valign="middle" >−5.69598</td><td align="center" valign="middle" >OK</td><td align="center" valign="middle" >5.72751</td></tr><tr><td align="center" valign="middle" >CT</td><td align="center" valign="middle" >116</td><td align="center" valign="middle" >GA</td><td align="center" valign="middle" >314</td><td align="center" valign="middle" >CT</td><td align="center" valign="middle" >−4.66888</td><td align="center" valign="middle" >GA</td><td align="center" valign="middle" >6.03714</td></tr><tr><td align="center" valign="middle" >WA</td><td align="center" valign="middle" >128</td><td align="center" valign="middle" >IL</td><td align="center" valign="middle" >316</td><td align="center" valign="middle" >WA</td><td align="center" valign="middle" >−3.96127</td><td align="center" valign="middle" >IL</td><td align="center" valign="middle" >6.27973</td></tr><tr><td align="center" valign="middle" >NY</td><td align="center" valign="middle" >133</td><td align="center" valign="middle" >TN</td><td align="center" valign="middle" >333</td><td align="center" valign="middle" >NY</td><td align="center" valign="middle" >−3.72165</td><td align="center" valign="middle" >TN</td><td align="center" valign="middle" >7.46995</td></tr><tr><td align="center" valign="middle" >SD</td><td align="center" valign="middle" >149</td><td align="center" valign="middle" >AR</td><td align="center" valign="middle" >346</td><td align="center" valign="middle" >SD</td><td align="center" valign="middle" >−2.85342</td><td align="center" valign="middle" >AR</td><td align="center" valign="middle" >8.60699</td></tr><tr><td align="center" valign="middle" >MT</td><td align="center" valign="middle" >154</td><td align="center" valign="middle" >NM</td><td align="center" valign="middle" >347</td><td align="center" valign="middle" >MT</td><td align="center" valign="middle" >−2.57816</td><td align="center" valign="middle" >NM</td><td align="center" valign="middle" >8.72040</td></tr><tr><td align="center" valign="middle" >NJ</td><td align="center" valign="middle" >156</td><td align="center" valign="middle" >SC</td><td align="center" valign="middle" >356</td><td align="center" valign="middle" >NJ</td><td align="center" valign="middle" >−2.46596</td><td align="center" valign="middle" >SC</td><td align="center" valign="middle" >9.40882</td></tr><tr><td align="center" valign="middle" >CO</td><td align="center" valign="middle" >157</td><td align="center" valign="middle" >MO</td><td align="center" valign="middle" >361</td><td align="center" valign="middle" >CO</td><td align="center" valign="middle" >−2.39117</td><td align="center" valign="middle" >MO</td><td align="center" valign="middle" >10.12647</td></tr><tr><td align="center" valign="middle" >WI</td><td align="center" valign="middle" >158</td><td align="center" valign="middle" >MD</td><td align="center" valign="middle" >362</td><td align="center" valign="middle" >WI</td><td align="center" valign="middle" >−2.34701</td><td align="center" valign="middle" >MD</td><td align="center" valign="middle" >10.37644</td></tr><tr><td align="center" valign="middle" >KS</td><td align="center" valign="middle" >196</td><td align="center" valign="middle" >AL</td><td align="center" valign="middle" >384</td><td align="center" valign="middle" >KS</td><td align="center" valign="middle" >−0.40414</td><td align="center" valign="middle" >AL</td><td align="center" valign="middle" >13.09039</td></tr><tr><td align="center" valign="middle" >VA</td><td align="center" valign="middle" >199</td><td align="center" valign="middle" >MS</td><td align="center" valign="middle" >390</td><td align="center" valign="middle" >VA</td><td align="center" valign="middle" >−0.24508</td><td align="center" valign="middle" >MS</td><td align="center" valign="middle" >14.87734</td></tr><tr><td align="center" valign="middle" >CA</td><td align="center" valign="middle" >202</td><td align="center" valign="middle" >LA</td><td align="center" valign="middle" >398</td><td align="center" valign="middle" >CA</td><td align="center" valign="middle" >−0.06694</td><td align="center" valign="middle" >LA</td><td align="center" valign="middle" >17.20416</td></tr></tbody></table></table-wrap><p>Applying the selection rules to the SHRs data set, the constants b<sub>1</sub>, …, b<sub>4</sub> and d<sub>1</sub>, …, d<sub>4</sub> need to be obtained as in Section 4 and given in <xref ref-type="table" rid="table5">Table 5</xref> (to 2 dp). For this data set, k = 50 states and n = 8 years. The b<sub>1</sub> (and b<sub>3</sub>) are obtained from the R-code given in Appendix D based on 50,000 simulations. The b<sub>2</sub> and b<sub>4</sub> values are obtained using the Appendix E R-code. Similarly, the d<sub>1</sub> (and d<sub>3</sub>) are obtained from Appendix F, and d<sub>2</sub> and d<sub>4</sub> from Appendix G. A simulation approach seems preferable to the asymptotic approach used in Section 5 since n is relatively small. For comparison, the asymptotic values of the selection constants are given in the last row of <xref ref-type="table" rid="table5">Table 5</xref> in italics and are seen to be quite close to the simulated values. The sum of squares for the normal scores, ssq for k = 50 and n = 8, is 47.4217 and is used in the calculations for the asymptotic values.</p><p><xref ref-type="table" rid="table6">Table 6</xref> provides the number of selected states in the subsets chosen by the four rules using rank sums and the four rules using normal scores. As noted for the MVTFRs (<xref ref-type="table" rid="table3">Table 3</xref>), clearly the number of populations chosen using the rank sum vs. the normal score sum makes a substantial difference. The subset size using Q<sub>1</sub> is less than half that using R<sub>1</sub> (9 vs. 19). The subset size using Q<sub>3</sub> is substantially less in comparison to that of R<sub>3</sub> (15 vs. 22). However, the subset sizes Q<sub>2</sub> and R<sub>2</sub> (Q<sub>4</sub> and R<sub>4</sub>) are within one (two) of each other. The results of the comparative analyses of the MVTFRs and the SHRs are very similar.</p></sec><sec id="s6"><title>6. Summary and Conclusions</title><p>As observed here, R<sub>2</sub> chooses substantially more populations in the selected subset than does rule R<sub>1</sub>. This might be expected since R<sub>2</sub> guarantees a probability of correct selection to be no less than P<sup>*</sup> for any configuration of the population θ-parameters, while that guarantee for rule R<sub>1</sub> is proven for slippage configurations of the θ-parameters. However, limited simulation studies do suggest that the stronger unconstrained P<sup>*</sup> guarantee for R<sub>1</sub> may hold for some classes of distributions (e.g., see Lorenzen and McDonald [<xref ref-type="bibr" rid="scirp.133525-ref6">6</xref>] ). In general for the rank sums, n ( k + 1 ) / 2 ≤ max ( T j , j = 1 , ⋯ , k ) ≤ n ⋅ k , so for k = 51 and n = 19, 494 ≤ max (T<sub>j</sub>) ≤ 969. For P<sup>*</sup> = 0.90, b<sub>1</sub> = 237.53, so 256.47 ≤ max ( T j ) − b 1 ≤ 731.47 . With b<sub>1</sub> = 237.53 and max (T<sub>j</sub>) = 930, then rule R<sub>1</sub> selects all states such that T<sub>i</sub> ≥ 692.47. For rule R<sub>2</sub> the determination of b<sub>2</sub> as seen in (2.7) does not depend on the ranks. It depends only on k, n, and P<sup>*</sup>. So here b<sub>2</sub> = 411.77 and thus R<sub>2</sub> chooses all states</p><table-wrap id="table5" ><label><xref ref-type="table" rid="table5">Table 5</xref></label><caption><title> Selection rules constants for the SHRs (k = 50, n = 8, and P<sup>*</sup> = 0.90)</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >R<sub>1 </sub></th><th align="center" valign="middle" >R<sub>2</sub></th><th align="center" valign="middle" >R<sub>3</sub></th><th align="center" valign="middle" >R<sub>4</sub></th><th align="center" valign="middle" >Q<sub>1</sub></th><th align="center" valign="middle" >Q<sub>2</sub></th><th align="center" valign="middle" >Q<sub>3</sub></th><th align="center" valign="middle" >Q<sub>4</sub></th></tr></thead><tr><td align="center" valign="middle" >b<sub>1</sub> = 148</td><td align="center" valign="middle" >b<sub>2</sub> = 151</td><td align="center" valign="middle" >b<sub>3</sub> = 148</td><td align="center" valign="middle" >b<sub>4</sub> = 257</td><td align="center" valign="middle" >d<sub>1</sub> = 10.13</td><td align="center" valign="middle" >d<sub>2</sub> = −3.54</td><td align="center" valign="middle" >d<sub>3</sub> = 10.13</td><td align="center" valign="middle" >d<sub>4</sub> = 3.54</td></tr><tr><td align="center" valign="middle" >150.85</td><td align="center" valign="middle" >151.7</td><td align="center" valign="middle" >150.85</td><td align="center" valign="middle" >256.3</td><td align="center" valign="middle" >10.18</td><td align="center" valign="middle" >−3.53</td><td align="center" valign="middle" >10.18</td><td align="center" valign="middle" >3.53</td></tr></tbody></table></table-wrap><table-wrap id="table6" ><label><xref ref-type="table" rid="table6">Table 6</xref></label><caption><title> Number of states chosen by selection rules with P<sup>*</sup> = 0.90, k = 50, and n = 8</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >R<sub>1 </sub></th><th align="center" valign="middle" >R<sub>2</sub></th><th align="center" valign="middle" >R<sub>3</sub></th><th align="center" valign="middle" >R<sub>4</sub></th><th align="center" valign="middle" >Q<sub>1 </sub></th><th align="center" valign="middle" >Q<sub>2</sub></th><th align="center" valign="middle" >Q<sub>3</sub></th><th align="center" valign="middle" >Q<sub>4</sub></th></tr></thead><tr><td align="center" valign="middle" >19</td><td align="center" valign="middle" >32</td><td align="center" valign="middle" >22</td><td align="center" valign="middle" >32</td><td align="center" valign="middle" >9</td><td align="center" valign="middle" >33</td><td align="center" valign="middle" >15</td><td align="center" valign="middle" >34</td></tr></tbody></table></table-wrap><p>such that T<sub>i</sub> &gt; 411.77. So which rule places more populations in the selected subset depends on max(T<sub>j</sub>). If its value is relatively close to the upper bound n&#183;k, then R<sub>1</sub> chooses fewer populations than R<sub>2</sub>. If its value is relatively close to the lower bound n(k + 1)/2, then R<sub>1</sub> chooses more populations than R<sub>2</sub>.</p><p>With the traffic fatality rates considered here, rule Q<sub>1</sub> placed seven states in the selected subset and rule R<sub>1</sub> placed sixteen states in the selected subset. So which of these two rules to use in practice? From <xref ref-type="fig" rid="fig2">Figure 2</xref>, it appears that Q<sub>1</sub> would be the appropriate choice when it is desired to place relatively greater weight on the extreme three or four observations and the underlying distribution of the data is approximately symmetric. <xref ref-type="fig" rid="fig3">Figure 3</xref> shows the values of the MVTFRs for the year 1994 to be approximately symmetric and normal, a characteristic shared by most of the years. Such a pattern seems to favor the choice of normal scores over the rank scores.</p><p>The same statements would apply to the choice between Q<sub>3</sub> and R<sub>3</sub>. Clearly there is more work to be done in this area of statistical inference. This article compared only two scoring rules based on the expected values of order statistics from two distributions, the uniform distribution and the normal distribution. Substantial differences in the size of selected subsets result from the application of these nonparametric subset selection rules to a study of state motor vehicle traffic fatality rates (state homicide rates) over a nineteen (eight) year period. Is it possible for R<sub>1</sub> to place fewer populations in the selected subset than rule Q<sub>1</sub>?</p><p>While the examples given in Sections 4 and 5 demonstrate that the selected subset size using rule Q<sub>1</sub> (normal scores) can be smaller than that using rule R<sub>1</sub> (rank scores), it should be noted that this is not always so. Consider the case where the probability distributions for each of the populations share support over the same interval. Then it’s possible that within each block any rank order</p><table-wrap id="table7" ><label><xref ref-type="table" rid="table7">Table 7</xref></label><caption><title> Ranked data for k = 7, n = 2, and selected populations noted in red, P<sup>*</sup> = 0.75</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Population</th><th align="center" valign="middle" >π<sub>1 </sub></th><th align="center" valign="middle" >π<sub>2</sub></th><th align="center" valign="middle" >π<sub>3</sub></th><th align="center" valign="middle" >π<sub>4</sub></th><th align="center" valign="middle" >π<sub>5</sub></th><th align="center" valign="middle" >π<sub>6</sub></th><th align="center" valign="middle" >π<sub>7</sub></th></tr></thead><tr><td align="center" valign="middle" >Block 1 ranks</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >2</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >5</td><td align="center" valign="middle" >6</td><td align="center" valign="middle" >7</td></tr><tr><td align="center" valign="middle" >Block 2 ranks</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >2</td><td align="center" valign="middle" >7</td><td align="center" valign="middle" >5</td><td align="center" valign="middle" >6</td><td align="center" valign="middle" >4</td></tr><tr><td align="center" valign="middle" >T<sub>i </sub></td><td align="center" valign="middle" >2</td><td align="center" valign="middle" >5</td><td align="center" valign="middle" >5</td><td align="center" valign="middle" >11</td><td align="center" valign="middle" >10</td><td align="center" valign="middle" >12</td><td align="center" valign="middle" >11<sub> </sub></td></tr><tr><td align="center" valign="middle" >S<sub>i </sub></td><td align="center" valign="middle" >−2.70436</td><td align="center" valign="middle" >−1.11008</td><td align="center" valign="middle" >−1.11008</td><td align="center" valign="middle" >1.35218</td><td align="center" valign="middle" >0.70542</td><td align="center" valign="middle" >1.51474</td><td align="center" valign="middle" >1.35218</td></tr></tbody></table></table-wrap><table-wrap id="table8" ><label><xref ref-type="table" rid="table8">Table 8</xref></label><caption><title> The Number of Configurations (N) for Which the Number of Populations Chosen by R<sub>1</sub> Less the Number Chosen by Q<sub>1</sub> is Equal to ∆ for k = 7, n = 2, and P<sup>*</sup> = 0.7</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >∆</th><th align="center" valign="middle" >−2</th><th align="center" valign="middle" >−1</th><th align="center" valign="middle" >0</th><th align="center" valign="middle" >1</th></tr></thead><tr><td align="center" valign="middle" >N<sub> </sub></td><td align="center" valign="middle" >40,320</td><td align="center" valign="middle" >665,280</td><td align="center" valign="middle" >23,486,400</td><td align="center" valign="middle" >1,209,600</td></tr></tbody></table></table-wrap><p>of the population observations can occur. Using the R-code in Appendix H with k = 7, n = 2, and P<sup>*</sup> = 0.75, it’s determined that b<sub>1</sub> = 6 and d<sub>1</sub> = 2.70436. Assuming the observations yield the ranked values given in <xref ref-type="table" rid="table7">Table 7</xref>, then using the selection rules given in (2.1) and (2.9) rule R<sub>1</sub> chooses four populations and rule, Q<sub>1</sub> chooses six populations.</p><p>With k = 7, there are seven factorial (5040) permutations of possible rank orders for a given block. Thus, with k = 7 and n = 2 there are 5040<sup>2</sup> = 25,401,600 possible rank order configurations for this experimental design with two blocks. The number of these configurations yielding specific differences in number of populations chosen by the two ranking procedures is calculated with the R-code in Appendix H and is given in <xref ref-type="table" rid="table8">Table 8</xref>. Negative ∆-values indicate that subset selections using R<sub>1</sub> result in fewer chosen population that does that using Q<sub>1</sub>. The specific configuration given in <xref ref-type="table" rid="table7">Table 7</xref> is one of the 40,320 given in <xref ref-type="table" rid="table8">Table 8</xref> under ∆ = −2. Of the total number of possible configurations, 705,600 (or approximately 2.8 percent) yield smaller subset sizes chosen by R<sub>1</sub> compared to that of Q<sub>1</sub>.</p><p>Given the findings in this article, what should be done in practice? The state of theoretical development along with observance of outcomes using differing scoring rules, suggests analyses should be carried out with several scoring rules, such as ranks and normal scores, for a fixed value of P*. Then use the results that yield the smaller subset size for the given probability of correct selection criteria.</p></sec><sec id="s7"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s8"><title>Cite this paper</title><p>McDonald, G.C. and Alsaeed, S. (2024) Comparison of Block Design Nonparametric Subset Selection Rules Based on Alternative Scoring Rules. Applied Mathematics, 15, 355-389. https://doi.org/10.4236/am.2024.155022</p></sec><sec id="s9"><title>Appendix A</title><p>#MVTFR Ranks 1994_2012</p><p>#Rank sums for the MVTFRs</p><p>#WIREs Comput Stat 2016, 8:222-237, doi: 10.1002/wics.1385</p><p>#k is the number of populations (e.g., states); n is the number of blocks (e.g., years)</p><p>k=51;n=19</p><p>State=c(&quot;AL&quot;,&quot;AK&quot;,&quot;AZ&quot;,&quot;AR&quot;,&quot;CA&quot;,&quot;CO&quot;,&quot;CT&quot;,&quot;DE&quot;,&quot;DC&quot;,&quot;FL&quot;,&quot;GA&quot;,&quot;HI&quot;,&quot;ID&quot;,&quot;IL&quot;,&quot;IN&quot;,&quot;IA&quot;,</p><p>&quot;KS&quot;,&quot;KY&quot;,&quot;LA&quot;,&quot;ME&quot;,&quot;MD&quot;,&quot;MA&quot;,&quot;MI&quot;,&quot;MN&quot;,&quot;MS&quot;,&quot;MO&quot;,&quot;MT&quot;,&quot;NE&quot;,&quot;NV&quot;,&quot;NH&quot;,&quot;NJ&quot;,&quot;NM&quot;,&quot;NY&quot;,</p><p>&quot;NC&quot;,&quot;ND&quot;,&quot;OH&quot;,&quot;OK&quot;,&quot;OR&quot;,&quot;PA&quot;,&quot;RI&quot;,&quot;SC&quot;,&quot;SD&quot;,&quot;TN&quot;,&quot;TX&quot;,&quot;UT&quot;,&quot;VT&quot;,&quot;VA&quot;,&quot;WA&quot;,&quot;WV&quot;,&quot;WI&quot;,&quot;WY&quot;)</p><p>y1994=c(2.21,2.05,2.33,2.44,1.56,1.74,1.14,1.59,2.00,2.2,1.72,1.54,2.15,1.68,1.59,1.86,1.79,</p><p>1.95,2.25,1.51,1.47,0.94,1.67,1.49,2.77,1.9,2.22,1.75,2.26,1.13,1.26,2.18,1.49,1.99,1.39,</p><p>1.4,1.93,1.68,1.56,0.89,2.27,2.02,2.23,1.79,1.9,1.25,1.38,1.35,2.08,1.42,2.15)</p><p>y1995=c(2.2,2.11,2.61,2.37,1.52,1.84,1.13,1.61,1.67,2.19,1.74,1.64,2.13,1.68,1.49,2.03,1.76,</p><p>2.07,2.31,1.49,1.5,0.92,1.79,1.35,2.94,1.87,2.28,1.61,2.24,1.11,1.27,2.29,1.46,1.9,1.13,</p><p>1.35,1.74,1.91,1.57,1.00,2.28,2.06,2.24,1.76,1.73,1.71,1.29,1.33,2.16,1.45,2.41)</p><p>y1996=c(2.23,1.97,2.36,2.21,1.43,1.71,1.1,1.51,1.59,2.12,1.76,1.84,1.99,1.53,1.49,1.73,1.89,</p><p>1.98,2.37,1.32,1.32,0.83,1.67,1.3,2.65,1.88,2.12,1.8,2.18,1.22,1.31,2.25,1.34,1.89,1.26,</p><p>1.35,1.96,1.73,1.52,0.97,2.34,2.24,2.12,2.02,1.64,1.38,1.23,1.44,1.97,1.44,1.94)</p><p>y1997=c(2.23,1.76,2.19,2.35,1.32,1.62,1.19,1.79,1.8,2.08,1.69,1.65,2.01,1.41,1.36,1.67,1.82,</p><p>1.97,2.44,1.45,1.31,0.87,1.58,1.22,2.73,1.89,2.82,1.77,2.13,1.12,1.23,2.21,1.37,1.81,1.47,</p><p>1.39,2.02,1.62,1.59,1.06,2.18,1.86,2.02,1.77,1.79,1.48,1.4,1.32,2.08,1.33,1.81)</p><p>y1998=c(1.94,1.55,2.17,2.2,1.2,1.6,1.12,1.4,1.63,2.05,1.63,1.5,1.97,1.38,1.42,1.55,1.82,1.91,</p><p>2.3,1.42,1.25,0.78,1.46,1.31,2.77,1.81,2.47,1.79,2.19,1.11,1.15,1.91,1.23,1.87,1.25,1.36,1.8,</p><p>1.61,1.48,0.93,2.34,2.04,1.94,1.74,1.65,1.58,1.29,1.27,1.9,1.26,1.92)</p><p>y1999=c(2.03,1.74,2.18,2.07,1.19,1.54,1.01,1.18,1.18,2.06,1.52,1.21,1.99,1.42,1.46,1.68,1.95,</p><p>1.75,2.28,1.28,1.2,0.8,1.44,1.22,2.66,1.64,2.24,1.64,2.01,1.18,1.11,2.05,1.26,1.71,1.64,1.36,</p><p>1.74,1.19,1.52,1.06,2.41,1.82,2.01,1.67,1.63,1.38,1.19,1.21,2.08,1.31,2.42)</p><p>y2000=c(1.76,2.3,2.11,2.24,1.22,1.63,1.11,1.49,1.37,1.99,1.47,1.55,2.04,1.38,1.25,1.51,1.64,</p><p>1.75,2.3,1.19,1.17,0.82,1.41,1.19,2.67,1.72,2.4,1.53,1.83,1.05,1.08,1.9,1.13,1.74,1.19,1.29,</p><p>1.5,1.33,1.49,0.96,2.34,2.05,1.99,1.72,1.65,1.12,1.24,1.18,2.14,1.4,1.88)</p><p>y2001=c(1.75,1.89,2.12,2.08,1.27,1.73,1.03,1.58,1.81,1.77,1.53,1.61,1.84,1.37,1.27,1.49,1.75,</p><p>1.83,2.2,1.33,1.27,0.9,1.34,1.06,2.18,1.62,2.3,1.36,1.72,1.15,1.08,2.00,1.2,1.67,1.45,1.29,</p><p>1.57,1.42,1.49,1.01,2.27,2.00,1.85,1.73,1.24,1.17,1.27,1.21,1.91,1.33,2.16)</p><p>y2002=c(1.8,1.82,2.18,2.13,1.27,1.71,1.04,1.4,1.33,1.76,1.41,1.34,1.86,1.35,1.09,1.31,1.78,</p><p>1.95,2.09,1.47,1.23,0.86,1.28,1.2,2.43,1.77,2.59,1.64,2.12,1.01,1.1,1.97,1.15,1.7,1.32,1.31,</p><p>1.62,1.26,1.54,1.03,2.23,2.12,1.73,1.73,1.34,0.98,1.18,1.2,2.19,1.37,1.95)</p><p>y2003=c(1.71,1.98,2.07,2.09,1.31,1.48,0.95,1.57,1.87,1.71,1.47,1.43,2.05,1.36,1.15,1.42,1.64,</p><p>1.99,2.13,1.39,1.19,0.86,1.27,1.18,2.33,1.81,2.41,1.54,1.91,0.98,1.05,1.92,1.11,1.66,1.41,1.17,</p><p>1.47,1.46,1.48,1.24,2.01,2.38,1.73,1.71,1.29,0.83,1.23,1.09,1.96,1.42,1.79)</p><p>y2004=c(1.95,2.02,2.01,2.22,1.25,1.45,0.93,1.44,1.15,1.65,1.44,1.46,1.77,1.24,1.3,1.23,1.57,</p><p>2.04,2.08,1.3,1.16,0.87,1.12,1.00,2.28,1.64,2.04,1.32,1.95,1.26,0.99,2.18,1.08,1.64,1.32,1.15,</p><p>1.67,1.28,1.38,0.98,2.11,2.24,1.89,1.6,1.2,1.25,1.17,1.02,2.02,1.31,1.77)</p><p>y2005=c(1.92,1.45,1.97,2.05,1.32,1.26,0.88,1.4,1.29,1.75,1.52,1.39,1.85,1.27,1.31,1.45,1.44,</p><p>2.08,2.14,1.13,1.09,0.8,1.09,0.98,2.32,1.83,2.26,1.43,2.06,1.24,1.01,2.04,1.03,1.53,1.62,</p><p>1.2,1.71,1.38,1.5,1.05,2.21,2.22,1.79,1.5,1.12,0.95,1.18,1.17,1.82,1.36,1.88)</p><p>y2006=c(1.99,1.49,2.07,2.01,1.29,1.1,0.98,1.57,1.02,1.65,1.49,1.58,1.76,1.17,1.27,1.4,1.55,</p><p>1.91,2.17,1.25,1.16,0.78,1.04,0.87,2.2,1.59,2.34,1.39,1.97,0.93,1.02,1.88,1.03,1.53,1.41,</p><p>1.11,1.57,1.35,1.41,0.98,2.08,2.08,1.82,1.48,1.11,1.11,1.19,1.12,1.96,1.22,2.07)</p><p>y2007=c(1.81,1.59,1.7,1.96,1.22,1.14,0.92,1.23,1.22,1.56,1.46,1.33,1.6,1.16,1.23,1.43,1.38,</p><p>1.8,2.19,1.22,1.09,0.79,1.04,0.89,2.04,1.43,2.45,1.32,1.68,0.96,0.95,1.54,0.97,1.62,1.42,</p><p>1.13,1.61,1.31,1.37,0.8,2.11,1.62,1.7,1.42,1.11,0.86,1.25,1.0,2.1,1.27,1.6)</p><p>y2008=c(1.63,1.27,1.52,1.81,1.05,1.15,0.95,1.35,0.94,1.5,1.37,1.04,1.52,0.98,1.11,1.34,1.29,</p><p>1.74,2.03,1.06,1.07,0.67,0.96,0.78,1.79,1.41,2.12,1.09,1.56,1.06,0.8,1.39,0.92,1.4,1.33,1.1,</p><p>1.55,1.24,1.36,0.79,1.86,1.35,1.5,1.48,1.06,1.0,1.0,0.94,1.82,1.05,1.68)</p><p>y2009=c(1.38,1.3,1.31,1.8,0.95,1.01,0.71,1.28,0.8,1.3,1.18,1.09,1.46,0.86,0.9,1.19,1.31,1.67,</p><p>1.84,1.1,0.99,0.62,0.9,0.74,1.73,1.27,2.01,1.15,1.19,0.85,0.8,1.39,0.87,1.28,1.72,0.92,1.57,</p><p>1.11,1.22,1.01,1.82,1.48,1.4,1.35,0.93,0.97,0.94,0.87,1.82,0.96,1.4)</p><p>y2010=c(1.34,1.17,1.27,1.7,0.84,1.96,1.02,1.13,0.67,1.25,1.12,1.13,1.32,0.88,1.0,1.24,1.44,</p><p>1.58,1.59,1.11,0.88,0.64,0.97,0.73,1.61,1.16,1.69,0.98,1.16,0.98,0.76,1.38,0.92,1.29,1.27,</p><p>0.97,1.4,0.94,1.32,0.81,1.65,1.58,1.47,1.29,0.95,0.98,0.9,0.8,1.64,0.96,1.66)</p><p>y2011=c(1.38,1.57,1.39,1.67,0.88,0.96,0.71,1.1,0.76,1.25,1.13,0.99,1.05,0.89,0.98,1.15,1.29,</p><p>1.5,1.46,0.95,0.86,0.68,0.94,0.65,1.62,1.14,1.79,0.95,1.02,0.71,0.86,1.36,0.92,1.19,1.62,</p><p>0.91,1.47,0.99,1.3,0.84,1.7,1.23,1.32,1.29,0.93,0.77,0.94,0.8,1.78,0.99,1.46)</p><p>y2012=c(1.33,1.23,1.37,1.65,0.88,1.01,0.75,1.24,0.42,1.27,1.11,1.25,1.13,0.91,0.99,1.16,1.32,</p><p>1.58,1.54,1.16,0.89,0.62,0.99,0.69,1.51,1.21,1.72,1.1,1.07,0.84,0.79,1.43,0.91,1.23,1.69,</p><p>1.0,1.48,1.01,1.32,0.82,1.76,1.46,1.42,1.43,0.82,1.07,0.96,0.78,1.76,1.04,1.33)</p><p>MV&lt;-data.frame(State,y1994,y1995,y1996,y1997,y1998,y1999,y2000,y2001,y2002,y2003,y2004,</p><p>y2005,y2006,y2007,y2008,y2009,y2010,y2011,y2012)</p><p>MV</p><p>#Tied ranks are resolved at random</p><p>x1&lt;-rank(y1994,ties.method=&quot;random&quot;);x2&lt;-rank(y1995,ties.method=&quot;random&quot;)</p><p>x3&lt;-rank(y1996,ties.method=&quot;random&quot;);x4&lt;-rank(y1997,ties.method=&quot;random&quot;)</p><p>x5&lt;-rank(y1998,ties.method=&quot;random&quot;);x6&lt;-rank(y1999,ties.method=&quot;random&quot;)</p><p>x7&lt;-rank(y2000,ties.method=&quot;random&quot;);x8&lt;-rank(y2001,ties.method=&quot;random&quot;)</p><p>x9&lt;-rank(y2002,ties.method=&quot;random&quot;);x10&lt;-rank(y2003,ties.method=&quot;random&quot;)</p><p>x11&lt;-rank(y2004,ties.method=&quot;random&quot;);x12&lt;-rank(y2005,ties.method=&quot;random&quot;)</p><p>x13&lt;-rank(y2006,ties.method=&quot;random&quot;);x14&lt;-rank(y2007,ties.method=&quot;random&quot;)</p><p>x15&lt;-rank(y2008,ties.method=&quot;random&quot;);x16&lt;-rank(y2009,ties.method=&quot;random&quot;)</p><p>x17&lt;-rank(y2010,ties.method=&quot;random&quot;);x18&lt;-rank(y2011,ties.method=&quot;random&quot;)</p><p>x19&lt;-rank(y2012,ties.method=&quot;random&quot;)</p><p>ra&lt;-data.frame(x1,x2,x3,x4,x5,x6,x7,x8,x9,x10,x11,</p><p>x12,x13,x14,x15,x16,x17,x18,x19)</p><p>#ra</p><p>ram&lt;-as.matrix(ra)</p><p>ram</p><p>RkSum&lt;-rep(0,k)</p><p>for (i in 1:k){RkSum[i]&lt;-sum(ram[i,])}</p><p>#RkSum</p><p>StRk&lt;-data.frame(State,RkSum)</p><p>#StRk</p><p>StRkOrd&lt;-StRk[order(StRk$RkSum),]</p><p>#StRkOrd</p><p>NorSc&lt;-rep(0,51)</p><p>#The following are approximations to exact values given by Harter</p><p>#for (i in 1:k){NorSc[i]&lt;-qnorm(i/(k+1))}</p><p>#NorSc&lt;-round(NorSc,4)</p><p>#The following expected values of normal order stats are taken from</p><p>#&quot;Expected Values of Normal Order Statistics,&quot; by H. Leon Harter (1961)</p><p>#If two states are tied and the rank values are r1 and r2 (r1</p><p>#is assigned to the state that is lower in alphabetical order (using the</p><p>#two letter state abbreviation; r2 is assigned to the other state.</p><p>#Similarly if three are more states are tied in their MVTFRs.</p><p>NorSc&lt;-c(-2.25678,-1.86371,-1.63829,-1.47409,-1.34207,-1.23003,-1.13162,</p><p>-1.04312,-0.96213,-0.88701,-0.81661,-0.75004,-0.68666,-0.62592,-0.56742,</p><p>-0.51080,-0.45578,-0.40211,-0.34957,-0.29799,-0.24719,-0.19702,-0.14735,</p><p>-0.09803,-0.04896,0,0.04896,0.09803,0.14735,0.19702,0.24719,0.29799,</p><p>0.34957,0.40211,0.45578,0.51080,0.56742,0.62592,0.68666,0.75004,0.81661,</p><p>0.88701,0.96213,1.04312,1.13162,1.23003,1.34207,1.47409,1.63829,1.86371,</p><p>2.25678)</p><p>sum(NorSc)</p><p>#NorSc</p><p>df94&lt;-data.frame(State,x1)</p><p>ns94&lt;-rep(0,51)</p><p>for (i in 1:51){ns94[i]&lt;-NorSc[x1[i]]}</p><p>df94&lt;-data.frame(State,x1,ns94)</p><p>#df94</p><p>#</p><p>df95&lt;-data.frame(State,x2)</p><p>ns95&lt;-rep(0,51)</p><p>for (i in 1:51){ns95[i]&lt;-NorSc[x2[i]]}</p><p>df95&lt;-data.frame(State,x2,ns95)</p><p>#df95</p><p>#</p><p>df96&lt;-data.frame(State,x3)</p><p>ns96&lt;-rep(0,51)</p><p>for (i in 1:51){ns96[i]&lt;-NorSc[x3[i]]}</p><p>df96&lt;-data.frame(State,x3,ns96)</p><p>#df96</p><p>#</p><p>df97&lt;-data.frame(State,x4)</p><p>ns97&lt;-rep(0,51)</p><p>for (i in 1:51){ns97[i]&lt;-NorSc[x4[i]]}</p><p>df97&lt;-data.frame(State,x4,ns97)</p><p>#df97</p><p>#</p><p>df98&lt;-data.frame(State,x5)</p><p>ns98&lt;-rep(0,51)</p><p>for (i in 1:51){ns98[i]&lt;-NorSc[x5[i]]}</p><p>df98&lt;-data.frame(State,x5,ns98)</p><p>#df98</p><p>#</p><p>df99&lt;-data.frame(State,x6)</p><p>ns99&lt;-rep(0,51)</p><p>for (i in 1:51){ns99[i]&lt;-NorSc[x6[i]]}</p><p>df99&lt;-data.frame(State,x6,ns99)</p><p>#df99</p><p>#</p><p>df00&lt;-data.frame(State,x7)</p><p>ns00&lt;-rep(0,51)</p><p>for (i in 1:51){ns00[i]&lt;-NorSc[x7[i]]}</p><p>df00&lt;-data.frame(State,x7,ns00)</p><p>#df00</p><p>#</p><p>df01&lt;-data.frame(State,x8)</p><p>ns01&lt;-rep(0,51)</p><p>for (i in 1:51){ns01[i]&lt;-NorSc[x8[i]]}</p><p>df01&lt;-data.frame(State,x8,ns01)</p><p>#df01</p><p>#</p><p>df02&lt;-data.frame(State,x9)</p><p>ns02&lt;-rep(0,51)</p><p>for (i in 1:51){ns02[i]&lt;-NorSc[x9[i]]}</p><p>df02&lt;-data.frame(State,x9,ns02)</p><p>#df02</p><p>#</p><p>df03&lt;-data.frame(State,x10)</p><p>ns03&lt;-rep(0,51)</p><p>for (i in 1:51){ns03[i]&lt;-NorSc[x10[i]]}</p><p>df03&lt;-data.frame(State,x10,ns03)</p><p>#df03</p><p>#</p><p>df04&lt;-data.frame(State,x11)</p><p>ns04&lt;-rep(0,51)</p><p>for (i in 1:51){ns04[i]&lt;-NorSc[x11[i]]}</p><p>df04&lt;-data.frame(State,x11,ns04)</p><p>#df04</p><p>#</p><p>df05&lt;-data.frame(State,x12)</p><p>ns05&lt;-rep(0,51)</p><p>for (i in 1:51){ns05[i]&lt;-NorSc[x12[i]]}</p><p>df05&lt;-data.frame(State,x12,ns05)</p><p>#df05</p><p>#</p><p>df06&lt;-data.frame(State,x13)</p><p>ns06&lt;-rep(0,51)</p><p>for (i in 1:51){ns06[i]&lt;-NorSc[x13[i]]}</p><p>df06&lt;-data.frame(State,x13,ns06)</p><p>#df06</p><p>#</p><p>df07&lt;-data.frame(State,x14)</p><p>ns07&lt;-rep(0,51)</p><p>for (i in 1:51){ns07[i]&lt;-NorSc[x14[i]]}</p><p>df07&lt;-data.frame(State,x14,ns07)</p><p>#df07</p><p>#</p><p>df08&lt;-data.frame(State,x15)</p><p>ns08&lt;-rep(0,51)</p><p>for (i in 1:51){ns08[i]&lt;-NorSc[x15[i]]}</p><p>df08&lt;-data.frame(State,x15,ns08)</p><p>#df08</p><p>#</p><p>df09&lt;-data.frame(State,x16)</p><p>ns09&lt;-rep(0,51)</p><p>for (i in 1:51){ns09[i]&lt;-NorSc[x16[i]]}</p><p>df09&lt;-data.frame(State,x16,ns09)</p><p>#df09</p><p>#</p><p>df10&lt;-data.frame(State,x17)</p><p>ns10&lt;-rep(0,51)</p><p>for (i in 1:51){ns10[i]&lt;-NorSc[x17[i]]}</p><p>df10&lt;-data.frame(State,x17,ns10)</p><p>#df10</p><p>#</p><p>df11&lt;-data.frame(State,x18)</p><p>ns11&lt;-rep(0,51)</p><p>for (i in 1:51){ns11[i]&lt;-NorSc[x18[i]]}</p><p>df11&lt;-data.frame(State,x18,ns11)</p><p>#df11</p><p>#</p><p>df12&lt;-data.frame(State,x19)</p><p>ns12&lt;-rep(0,51)</p><p>for (i in 1:51){ns12[i]&lt;-NorSc[x19[i]]}</p><p>df12&lt;-data.frame(State,x19,ns12)</p><p>#df12</p><p>#</p><p>Ns&lt;-data.frame(ns94,ns95,ns96,ns97,ns98,ns99,ns00,ns01,</p><p>ns02,ns03,ns04,ns05,ns06,ns07,ns08,ns09,ns10,ns11,ns12)</p><p>scm&lt;-as.matrix(Ns)</p><p>#scm</p><p>NsSum&lt;-rep(0,k)</p><p>for (i in 1:k){NsSum[i]&lt;-sum(scm[i,])}</p><p>#NsSum</p><p>StRkNs&lt;-data.frame(State,NsSum)</p><p>#StRkNs</p><p>#</p><p>StNs&lt;-data.frame(State,NsSum)</p><p>StNs</p><p>StNsOrd&lt;-StNs[order(StNs$NsSum),]</p><p>StNsOrd</p><p>StRks&lt;-data.frame(State,RkSum)</p><p>StRks</p><p>StRksOrd&lt;-StRks[order(StRks$RkSum),]</p><p>StRksOrd</p><p>#</p><p>Summary&lt;-data.frame(StRkOrd,StNsOrd)</p><p>Summary</p><p>#check on distribution of MVTFRs for one year, 1994</p><p>hist(y1994,col='red',freq=FALSE,</p><p>main=&quot;Histogram of 1994 MVTFRs\n Normal Density Overlay&quot;)</p><p>low&lt;-min(y1994)-0.1;up&lt;-max(y1994)+0.1</p><p>curve(dnorm(x,mean(y1994),sd(y1994),),from=low,to=up,add=TRUE)</p></sec><sec id="s10"><title>Appendix B</title><p>#Regression of exp51 and seq(2:51)</p><p>#Differences in the Expected value of normal order stats, n=51</p><p>exp51&lt;-c(-2.25678,-1.86371,-1.63829,-1.47409,-1.34207,-1.23003,-1.13162,</p><p>-1.04312,-0.96213,-0.88701,-0.81661,-0.75004,-0.68666,-0.62592,-0.56742,</p><p>-0.51080,-0.45578,-0.40211,-0.34957,-0.29799,-0.24719,-0.19702,-0.14735,</p><p>-0.09803,-0.04896,0,0.04896,0.09803,0.14735,0.19702,0.24719,0.29799,</p><p>0.34957,0.40211,0.45578,0.51080,0.56742,0.62592,0.68666,0.75004,0.81661,</p><p>0.88701,0.96213,1.04312,1.13162,1.23003,1.34207,1.47409,1.63829,1.86371,</p><p>2.25678)</p><p>sum(exp51)</p><p>diff&lt;-rep(0,50)</p><p>for (i in 1:50){</p><p>diff[i]&lt;-exp51[i+1]-exp51[i]</p><p>}</p><p>diff</p><p>x&lt;-seq(1:50)</p><p>#plot(x,diff,main=&quot;Difference in expected values of normal order stats\n k = 51&quot;)</p><p>#1st point is E[X(2)]-E[X(1)], 2nd point is E[X(3)]-E[X(2)], etc.</p><p>xx&lt;-seq(1:51)</p><p>par(mfrow=c(1,2))</p><p>model&lt;-lm(exp51~xx)</p><p>plot(xx,exp51,xlab=&quot;Ranks&quot;,ylab=&quot;Normal Scores&quot;,main=&quot;Normal Scores vs.\n Rank Values, k = 51&quot;,</p><p>cex.main=1,pch=16)</p><p>abline(model,col=&quot;red&quot;,lwd=2)</p><p>legend(&quot;topleft&quot;,&quot;Reg Line\nR^2 = 0.967&quot;,col=&quot;red&quot;,lty=1,cex=0.8,bg=&quot;yellow&quot;)</p><p>plot(x,diff,xlab=&quot;Ranks&quot;,ylab=&quot;Difference&quot;,cex.main=1,</p><p>main=&quot;Diff in Expected Values of\n Normal Order Stats, k = 51&quot;,pch=16)</p><p>abline(v=25.5,col=&quot;red&quot;,lty=2)</p></sec><sec id="s11"><title>Appendix C</title><p>#State Homicide Rates 2005,2014-2020</p><p>#Rank sums for the HRs</p><p>#Applied Mathematics,2022,13,585-601</p><p>#https://www.scirp.org/journal/am</p><p>#k is the number of populations (e.g., states); n is the number of blocks (e.g., years)</p><p>k=50;n=8</p><p>State=c(&quot;AK&quot;,&quot;AL&quot;,&quot;AR&quot;,&quot;AZ&quot;,&quot;CA&quot;,&quot;CO&quot;,&quot;CT&quot;,&quot;DE&quot;,&quot;FL&quot;,&quot;GA&quot;,&quot;HI&quot;,&quot;IA&quot;,&quot;ID&quot;,&quot;IL&quot;,&quot;IN&quot;,”KS”,</p><p>&quot;KY&quot;,&quot;LA&quot;,&quot;MA&quot;,&quot;MD&quot;,&quot;ME&quot;,&quot;MI&quot;,&quot;MN&quot;,&quot;MO&quot;,&quot;MS&quot;,&quot;MT&quot;,&quot;NC&quot;,&quot;ND&quot;,&quot;NE&quot;,&quot;NH&quot;,&quot;NJ&quot;,&quot;NM&quot;,&quot;NV&quot;,</p><p>&quot;NY&quot;,&quot;OH&quot;,&quot;OK&quot;,&quot;OR&quot;,&quot;PA&quot;,&quot;RI&quot;,&quot;SC&quot;,&quot;SD&quot;,&quot;TN&quot;,&quot;TX&quot;,&quot;UT&quot;,&quot;VA&quot;,&quot;VT&quot;,&quot;WA&quot;,&quot;WI&quot;,&quot;WV&quot;,&quot;WY&quot;)</p><p>y2005=c(1.93,2.47,2.30,2.41,2.17,1.71,1.59,2.13,2.02,2.19,1.29,1.21,1.59,2.15,2.03,1.72,</p><p>1.96,2.77,1.51,2.55,1.24,2.17,1.49,2.21,2.41,1.63,2.25,0.00,1.44,0.00,1.92,2.29,2.27,</p><p>1.86,1.99,2.06,1.53,2.09,1.57,2.29,1.53,2.33,2.11,1.42,2.10,0.00,1.67,1.79,1.96,0.00)</p><p>y2014=c(1.86,2.31,2.26,1.90,1.84,1.61,1.53,2.13,2.07,2.13,1.37,1.44,1.42,2.07,2.01,1.67,</p><p>1.86,2.67,1.32,2.14,1.32,2.09,1.29,2.24,2.65,1.53,1.99,0.00,1.63,0.00,1.81,2.15,2.09,</p><p>1.63,1.93,2.13,1.42,1.93,1.44,2.25,1.57,2.11,1.93,1.32,1.76,0.00,1.57,1.55,2.03,1.81)</p><p>y2015=c(2.30,2.53,2.23,1.98,1.90,1.69,1.67,2.24,2.09,2.21,1.37,1.44,1.32,2.17,2.05,1.86,</p><p>2.02,2.74,1.35,2.54,1.24,2.10,1.51,2.47,2.64,1.74,2.06,1.57,1.74,0.00,1.83,2.30,2.14,1.63,</p><p>2.05,2.35,1.63,1.99,1.51,2.46,1.78,2.20,1.99,1.32,1.83,0.00,1.63,1.83,1.83,0.00)</p><p>y2016=c(2.21,2.68,2.38,2.09,1.95,1.79,1.49,2.18,2.15,2.29,1.51,1.51,1.29,2.43,2.25,1.95,</p><p>2.20,2.90,1.35,2.52,0.00,2.14,1.42,2.50,2.71,1.79,2.23,0.00,1.61,0.00,1.84,2.45,2.23,1.67,</p><p>2.11,2.36,1.61,2.05,1.40,2.41,1.86,2.39,2.05,1.44,1.98,0.00,1.53,1.87,2.09,0.00)</p><p>y2017=c(2.57,2.78,2.49,2.13,1.92,1.84,1.59,2.17,2.10,2.29,1.44,1.63,1.55,2.41,2.20,2.11,</p><p>2.21,2.91,1.47,2.53,0.00,2.09,1.37,2.64,2.76,1.79,2.17,0.00,1.49,0.00,1.76,2.35,2.25,1.55,</p><p>2.24,2.35,1.57,2.13,0.00,2.44,1.78,2.39,2.02,1.47,1.96,0.00,1.67,1.69,2.11,0.00)</p><p>y2018=c(2.24,2.72,2.42,2.06,1.87,1.86,1.51,2.15,2.13,2.26,1.57,1.49,1.40,2.30,2.23,2.03,</p><p>2.06,2.82,1.40,2.44,0.00,2.11,1.40,2.65,2.82,1.78,2.10,1.44,1.29,1.27,1.69,2.59,2.26,1.59,</p><p>2.15,2.18,1.44,2.10,0.00,2.53,1.72,2.43,1.96,1.37,1.92,0.00,1.69,1.72,2.02,1.76)</p><p>y2019=c(2.59,2.77,2.45,2.03,1.83,1.79,1.57,2.06,2.14,2.31,1.44,1.49,1.24,2.31,2.20,1.89,</p><p>2.03,2.93,1.40,2.51,1.27,2.11,1.51,2.59,2.99,1.69,2.18,1.57,1.57,1.51,1.63,2.68,1.98,1.59,</p><p>2.13,2.39,1.55,2.06,1.44,2.61,1.67,2.43,2.03,1.47,1.95,0.00,1.59,1.78,2.01,1.81)</p><p>y2020=c(2.21,2.89,2.79,2.24,2.06,2.02,1.84,2.50,2.27,2.56,1.61,1.67,1.44,2.63,1.48,2.18,</p><p>2.46,3.31,1.49,2.65,1.21,2.38,1.67,2.87,3.35,2.13,2.36,1.81,1.76,0.00,1.79,2.59,2.21,1.86,</p><p>2.42,2.41,1.71,2.35,1.55,2.76,2.11,2.66,2.25,1.53,2.10,0.00,1.78,2.06,2.18,1.89)</p><p>HR&lt;-data.frame(State,y2005,y2014,y2015,y2016,y2017,y2018,y2019,y2020)</p><p>HR</p><p>#Tied ranks are resolved at random</p><p>x1&lt;-rank(y2005,ties.method=&quot;random&quot;);x2&lt;-rank(y2014,ties.method=&quot;random&quot;)</p><p>x3&lt;-rank(y2015,ties.method=&quot;random&quot;);x4&lt;-rank(y2016,ties.method=&quot;random&quot;)</p><p>x5&lt;-rank(y2017,ties.method=&quot;random&quot;);x6&lt;-rank(y2018,ties.method=&quot;random&quot;)</p><p>x7&lt;-rank(y2019,ties.method=&quot;random&quot;);x8&lt;-rank(y2020,ties.method=&quot;random&quot;)</p><p>ra&lt;-data.frame(x1,x2,x3,x4,x5,x6,x7,x8)</p><p>ram&lt;-as.matrix(ra)</p><p>#ram</p><p>RkSum&lt;-rep(0,k)</p><p>for (i in 1:k){RkSum[i]&lt;-sum(ram[i,])}</p><p>#RkSum</p><p>StRk&lt;-data.frame(State,RkSum)</p><p>#StRk</p><p>StRkOrd&lt;-StRk[order(StRk$RkSum),]</p><p>#StRkOrd</p><p>#NorSc taken from Harter, Biometrika (1961)</p><p>NorSc&lt;-c(-2.24907,-1.85487,-1.62863,-1.46374,-1.33109,</p><p>-1.21846,-1.11948,-1.03042,-0.94887,-0.87321,-0.80225,</p><p>-0.73513,-0.67117,-0.60986,-0.55077,-0.49354,-0.43789,</p><p>-0.38357,-0.33036,-0.27807,-0.22653,-0.17559,-0.12511,</p><p>-0.07494,-0.02496,0.02496,0.07494,0.12511,0.17559,</p><p>0.22653,0.27807,0.33036,0.38357,0.43789,0.49354,</p><p>0.55077,0.60986,0.67117,0.73513,0.80225,0.87321,</p><p>0.94887,1.03042,1.11948,1.21846,1.33109,1.46374,</p><p>1.62863,1.85487,2.24907)</p><p>#Approx to exact NorSc given above</p><p>#NorSc&lt;-rep(0,k)</p><p>#for (i in 1:k){NorSc[i]&lt;-qnorm(i/(k+1))}</p><p>#NorSc&lt;-round(NorSc,4)</p><p>#NorSc</p><p>df05&lt;-data.frame(State,x1)</p><p>df05&lt;-df05[order(df05$x1,decreasing=FALSE),]</p><p>df05&lt;-cbind(df05,NorSc)</p><p>#df05</p><p>df05&lt;-df05[order(df05$State,decreasing=FALSE),]</p><p>#df05</p><p>NS05&lt;-df05$NorSc</p><p>#NS05</p><p>df14&lt;-data.frame(State,x2)</p><p>df14&lt;-df14[order(df14$x2,decreasing=FALSE),]</p><p>df14&lt;-cbind(df14,NorSc)</p><p>#df14</p><p>df14&lt;-df14[order(df14$State,decreasing=FALSE),]</p><p>#df14</p><p>NS14&lt;-df14$NorSc</p><p>#NS14</p><p>df15&lt;-data.frame(State,x3)</p><p>df15&lt;-df15[order(df15$x3,decreasing=FALSE),]</p><p>df15&lt;-cbind(df15,NorSc)</p><p>#df15</p><p>df15&lt;-df15[order(df15$State,decreasing=FALSE),]</p><p>#df15</p><p>NS15&lt;-df15$NorSc</p><p>#NS15</p><p>df16&lt;-data.frame(State,x4)</p><p>df16&lt;-df16[order(df16$x4,decreasing=FALSE),]</p><p>df16&lt;-cbind(df16,NorSc)</p><p>#df16</p><p>df16&lt;-df16[order(df16$State,decreasing=FALSE),]</p><p>#df16</p><p>NS16&lt;-df16$NorSc</p><p>#NS16</p><p>df17&lt;-data.frame(State,x5)</p><p>df17&lt;-df17[order(df17$x5,decreasing=FALSE),]</p><p>df17&lt;-cbind(df17,NorSc)</p><p>#df17</p><p>df17&lt;-df17[order(df17$State,decreasing=FALSE),]</p><p>#df17</p><p>NS17&lt;-df17$NorSc</p><p>#NS17</p><p>df18&lt;-data.frame(State,x6)</p><p>df18&lt;-df18[order(df18$x6,decreasing=FALSE),]</p><p>df18&lt;-cbind(df18,NorSc)</p><p>#df18</p><p>df18&lt;-df18[order(df18$State,decreasing=FALSE),]</p><p>#df18</p><p>NS18&lt;-df18$NorSc</p><p>#NS18</p><p>df19&lt;-data.frame(State,x7)</p><p>df19&lt;-df19[order(df19$x7,decreasing=FALSE),]</p><p>df19&lt;-cbind(df19,NorSc)</p><p>#df19</p><p>df19&lt;-df19[order(df19$State,decreasing=FALSE),]</p><p>#df19</p><p>NS19&lt;-df19$NorSc</p><p>#NS19</p><p>df20&lt;-data.frame(State,x8)</p><p>df20&lt;-df20[order(df20$x8,decreasing=FALSE),]</p><p>df20&lt;-cbind(df20,NorSc)</p><p>#df20</p><p>df20&lt;-df20[order(df20$State,decreasing=FALSE),]</p><p>#df20</p><p>NS20&lt;-df20$NorSc</p><p>#NS20</p><p>Ns&lt;-data.frame(NS05,NS14,NS15,NS16,NS17,NS18,NS19,NS20)</p><p>scm&lt;-as.matrix(Ns)</p><p>#scm</p><p>NsSum&lt;-rep(0,k)</p><p>for (i in 1:k){NsSum[i]&lt;-sum(scm[i,])}</p><p>#NsSum</p><p>StRkNs&lt;-data.frame(State,RkSum,NsSum)</p><p>#StRkNs</p><p>#</p><p>StNs&lt;-data.frame(State,NsSum)</p><p>StNsOrd&lt;-StNs[order(StNs$NsSum),]</p><p>#StNsOrd</p><p>Summary&lt;-data.frame(StRkOrd,StNsOrd)</p><p>Summary</p></sec><sec id="s12"><title>Appendix D (b<sub>1</sub> and b<sub>3</sub>)</title><p>#Nonparametric Block Design Selection Procedure Based on Ranks</p><p>#k=no. of population;n=no. of blocks;w=no. of simulations</p><p>#P=min Prob of Correct Selection</p><p>k&lt;-50;n&lt;-8;w&lt;-50000;P=0.90</p><p>#calculate quantiles of max(T)-Ti for iid populations</p><p>rnk&lt;-seq(1:k);T&lt;-rep(0,k);U&lt;-rep(0,k);Q&lt;-rep(0,w);b1&lt;-0;</p><p>V&lt;-rep(0,k);C&lt;-rep(0,k);W&lt;-rep(0,k)</p><p>for (h in 1:w){</p><p>M&lt;-matrix(0,nrow=n,ncol=k)</p><p>for (j in 1:n){M[j,]&lt;-sample(rnk,size=k,replace=FALSE)}</p><p>M</p><p>for (i in 1:k){T[i]&lt;-sum(M[,i])}</p><p>T</p><p>for (j in 1:k){U[j]&lt;-max(T)-T[j]}</p><p>U</p><p>Q[h]&lt;-U[k]</p><p>}</p><p>message(&quot;k = &quot;,k,&quot;, n = &quot;,n,&quot;, w = &quot;,w,&quot;, P = &quot;,P)</p><p>quan&lt;-c(0.50,0.75,0.90,0.95,0.99)</p><p>Qile&lt;-quantile(Q,quan)</p><p>Qile</p><p>Qile&lt;-unname(Qile)</p><p>Qile</p><p>#table(Q)</p><p>if (P==0.50){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]}</p><p>if (P==0.75){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref2">2</xref>]}</p><p>if (P==0.90){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>]}</p><p>if (P==0.95){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>]}</p><p>if (P==0.99){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref5">5</xref>]}</p><p>c(P,b1)</p><p>#Select population i iff Ti&gt;=max(T)-b1</p><p>a&lt;-rep(1,k);b&lt;-rep(5,k)</p><p>V&lt;-rep(0,k)</p><p>for (h in 1:w){</p><p>C&lt;-rep(0,k);x&lt;-rep(0,k);rk&lt;-rep(0,k);T&lt;-rep(0,k)</p><p>M&lt;-matrix(0,nrow=n,ncol=k);U&lt;-rep(0,k)</p><p>for (j in 1:n){</p><p>for (i in 1:k){x[i]&lt;-runif(1,a[i],b[i])}</p><p>rk&lt;-rank(x)</p><p>M[j,]&lt;-rk</p><p>}</p><p>M</p><p>for (i in 1:k){T[i]&lt;-sum(M[,i])}</p><p>T</p><p>for (i in 1:k){U[i]&lt;-max(T)-T[i]</p><p>if (U[i]&lt;=b1){C[i]&lt;-1}</p><p>else {C[i]&lt;-0}</p><p>}</p><p>V&lt;-V+C</p><p>}</p><p>message(&quot;The probabilities of population selections are&quot;)</p><p>V/w</p><p>ESS&lt;-sum(V)/w</p><p>message(&quot;The expected subset size is &quot;,round(ESS,3))</p></sec><sec id="s13"><title>Appendix E (b<sub>2</sub> and b<sub>4</sub>)</title><p>#Nonparametric Block Design Selection Procedure Based on Ranks</p><p>#k=no. of population;n=no. of blocks;w=no. of simulations</p><p>#P=min Prob of Correct Selection</p><p>k&lt;-50;n&lt;-8;w&lt;-50000;P=0.90</p><p>#calculate quantiles of Tk for iid populations</p><p>rnk&lt;-seq(1:k);T&lt;-rep(0,k);U&lt;-rep(0,k);Q&lt;-rep(0,w);b1&lt;-0;</p><p>V&lt;-rep(0,k);C&lt;-rep(0,k);W&lt;-rep(0,k)</p><p>for (h in 1:w){</p><p>M&lt;-matrix(0,nrow=n,ncol=k)</p><p>for (j in 1:n){M[j,]&lt;-sample(rnk,size=k,replace=FALSE)}</p><p>for (i in 1:k){T[i]&lt;-sum(M[,i])}</p><p>Q[h]&lt;-T[k]</p><p>}</p><p>message(&quot;k = &quot;,k,&quot;, n = &quot;,n,&quot;, w = &quot;,w,&quot;, P = &quot;,P)</p><p>quan&lt;-c(0.01,0.05,0.10,0.25,0.50)</p><p>Qile&lt;-quantile(Q,quan)</p><p>Qile</p><p>Qile&lt;-unname(Qile)</p><p>Qile</p><p>#table(Q)</p><p>if (P==0.50){b2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref5">5</xref>]}</p><p>if (P==0.75){b2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>]}</p><p>if (P==0.90){b2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>]}</p><p>if (P==0.95){b2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref2">2</xref>]}</p><p>if (P==0.99){b2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]}</p><p>b4&lt;-(n*(k+1))-b2</p><p>message(&quot;P = &quot;,P,&quot;, b2 = &quot;,b2,&quot;, b4 = &quot;,b4)</p><p>#Select population i iff Ti&gt;b2</p><p>a&lt;-rep(1,k);b&lt;-rep(5,k)</p><p>V&lt;-rep(0,k)</p><p>for (h in 1:w){</p><p>C&lt;-rep(0,k);x&lt;-rep(0,k);rk&lt;-rep(0,k);T&lt;-rep(0,k)</p><p>M&lt;-matrix(0,nrow=n,ncol=k);U&lt;-rep(0,k)</p><p>for (j in 1:n){</p><p>for (i in 1:k){x[i]&lt;-runif(1,a[i],b[i])}</p><p>rk&lt;-rank(x)</p><p>M[j,]&lt;-rk</p><p>}</p><p>for (i in 1:k){T[i]&lt;-sum(M[,i])}</p><p>for (i in 1:k){U[i]&lt;-T[i]</p><p>if (U[i]&gt;b2){C[i]&lt;-1}</p><p>else {C[i]&lt;-0}</p><p>}</p><p>V&lt;-V+C</p><p>}</p><p>message(&quot;The probabilities of population selections are&quot;)</p><p>V/w</p><p>ESS&lt;-sum(V)/w</p><p>message(&quot;The expected subset size is &quot;,round(ESS,3))</p></sec><sec id="s14"><title>Appendix F (d1 and d3 Values)</title><p>#Nonparametric Block Design Selection Procedure Based on Normal Scores</p><p>#k=no. of population;n=no. of blocks;w=no. of simulations</p><p>#P=min Prob of Correct Selection</p><p>k&lt;-50;n&lt;-8;w&lt;-50000;P=0.90</p><p>#calculate quantiles of max(T)-Ti for iid populations</p><p>rnk&lt;-seq(1:k);T&lt;-rep(0,k);U&lt;-rep(0,k);Q&lt;-rep(0,w);b1&lt;-0;</p><p>V&lt;-rep(0,k);C&lt;-rep(0,k);W&lt;-rep(0,k);nsc&lt;-rep(0,k)</p><p>#For approx expected values of normal order statistics use:</p><p>#for (i in 1:k){nsc[i]&lt;-qnorm(i/(k+1))}</p><p>#for exact expected values of normal order stats read in:</p><p>#nsc taken from Harter, Biometrika (1961)</p><p>nsc&lt;-c(-2.24907,-1.85487,-1.62863,-1.46374,-1.33109,</p><p>-1.21846,-1.11948,-1.03042,-0.94887,-0.87321,-0.80225,</p><p>-0.73513,-0.67117,-0.60986,-0.55077,-0.49354,-0.43789,</p><p>-0.38357,-0.33036,-0.27807,-0.22653,-0.17559,-0.12511,</p><p>-0.07494,-0.02496,0.02496,0.07494,0.12511,0.17559,</p><p>0.22653,0.27807,0.33036,0.38357,0.43789,0.49354,</p><p>0.55077,0.60986,0.67117,0.73513,0.80225,0.87321,</p><p>0.94887,1.03042,1.11948,1.21846,1.33109,1.46374,</p><p>1.62863,1.85487,2.24907)</p><p>for (h in 1:w){</p><p>M&lt;-matrix(0,nrow=n,ncol=k)</p><p>for (j in 1:n){M[j,]&lt;-sample(nsc,size=k,replace=FALSE)}</p><p>M</p><p>for (i in 1:k){T[i]&lt;-sum(M[,i])}</p><p>T</p><p>for (j in 1:k){U[j]&lt;-max(T)-T[j]}</p><p>U</p><p>Q[h]&lt;-U[k]</p><p>}</p><p>message(&quot;k = &quot;,k,&quot;, n = &quot;,n,&quot;, w = &quot;,w,&quot;, P = &quot;,P)</p><p>quan&lt;-c(0.50,0.75,0.90,0.95,0.99)</p><p>Qile&lt;-quantile(Q,quan)</p><p>Qile</p><p>Qile&lt;-unname(Qile)</p><p>Qile</p><p>#table(Q)</p><p>if (P==0.50){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]}</p><p>if (P==0.75){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref2">2</xref>]}</p><p>if (P==0.90){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>]}</p><p>if (P==0.95){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>]}</p><p>if (P==0.99){b1&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref5">5</xref>]}</p><p>c(P,b1)</p><p>#Select population i iff Ti&gt;=max(T)-b1</p><p>a&lt;-rep(1,k);b&lt;-rep(2,k)</p><p>V&lt;-rep(0,k)</p><p>for (h in 1:w){</p><p>C&lt;-rep(0,k);W&lt;-rep(0,k);x&lt;-rep(0,k);rk&lt;-rep(0,k);S&lt;-rep(0,k)</p><p>ns&lt;-rep(0,k)</p><p>M&lt;-matrix(0,nrow=n,ncol=k)</p><p>for (j in 1:n){</p><p>for (i in 1:k){x[i]&lt;-runif(1,a[i],b[i])}</p><p>rk&lt;-rank(x)</p><p>for (i in 1:k){ns[i]&lt;-nsc[rk[i]]}</p><p>M[j,]&lt;-ns</p><p>}</p><p>M</p><p>for (i in 1:k){S[i]&lt;-sum(M[,i])}</p><p>S</p><p>for (i in 1:k){W[i]&lt;-max(S)-S[i]</p><p>if (W[i]&lt;=b1){C[i]&lt;-1}</p><p>else {C[i]&lt;-0}</p><p>}</p><p>V&lt;-V+C</p><p>}</p><p>message(&quot;The probabilities of population selections are&quot;)</p><p>V/w</p><p>ESS&lt;-sum(V)/w</p><p>message(&quot;The expected subset size is &quot;,round(ESS,3))</p></sec><sec id="s15"><title>Appendix G (d<sub>2</sub> and d<sub>4</sub> Values)</title><p>#Nonparametric Block Design Selection Procedure Based on Normal Scores</p><p>#k=no. of population;n=no. of blocks;w=no. of simulations</p><p>#P=min Prob of Correct Selection</p><p>k&lt;-50;n&lt;-8;w&lt;-50000;P=0.90</p><p>#calculate quantiles of max(T)-Ti for iid populations</p><p>rnk&lt;-seq(1:k);T&lt;-rep(0,k);U&lt;-rep(0,k);Q&lt;-rep(0,w);b1&lt;-0;</p><p>V&lt;-rep(0,k);C&lt;-rep(0,k);W&lt;-rep(0,k);nsc&lt;-rep(0,k)</p><p>#For approx expected values of normal order statistics use:</p><p>#for (i in 1:k){nsc[i]&lt;-qnorm(i/(k+1))}</p><p>#for exact expected values of normal order stats read in:</p><p>#nsc taken from Harter, Biometrika (1961)</p><p>nsc&lt;-c(-2.24907,-1.85487,-1.62863,-1.46374,-1.33109,</p><p>-1.21846,-1.11948,-1.03042,-0.94887,-0.87321,-0.80225,</p><p>-0.73513,-0.67117,-0.60986,-0.55077,-0.49354,-0.43789,</p><p>-0.38357,-0.33036,-0.27807,-0.22653,-0.17559,-0.12511,</p><p>-0.07494,-0.02496,0.02496,0.07494,0.12511,0.17559,</p><p>0.22653,0.27807,0.33036,0.38357,0.43789,0.49354,</p><p>0.55077,0.60986,0.67117,0.73513,0.80225,0.87321,</p><p>0.94887,1.03042,1.11948,1.21846,1.33109,1.46374,</p><p>1.62863,1.85487,2.24907)</p><p>for (h in 1:w){</p><p>M&lt;-matrix(0,nrow=n,ncol=k)</p><p>for (j in 1:n){M[j,]&lt;-sample(nsc,size=k,replace=FALSE)}</p><p>for (i in 1:k){T[i]&lt;-sum(M[,i])}</p><p>Q[h]&lt;-T[k]</p><p>}</p><p>message(&quot;k = &quot;,k,&quot;, n = &quot;,n,&quot;, w = &quot;,w,&quot;, P = &quot;,P)</p><p>quan&lt;-c(0.01,0.05,0.10,0.25,0.50)</p><p>Qile&lt;-quantile(Q,quan)</p><p>Qile</p><p>Qile&lt;-unname(Qile)</p><p>Qile</p><p>#table(Q)</p><p>if (P==0.50){d2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref5">5</xref>]}</p><p>if (P==0.75){d2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>]}</p><p>if (P==0.90){d2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>]}</p><p>if (P==0.95){d2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref2">2</xref>]}</p><p>if (P==0.99){d2&lt;-Qile[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]}</p><p>d4&lt;--d2</p><p>c(P,d2)</p><p>message(&quot;P = &quot;,P,&quot;, d2 = &quot;,d2,&quot;, d4 = &quot;,d4)</p><p>#Select population i iff Ti&gt;d2</p><p>a&lt;-rep(1,k);b&lt;-rep(5,k)</p><p>V&lt;-rep(0,k)</p><p>for (h in 1:w){</p><p>C&lt;-rep(0,k);W&lt;-rep(0,k);x&lt;-rep(0,k);rk&lt;-rep(0,k);S&lt;-rep(0,k)</p><p>ns&lt;-rep(0,k)</p><p>M&lt;-matrix(0,nrow=n,ncol=k)</p><p>for (j in 1:n){</p><p>for (i in 1:k){x[i]&lt;-runif(1,a[i],b[i])}</p><p>rk&lt;-rank(x)</p><p>for (i in 1:k){ns[i]&lt;-nsc[rk[i]]}</p><p>M[j,]&lt;-ns</p><p>}</p><p>for (i in 1:k){S[i]&lt;-sum(M[,i])}</p><p>for (i in 1:k){W[i]&lt;-S[i]</p><p>if (W[i]&gt;d2){C[i]&lt;-1}</p><p>else {C[i]&lt;-0}</p><p>}</p><p>V&lt;-V+C</p><p>}</p><p>message(&quot;The probabilities of population selections are&quot;)</p><p>V/w</p><p>ESS&lt;-sum(V)/w</p><p>message(&quot;The expected subset size is &quot;,round(ESS,3))</p></sec><sec id="s16"><title>Appendix H</title><p>#k = 7, n = 2</p><p>#generates all the permutations of 1:7</p><p>k&lt;-7;n=2</p><p>f&lt;-factorial(k)</p><p>f2&lt;-f^2</p><p>D&lt;-matrix(0,nrow=f,ncol=k)</p><p>E&lt;-matrix(0,nrow=f,ncol=k)</p><p>C&lt;-matrix(0,nrow=f2,ncol=k)</p><p>F&lt;-matrix(0,nrow=f2,ncol=k)</p><p>permutations &lt;- function(n){</p><p>if(n==1){</p><p>return(matrix(1))</p><p>} else {</p><p>sp &lt;- permutations(n-1)</p><p>p &lt;- nrow(sp)</p><p>A &lt;- matrix(nrow=n*p,ncol=n)</p><p>for(i in 1:n){</p><p>A[(i-1)*p+1:p,] &lt;- cbind(i,sp+(sp&gt;=i))</p><p>}</p><p>return(A)</p><p>}</p><p>}</p><p>D&lt;-permutations(k)</p><p>dim(D)</p><p>head(D,5)</p><p>tail(D,5)</p><p>a&lt;-c(rep(0,f));b&lt;-c(rep(0,f))</p><p>for (i in 1:f){a[i]&lt;-(i-1)*f+1</p><p>b[i]&lt;-i*f</p><p>}</p><p>#for (i in a[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]:b[<xref ref-type="bibr" rid="scirp.133525-ref1">1</xref>]){C[i,]&lt;-D[1,]+D[i,]}</p><p>#for (i in a[<xref ref-type="bibr" rid="scirp.133525-ref2">2</xref>]:b[<xref ref-type="bibr" rid="scirp.133525-ref2">2</xref>]){C[i,]&lt;-D[2,]+D[i-f,]}</p><p>#for (i in a[<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>]:b[<xref ref-type="bibr" rid="scirp.133525-ref3">3</xref>]){C[i,]&lt;-D[3,]+D[i-2*f,]}</p><p>#for (i in a[<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>]:b[<xref ref-type="bibr" rid="scirp.133525-ref4">4</xref>]){C[i,]&lt;-D[4,]+D[i-3*f,]}</p><p>#for (i in a[<xref ref-type="bibr" rid="scirp.133525-ref5">5</xref>]:b[<xref ref-type="bibr" rid="scirp.133525-ref5">5</xref>]){C[i,]&lt;-D[5,]+D[i-4*f,]}</p><p>####</p><p>#for (i in a[f]:b[f]){C[i,]&lt;-D[f,]+D[i-(f-1)*f,]}</p><p>####</p><p>for (i in 1:f){</p><p>for (j in 1:f){</p><p>C[(i-1)*f+j,]&lt;-D[i,]+D[j,]</p><p>}</p><p>}</p><p>dim(C)</p><p>head(C,5)</p><p>tail(C,5)</p><p>S&lt;-c(rep(0,f2))</p><p>for(i in 1:f2){S[i]&lt;-max(C[i,])-C[i,1]}</p><p>head(S,5)</p><p>tail(S,5)</p><p>table(S)</p><p>df&lt;-data.frame(table(S))</p><p>df</p><p>Pr&lt;-df$Freq/f2</p><p>Pr&lt;-round(Pr,5)</p><p>CDF&lt;-cumsum(Pr)</p><p>df1&lt;-data.frame(df,Pr,CDF)</p><p>df1</p><p>###########################</p><p>v3&lt;-c(rep(0,f2))</p><p>for(i in 1:f2){v3[i]&lt;-max(C[i,])-6}</p><p>message(&quot;max(Ti)-d for k = 7, n = 2, d = 6 and</p><p>P* = 0.74904&quot;)</p><p>head(v3,5)</p><p>tail(v3,5)</p><p>v2&lt;-c(rep(0,f2))</p><p>for(i in 1:f2){v2[i]&lt;-max(C[i,])-8}</p><p>message(&quot;max(Ti)-d for k = 7, n = 2, d = 8 and</p><p>P* = 0.906&quot;)</p><p>head(v2,5)</p><p>tail(v2,5)</p><p>K&lt;-c(rep(0,f2))</p><p>for (i in 1:f2){</p><p>if (C[i,1]&gt;=v3[i]){K[i]&lt;-1}</p><p>if (C[i,2]&gt;=v3[i]){K[i]&lt;-K[i]+1}</p><p>if (C[i,3]&gt;=v3[i]){K[i]&lt;-K[i]+1}</p><p>if (C[i,4]&gt;=v3[i]){K[i]&lt;-K[i]+1}</p><p>if (C[i,5]&gt;=v3[i]){K[i]&lt;-K[i]+1}</p><p>if (C[i,6]&gt;=v3[i]){K[i]&lt;-K[i]+1}</p><p>if (C[i,7]&gt;=v3[i]){K[i]&lt;-K[i]+1}</p><p>}</p><p>length(K)</p><p>message(&quot;number of pops chosen with k = 7, n = 2,</p><p>d = 6, and P* = 0.74904 for each of the &quot;,f2,&quot; rank sums&quot;)</p><p>head(K,5)</p><p>tail(K,5)</p><p>L&lt;-c(rep(0,f2))</p><p>for (i in 1:f2){</p><p>if (C[i,1]&gt;=v2[i]){L[i]&lt;-1}</p><p>if (C[i,2]&gt;=v2[i]){L[i]&lt;-L[i]+1}</p><p>if (C[i,3]&gt;=v2[i]){L[i]&lt;-L[i]+1}</p><p>if (C[i,4]&gt;=v2[i]){L[i]&lt;-L[i]+1}</p><p>if (C[i,5]&gt;=v2[i]){L[i]&lt;-L[i]+1}</p><p>if (C[i,6]&gt;=v2[i]){L[i]&lt;-L[i]+1}</p><p>if (C[i,7]&gt;=v2[i]){L[i]&lt;-L[i]+1}</p><p>}</p><p>length(L)</p><p>message(&quot;number of pops chosen with k = 7, n = 2,</p><p>d = 8, and P* = 0.906 for each of the &quot;,f2,&quot; rank sums&quot;)</p><p>head(L,5)</p><p>tail(L,5)</p><p>#####################################################</p><p>#Now replace ranks by normal scores (k=7) given by ns</p><p>ns&lt;-c(-1.35218,-0.75737,-0.35271,0,0.35271,0.75737,1.35218)</p><p>sum(ns)</p><p>E&lt;-matrix(0,nrow=f,ncol=k)</p><p>for (i in 1:f){</p><p>for (j in 1:k){</p><p>for (m in 1:k){</p><p>if (D[i,j]==m){E[i,j]&lt;-ns[m]}</p><p>}</p><p>}</p><p>}</p><p>dim(E)</p><p>for (i in 1:f){</p><p>for (j in 1:f){</p><p>F[(i-1)*f+j,]&lt;-E[i,]+E[j,]</p><p>}</p><p>}</p><p>dim(F)</p><p>head(F,5)</p><p>tail(F,5)</p><p>U&lt;-c(rep(0,f2))</p><p>for (i in 1:f2){U[i]&lt;-max(F[i,])-F[i,1]}</p><p>U&lt;-round(U,5)</p><p>length(U)</p><p>head(U,5)</p><p>tail(U,5)</p><p>table(U)</p><p>df3&lt;-data.frame(table(U))</p><p>Pro&lt;-df3$Freq/f2</p><p>Pro&lt;-round(Pro,5)</p><p>CDF1&lt;-cumsum(Pro)</p><p>df4&lt;-data.frame(df3,Pro,CDF1)</p><p>df4[40:length(Pro),]</p><p>################################</p><p>v5&lt;-c(rep(0,f2))</p><p>for (i in 1:f2){v5[i]&lt;-max(F[i,])-2.70436}</p><p>message(&quot;max(Si)-d for k = 7, n = 2, d = 2.70436</p><p>and P* = 0.75324&quot;)</p><p>head(v5,5)</p><p>tail(v5,5)</p><p>v4&lt;-c(rep(0,f2))</p><p>for (i in 1:f2){v4[i]&lt;-max(F[i,])-3.62429}</p><p>message(&quot;max(Si)-d for k = 7, n = 2, d = 3.62429</p><p>and P* = 0.90840&quot;)</p><p>head(v4,5)</p><p>tail(v4,5)</p><p>K1&lt;-c(rep(0,f2))</p><p>for (i in 1:f2){</p><p>if (F[i,1]&gt;=v5[i]){K1[i]&lt;-1}</p><p>if (F[i,2]&gt;=v5[i]){K1[i]&lt;-K1[i]+1}</p><p>if (F[i,3]&gt;=v5[i]){K1[i]&lt;-K1[i]+1}</p><p>if (F[i,4]&gt;=v5[i]){K1[i]&lt;-K1[i]+1}</p><p>if (F[i,5]&gt;=v5[i]){K1[i]&lt;-K1[i]+1}</p><p>if (F[i,6]&gt;=v5[i]){K1[i]&lt;-K1[i]+1}</p><p>if (F[i,7]&gt;=v5[i]){K1[i]&lt;-K1[i]+1}</p><p>}</p><p>length(K1)</p><p>message(&quot;number of pops chosen with k = 7, n = 2,</p><p>d = 6, and P* = 0.74904 for each of the &quot;,f2,&quot; norm scores&quot;)</p><p>head(K1,5)</p><p>tail(K1,5)</p><p>L1&lt;-c(rep(0,f2))</p><p>for (i in 1:f2){</p><p>if (F[i,1]&gt;=v4[i]){L1[i]&lt;-1}</p><p>if (F[i,2]&gt;=v4[i]){L1[i]&lt;-L1[i]+1}</p><p>if (F[i,3]&gt;=v4[i]){L1[i]&lt;-L1[i]+1}</p><p>if (F[i,4]&gt;=v4[i]){L1[i]&lt;-L1[i]+1}</p><p>if (F[i,5]&gt;=v4[i]){L1[i]&lt;-L1[i]+1}</p><p>if (F[i,6]&gt;=v4[i]){L1[i]&lt;-L1[i]+1}</p><p>if (F[i,7]&gt;=v4[i]){L1[i]&lt;-L1[i]+1}</p><p>}</p><p>length(L1)</p><p>message(&quot;number of pops chosen with k = 7, n = 2,</p><p>d = 8, and P* = 0.906 for each of the &quot;,f2,&quot; norm scores&quot;)</p><p>head(L1,5)</p><p>tail(L1,5)</p><p>##################################</p><p>#K-K1 &lt; 0 means fewer pops chosen using rank scores at P* = 0.75</p><p>#L-L1 &lt; 0 means fewer pops chosen using rank scores at P* = 0.90</p><p>Z&lt;-K-K1</p><p>max(Z)</p><p>min(Z)</p><p>W&lt;-L-L1</p><p>max(W)</p><p>min(W)</p><p>table(Z)</p><p>table(W)</p><p>#note that Z[<xref ref-type="bibr" rid="scirp.133525-ref142">142</xref>] = -2</p><p>#C[142,]=c(2,5,5,11,10,12,11)</p><p>#C[142,]=c(1,2,3,4,5,6,7)+c(1,3,2,7,5,6,4)</p><p>#rank sums = c(2,5,5,11,10,12,11)</p><p>#norm scores = c(-2.70436,-1.11008,-1.11008,1.35218,0.70542,</p><p># 1.51474,1.35218)</p><p>#Rank procedure chooses popi if Ti&gt;=max(T)-6=6 so 4 chosen</p><p>#Norm score procedure choose if Si&gt;=max(S)-2.70436=-1.18962 so 6 chosen</p><p>#Rank procedure chooses 2 fewer than Norm score procedure</p></sec></body><back><ref-list><title>References</title><ref id="scirp.133525-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Conover, W.J. (1999) Practical Nonparametric Statistic. 3&lt;sup&gt;rd&lt;/sup&gt; Edition, John Wiley &amp; Sons, Inc., New York.</mixed-citation></ref><ref id="scirp.133525-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">LaVange, L.M. and Koch, G.G. (2006) Rank Score Tests. &lt;i&gt;Circulation&lt;/i&gt;, 114, 2528-2533. &lt;br&gt;https://doi.org/10.1161/CIRCULATIONAHA.106.613638</mixed-citation></ref><ref id="scirp.133525-ref3"><label>3</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>McDonald</surname><given-names> G.C. </given-names></name>,<etal>et al</etal>. (<year>1972</year>)<article-title>Some Multiple Comparison Selection Procedures Based on Ranks</article-title><source>&lt;i&gt; &lt;/i&gt;&lt;i&gt;Sankhya&lt;/i&gt;: &lt;i&gt;The Indian Journal of Statistics Series A&lt;/i&gt;</source><volume> 34</volume>,<fpage> 53</fpage>-<lpage>64</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.133525-ref4"><label>4</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>McDonald</surname><given-names> G.C. </given-names></name>,<etal>et al</etal>. (<year>1973</year>)<article-title>The Distribution of Some Rank Statistics with Applications in Block Design Selection Problems</article-title><source> &lt;i&gt;Sankhya&lt;/i&gt;: &lt;i&gt;The Indian Journal of Statistics Series&lt;/i&gt; &lt;i&gt;A&lt;/i&gt;</source><volume> 35</volume>,<fpage> 187</fpage>-<lpage>204</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.133525-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">McDonald, G.C. (1979) Nonparametric Selection Procedures Applied to State Traffic Fatality Rates. &lt;i&gt;Technometrics&lt;/i&gt;, 21, 515-523.&lt;br&gt;https://doi.org/10.1080/00401706.1979.10489822</mixed-citation></ref><ref id="scirp.133525-ref6"><label>6</label><mixed-citation publication-type="book" xlink:type="simple">Lorenzen, T.J. and McDonald, G.C. (1984) A Nonparametric Analysis of Urban, Rural, and Interstate Traffic Fatality Rates. In: Santner, T.J. and Tamhane, A.C., Eds., &lt;i&gt;Design of &lt;/i&gt;&lt;i&gt;Experiments&lt;/i&gt;&lt;i&gt;-&lt;/i&gt;&lt;i&gt;Ranking &lt;/i&gt;&lt;i&gt;and &lt;/i&gt;&lt;i&gt;Selection&lt;/i&gt;, Marcel Dekker, New York, 143-178.</mixed-citation></ref><ref id="scirp.133525-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Green, J., McDonald, G.C. and Rao, N. (2006) Using Selection Procedures to Analyze State Traffic Fatality Rates. &lt;i&gt;American Journal of Mathematical and Manag&lt;/i&gt;&lt;i&gt;e&lt;/i&gt;&lt;i&gt;ment Sciences&lt;/i&gt;, 26, 387-416. &lt;br&gt;https://doi.org/10.1080/01966324.2006.10737680</mixed-citation></ref><ref id="scirp.133525-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Green, J. and McDonald, G.C. (2009) Nonparametric Subset Selection Procedures: Applications and Properties. &lt;i&gt;American Journal of Mathematical and Management&lt;/i&gt; &lt;i&gt;Sciences&lt;/i&gt;, 29, 413-436. &lt;br&gt;https://doi.org/10.1080/01966324.2009.10737766</mixed-citation></ref><ref id="scirp.133525-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">McDonald, G.C. (2016) Applications of Subset Selection Procedures and Bayesian Ranking Methods in Analysis of Traffic Fatality Data. &lt;i&gt;WIREs Computational Stati&lt;/i&gt;&lt;i&gt;s&lt;/i&gt;&lt;i&gt;tics&lt;/i&gt;, 8, 222-237. &lt;br&gt;https://doi.org/10.1002/wics.1385</mixed-citation></ref><ref id="scirp.133525-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Wang, A.Q. and McDonald, G.C. (2022) Analysis of State Homicide Rates Using Statistical Ranking and Selection Procedures.&lt;i&gt; &lt;/i&gt;&lt;i&gt;Applied Mathematics&lt;/i&gt;, 13, 585-601. &lt;br&gt;https://doi.org/10.4236/am.2022.137037 </mixed-citation></ref><ref id="scirp.133525-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Gupta, S.S. and Panchapakesan, S. (1979) Multiple Decision Procedures. John Wiley &amp; Sons, Inc. Republished in the Classics in Applied Mathematics Series, No. 44 (2002), Society for Industrial and Applied Mathematics, Philadelphia.&lt;br&gt;https://epubs.siam.org/doi/pdf/10.1137/1.9780898719161.fm </mixed-citation></ref><ref id="scirp.133525-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">McDonald, G.C. (2021) Computing Probabilities for Rank Statistics Used with Block Design Nonparametric Subset Selection Rules. &lt;i&gt;American Journal of Mathematical and Management Sciences&lt;/i&gt;, 41, 38-50. &lt;br&gt;https://doi.org/10.1080/01966324.2021.1910885</mixed-citation></ref><ref id="scirp.133525-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Harter, H.L. (1961) Expected Values of Normal Order Statistics. &lt;i&gt;Biometrika&lt;/i&gt;, 48, 151-165. &lt;br&gt;https://doi.org/10.1093/biomet/48.1-2.151</mixed-citation></ref><ref id="scirp.133525-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Birnbaum, A. and Dudman, J. (1963) Logistic Order Statistics. &lt;i&gt;Annals &lt;/i&gt;&lt;i&gt;o&lt;/i&gt;&lt;i&gt;f Math&lt;/i&gt;&lt;i&gt;e&lt;/i&gt;&lt;i&gt;matical Statistics&lt;/i&gt;, 34, 658-663. &lt;br&gt;https://doi.org/10.1214/aoms/1177704178</mixed-citation></ref><ref id="scirp.133525-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Tukey, J.W. (1949) One Degree-of-Freedom for Non-Additivity. &lt;i&gt;Biometrics&lt;/i&gt;, 5, 232-242.&lt;br&gt;https://doi.org/10.2307/3001938</mixed-citation></ref></ref-list></back></article>