<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN" "JATS-journalpublishing1-4.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.4" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">jdaip</journal-id>
      <journal-title-group>
        <journal-title>Journal of Data Analysis and Information Processing</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2327-7203</issn>
      <issn pub-type="ppub">2327-7211</issn>
      <publisher>
        <publisher-name>Scientific Research Publishing</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.4236/jdaip.2026.141006</article-id>
      <article-id pub-id-type="publisher-id">jdaip-149402</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
        <subj-group>
          <subject>Computer Science</subject>
          <subject>Communications</subject>
          <subject>Physics</subject>
          <subject>Mathematics</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Bootstrap Implementation of Mardia’s K² Test and Performance Comparison with Royston’s H Test</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Rubia</surname>
            <given-names>José Moral de la</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="aff1"><label>1</label> School of Psychology, Universidad Autónoma de Nuevo León, Monterrey, Mexico </aff>
      <author-notes>
        <fn fn-type="conflict" id="fn-conflict">
          <p>The author declares no conflicts of interest regarding the publication of this paper.</p>
        </fn>
      </author-notes>
      <pub-date pub-type="epub">
        <day>01</day>
        <month>02</month>
        <year>2026</year>
      </pub-date>
      <pub-date pub-type="collection">
        <month>02</month>
        <year>2026</year>
      </pub-date>
      <volume>14</volume>
      <issue>01</issue>
      <fpage>83</fpage>
      <lpage>118</lpage>
      <history>
        <date date-type="received">
          <day>07</day>
          <month>01</month>
          <year>2026</year>
        </date>
        <date date-type="accepted">
          <day>02</day>
          <month>02</month>
          <year>2026</year>
        </date>
        <date date-type="published">
          <day>05</day>
          <month>02</month>
          <year>2026</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2026 by the authors and Scientific Research Publishing Inc.</copyright-statement>
        <copyright-year>2026</copyright-year>
        <license license-type="open-access">
          <license-p> This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link> ). </license-p>
        </license>
      </permissions>
      <self-uri content-type="doi" xlink:href="https://doi.org/10.4236/jdaip.2026.141006">https://doi.org/10.4236/jdaip.2026.141006</self-uri>
      <abstract>
        <p>This article has two objectives. The first is to develop an R script that performs Mardia’s K<sup>2</sup> test for assessing multivariate normality (MVN), using both its original asymptotic formulation and a bootstrap method. Unlike previous studies that rely on Monte Carlo simulations to obtain critical values, the bootstrap approach focuses on resampling from sample data while preserving its correlational structure. The second objective is to compare both variants of K<sup>2</sup> test with Royston’s H test in terms of hit rate and statistical power. A total of 200 samples were generated from MVN distributions and another 200 from multivariate t distributions with five degrees of freedom, varying sample size (7 levels), number of variables (5 levels), and homogeneous correlation among variables (5 levels). The script computes the K<sup>2</sup> and H test statistics, their <italic>p</italic>-values, and statistical power, and also provides graphical representations of the sampling distribution of the K<sup>2</sup> statistic. Two worked examples, one based on a randomly generated sample from an MVN distribution and another based on an empirical sample without an MVN distribution, demonstrate the script’s functionality. The sampling distribution of the bootstrap K<sup>2</sup> statistic is less peaked and exhibits a heavier right tail than the distribution implied by the asymptotic approximation, which is defined by a weighted sum of correlated chi-squared variables. When the null hypothesis was retained (<italic>i.e</italic>., under multivariate normality), the H test showed slightly better performance in terms of correct retention. In contrast, when the null hypothesis was violated, both variants of Mardia’s K<sup>2</sup> test outperformed the H test; in this context, the asymptotic variant generally performed better than the bootstrap variant. In samples drawn from a multivariate t distribution, as well as in analyses aggregated across both distributions, the asymptotic K<sup>2</sup> variant achieved the highest mean power. In multivariate normal samples, however, the bootstrap K<sup>2</sup> variant showed superior performance, followed by the asymptotic variant, with the H test performing worst. This new variant is a robust option for assessing MVN, and its use and further exploration are recommended.</p>
      </abstract>
      <kwd-group kwd-group-type="author-generated" xml:lang="en">
        <kwd>Multivariate Normal Distribution</kwd>
        <kwd>Statistical Hypothesis Testing</kwd>
        <kwd>Statistical Power</kwd>
        <kwd>Multivariate Skewness</kwd>
        <kwd>Multivariate Kurtosis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec1">
      <title>1. Introduction</title>
      <p>In 1970, Mardia [<xref ref-type="bibr" rid="B1">1</xref>] introduced two measures for assessing the shape of multivariate distributions: one for skewness (B<sub>1</sub>) and another for kurtosis (B<sub>2</sub>). He also developed corresponding significance tests for each measure (Q and Z, respectively). In 1974, he proposed a small-sample correction for the significance test (Q<sub>c</sub>) associated with the B₁ skewness measure [<xref ref-type="bibr" rid="B2">2</xref>]. Later, in 1980, he introduced an omnibus test (K<sup>2</sup> = Q + Z<sup>2</sup>), which combines both measures to evaluate multivariate normality [<xref ref-type="bibr" rid="B3">3</xref>]. Years afterward, Urzua [<xref ref-type="bibr" rid="B4">4</xref>] revisited the omnibus K<sup>2</sup> statistic and computed critical values for sample sizes of 10, 20, 50, 100, 200, and ∞ using Monte Carlo simulation. Koizumi <italic>et al</italic>. [<xref ref-type="bibr" rid="B5">5</xref>] also explored this topic through Monte Carlo simulation.</p>
      <p>Mardia [<xref ref-type="bibr" rid="B1">1</xref>][<xref ref-type="bibr" rid="B2">2</xref>] demonstrated that the asymptotic sampling distribution of the test statistics Q and Q<sub>c</sub> follows a chi-square distribution with degrees of freedom given by <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)/6, while the asymptotic distribution of the Z statistic follows a standard normal distribution. Since these two statistics are asymptotically independent, the asymptotic distribution of the omnibus statistic K<sup>2</sup> is a chi-square distribution with degrees of freedom 1 + <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)/6.</p>
      <p>The asymptotic approximation improves with larger sample sizes (e.g., <italic>n</italic> ≥ 1000), fewer variables (typically 2 ‑ 4), and low inter-variable correlations (<italic>ρ</italic> ≤ 0.30). Conversely, in small samples with many highly correlated variables, the chi-square approximation becomes inadequate, and the distribution of K<sup>2</sup> more closely resembles a generalized chi-square distribution [<xref ref-type="bibr" rid="B6">6</xref>]. An approximation to this generalized distribution can be obtained using the Bootstrap method [<xref ref-type="bibr" rid="B7">7</xref>].</p>
      <p>The departure of K<sup>2</sup> from a standard chi-square distribution arises because the skewness and kurtosis components of Mardia’s test are not independent in finite samples, especially when the variables are strongly correlated. In such conditions, the joint variability of Q (skewness) and Z (kurtosis) is affected by their non-negligible covariance, producing a weighted sum of correlated quadratic forms rather than the independent sum assumed asymptotically. This dependence structure inflates the variance of K<sup>2</sup> and produces heavier right tails, making its empirical distribution better described by a generalized chi-square variable. As sample size increases or correlations weaken, the covariance between Q and Z diminishes, and the distribution of K<sup>2</sup> converges toward its nominal chi-square form.</p>
      <p>The first objective of this study is to develop an R script [<xref ref-type="bibr" rid="B8">8</xref>] for calculating the omnibus K<sup>2</sup> test using both the asymptotic approach (with a chi-square distribution and 1 + <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)/6 degrees of freedom, where <italic>k</italic> denotes the number of variables assumed to follow a multivariate normal distribution) and the bootstrap method. This allows for the estimation of critical values, <italic>p</italic>-values, and statistical power without relying on the limited critical value tables available in the literature [<xref ref-type="bibr" rid="B4">4</xref>][<xref ref-type="bibr" rid="B5">5</xref>]. The script also computes asymptotic measures and significance tests for multivariate skewness (B<sub>1</sub> and Q) and multivariate kurtosis (B<sub>2</sub> and Z). The main script is complemented by an auxiliary script that calculates critical values, <italic>p</italic>-values, and statistical power using the well-known MVN H-test [<xref ref-type="bibr" rid="B9">9</xref>]. The implementation of these scripts is illustrated with two examples. The first uses a dataset of 61 observations comprising 3-tuples that represent the D scores from aptitude tests administered to applicants for a master’s program in architecture. The second example uses real-world data collected with four 7-point Likert-type scales. The first dataset is based on a randomly generated sample from an MVN distribution, whereas the second relies on an empirical sample without an MVN distribution. R was chosen because it is one of the most comprehensive statistical software environments, offering free and open access. Moreover, it is continuously developed and reviewed by the statistical and mathematical community [<xref ref-type="bibr" rid="B10">10</xref>][<xref ref-type="bibr" rid="B11">11</xref>].</p>
      <p>The second objective of this statistical-methodological study is to compare the performance of the asymptotic and bootstrap variants of the K<sup>2</sup> test with Royston’s H test in terms of hit rate and statistical power. To this end, 200 random samples are generated from multivariate normal distributions (MVN) and 200 from multivariate Student’s t-distributions (MVT) with five degrees of freedom, varying the sample size (50, 75, 100, 125, 150, 200, 250, and 500), the number of variables (2, 3, 4, 5, and 6), and the homogeneous correlation among variables (0, 0.3, 0.5, 0.7, and 0.9). The study was limited to two distributions for the sake of simplicity: one in which the null hypothesis should be retained (MVN) and one in which it should be rejected (MVT). The multivariate t-distribution with five degrees of freedom was chosen as the primary alternative to normality because its pronounced leptokurtosis, resulting from heavy tails, represents a common and substantial deviation from the multivariate normal distribution. For simplicity, the study focused on only alternative for multivariate non-normality. The three sample configuration variables (sample size, number of variables, and homogeneous correlation) are known determinants of statistical power in multivariate tests [<xref ref-type="bibr" rid="B12">12</xref>]. Royston’s H test was selected as the comparison benchmark because it is one of the most widely used and powerful tests for assessing multivariate normality [<xref ref-type="bibr" rid="B13">13</xref>][<xref ref-type="bibr" rid="B14">14</xref>].</p>
    </sec>
    <sec id="sec2">
      <title>2. Method</title>
      <sec id="sec2dot1">
        <title>2.1. For the First Objective</title>
        <p>The R script consists of two sections, each included in a different appendix, and is available as a Word file (Appendices_with_R_Scripts.docx) in a GitHub repository: <ext-link ext-link-type="uri" xlink:href="https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git">https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git</ext-link>.</p>
        <p>Appendix 1 presents the script for computing the multivariate skewness coefficient (B<sub>1</sub>) and its asymptotic significance test (Q), the multivariate kurtosis coefficient (B<sub>2</sub>) and its asymptotic significance test (Z), as well as the K<sup>2</sup> statistic and its significance tests using both asymptotic and bootstrap approaches. The asymptotic significance of the K<sup>2</sup> statistic represents the probability of obtaining a value greater than or equal to the observed one under a chi-square distribution with 1 + k(k + 1)(k + 2)/6 degrees of freedom.</p>
        <p>For the bootstrap approach, both the normative bootstrap distribution and the empirical bootstrap distribution are generated. Let us begin with the normative bootstrap distribution. Given a sample of size <italic>n</italic> composed of <italic>k</italic> correlated variables, a normative sample of the same size and dimension is generated to follow a multivariate normal distribution with the same correlational structure. This process begins by creating a square matrix of <italic>k</italic> independent variables with standard normal distribution, each consisting of <italic>n</italic> observations. These values are obtained using the probit (quantile) function of the standard normal distribution. First, equally spaced quantile orders are generated, ranging from 0.5/<italic>n</italic> to 1 − 0.5/<italic>n</italic>. The quantile orders are then randomly permuted for each variable using a fixed seed (123) to ensure reproducibility [<xref ref-type="bibr" rid="B8">8</xref>]. In this way, each of the <italic>k</italic> variables shares the same set of quantile orders in a different sequence, forming mutually independent vectors.</p>
        <p>The resulting matrix of independent Gaussian vectors is multiplied by the lower triangular matrix from the Cholesky decomposition of the original sample’s correlation matrix [<xref ref-type="bibr" rid="B15">15</xref>], yielding a normative sample with the same correlational structure as the original. From this sample, 1000 samples are randomly drawn with replacement by rows—not by columns—in order to preserve the correlational structure. The K<sup>2</sup> statistic is computed for each resample. These 1000 values constitute the normative bootstrap distribution. In this distribution, which replaces the theoretical chi-square distribution with 1 + <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)/6 degrees of freedom, the probability of observing the original sample’s K<sup>2</sup> statistic is calculated using the right tail. This yields the bootstrap <italic>p</italic>-value. The 95th percentile of the distribution serves as the normative bootstrap critical value, which is then used to estimate statistical power.</p>
        <p>In contrast, the empirical bootstrap distribution is generated by drawing with replacement 1000 samples directly from the original sample, again using row-wise resampling to preserve the correlational structure. The K<sup>2</sup> statistic is calculated for each resample, and these 1000 values form the empirical bootstrap distribution. Using the normative bootstrap critical value (<italic>i.e</italic>., the 95th percentile), the proportion of resamples in which the null hypothesis of multivariate normality is rejected (<italic>i.e</italic>., when the K<sup>2</sup> estimate exceeds the critical value) is calculated. This proportion represents the bootstrap estimate of statistical power at the 5% significance level. If the null hypothesis of multivariate normality holds, power should be below 0.5, preferably under 0.2. Conversely, if the null hypothesis is false, power should exceed 0.5, preferably above 0.8. Otherwise, the result may be considered ambiguous or questionable.</p>
        <p>Finally, the script enables visualization of the sampling distribution of the K² statistic through a figure that displays several components. The empirical bootstrap distribution is presented as a histogram with an overlaid smooth density curve, estimated using kernel density estimation via the density() function in R’s base Stats package. By default, this function employs a Gaussian kernel and determines the bandwidth with the nrd0 method, an adaptation of Silverman’s rule of thumb (Silverman, 1986). The normative bootstrap and chi-square distributions (with 1 + <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)/6 degrees of freedom) are represented by smooth density curves computed in the same way. In addition, the observed value of the K<sup>2</sup> statistic and the critical values from both the normative bootstrap and chi-square distributions are indicated with vertical lines.</p>
        <p>Appendix 2 provides the R script for computing Royston’s MVN H-test. Using an asymptotic approach, the script calculates both the <italic>p</italic>-value and statistical power at a 5% significance level. This test is included as a comparative or complementary method for assessing multivariate normality. The aim is to evaluate the consistency of results across the asymptotic and bootstrap variants of the K<sup>2</sup> test and the H test in terms of retaining or rejecting the null hypothesis of multivariate normality.</p>
        <p>The operation of the script is illustrated using two examples. The first example is executed with data generated from a multivariate normal distribution; the code used to generate these data is provided in Script 1 (Appendix 1), and the resulting sample is shown in Script 2 (Appendix 2). The second example is executed using the non-normal data provided in Appendix 3.</p>
      </sec>
      <sec id="sec2dot2">
        <title>2.2. For the Second Objective</title>
        <p>A total of 200 samples—each consisting of 50, 75, 100, 125, 150, 200, 250, or 500 observations and 2, 3, 4, 5, or 6 variables with homogeneous correlations of 0, 0.3, 0.5, 0.7, or 0.9—were drawn from a multivariate t-distribution with five degrees of freedom (MVTD). An additional set of 200 samples with identical characteristics was drawn from a multivariate normal distribution (MVND). For both sets, two variants of the K<sup>2</sup> test (K<sup>2</sup>_a and K<sup>2</sup>_b), along with Royston’s H test, were computed. The R package mvtnorm was used to generate the 400 samples [<xref ref-type="bibr" rid="B16">16</xref>]. A single random seed (123) was set at the beginning of each simulation condition to ensure reproducibility [<xref ref-type="bibr" rid="B8">8</xref>]. The random number generator state was allowed to advance naturally across successive data generations, so that the 200 samples correspond to independent realizations rather than identical repetitions. The correlation matrices were specified using a compound symmetry (equicorrelation) structure, with unit variances and a common off-diagonal correlation coefficient <italic>ρ</italic>. Further details can be found in Appendix 4, which is also accessible in the GitHub repository: <ext-link ext-link-type="uri" xlink:href="https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git">https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git</ext-link>.</p>
        <p>Since the samples were generated using the same seed, the hit rate and statistical power data for the three multivariate normality tests represent repeated measurements. Therefore, statistical tests appropriate for repeated measures were applied.</p>
        <p>Confidence intervals for the hit rates were calculated using the Wilson method with continuity correction [<xref ref-type="bibr" rid="B17">17</xref>]. Differences in hit rates among the five multivariate normality tests were assessed using Cochran’s Q test [<xref ref-type="bibr" rid="B18">18</xref>]. Effect size was measured with the eta-squared coefficient [<xref ref-type="bibr" rid="B19">19</xref>]. Pairwise comparisons of hit rates were conducted using the paired z-test with an estimated common standard error [<xref ref-type="bibr" rid="B20">20</xref>]. Bonferroni correction was applied to control the familywise error rate [<xref ref-type="bibr" rid="B21">21</xref>]. Effect size for pairwise comparisons was calculated using a z-based analogue of Cohen’s d: <italic>d</italic> = <italic>z</italic>/<inline-formula><mml:math display="inline"><mml:mrow><mml:msqrt><mml:mi> n </mml:mi></mml:msqrt></mml:mrow></mml:math></inline-formula> [<xref ref-type="bibr" rid="B20">20</xref>]. The resulting d values were interpreted according to Cohen’s thresholds: [0, 0.2) = trivial, [0.2, 0.5) = small, [0.5, 0.8) = medium, and ≥0.8 = large (Cohen, 1988) [<xref ref-type="bibr" rid="B22">22</xref>]. All omnibus and pairwise comparisons were performed using SPSS version 27 [<xref ref-type="bibr" rid="B23">23</xref>].</p>
        <p>The comparison of mean statistical power (based on the 200 samples from the MVT distribution) and its complement (based on the 200 samples from the MVN distribution) was conducted using repeated measures ANOVA. Sample size, number of variables, and homogeneous correlation among variables were included as covariates in the general linear model (GLM), with distribution type (MVT vs. MVN) as a between-subjects factor. The assumption of sphericity—<italic>i.e</italic>., homogeneity of variances of the differences between all possible pairs of repeated measures—was tested using Mauchly’s test [<xref ref-type="bibr" rid="B24">24</xref>]. When this assumption was violated, a multivariate approach to within-subject effects was employed using Pillai’s trace [<xref ref-type="bibr" rid="B25">25</xref>], as recommended for repeated measures ANOVA [<xref ref-type="bibr" rid="B26">26</xref>]. Effect sizes for each model component were estimated using partial eta-squared coefficients.</p>
        <p>Pairwise comparisons were performed using the paired-samples Student’s t-test with Bonferroni correction. Cohen’s d [<xref ref-type="bibr" rid="B22">22</xref>], with the Hedges–Olkin correction [<xref ref-type="bibr" rid="B27">27</xref>], was used to quantify the effect size. The 95% confidence intervals for the mean statistical power of the three MVN tests, as well as the correlations between statistical power and the three cofactors (in the GLM), were computed using the bias-corrected and accelerated (BCa) percentile method based on 1000 bootstrap samples [<xref ref-type="bibr" rid="B7">7</xref>]. The reported correlations correspond to Pearson product-moment coefficients. These analyses were executed using SPSS version 27 [<xref ref-type="bibr" rid="B23">23</xref>].</p>
        <p>In addition to the violation of the sphericity assumption, the assumption of multivariate normality in the repeated measures component (MVN tests), assessed using Royston’s H test [<xref ref-type="bibr" rid="B9">9</xref>], and the assumption of univariate normality in the distribution of residuals for each repeated measure, assessed using Shapiro–Wilk’s W test [<xref ref-type="bibr" rid="B28">28</xref>][<xref ref-type="bibr" rid="B29">29</xref>], were not satisfied. Therefore, a nonparametric repeated measures ANOVA [<xref ref-type="bibr" rid="B30">30</xref>] was also applied. Pairwise comparisons were conducted using the rank-sum test [<xref ref-type="bibr" rid="B31">31</xref>]. Effect size in the omnibus test was estimated using Kendall’s W coefficient [<xref ref-type="bibr" rid="B32">32</xref>], and in pairwise comparisons using the rank-biserial correlation [<xref ref-type="bibr" rid="B33">33</xref>]. The JASP program version 0.19.2 [<xref ref-type="bibr" rid="B34">34</xref>] was used to perform these power comparisons.</p>
      </sec>
    </sec>
    <sec id="sec3">
      <title>3. Results</title>
      <sec id="sec3dot1">
        <title>3.1. Example with Multivariate Normality</title>
        <p>Three psychometric aptitude scales—assessing mechanical, spatial, and numerical reasoning—were administered to 61 applicants for a Master of Architecture degree. It is assumed that the sample composed of 61 observations of 3-uples follows a multivariate normal distribution. This assumption is tested using both the asymptotic and bootstrap variants of the Mardia’s K<sup>2</sup> test, as well as Royston’s H test.</p>
        <p>The sample was generated using an R script provided in Appendix 1, which also includes the code for running the asymptotic and bootstrap versions of K<sup>2</sup> test. Appendix 2 presents the three scoring vectors and the code for performing H test.</p>
        <p>At a 0.05 significance level, the multivariate distribution—based on 61 observations of 3-tuples—shows symmetry and mesokurtosis, as confirmed by Mardia’s tests. Both versions of the K<sup>2</sup> test support the assumption of multivariate normality, although the bootstrap variant exhibits lower power (ϕ = 0.016 &lt; 0.2) compared to the asymptotic one (ϕ = 0.2516 &lt; 0.5). Royston’s H test also supports multivariate normality, with low power (ϕ = 0.1216 &lt; 0.2). Notably, when the null hypothesis is not rejected, statistical power should fall below 0.5—ideally under 0.2. Otherwise, the result is contradictory or questionable.</p>
        <p><xref ref-type="fig" rid="fig1">Figure 1</xref> shows that the three sampling distributions associated with the K<sup>2</sup> test—namely, the asymptotic (chi-square) distribution in green, the normative bootstrap distribution in red, and the empirical bootstrap distribution in yellow—are similar in overall shape. Among them, the normative bootstrap distribution is the least peaked and exhibits the greatest positive skewness. This divergence indicates that the correlation between multivariate skewness and kurtosis in the empirical data inflates the variability of the K² statistic, causing it to behave more like a generalized chi-square variable with substantial tail weight than like the nominal chi-square distribution.</p>
        <p>Sample size: <italic>n</italic> = 61.</p>
        <p>Number of variables in the original sample: <italic>k</italic> = 3.</p>
        <p>Significance level: <italic>α</italic> = 0.05.</p>
        <p>Average of the correlation among variables: <italic>m</italic>(R<sub>3</sub><sub>×</sub><sub>3</sub>) = 0.6846.</p>
        <p>Mardia’s multivariate skewness and kurtosis tests</p>
        <p>Mardia’s multivariate skewness statistic: B₁ = 4.5598.</p>
        <p>Two-tailed <italic>p</italic>-value for the null hypothesis of multivariate symmetry: <italic>p</italic> = 0.9186.</p>
        <p>Mardia’s multivariate kurtosis statistic: B<sub>2</sub> = −0.5925.</p>
        <p>Two-tailed <italic>p</italic>-value for the null hypothesis of multivariate mesokurtosis:</p>
        <p><italic>p</italic> = 0.5535.</p>
        <p>Asymptotic variant of the MVN K-square test</p>
        <p>Mardia’s multivariate normality statistic: MVN K-square = 4.9108.</p>
        <p>Degrees of freedom: <italic>df</italic> = 11</p>
        <p>Critical value: 0.95χ<sup>2</sup> [<xref ref-type="bibr" rid="B11">11</xref>] = 19.6751</p>
        <fig id="fig1">
          <label>Figure 1</label>
          <graphic xlink:href="https://html.scirp.org/file/2870877-rId19.jpeg?20260205103348" />
        </fig>
        <p><bold>Figure 1.</bold> Histogram (yellow) with density curves for the empirical bootstrap (yellow), normative bootstrap (red), and chi-square (green) distributions. The value of the k-square statistic is indicated by a blue vertical line (K<sup>2</sup> = 4.9108), the critical value of the normative bootstrap distribution by a red vertical line (0.95 quantile: 0.95qboot = 24.0928), and the critical value of the chi-square distribution with 11 degrees of freedom by a green vertical line (0.95χ<sup>2</sup> [<xref ref-type="bibr" rid="B11">11</xref>] = 19.6751).</p>
        <p>Right-tailed <italic>p</italic>-value under a chi-square distribution with 11 degrees of freedom:</p>
        <p><italic>p</italic> = 0.9354.</p>
        <p>H₀ (multivariate normality) is maintained at the 0.05 level based on the asymptotic <italic>p</italic>-value (chi-2 distribution).</p>
        <p>Asymptotic statistical power for the K-square statistic: ϕ = 0.2516 &lt; 0.5.</p>
        <p>If the null hypothesis is rejected, statistical power should exceed 0.5—preferably 0.8 or higher. If the null hypothesis is not rejected, statistical power should be below 0.5—preferably under 0.2. Otherwise, the result is contradictory.</p>
        <p>Bootstrap variant of the MVN K-square test</p>
        <p>Bootstrap critical value (0.95 quantile): 0.95qboot = 24.0928.</p>
        <p>Bootstrap <italic>p</italic>-value: pboot = 0.9790.</p>
        <p>H₀ (multivariate normality) is maintained at the 0.05 level based on the bootstrap <italic>p</italic>-value.</p>
        <p>Bootstrap power: ϕ = 0.016 &lt; 0.2.</p>
        <p>Royston’s multivariate normality H test</p>
        <p>H statistic: <italic>H</italic> = 1.7955.</p>
        <p>Degrees of freedom: <italic>df</italic> = 2.6596.</p>
        <p><italic>p</italic>-value for the null hypothesis of multivariate normality: <italic>p</italic> = 0.5498.</p>
        <p>Statistical power of the H test at a significance level of 0.05: ϕ = 0.1844 &lt; 0.2.</p>
      </sec>
      <sec id="sec3dot2">
        <title>3.2. Example without Multivariate Normality</title>
        <p>Next, an illustrative example is presented using real-world data with characteristics typical of psychology studies. A non-probability sampling method was employed, resulting in an incidental sample of 509 participants, including 300 women (59%) and 209 men (41%). The questionnaire was administered digitally and began with a request for informed consent. The Subjective Happiness Scale [<xref ref-type="bibr" rid="B35">35</xref>], adapted for Mexico and validated in the general Mexican population by Quezada <italic>et al</italic>. [<xref ref-type="bibr" rid="B36">36</xref>], was used as the measurement instrument. Appendix 3 provides the list of variables used in this example, which is available in the GitHub repository at <ext-link ext-link-type="uri" xlink:href="https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git">https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git</ext-link>. It consists of four Likert-type items, that is, ordinal variables with seven ordered categories. The data frame only needs to be modified to include these four variables —original_data &lt;- data.frame(x1, x2, x3, x4)— and no further changes are required in the scripts presented in Appendices 1 and 2.</p>
        <p>The results clearly reject the null hypothesis of multivariate normality, both for the K<sup>2</sup> test under its asymptotic and bootstrap approaches, and for Royston’s H test under its asymptotic or standard approach, which is the only one considered. <xref ref-type="fig" rid="fig2">Figure 2</xref> displays the histogram with density curves corresponding to the empirical bootstrap, the normative bootstrap, and the chi-square distributions for the K<sup>2</sup> test.</p>
        <fig id="fig2">
          <label>Figure 2</label>
          <graphic xlink:href="https://html.scirp.org/file/2870877-rId21.jpeg?20260205103348" />
        </fig>
        <p><bold>Figure 2.</bold> Histogram (yellow) with density curves for the empirical bootstrap (yellow), normative bootstrap (red), and chi-square (green) distributions. The value of the k-square statistic is indicated by a blue vertical line (k-square = 507.174), the critical value of the normative bootstrap distribution by a red vertical line (0.95qboot = 62.1757), and the critical value of the chi-square distribution with 21 degrees of freedom by a green vertical line (0.95χ<sup>2</sup>[<xref ref-type="bibr" rid="B21">21</xref>] = 32.6706).</p>
        <p>Compared to <xref ref-type="fig" rid="fig1">Figure 1</xref>, which shows the sampling distribution of the K<sup>2</sup> statistic with three variables following a multivariate normal distribution, the empirical distribution is much more leptokurtic and positively skewed. This arises because the four items were negatively asymmetric, and two of them were leptokurtic.</p>
        <p>A further aspect worth noting is the marked difference in scale between <xref ref-type="fig" rid="fig1">Figure 1</xref> and <xref ref-type="fig" rid="fig2">Figure 2</xref>. In <xref ref-type="fig" rid="fig1">Figure 1</xref>, the sampling distribution of the K<sup>2</sup> statistic is concentrated within a narrow range, and the empirical, normative-bootstrap, and chi-square curves overlap closely. By contrast, in <xref ref-type="fig" rid="fig2">Figure 2</xref> the empirical K<sup>2</sup> values extend over a much wider range, with extreme values far exceeding those expected under multivariate normality. This stretching of the horizontal axis reflects the strong departure from normality, while the clear separation between the three density curves shows that neither the chi-square approximation nor the normative bootstrap can capture the heavy-tailed behaviour of the empirical bootstrap.</p>
        <p>Sample size: <italic>n</italic> = 509.</p>
        <p>Number of variables in the original sample: <italic>k</italic> = 4.</p>
        <p>Significance level: <italic>α</italic> = 0.05.</p>
        <p>Average of the correlation among variables: <italic>m</italic>(R) = 0.4281.</p>
        <p>Mardia’s multivariate skewness and kurtosis tests</p>
        <p>Mardia’s multivariate skewness statistic: B1 = 261.5066.</p>
        <p>Two-tailed <italic>p</italic>-value for the null hypothesis of multivariate symmetry: <italic>p</italic> = 0.</p>
        <p>Mardia’s multivariate kurtosis statistic: B2 = 15.6738.</p>
        <p>Two-tailed <italic>p</italic>-value for the null hypothesis of multivariate mesokurtosis: <italic>p</italic> = 0.</p>
        <p>Asymptotic variant of the MVN K-square test</p>
        <p>Mardia’s multivariate normality statistic: MVN K-square = 507.174.</p>
        <p>Degrees of freedom: <italic>df</italic> = 21.</p>
        <p>Critical value: 0.95 _chi-sq[<xref ref-type="bibr" rid="B21">21</xref>] = 32.6706.</p>
        <p>Right-tailed <italic>p</italic>-value under a chi-square distribution with 21 degrees of freedom:</p>
        <p>p_value = 0.</p>
        <p>H_0 (multivariate normality) is rejected at the level 0.05 based on the asymptotic <italic>p</italic>-value (chi-2 distribution).</p>
        <p>Asymptotic statistical power for the K-square statistic: phi = 1.</p>
        <p>If the null hypothesis is rejected, the statistical power should exceed 0.5, preferably 0.8 or higher.</p>
        <p>If the null hypothesis is not rejected, the statistical power should be below 0.5, preferably under 0.2.</p>
        <p>Otherwise, the result is contradictory or questionable.</p>
        <p>Bootstrap variant of the MVN K-square test</p>
        <p>Bootstrap critical value (0.95 quantile): q_boot(0.95) = 62.1757.</p>
        <p>Bootstrap <italic>p</italic>-value: p_boot = 0.</p>
        <p>H_0 (multivariate normality) is rejected at the level 0.05 based on the bootstrap <italic>p</italic>-value.</p>
        <p>Bootstrap power: phi_boot = 1.</p>
        <p>Royston’s multivariate normality H test:</p>
        <p>H statistic: <italic>H</italic> = 281.542.</p>
        <p>Degrees of freedom: <italic>df</italic> = 3.9544.</p>
        <p><italic>p</italic>-value for the null hypothesis of multivariate normality: <italic>p</italic> &lt; 0.0001.</p>
        <p>Statistical power of the H test at a significance level of 0.05: phi = 1.</p>
      </sec>
      <sec id="sec3dot3">
        <title>3.3. Hit Rate Comparisons</title>
        <p>In the 200 samples drawn from the multivariate t distribution, the hit rate (<italic>HR</italic>) of the H test was the lowest, and its confidence interval lay entirely below those of the two variants of the Q test, whose intervals overlapped (<bold>Table 1</bold> and <xref ref-type="fig" rid="fig3">Figure 3</xref>).</p>
        <fig id="fig3">
          <label>Figure 3</label>
          <graphic xlink:href="https://html.scirp.org/file/2870877-rId22.jpeg?20260205103348" />
        </fig>
        <p><bold>Figure 3.</bold> Diagram of the hit rates of the three MVN tests based on 200 samples drawn from multivariate t-distributions, 200 samples drawn from multivariate normal distributions, and a combined sample of 400 observations.</p>
        <p><bold>Table 1.</bold> Hit rates and 95% continuity-corrected Wilson confidence intervals based on 200 samples from multivariate t distributions, 200 samples from multivariate normal distributions, and a combined sample of 400 observations.</p>
        <table-wrap id="tbl1">
          <label>Table 1</label>
          <table>
            <tbody>
              <tr>
                <td rowspan="2">MVNtests</td>
                <td>MTD</td>
                <td>MND</td>
                <td>Combined</td>
              </tr>
              <tr>
                <td>
                  <italic>HR</italic>
                  (
                  <italic>LL</italic>
                  ,
                  <italic>UL</italic>
                  )
                </td>
                <td>
                  <italic>HR</italic>
                  (
                  <italic>LL</italic>
                  ,
                  <italic>UL</italic>
                  )
                </td>
                <td>
                  <italic>HR</italic>
                  (
                  <italic>LL</italic>
                  ,
                  <italic>UL</italic>
                  )
                </td>
              </tr>
              <tr>
                <td>H</td>
                <td>0.915 (0.865, 0.948)</td>
                <td>0.990 (0.961, 0.998)</td>
                <td>0.953 (0.926, 0.970)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _a
                </td>
                <td>1 (0.977, 1)</td>
                <td>0.925 (0.877, 0.956)</td>
                <td>0.963 (0.938, 0.978)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _b
                </td>
                <td>1 (0.977, 1)</td>
                <td>0.975 (0.939, 0.991)</td>
                <td>0.988 (0.969, 0.995)</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. Distribution: MTD = 200 samples drawn from multivariate t distribution, MND = 200 samples drawn from multivariate normal distribution, and combined = 400 samples drawn MTV or MND. MVN test: H = Royston’s H test, K<sup>2</sup>_a = asymptotic variant of the Q test, and K<sup>2</sup>_b = bootstrap variant of Q test. Statistic: <italic>HR</italic> = hit rate, <italic>LL</italic> = lower limit and <italic>UL</italic> = upper limit of a 95% continuity-corrected Wilson confidence interval.</p>
        <p>When comparing HRs among the three MVN tests, a significant difference was found using Cochran’s Q test (Χ<sup>2</sup>[2, <italic>N</italic> = 200] = 34, <italic>p</italic> &lt; 0.001), with a medium effect size (0.06 &lt; <italic>η</italic><sup>2</sup> = 0.085 &lt; 0.14). According to Bonferroni-adjusted paired z tests, the HRs of both Q test variants were significantly higher than that of the H test, although the effect sizes were small (<bold>Table 2</bold>).</p>
        <p>In the combined sample, the HRs of the three MVN tests overlapped (<bold>Table 1</bold> and <xref ref-type="fig" rid="fig3">Figure 3</xref>). However, when comparing HRs among the three MVN tests, a significant difference was found (Χ<sup>2</sup>[2, <italic>N</italic> = 400] = 9.176, p &lt; 0.010), with a small effect size (0.01 &lt; <italic>η</italic><sup>2</sup> = 0.011 &lt; 0.06). After applying Bonferroni’s correction, only one significant difference remained: the HR of the bootstrap variant of the Q test was significantly higher than that of the H test, although the effect size was trivial (<bold>Table 2</bold>).</p>
        <p>The <italic>HR</italic> of the H test was significantly higher for samples drawn from multivariate normal distributions than from multivariate t-distributions (<italic>d</italic> = −0.075, <italic>se</italic> = 0.021, <italic>z</italic> = −3.526, <italic>p</italic> &lt; 0.001). In contrast, the HRs of both the asymptotic variant (<italic>d</italic> = 0.075, <italic>se</italic> = 0.019, <italic>z</italic> = 3.948, <italic>p</italic> &lt; 0.001) and the bootstrap variant (<italic>d</italic> = 0.025, <italic>se</italic> = 0.011, <italic>z</italic> = 2.250, <italic>p</italic> = 0.024) of the K<sup>2</sup> test were significantly higher for samples drawn from MVT than from MVN distributions.</p>
        <p><bold>Table 2.</bold>Pairwise comparisons of hit rates using Bonferroni-adjusted paired z-tests on 200 samples from multivariate t-distributions, 200 samples from multivariate normal distributions, and a combined sample of 400 observations.</p>
        <table-wrap id="tbl2">
          <label>Table 2</label>
          <table>
            <tbody>
              <tr>
                <td>Distribution</td>
                <td>Test1 - Test2</td>
                <td>
                  <italic>md</italic>
                </td>
                <td>
                  <italic>se</italic>
                </td>
                <td>
                  <italic>z</italic>
                </td>
                <td>
                  <italic>p</italic>
                </td>
                <td>
                  <italic>p</italic>
                  <sub>Bonf</sub>
                </td>
                <td>
                  <italic>d</italic>
                </td>
              </tr>
              <tr>
                <td rowspan="3">MTD</td>
                <td>
                  K
                  <sup>2</sup>
                  a - H
                </td>
                <td>0.085</td>
                <td>0.017</td>
                <td>5.050</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.357</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  b - H
                </td>
                <td>0.085</td>
                <td>0.017</td>
                <td>5.050</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.357</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a - K
                  <sup>2</sup>
                  b
                </td>
                <td>0</td>
                <td>0.017</td>
                <td>0</td>
                <td>1</td>
                <td>1</td>
                <td>0</td>
              </tr>
              <tr>
                <td rowspan="3">MND</td>
                <td>
                  K
                  <sup>2</sup>
                  a - H
                </td>
                <td>−0.065</td>
                <td>0.017</td>
                <td>−3.862</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>−0.273</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  b - H
                </td>
                <td>−0.015</td>
                <td>0.017</td>
                <td>−0.891</td>
                <td>0.373</td>
                <td>1</td>
                <td>−0.063</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a - K
                  <sup>2</sup>
                  b
                </td>
                <td>−0.050</td>
                <td>0.017</td>
                <td>−2.970</td>
                <td>0.003</td>
                <td>0.009</td>
                <td>−0.210</td>
              </tr>
              <tr>
                <td rowspan="3">Combined</td>
                <td>
                  K
                  <sup>2</sup>
                  a - H
                </td>
                <td>0.010</td>
                <td>0.012</td>
                <td>0.84</td>
                <td>0.401</td>
                <td>1</td>
                <td>0.042</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  b - H
                </td>
                <td>0.035</td>
                <td>0.012</td>
                <td>2.941</td>
                <td>0.003</td>
                <td>0.010</td>
                <td>0.147</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a - K
                  <sup>2</sup>
                  b
                </td>
                <td>−0.025</td>
                <td>0.012</td>
                <td>−2.100</td>
                <td>0.036</td>
                <td>0.107</td>
                <td>−0.105</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. <italic>md</italic> = mean difference in hit rate between MVN test 1 and MVN test 2, <italic>se</italic> = standard error, <italic>z</italic> = paired z statistic, <italic>p</italic> = two-tailed <italic>p</italic>-value, <italic>p</italic><sub>Bonf</sub> = Bonferroni-adjusted two-tailed <italic>p</italic>-value, <italic>d</italic> = Cohen’s d (effect size measure).</p>
      </sec>
      <sec id="sec3dot4">
        <title>3.4. Mean Statistical Power Comparison</title>
        <p>3.4.1. In Samples Drawn from MVT Distributions</p>
        <p>In the 200 samples drawn from the MVT distributions, the asymptotic variant of the K<sup>2</sup> test exhibited the highest mean statistical power, while the H test showed the lowest. However, the 95% BCa confidence intervals for the mean power overlapped across the three MVN tests (<bold>Table 3</bold> and <xref ref-type="fig" rid="fig4">Figure 4</xref>).</p>
        <p><bold>Table 3.</bold> Mean statistical powers and 95% BCa confidence intervals based on 200 samples from multivariate t distributions, 200 samples from multivariate normal distributions, and a combined sample of 400 observations.</p>
        <table-wrap id="tbl3">
          <label>Table 3</label>
          <table>
            <tbody>
              <tr>
                <td>MVN tests</td>
                <td>MTD</td>
                <td>MND</td>
                <td>Combined</td>
              </tr>
              <tr>
                <td>H</td>
                <td>0.950 (0.906, 1)</td>
                <td>0.732 (0.709, 0.755)</td>
                <td>0.841 (0.814, 0.872)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _a
                </td>
                <td>0.994 (0.989, 0.998)</td>
                <td>0.765 (0.731, 0.798)</td>
                <td>0.880 (0.860, 0.898)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _b
                </td>
                <td>0.963 (0.951, 0.974)</td>
                <td>0.766 (0.735, 0.799)</td>
                <td>0.864 (0.844, 0.885)</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. Point estimates and 95% BCa confidence intervals for the average statistical power (its complement for 200 samples drawn from multivariate normal distributions) based on 1000 simulations.</p>
        <p>The sphericity assumption was strongly violated (Mauchly <italic>W</italic> = 0.075, Χ<sup>2</sup>[2, <italic>N</italic> = 200] = 504.182, <italic>p</italic> &lt; 0.001); therefore, a Huynh-Feldt epsilon correction (ε = 0.528 &lt; 0.7) was applied to adjust the degrees of freedom in the analysis of within-subject effects. Additionally, a multivariate test was included, as it is more appropriate when the sphericity assumption is not met.</p>
        <fig id="fig4">
          <label>Figure 4</label>
          <graphic xlink:href="https://html.scirp.org/file/2870877-rId23.jpeg?20260205103348" />
        </fig>
        <p><bold>Figure 4.</bold> Diagram of the mean statistical power (its complement of the statistical power for 200 samples drawn from multivariate normal distributions) of the three MVN tests.</p>
        <p>The effect of the MVN test type was not significant (<italic>F</italic>[1.056, 206.928] = 1.591, <italic>p</italic> = 0.209, <italic>η</italic><sup>2</sup> = 0.008, ϕ = 0.247), nor were its interactions with sample size (<italic>F</italic>[1.056, 206.928] = 1.4, <italic>p</italic> = 0.240, <italic>η</italic><sup>2</sup> = 0.007, ϕ = 0.223), the number of variables (<italic>F</italic>[1.056, 206.928] = 1.424, <italic>p</italic> = 0.236, <italic>η</italic><sup>2</sup> = 0.007, ϕ = 0.226), or the homogeneous correlation among variables (<italic>F</italic>[1.056, 206.928] = 1.881, <italic>p</italic> = 0.171, <italic>η</italic><sup>2</sup> &lt; 0.010, ϕ = 0.283).</p>
        <p>In contrast to these results, the multivariate test of within-subject effects revealed a significant main effect of the MVN test type (Pillai’s Λ = 0.158; <italic>F</italic>[2, 195] = 18.316, <italic>p</italic> &lt; 0.001; <italic>η</italic><sup>2</sup> = 0.158; ϕ = 1), as were its significant interactions with two of the three cofactors (Pillai’s Λ = 0.072; <italic>F</italic>[2, 195] = 7.610, <italic>p</italic> = 0.001; <italic>η</italic><sup>2</sup> = 0.072; ϕ = 0.944 with sample size and Pillai’s Λ = 0.062; <italic>F</italic>[2, 195] = 6.465, <italic>p</italic> = 0.002; <italic>η</italic><sup>2</sup> = 0.062; ϕ = 0.902 with number of variables). The effect size of the MVN test type on power was large, and the effect sizes of the interactions were medium. However, the interaction between the MVN test type and the homogeneous correlation among variables was not significant (Pillai’s Λ = 0.010; <italic>F</italic>[2, 195] = 0.964, <italic>p</italic> = 0.383; <italic>η</italic>² = 0.010; ϕ = 0.216).</p>
        <p>Based on Pearson’s product-moment correlation, with confidence intervals obtained using the BCa method, the statistical power of the three MVN tests was independent of the homogeneous correlation among variables. However, power was positively correlated with sample size. Additionally, the power of both K² test variants was positively correlated with the number of variables. The strength of association was low for the H test and the asymptotic variant of the K² test, and moderate for the bootstrap variant (see <bold>Table 4</bold>).</p>
        <p>In the pairwise comparisons, only one of the three differences was significant both before and after applying the Bonferroni correction. The asymptotic variant of the K<sup>2</sup> test showed significantly higher mean power than its bootstrap variant, with a small effect size (Hedges-Olkin g = 0.455, 95% asymptotic <italic>CI</italic>: 0.310, 0.600). See <bold>Table 5</bold>.</p>
        <p>Apart from the failure to comply with the assumption of sphericity, the assumption of multivariate normality across the three repeated measures was not met (Royston’s test: <italic>H</italic> = 342.06, <italic>df</italic> = 3.013, <italic>p</italic> &lt; 0.001, ϕ = 1). Additionally, univariate </p>
        <p><bold>Table 4.</bold> Correlations between the mean statistical powers of the three MVN tests and the three cofactors based on 200 samples drawn from multivariate t distributions.</p>
        <table-wrap id="tbl4">
          <label>Table 4</label>
          <table>
            <tbody>
              <tr>
                <td>MVN tests</td>
                <td>
                  <italic>n</italic>
                </td>
                <td>
                  <italic>k</italic>
                </td>
                <td>
                  <italic>r</italic>
                </td>
              </tr>
              <tr>
                <td>H</td>
                <td>0.109* (0.028, 0.333)</td>
                <td>0.114 (−0.060, 0.213)</td>
                <td>−0.095 (−0.303, −0.033)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _a
                </td>
                <td>0.196* (0.132, 0.249)</td>
                <td>0.252* (0.160, 0.327)</td>
                <td>
                  −3.71 × 10
                  <sup>−17</sup>
                  (−0.128, 0.150)
                </td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _b
                </td>
                <td>0.300* (0.238, 0.370)</td>
                <td>0.302* (0.225, 0.381)</td>
                <td>
                  2.06 × 10
                  <sup>−17</sup>
                  (−0.135, 0.138)
                </td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. ϕ = average statistical power (its complement for 200 samples drawn from multivariate normal distributions); <italic>LL</italic> = lower limit, and <italic>UL</italic> = upper limit of the 95% BCa confidence interval based on 1000 simulations.</p>
        <p><bold>Table 5.</bold> Post hoc pairwise comparisons of mean statistical power between MVN tests (Bonferroni-adjusted) based on 200 samples from multivariate t distributions.</p>
        <table-wrap id="tbl5">
          <label>Table 5</label>
          <table>
            <tbody>
              <tr>
                <td>Test1 - Test2</td>
                <td>
                  <italic>md</italic>
                </td>
                <td>
                  <italic>se</italic>
                </td>
                <td>
                  <italic>z</italic>
                </td>
                <td>
                  <italic>p</italic>
                </td>
                <td>
                  <italic>p</italic>
                  <sub>Bonf</sub>
                </td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  a
                </td>
                <td>−0.044 (−0.113, 0.024)</td>
                <td>0.028</td>
                <td>−1.564</td>
                <td>0.119</td>
                <td>0.358</td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>−0.013 (−0.080, 0.054)</td>
                <td>0.028</td>
                <td>−0.474</td>
                <td>0.636</td>
                <td>1</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>0.031 (0.020, 0.042)</td>
                <td>0.005</td>
                <td>6.835</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. md = mean difference in statistical power between Test 1 and Test 2 (<italic>LL</italic> = lower limit, and <italic>UL</italic> = upper limit, of the 95% asymptotic confidence interval adjusted using Bonferroni correction to control the Type I error rate for an alpha level of 0.05); se = asymptotic standard error; t[<italic>df</italic>] = paired t-test statistic [degrees of freedom = 196]; <italic>p</italic> = two-tailed asymptotic <italic>p</italic>-value; <italic>p</italic><sub>Bonf</sub> = two-tailed Bonferroni-adjusted asymptotic <italic>p</italic>-value.</p>
        <p>normality was violated in the residuals for each dependent variable (Shapiro-Wilk <italic>W</italic> = 0.312, <italic>p</italic> &lt; 0.001 for H; <italic>W</italic> = 0.472, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_a; <italic>W</italic> = 0.724, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_b). Consequently, the result of the analysis of variance should be interpreted with caution.</p>
        <p>The Friedman test revealed a statistically significant difference in the central tendency of statistical power among the three MVN tests (Χ<sup>2</sup>[2, <italic>N</italic> = 200] = 174.162, <italic>p</italic> &lt; 0.001), with a medium effect size for the MVN test type on statistical power (Kendall <italic>W</italic> = 0.435).</p>
        <p><bold>Table 6.</bold> Pairwise comparisons of statistical power using the Conover rank-sum test (Bonferroni-adjusted) based on 200 samples from multivariate t distributions.</p>
        <table-wrap id="tbl6">
          <label>Table 6</label>
          <table>
            <tbody>
              <tr>
                <td>T1 ‑ T2</td>
                <td>
                  <italic>w</italic>
                  <italic>ᵢ</italic>
                </td>
                <td>
                  <italic>w</italic>
                  <italic>ⱼ</italic>
                </td>
                <td>
                  |
                  <italic>t</italic>
                  |
                </td>
                <td>
                  <italic>p</italic>
                </td>
                <td>
                  <italic>p</italic>
                  <sub>Bonf</sub>
                </td>
                <td>
                  |
                  <italic>r</italic>
                  <sub>rb</sub>
                  |
                </td>
              </tr>
              <tr>
                <td>H ‑ K²a</td>
                <td>334</td>
                <td>528.5</td>
                <td>15.308</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.964</td>
              </tr>
              <tr>
                <td>H ‑ K²b</td>
                <td>334</td>
                <td>337.5</td>
                <td>0.275</td>
                <td>0.783</td>
                <td>1</td>
                <td>0.011</td>
              </tr>
              <tr>
                <td>K²a ‑ K²b</td>
                <td>528.5</td>
                <td>337.5</td>
                <td>15.033</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>1</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. <italic>w</italic><italic>ᵢ</italic> = sum of ranks for Test 1, <italic>w</italic><italic>ⱼ</italic> = sum of ranks for Test 2, |<italic>t</italic>| = absolute value of the t-test statistic (degrees of freedom: <italic>df</italic> = 398), p = asymptotic two-tailed <italic>p</italic>-value, <italic>p</italic><sub>Bonf</sub> = Bonferroni-adjusted <italic>p</italic>-value for an alpha level of 0.05, |<italic>r</italic><sub>rb</sub>| = absolute value of the rank-biserial correlation, used as a measure of effect size.</p>
        <p>Pairwise comparisons using the Conover rank sum test indicated that two of the three differences were statistically significant, even after applying the Bonferroni correction, with a large effect size based on biserial rank correlation. The central tendency of statistical power for the asymptotic variant of the K<sup>2</sup> test was significantly higher than that of both its bootstrap variant and the H test (<bold>Table 6</bold>).</p>
        <p>3.4.2. In Samples Drawn from MVN Distributions</p>
        <p>In the 200 samples drawn from the MVN distribution, the two variants of the K<sup>2</sup> test exhibited the lowest mean statistical power (<italic>HR</italic> = 0.235, 95% BCa <italic>CI</italic>: 0.202, 0.266 for asymptotic variant and <italic>HR</italic> = 0.234, 95% BCa <italic>CI</italic>: 0.203, 0.265), while H test showed the highest (<italic>HR</italic> = 0.268, 95% BCa <italic>CI</italic>: 0.244, 0.290). Since the null hypothesis should be retained in these 200 tests, the closer the statistical power is to 0, the better the test performs. However, the 95% BCa confidence intervals for the mean statistical power of the three MVT tests overlapped.</p>
        <p>The sphericity assumption was violated (Mauchly <italic>W</italic> = 0.869, Χ<sup>2</sup> [2, <italic>N</italic> = 200] = 27.456, <italic>p</italic> &lt; 0.001); therefore, a Huynh-Feldt epsilon correction (ε = 0.884 &gt; 0.7) was applied to adjust the degrees of freedom in the analysis of within-subject effects. Additionally, the multivariate test was included.</p>
        <p>The effect of the MVN type of test was significant (<italic>F</italic>[1.768, 346.492] = 34.591, <italic>p</italic> &lt; 0.001, <italic>η</italic><sup>2</sup> = 0.150, ϕ = 1), as were its interactions with sample size (<italic>F</italic>[1.768, 346.492] = 11.54, <italic>p</italic> &lt; 0.001, <italic>η</italic><sup>2</sup> = 0.056, ϕ = 0.988) and the number of variables (<italic>F</italic>[1.768, 346.492] = 94.702, <italic>p</italic> &lt; 0.001, <italic>η</italic><sup>2</sup> = 0.326, ϕ = 1). The effect size was large for the main effect and the interaction with the number of variables, while it was small for the interaction with sample size. However, the interaction between the MVN type of test and the homogeneous correlation among variables was not significant (<italic>F</italic>[1.768, 346.492] = 1.843, <italic>p</italic> = 0.165, <italic>η</italic><sup>2</sup> = 0.009, ϕ = 0.360).</p>
        <p>The multivariate test for the analysis of within-subject effects yielded very similar results. The effect of the MVN test type was significant (Pillai’s Λ = 0.350; <italic>F</italic>[2, 195] = 52.423, <italic>p</italic> &lt; 0.001; <italic>η</italic><sup>2</sup> = 0.350; ϕ = 1) as were its interactions with sample size (Pillai’s Λ = 0.091; <italic>F</italic>[2, 195] = 9.706, <italic>p</italic> &lt; 0.001; <italic>η</italic><sup>2</sup> = 0.091; ϕ = 0.981) and number of variables (Pillai’s Λ = 0.588; <italic>F</italic>[2, 195] = 139.404, <italic>p</italic> &lt; 0.001; <italic>η</italic><sup>2</sup> = 0.588; ϕ = 1). The effect size was large for both the main effect and the interaction with number of variables, while it was medium for the interaction with sample size. However, the interaction between the MVN test type and the homogeneous correlation among variables was not significant at the 5% level of significance, though it was significant at the 10% level (Λ of Pillai = 0.029; <italic>F</italic>[2, 394] = 2.871, <italic>p</italic> = 0.059; <italic>η</italic><sup>2</sup> = 0.029; ϕ = 0.557).</p>
        <p>Based on Pearson’s product–moment correlations, with confidence intervals obtained using the BCa method, the statistical power of the two variants of the Q test was independent of the homogeneous correlation among variables. This cofactor was only negatively correlated with the power of the H test, and the strength of this association was small. Sample size was negatively correlated with the power of both the H test and the bootstrap variant of the K<sup>2</sup> test, and positively correlated with the power of the asymptotic variant. In all three cases, the strength of association was small. The number of variables was negatively correlated with the power of both K<sup>2</sup> test variants, showing a strong association with the asymptotic variant and a moderate association with the bootstrap variant; in contrast, it was positively correlated with the power of the H test, with a moderate strength of association (<bold>Table 7</bold>).</p>
        <p><bold>Table 7.</bold> Correlations between the mean statistical powers of the three MVN tests and the three cofactors based on 200 samples drawn from multivariate normal distributions.</p>
        <table-wrap id="tbl7">
          <label>Table 7</label>
          <table>
            <tbody>
              <tr>
                <td>MVN tests</td>
                <td>
                  <italic>n</italic>
                </td>
                <td>
                  <italic>k</italic>
                </td>
                <td>
                  <italic>r</italic>
                </td>
              </tr>
              <tr>
                <td>H</td>
                <td>−0.188* (−0.289, −0.076)</td>
                <td>0.375* (0.257, 0.483)</td>
                <td>−0.184* (−0.327, −0.026)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _a
                </td>
                <td>0.146* (−0.028, 0.328)</td>
                <td>−0.795* (−0.829, −0.760)</td>
                <td>0 (−0.162, 0.146)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _b
                </td>
                <td>−0.221* (−0.309, −0.121)</td>
                <td>−0.412* (−0.515, −0.307)</td>
                <td>
                  −9.16 × 10
                  <sup>−18</sup>
                  (−0.145, 0.148)
                </td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. <italic>n</italic> = sample size, <italic>k</italic> = number of variables, and <italic>r</italic> = homogeneous correlation among variables. *An asterisk superscript indicates statistical significance at the 0.05 confidence level, as the confidence interval obtained via the BCa method (based on 1000 simulations) includes zero.</p>
        <p>In the pairwise comparisons, two significant differences were observed before applying the Bonferroni correction. Both variants of the K<sup>2</sup> test showed lower mean statistical power than the H test. However, the effect sizes were trivial and not statistically significant (Hedges-Olkin <italic>g</italic> = 0.098, 95% asymptotic <italic>CI</italic>: −0.041, 0.237 for H - K<sup>2</sup>_a; and <italic>g</italic> = 0.120, 95% asymptotic <italic>CI</italic>: −0.019, 0.259 for H - K<sup>2</sup>_b). After applying the Bonferroni correction, neither of these two differences remained statistically significant (<bold>Table 8</bold>).</p>
        <p><bold>Table 8.</bold> Post hoc pairwise comparisons of mean statistical power between MVN tests (Bonferroni-adjusted) based on 200 samples from multivariate normal distributions.</p>
        <table-wrap id="tbl8">
          <label>Table 8</label>
          <table>
            <tbody>
              <tr>
                <td>T1 ‑ T2</td>
                <td>
                  <italic>md</italic>
                </td>
                <td>
                  <italic>se</italic>
                </td>
                <td>t[196]</td>
                <td>
                  <italic>p</italic>
                </td>
                <td>
                  <italic>p</italic>
                  <sub>Bonf</sub>
                </td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  a
                </td>
                <td>0.033 (−0.005, 0.072)</td>
                <td>0.016</td>
                <td>2.096</td>
                <td>0.037</td>
                <td>0.112</td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>0.034 (−0.007, 0.074)</td>
                <td>0.017</td>
                <td>2.022</td>
                <td>0.045</td>
                <td>0.134</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>0.0003 (−0.050, 0.051)</td>
                <td>0.021</td>
                <td>0.014</td>
                <td>0.989</td>
                <td>1</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. md = mean difference in statistical power between Test 1 and Test 2 (<italic>LL</italic> = lower limit, and <italic>UL</italic> = upper limit, of the 95% asymptotic confidence interval adjusted using Bonferroni correction to control the Type I error rate for an alpha level of 0.05); se = asymptotic standard error; t[<italic>df</italic>] = paired t-test statistic [degrees of freedom = 196]; <italic>p</italic> = two-tailed asymptotic <italic>p</italic>-value; <italic>p</italic><sub>Bonf</sub> = two-tailed Bonferroni-adjusted asymptotic <italic>p</italic>-value.</p>
        <p>Apart from the violation of the sphericity assumption, the assumption of multivariate normality across the three repeated measures was not met (Royston’s test: <italic>H</italic> = 132.84, <italic>df</italic> = 2.939, <italic>p</italic> &lt; 0.001, ϕ = 1). Additionally, univariate normality was violated in the residuals for each dependent variable (Shapiro-Wilk: <italic>W</italic> = 0.966, <italic>p</italic> &lt; 0.001 for H; <italic>W</italic> = 0.939, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_a; <italic>W</italic> = 0.960, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_b). Although these deviations are smaller than those observed in the general linear model based on 200 samples drawn from multivariate t-distributions, the results of the analysis of variance should still be interpreted with caution.</p>
        <p>The Friedman test revealed a statistically significant difference in the central tendency of statistical power among the three MVN tests (Χ<sup>2</sup>[2, <italic>N</italic> = 200] = 275.16, <italic>p</italic> &lt; 0.001), with a large effect size of test type on power (Kendall <italic>W</italic> = 0.688).</p>
        <p>In pairwise comparisons using the Conover rank sum test, all three differences in central tendency were statistically significant, even after applying the Bonferroni correction. Based on the rank-biserial correlation, the effect size was large in both comparisons involving the H test, and small in the comparison between the two K<sup>2</sup> test variants. The H test had the highest sum of ranks (<bold>Table 9</bold>).</p>
        <p><bold>Table 9.</bold>Pairwise comparisons of statistical power using the Conover rank-sum test (Bonferroni-adjusted) based on 200 samples from multivariate normal distributions.</p>
        <table-wrap id="tbl9">
          <label>Table 9</label>
          <table>
            <tbody>
              <tr>
                <td>T1 ‑ T2</td>
                <td>
                  <italic>w</italic>
                  <italic>ᵢ</italic>
                </td>
                <td>
                  <italic>w</italic>
                  <italic>ⱼ</italic>
                </td>
                <td>
                  |
                  <italic>t</italic>
                  |
                </td>
                <td>
                  <italic>p</italic>
                </td>
                <td>
                  <italic>p</italic>
                  <sub>Bonf</sub>
                </td>
                <td>
                  |
                  <italic>r</italic>
                  <sub>rb</sub>
                  |
                </td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  a
                </td>
                <td>590</td>
                <td>326</td>
                <td>23.569</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.995</td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>590</td>
                <td>284</td>
                <td>27.318</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.996</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>326</td>
                <td>284</td>
                <td>3.750</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.174</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. <italic>w</italic><italic>ᵢ</italic> = sum of ranks for Test 1, <italic>w</italic><italic>ⱼ</italic> = sum of ranks for Test 2, |<italic>t</italic>| = absolute value of the t-test statistic (degrees of freedom: <italic>df</italic> = 398), p = asymptotic two-tailed <italic>p</italic>-value, pBonf = Bonferroni-adjusted <italic>p</italic>-value for an alpha level of 0.05, |<italic>r</italic><sub>rb</sub>| = absolute value of the rank-biserial correlation, used as a measure of effect size.</p>
        <p><bold>Table 10.</bold> Critical values for the K<sup>2</sup> test at the 10% and 5% significance levels.</p>
        <table-wrap id="tbl10">
          <label>Table 10</label>
          <table>
            <tbody>
              <tr>
                <td colspan="2">
                  <italic>Sanple</italic>
                </td>
                <td colspan="3">Critical value at 10%</td>
                <td colspan="3">Critical value at 5%</td>
              </tr>
              <tr>
                <td>
                  <italic>k</italic>
                </td>
                <td>
                  <italic>n</italic>
                </td>
                <td>Urzua</td>
                <td>Boot</td>
                <td>Chi-2</td>
                <td>Urzua</td>
                <td>Boot</td>
                <td>Chi-2</td>
              </tr>
              <tr>
                <td>2</td>
                <td>50</td>
                <td>7.83</td>
                <td>6.06</td>
                <td>9.24</td>
                <td>11.57</td>
                <td>7.09</td>
                <td>11.07</td>
              </tr>
              <tr>
                <td>
                </td>
                <td>100</td>
                <td>7.98</td>
                <td>8.10</td>
                <td>
                </td>
                <td>11.20</td>
                <td>9.91</td>
                <td>
                </td>
              </tr>
              <tr>
                <td>
                </td>
                <td>200</td>
                <td>7.88</td>
                <td>11.14</td>
                <td>
                </td>
                <td>10.94</td>
                <td>13.29</td>
                <td>
                </td>
              </tr>
              <tr>
                <td>3</td>
                <td>50</td>
                <td>11.35</td>
                <td>22.02</td>
                <td>17.28</td>
                <td>16.62</td>
                <td>24.14</td>
                <td>19.68</td>
              </tr>
              <tr>
                <td>
                </td>
                <td>100</td>
                <td>11.12</td>
                <td>27.16</td>
                <td>
                </td>
                <td>15.24</td>
                <td>30.15</td>
                <td>
                </td>
              </tr>
              <tr>
                <td>
                </td>
                <td>200</td>
                <td>11.01</td>
                <td>22.41</td>
                <td>
                </td>
                <td>14.37</td>
                <td>24.94</td>
                <td>
                </td>
              </tr>
              <tr>
                <td>4</td>
                <td>50</td>
                <td>14.81</td>
                <td>35.25</td>
                <td>29.62</td>
                <td>20.43</td>
                <td>38.05</td>
                <td>32.67</td>
              </tr>
              <tr>
                <td>
                </td>
                <td>100</td>
                <td>14.69</td>
                <td>45.19</td>
                <td>
                </td>
                <td>19.16</td>
                <td>49.16</td>
                <td>
                </td>
              </tr>
              <tr>
                <td>
                </td>
                <td>200</td>
                <td>14.00</td>
                <td>36.37</td>
                <td>
                </td>
                <td>18.21</td>
                <td>39.46</td>
                <td>
                </td>
              </tr>
              <tr>
                <td>5</td>
                <td>50</td>
                <td>18.06</td>
                <td>61.87</td>
                <td>47.21</td>
                <td>23.95</td>
                <td>65.52</td>
                <td>51</td>
              </tr>
              <tr>
                <td>
                </td>
                <td>100</td>
                <td>17.60</td>
                <td>71.90</td>
                <td>
                </td>
                <td>22.91</td>
                <td>76.47</td>
                <td>
                </td>
              </tr>
              <tr>
                <td>
                </td>
                <td>200</td>
                <td>17.00</td>
                <td>70.94</td>
                <td>
                </td>
                <td>21.25</td>
                <td>78.03</td>
                <td>11.07</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. <italic>k</italic> = number of variables; <italic>n</italic> = sample size. Urzua = critical values for the K<sup>2</sup> test at the 10% and 5% significance levels, based on Urzúa (1997, p. 352); Boot = bootstrap critical value (<italic>i.e</italic>., the 0.90 or 0.95 quantile of the normative bootstrap distribution); Chi-2 = 0.90 or 0.95 quantile of the chi-squared distribution with 1 + <italic>k</italic>(<italic>k</italic> + 1))(<italic>k</italic> + 2)/6 degrees of freedom.</p>
        <p>The 10% and 5% critical values reported by Urzua (1997, p. 352) [<xref ref-type="bibr" rid="B3">3</xref>], based on Monte Carlo simulations, were lower than the corresponding critical values from both the chi-square distribution with 1 + <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)⁄6 degrees of freedom and those generated by the script used in this study (from the normative bootstrap distribution). The latter were the highest.</p>
        <p>The average absolute difference between Urzua’s critical values and those from the chi-square distribution was 12.31, while the difference with the critical values obtained by the script was 10.01 (<bold>Table 10</bold>).</p>
        <p>3.4.3. In the Combined Sample of 400 Observations</p>
        <p>In the combined sample, the asymptotic variant of the K<sup>2</sup> test exhibited the highest mean power, while the H test showed the lowest. However, the 95% BCa confidence intervals for the mean statistical power overlapped among the three tests (<bold>Table 3</bold> and <xref ref-type="fig" rid="fig4">Figure 4</xref>).</p>
        <p>The sphericity assumption was violated (Mauchly <italic>W</italic> = 0.758, Χ<sup>2</sup>[2, <italic>N</italic> = 400] = 109.11, <italic>p</italic> &lt; 0.001); therefore, a Greenhouse-Geisser epsilon factor (ε = 0.805 &gt; 0.7) was applied to adjust the degrees of freedom in the analysis of within-subject effects and the multivariate test was included.</p>
        <p>The effect of the MVN test type was significant (<italic>F</italic>[1.61, 636.127] = 6.652, <italic>p</italic> = 0.003, <italic>η</italic><sup>2</sup> = 0.017, ϕ = 0.864), as were its interactions with sample size (<italic>F</italic>[1.61, 636.127] = 7.062, <italic>p</italic> = 0.002, <italic>η</italic><sup>2</sup> = 0.018, ϕ = 0.884) and number of variables (<italic>F</italic>[1.61, 636.127] = 22.406, <italic>p</italic> &lt; 0.001, <italic>η</italic><sup>2</sup> = 0.054, ϕ = 1). In all three cases, the effect size was small. However, the interaction between MVN test type and homogeneous correlation among variables was not significant (<italic>F</italic>[1.61, 636.127] = 0.055, <italic>p</italic> = 0.914, <italic>η</italic><sup>2</sup> &lt; 0.001, ϕ = 0.058), nor was the interaction with distribution type (<italic>F</italic>[1.61, 636.127] = 0.526, <italic>p</italic> = 0.552, <italic>η</italic><sup>2</sup> = 0.001, ϕ = 0.128).</p>
        <p>The multivariate test for the analysis of within-subject effects yielded the same result. The effect of the MVN test type was significant (Pillai’s Λ = 0.022; <italic>F</italic>[2, 394] = 4.479, <italic>p</italic> = 0.012; <italic>η</italic><sup>2</sup> = 0.022; ϕ = 0.765) as were its interactions with sample size (Pillai’s Λ = 0.052; <italic>F</italic>[2, 394] = 10.700, <italic>p</italic> &lt; 0.001; <italic>η</italic><sup>2</sup> = 0.052; ϕ = 0.99) and number of variables (Pillai’s Λ = 0.082; <italic>F</italic>[2, 394] = 17.639, <italic>p</italic> &lt; 0.001; <italic>η</italic><sup>2</sup> = 0.082; ϕ = 1). All three significant effects were small. The other two interactions involving the MVN test type were not significant: neither with homogeneous correlation among variables (Pillai’s Λ = 0; <italic>F</italic>[2, 394] = 0.037, <italic>p</italic> = 0.963; <italic>η</italic><sup>2</sup> = 0; ϕ = 0.056), nor with distribution type (Pillai’s Λ = 0.005; <italic>F</italic>[2, 394] = 1.026, <italic>p</italic> = 0.359; <italic>η</italic><sup>2</sup> = 0.005; ϕ = 0.229).</p>
        <p>Based on Pearson’s product-moment correlation, with confidence intervals obtained using the BCa method, the statistical power of the three tests was independent of the homogeneous correlation among variables. The statistical power of the H test correlated positively with sample size, with a small strength of association (<italic>r</italic> = 0.114, 95% BCa <italic>CI</italic>: 0.034, 0.270). The statistical power of the asymptotic variant of the K<sup>2</sup> test correlated positively with the number of variables, with a moderate strength of association. The statistical power of the bootstrap variant of the K<sup>2</sup> test correlated positively with both sample size and number of variables, with a small strength of association in each case (<bold>Table 11</bold>).</p>
        <p><bold>Table 11.</bold>Correlations between the statistical powers of the three MVN tests and the three cofactors based on a combined sample of 400 observations.</p>
        <table-wrap id="tbl11">
          <label>Table 11</label>
          <table>
            <tbody>
              <tr>
                <td>MVN tests</td>
                <td>
                  <italic>n</italic>
                </td>
                <td>
                  <italic>k</italic>
                </td>
                <td>
                  <italic>r</italic>
                </td>
              </tr>
              <tr>
                <td>H</td>
                <td>0.114* (0.034, 0.270)</td>
                <td>−0.022 (−0.189, 0.065)</td>
                <td>−0.014 (−0.110, 0.063)</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _a
                </td>
                <td>−0.069 (−0.204, 0.045)</td>
                <td>0.477* (0.420, 0.539)</td>
                <td>
                  4.55 × 10
                  <sup>−</sup>
                  <sup>16</sup>
                  (−0.104, 0.101)
                </td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  _b
                </td>
                <td>0.187*(0.122, 0.245)</td>
                <td>0.296* (0.214, 0.387)</td>
                <td>
                  4.65 × 10
                  <sup>−</sup>
                  <sup>18</sup>
                  (−0.093, 0.094)
                </td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. <italic>n</italic> = sample size, <italic>k</italic> = number of variables, and <italic>r</italic> = homogeneous correlation among variables. * An asterisk superscript indicates statistical significance at the 0.05 confidence level, as the confidence interval obtained via the BCa method (based on 1000 simulations) includes zero.</p>
        <p>In the pairwise comparisons, only one of the three differences was statistically significant before applying the Bonferroni correction. The asymptotic approximation of the K<sup>2</sup> test showed significantly higher mean power than the H test, with a trivial effect size (Hedges-Olkin <italic>g</italic> = −0.104, 95% asymptotic <italic>CI</italic>: −0.202, −0.006). However, after applying the Bonferroni correction, none of the differences remained statistically significant (see <bold>Table 12</bold>).</p>
        <p><bold>Table 12.</bold> Post hoc pairwise comparisons of mean statistical power between MVN tests (Bonferroni-adjusted) based on a combined sample of 400 observations.</p>
        <table-wrap id="tbl12">
          <label>Table 12</label>
          <table>
            <tbody>
              <tr>
                <td>T1 ‑ T2</td>
                <td>
                  <italic>md</italic>
                </td>
                <td>
                  <italic>se</italic>
                </td>
                <td>
                  <italic>t</italic>
                  [196]
                </td>
                <td>
                  <italic>p</italic>
                </td>
                <td>
                  <italic>p</italic>
                  <sub>Bonf</sub>
                </td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  a
                </td>
                <td>−0.039 (−0.081, 0.004)</td>
                <td>0.018</td>
                <td>−2.182</td>
                <td>395</td>
                <td>0.030</td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>−0.023 (−0.064, 0.017)</td>
                <td>0.017</td>
                <td>−1.388</td>
                <td>395</td>
                <td>0.166</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>0.015 (−0.011, 0.042)</td>
                <td>0.011</td>
                <td>1.382</td>
                <td>395</td>
                <td>0.168</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. md = mean difference in statistical power (its complement for 200 samples drawn from multivariate normal distributions) between Test 1 and Test 2 (<italic>LL</italic> = lower limit, and <italic>UL</italic> = upper limit, of the 95% asymptotic confidence interval adjusted using Bonferroni correction to control the Type I error rate for an alpha level of 0.05); se = asymptotic standard error; <italic>t</italic>[<italic>df</italic>] = paired t-test statistic [degrees of freedom = 395]; <italic>p</italic> = two-tailed asymptotic <italic>p</italic>-value; <italic>p</italic><sub>Bonf</sub> = two-tailed Bonferroni-adjusted asymptotic <italic>p</italic>-value.</p>
        <p>It should be noted that the assumption of homogeneity across the matrices of observed covariances of the dependent variables was not met (Box <italic>M</italic> = 987.872, <italic>F</italic>[6, 1147681.811] = 163.3, <italic>p</italic> &lt; 0.001), nor was the assumption of homogeneity of variance between the residuals of two dependent variables (<italic>F</italic>[1, 398] = 30.373, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_a and <italic>F</italic>[1, 398] = 109.609, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_b). However, this assumption was met for the H test (<italic>F</italic>[1, 398] = 0.044, <italic>p</italic> = 0.833). Additionally, the assumption of multivariate normality was violated across the three repeated measures (Royston’s test: <italic>H</italic> = 368.72, <italic>df</italic> = 3.003, <italic>p</italic> &lt; 0.001, ϕ = 1), as well as the univariate normality in the residuals for each dependent variable (Shapiro-Wilk <italic>W</italic> = 0.377, <italic>p</italic> &lt; 0.001 for H; <italic>W</italic> = 0.905, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_a; <italic>W</italic> = 0.947, <italic>p</italic> &lt; 0.001 for K<sup>2</sup>_b). Consequently, the results of the analysis of variance should be interpreted with caution.</p>
        <p>The Friedman test revealed a statistically significant difference in the central tendency of statistical power among the three MVN tests (Χ<sup>2</sup>[2, <italic>N</italic> = 400] = 84.241, <italic>p</italic> &lt; 0.001), with a small effect size of test type on statistical power (Kendall <italic>W</italic> = 0.105).</p>
        <p>In the pairwise comparisons using the Conover rank sum test, all three differences in central tendency were significant, even after applying the Bonferroni correction. The effect size was medium for the comparison between the asymptotic variant of the K<sup>2</sup> test and the H test, and small for the other two comparisons, based on rank-biserial correlation (<bold>Table 13</bold>).</p>
        <p><bold>Table 13.</bold> Pairwise comparisons of statistical power using the Conover rank-sum test (Bonferroni-adjusted) based on a combined sample of 400 observations.</p>
        <table-wrap id="tbl13">
          <label>Table 13</label>
          <table>
            <tbody>
              <tr>
                <td>T1 ‑ T2</td>
                <td>
                  <italic>w</italic>
                  <italic>ᵢ</italic>
                </td>
                <td>
                  <italic>wj</italic>
                </td>
                <td>
                  |
                  <italic>t</italic>
                  |
                </td>
                <td>
                  <italic>p</italic>
                </td>
                <td>
                  <italic>p</italic>
                  <sub>Bonf</sub>
                </td>
                <td>
                  |
                  <italic>r</italic>
                  <sub>rb</sub>
                  |
                </td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  a
                </td>
                <td>692.5</td>
                <td>929.5</td>
                <td>9.568</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.388</td>
              </tr>
              <tr>
                <td>
                  H ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>692.5</td>
                <td>778</td>
                <td>3.452</td>
                <td>&lt;0.001</td>
                <td>0.002</td>
                <td>0.165</td>
              </tr>
              <tr>
                <td>
                  K
                  <sup>2</sup>
                  a ‑ K
                  <sup>2</sup>
                  b
                </td>
                <td>929.5</td>
                <td>778</td>
                <td>6.116</td>
                <td>&lt;0.001</td>
                <td>&lt;0.001</td>
                <td>0.155</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>Note. <italic>w</italic><italic>ᵢ</italic> = sum of ranks for Test 1, <italic>w</italic><italic>ⱼ</italic> = sum of ranks for Test 2, |<italic>t</italic>| = absolute value of the t-test statistic (degrees of freedom: <italic>df</italic> = 798), <italic>p</italic> = asymptotic two-tailed <italic>p</italic>-value, <italic>p</italic><sub>Bonf</sub> = Bonferroni-adjusted <italic>p</italic>-value for an alpha level of 0.05, |<italic>r</italic><sub>rb</sub>| = absolute value of the rank-biserial correlation, used as a measure of effect size.</p>
      </sec>
    </sec>
    <sec id="sec4">
      <title>4. Discussion</title>
      <p>The R program successfully met the study’s first objective: automating the computation of Mardia’s K<sup>2</sup> test using both the original asymptotic approach [<xref ref-type="bibr" rid="B3">3</xref>] and a novel bootstrap method, while also incorporating Royston’s H test as a complementary tool for assessing multivariate normality. Running the R script on a dataset of 61 D-scores from three correlated variables generated under an MVN distribution, as well as on an empirical dataset of 509 Likert-type scores from four correlated variables that do not follow an MVN distribution, demonstrated how straightforward it is to interpret the results based on the <italic>p</italic>-value, the critical value, and the statistical power.</p>
      <p>Additionally, the script generates a graph depicting the sampling distribution of the K<sup>2</sup> statistic, estimated using both the asymptotic approach (a chi-square distribution with 1 + <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)/6 degrees of freedom) and resampling methods (empirical and normative bootstrap distributions). These density curves tend to converge when the original sample is drawn from a multivariate normal distribution, especially when inter-variable correlations are low. The greater the deviation from normality, the more pronounced the discrepancies among the three curves.</p>
      <p>Given that the hypothesis pertains to the distribution (<italic>i.e</italic>., multivariate normality), a normative sampling distribution can be generated to replace the asymptotic approximation (chi-square) for computing the <italic>p</italic>-value. This also allows for the determination of a critical value to evaluate the rejection rate within the empirical bootstrap distribution, thereby estimating statistical power (<italic>i.e</italic>., the proportion of estimates exceeding the critical value). The normative bootstrap distribution is less peaked and exhibits a heavier right tail than the chi-square distribution, resembling a generalized chi-square distribution—typical of quadratic forms involving correlated Gaussian variables [<xref ref-type="bibr" rid="B6">6</xref>], as illustrated in the example. For the K² statistic, this behavior reflects the correlation between multivariate skewness and [<xref ref-type="bibr" rid="B1">1</xref>][<xref ref-type="bibr" rid="B37">37</xref>]. To preserve the original interdependence among variables, the lower triangular matrix from the Cholesky decomposition of the correlation matrix is used to generate the normative sample [<xref ref-type="bibr" rid="B38">38</xref>], and resampling is performed by rows [<xref ref-type="bibr" rid="B39">39</xref>].</p>
      <p>Psychological theory holds that intellectual abilities are normally distributed and highly correlated, forming the basis for a general factor [<xref ref-type="bibr" rid="B40">40</xref>]. Beyond demonstrating the script’s correct execution with an example where all three MVN tests—and the convergence of the sampling distribution curves—supported the multivariate normality of intellectual abilities, the study’s second objective was to compare the performance of the two K<sup>2</sup> test variants with that of Royston’s H test.</p>
      <p>Two hundred samples were simulated under the null hypothesis of multivariate normality, and another 200 under non-normal conditions using a multivariate t-distribution with five degrees of freedom. The number of variables (ranging from 2 to 6), sample size (50, 75, 100, 125, 150, 200, 250, and 500), and the homogeneous correlation among variables were systematically varied. These three factors—number of variables, sample size, and inter-variable correlation—were treated as cofactors, as they are quantitative variables that influence statistical power in hypothesis testing (Jobst <italic>et al</italic>., 2023). To simplify the simulation, homogeneous correlations were used (0, 0.3, 0.5, 0.7, and 0.9). Combining the two sample sets yielded a dataset of 400 observations, with a “distribution type” variable including two categories (MVT and MVN). As both sample sets were generated using the same random seed (123), they were analyzed using repeated measures statistical methods [<xref ref-type="bibr" rid="B31">31</xref>].</p>
      <p>To assess the performance of each MVN test, two indicators were considered: hit rate and statistical power [<xref ref-type="bibr" rid="B41">41</xref>]. When rejecting the null hypothesis, power should exceed 0.5—ideally reaching 0.8 or 0.9—when 200 samples are drawn from a multivariate t-distribution with five degrees of freedom. Conversely, when the null hypothesis is true (<italic>i.e</italic>., the 200 samples are from a multivariate normal distribution), power should remain below 0.5, ideally under 0.2 or 0.1 [<xref ref-type="bibr" rid="B42">42</xref>]. Because of this inverse directionality, when comparing the power of the three MVN tests in the combined sample, the complement of power (<italic>i.e</italic>., the type II error rate) was used for the MVN samples.</p>
      <p>All three MVN tests demonstrated high hit rates, exceeding 0.9. When the null hypothesis was retained, the H test showed the best performance, whereas when it was rejected, the two K<sup>2</sup> variants outperformed the H test. In the combined sample, the bootstrap variant of the K<sup>2</sup> test demonstrated superior performance compared to the H test.</p>
      <p>Mean comparisons were conducted among the MVN tests with respect to statistical power. The distributions of statistical power were non-normal, both univariately and multivariately. Likewise, the residuals from the general linear models violated the assumption of normality. Assumptions of sphericity (<italic>i.e</italic>., equality of variances in pairwise differences) and homogeneity of observed covariance matrices across dependent variables were also violated. In these models, the powers of the MVN tests were treated as correlated or repeated dependent variables, with sample size, number of variables, and homogeneous correlation as cofactors. Additionally, distribution type served as a between-subjects factor defining two independent groups in the combined model.</p>
      <p>As a result, the repeated measures ANOVA results must be interpreted with caution. For this reason, nonparametric tests—specifically the Friedman test and pairwise Conover comparisons—were used. Moreover, Pearson correlations between statistical power and the cofactors were computed, with confidence intervals and significance levels estimated using the bootstrap BCa method. Confidence intervals including zero (<italic>i.e</italic>., with bounds of opposite signs) were interpreted as indicating non-significant correlations [<xref ref-type="bibr" rid="B43">43</xref>].</p>
      <p>In the multivariate t-distribution samples, the mean statistical power of the asymptotic K<sup>2</sup> variant was significantly higher than that of both the bootstrap variant and the H test. In the multivariate normal samples, the bootstrap variant showed the best performance, followed by the asymptotic K<sup>2</sup> variant, with the H test performing worst. In the combined sample, the asymptotic variant exhibited the highest mean power, followed by the bootstrap variant, while the H test had the lowest.</p>
      <p>Although the asymptotic variant of Mardia’s K<sup>2</sup> test consistently achieved higher mean power under non-normal conditions, this advantage must be interpreted in light of its comparatively weaker control of Type I error under multivariate normality. In hypothesis testing, higher power does not necessarily imply superior overall performance if it is accompanied by an inflated rejection rate when the null hypothesis is true [<xref ref-type="bibr" rid="B44">44</xref>]. In this sense, the bootstrap variant represents a more balanced compromise between sensitivity to departures from normality and adherence to the nominal significance level [<xref ref-type="bibr" rid="B7">7</xref>].</p>
      <p>This trade-off reflects a well-known distinction between asymptotic and resampling-based procedures [<xref ref-type="bibr" rid="B45">45</xref>]. Asymptotic tests tend to gain power more rapidly with increasing dimensionality and sample size, but may rely on distributional approximations that are less accurate in finite samples. In contrast, bootstrap calibration adapts the critical values to the empirical sampling distribution, improving Type I error control at the cost of a modest reduction in power under certain alternatives.</p>
      <p>Accordingly, the choice between the asymptotic and bootstrap variants of the K<sup>2</sup> test should be guided by the primary inferential goal. When sensitivity to non-normality is paramount and moderate deviations from the nominal Type I error rate are acceptable, the asymptotic variant may be preferred. Conversely, when strict control of the significance level is required—particularly in confirmatory analyses—the bootstrap variant provides a more robust and reliable option.</p>
      <p>In terms of hit rate (<italic>HR</italic> &gt; 0.90) and statistical power (high for rejection—ϕ ≥ 0.8—and low for retention—ϕ ≤ 0.2), Mardia’s test slightly outperformed the H test, consistent with Anis <italic>et al</italic>. [<xref ref-type="bibr" rid="B13">13</xref>], who emphasized Type II error and power. Specifically, the current power comparison found that the K<sup>2</sup> test outperformed the H test, particularly in its bootstrap variant. However, Moral [<xref ref-type="bibr" rid="B46">46</xref>] found that the asymptotic form of the K<sup>2</sup> test performed worse than both the H test and the chi-square approximation of the Q-test in a study involving a broader range of non-normal distributions and greater variability in sample sizes. Notably, this disadvantage was not observed for a multivariate t-distribution with 100 degrees of freedom, and the bootstrap variant of the K<sup>2</sup> test was not included in that study. Therefore, no inconsistency exists between the two findings.</p>
      <p>Analyses of within-subject effects in the repeated measures ANOVA and correlation results revealed an interaction between sample size and MVN test type. Larger sample sizes increased statistical power, with the bootstrap K<sup>2</sup> variant benefiting most from the increase—although the strength of association was moderate to low. In contrast, the H test benefited least.</p>
      <p>The number of variables also interacted with MVN test type. A larger number of variables increased statistical power, particularly for the asymptotic K² variant. The H test showed the smallest benefit, and its improvement was mainly evident in samples drawn from a multivariate normal distribution.</p>
      <p>Homogeneous correlation did not interact with MVN test type. The statistical power of both K<sup>2</sup> variants was entirely unaffected by homogeneous correlation among variables. Only in multivariate normal samples did higher correlations slightly enhance the H test’s power, with a small effect size.</p>
      <p>The invariance of Mardia’s tests—for multivariate skewness, kurtosis, and their combined K<sup>2</sup> statistic—across varying levels of homogeneous or heterogeneous correlation is due to their reliance on standardized central moments, which remove the effects of scale and correlation, rendering the K² statistic insensitive to correlation magnitude.</p>
      <p>When comparing the critical values obtained in this study (asymptotic and bootstrap) with those derived from Urzua’s Monte Carlo simulation [<xref ref-type="bibr" rid="B4">4</xref>], the latter were consistently lower—even compared to the asymptotic chi-square values (with 1 + <italic>k</italic>(<italic>k</italic> + 1)(<italic>k</italic> + 2)/6 degrees of freedom). This discrepancy arises from differences in simulation procedures. The present study conditioned simulations on both the empirical sample (original data) and a normative sample (perfectly symmetric and normally distributed), each consisting of correlated variables preserving the original correlational structure. This design preserved not only the number of variables and observations but also their distributional and correlational characteristics, thus more closely approximating the sampling distribution of the statistic.</p>
      <p>In contrast, Monte Carlo simulations—where <italic>b</italic> independent samples of size <italic>n</italic> are generated from a MVN distribution with zero correlation—condition the sampling distribution on an idealized scenario. While this minimizes error and leads to lower critical values, it sacrifices representativeness of empirical conditions [<xref ref-type="bibr" rid="B47">47</xref>].</p>
      <p>This study has several limitations related to the scope of a simulation design. Only two generating distributions were considered—a multivariate normal distribution under the null hypothesis and a symmetric but leptokurtic multivariate t distribution with five degrees of freedom—excluding other relevant families such as asymmetric, mesokurtic, or near-normal alternatives.</p>
      <p>In addition, the range of sample sizes was restricted, omitting very small and very large samples. The simulations were based on 400 parameter datasets spanning combinations of sample size, dimensionality, and homogeneous correlation, which allowed performance to be assessed on average but did not permit precise estimation of type-I error or power for specific parameter configurations; this limitation could be addressed in future work by generating multiple samples per condition.</p>
      <p>Finally, all datasets assumed homogeneous correlations, whereas extending the analysis to heterogeneous dependence structures—using, for example, random positive-definite covariance models—would substantially broaden the applicability of the results and provide a more comprehensive evaluation of the robustness of the bootstrap and asymptotic procedures.</p>
    </sec>
    <sec id="sec5">
      <title>5. Conclusions</title>
      <p>The developed R script enables the execution of Mardia’s K<sup>2</sup> test using either an asymptotic or bootstrap approach. Unlike previous studies, the bootstrap method focuses on the sample data while preserving its correlational structure. The script allows for the computation of the test statistic, its <italic>p</italic>-value, and statistical power, as well as the graphical representation of the sampling distribution of the K<sup>2</sup> statistic. Both variants of the Mardia test included in the script are complemented by Royston’s H test.</p>
      <p>A simulation study comparing two multivariate normality tests—one of which includes two variants—under two types of distributions (one favoring the null hypothesis and the other opposing it) revealed that no single MVN test uniformly dominates across all conditions. The H test performs best for retaining normality, whereas Mardia’s K<sup>2</sup> test—particularly its bootstrap variant—shows superior performance for detecting departures from normality. Consequently, the bootstrap K<sup>2</sup> test is recommended when the primary objective is sensitivity to non-normality, while acknowledging that its advantage depends on sample size, dimensionality, and the underlying distribution.</p>
      <p>The sample distribution resulting from the bootstrap approach is less peaked and exhibits a heavier right tail compared to the distribution obtained from the asymptotic approximation. This distribution corresponds to a generalized chi-square, which is typical of sums of squares of correlated normal variables.</p>
    </sec>
    <sec id="sec6">
      <title>6. Suggestions</title>
      <p>It is recommended to use the script (available as a Word file in a GitHub repository at <ext-link ext-link-type="uri" xlink:href="https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git">https://github.com/josemoraldelarubia579/R_script-for_MVN_K2_and_H_tests.git</ext-link>) when testing for multivariate normality and to further investigate the bootstrap variant of Mardia’s K<sup>2</sup> test across various distributions, as well as in comparison with other tests, such as the Henze-Zirkler test [<xref ref-type="bibr" rid="B48">48</xref>], which is available in R [<xref ref-type="bibr" rid="B49">49</xref>].</p>
    </sec>
    <sec id="sec7">
      <title>Acknowledgements</title>
      <p>The author wishes to thank the reviewers and the editor for their valuable comments and corrections, which significantly enhanced the quality of the manuscript.</p>
    </sec>
    <sec id="sec8">
      <title>Appendix 1</title>
      <p># Asymptotic and bootstrap variants of Mardia’s MVN K<sup>2</sup> test for samples of at least 20 observations of k-tuples</p>
      <p># Define the multivariate random sample</p>
      <p>library(mvtnorm)</p>
      <p># Parameters</p>
      <p>n &lt;- 61</p>
      <p>k &lt;- 3</p>
      <p># Correlation matrix</p>
      <p>sigma &lt;- matrix(c(1, 0.8, 0.7,</p>
      <p>0.8, 1, 0.6,</p>
      <p>0.7, 0.6, 1),</p>
      <p>nrow = k, byrow = TRUE)</p>
      <p># Data simulation</p>
      <p>set.seed(4567)</p>
      <p>mvn_data &lt;- rmvnorm(n = n, sigma = sigma)</p>
      <p># Scaling and rounding</p>
      <p>x1 &lt;- round(20 * mvn_data[, 1] + 65, 0) # Mechanical reasoning D-scores</p>
      <p>x2 &lt;- round(20 * mvn_data[, 2] + 60, 0) # Spatial reasoning D-scores</p>
      <p>x3 &lt;- round(20 * mvn_data[, 3] + 68, 0) # Numerical reasoning D-scores</p>
      <p># Database</p>
      <p>original_data &lt;- data.frame(x1, x2, x3)</p>
      <p># print(original_data)</p>
      <p># R package required</p>
      <p>library(MVN)</p>
      <p># Sample size, number of variables, and average of the correlation among variables</p>
      <p>n &lt;- length(x1)</p>
      <p>cat("\nSample size: n =", n, ".\n")</p>
      <p>k &lt;- length(original_data)</p>
      <p>cat("Number of variables in the original sample: k = ", k, ".\n")</p>
      <p>alpha &lt;- 0.05</p>
      <p>cat("Significance level: α =", alpha, ".\n")</p>
      <p>COR &lt;- cor(original_data)</p>
      <p>LT_COR &lt;- COR[lower.tri(COR)]</p>
      <p>cat("Average of the correlation among variables: m(R) =", round(mean(LT_COR), 4), ".\n")</p>
      <p># Mardia's multivariate skewness (B1) and multivariate kurtosis (B2) tests</p>
      <p>mardia &lt;- mvn(original_data, subset = NULL, mvnTest = "mardia")</p>
      <p>skew &lt;- as.numeric(as.character(mardia$multivariateNormality$Statistic[<xref ref-type="bibr" rid="B1">1</xref>]))</p>
      <p>kurt &lt;- as.numeric(as.character(mardia$multivariateNormality$Statistic[<xref ref-type="bibr" rid="B2">2</xref>]))</p>
      <p>p_skew &lt;- as.numeric(as.character(mardia$multivariateNormality$p[<xref ref-type="bibr" rid="B1">1</xref>]))</p>
      <p>p_kurt &lt;- as.numeric(as.character(mardia$multivariateNormality$p[<xref ref-type="bibr" rid="B2">2</xref>]))</p>
      <p>cat("\nMardia’s multivariate skewness and kurtosis tests\n")</p>
      <p>cat("Mardia's multivariate skewness statistic: B1 =", round(skew, 4), ".\n")</p>
      <p>cat("Two-tailed p-value for the null hypothesis of multivariate symmetry: p =", round(p_skew, 4), ".\n")</p>
      <p>cat("Mardia's multivariate kurtosis statistic: B2 =", round(kurt, 4), ".\n")</p>
      <p>cat("Two-tailed p-value for the null hypothesis of multivariate mesokurtosis: p =", round(p_kurt, 4), ".\n")</p>
      <p># Mardia's MVN K² test</p>
      <p># Function to calculate K² statistic</p>
      <p>calculate_k2 &lt;- function(data) {</p>
      <p>mardia &lt;- mvn(data, mvnTest = "mardia")</p>
      <p>skew &lt;- as.numeric(as.character(mardia$multivariateNormality$Statistic[<xref ref-type="bibr" rid="B1">1</xref>]))</p>
      <p>kurt &lt;- as.numeric(as.character(mardia$multivariateNormality$Statistic[<xref ref-type="bibr" rid="B2">2</xref>]))</p>
      <p>p_skew &lt;- as.numeric(as.character(mardia$multivariateNormality$p[<xref ref-type="bibr" rid="B1">1</xref>]))</p>
      <p>p_kurt &lt;- as.numeric(as.character(mardia$multivariateNormality$p[<xref ref-type="bibr" rid="B2">2</xref>]))</p>
      <p>k2_stat &lt;- skew + kurt^2</p>
      <p>return(k2_stat)</p>
      <p>}</p>
      <p># Statistic observed in the original sample</p>
      <p>k2_stat &lt;- calculate_k2(original_data)</p>
      <p>df_mardia &lt;- 1 + (k * (k + 1) * (k + 2)) / 6</p>
      <p>p_val &lt;- pchisq(k2_stat, df = df_mardia, lower.tail = FALSE)</p>
      <p>asymp_crit_value &lt;- qchisq(1 - alpha, df = df_mardia, lower.tail = TRUE)</p>
      <p>asymp_power &lt;- 1 - pchisq(asymp_crit_value, df = df_mardia, ncp = k2_stat, lower.tail = TRUE)</p>
      <p># Interpretation from asymptotic approach</p>
      <p>cat("\nAsymptotic variant of the MVN K-square test\n")</p>
      <p>cat("Mardia's multivariate normality statistic: MVN K-square =", round(k2_stat, 4), ".\n")</p>
      <p>cat("Degrees of freedom: df =", df_mardia, ".\n")</p>
      <p>cat("Critical value:", 1-alpha, "_chi-sq[", df_mardia, "] =", round(asymp_crit_value, 4), ".\n")</p>
      <p>cat("Right-tailed p-value under a chi-square distribution with", df_mardia, "degrees of freedom: p_value =", round(p_val, 4), ".\n")</p>
      <p>if (p_val &lt; alpha) {</p>
      <p>cat("H_0 (multivariate normality) is rejected at the level", alpha, "based on the asymptotic p-value (chi-2 distribution).\n")</p>
      <p>} else {cat("H_0 (multivariate normality) is maintained at the level", alpha, " asymptotic p-value (chi-2 distribution).\n")}</p>
      <p>cat("Asymptotic statistical power for the K-square statistic: phi =", round(asymp_power, 4), ".\n")</p>
      <p>cat("\nIf the null hypothesis is rejected, the statistical power should exceed 0.5, preferably 0.8 or higher.\n")</p>
      <p>cat("If the null hypothesis is not rejected, the statistical power should be below 0.5, preferably under 0.2.\n")</p>
      <p>cat("Otherwise, the result is contradictory or questionable.\n")</p>
      <p># Normative sample (with the original correlations)</p>
      <p>p &lt;- seq(0.5 / n, 1 - 0.5 / n, length.out = n)</p>
      <p>independent_data &lt;- matrix(NA, nrow = n, ncol = k)</p>
      <p>set.seed(123)</p>
      <p>for (j in 1:k) {</p>
      <p>independent_data[, j] &lt;- qnorm(sample(p))} # Resampling with replacement by row</p>
      <p>cor_matrix &lt;- cor(original_data)</p>
      <p>chol_matrix &lt;- chol(cor_matrix)</p>
      <p>normative_data &lt;- independent_data %*% chol_matrix</p>
      <p># Bootstrap normative distribution</p>
      <p>B &lt;- 1000</p>
      <p>k2_boot_norm_vals &lt;- replicate(B, {</p>
      <p>indices &lt;- sample(1:n, size = n, replace = TRUE)</p>
      <p>calculate_k2(normative_data[indices, ])})</p>
      <p># Empirical bootstrap distribution. Resampling with replacement by row</p>
      <p>n &lt;- nrow(original_data)</p>
      <p>set.seed(123)</p>
      <p>k2_boot_vals &lt;- replicate(B, {</p>
      <p>indices &lt;- sample(1:n, size = n, replace = TRUE)</p>
      <p>boot_data &lt;- original_data[indices, ]</p>
      <p>calculate_k2(boot_data)})</p>
      <p># Bootstrap critical value, p-value, and power</p>
      <p>boot_crit_value &lt;- quantile(k2_boot_norm_vals, probs = 0.95, type = 8)</p>
      <p>boot_p_value &lt;- mean(k2_boot_norm_vals &gt;= k2_stat)</p>
      <p>boot_power &lt;- mean(k2_boot_vals &gt; boot_crit_value)</p>
      <p># Interpretation from bootstrap approach</p>
      <p>cat("\nBootstrap variant of the MVN K-square test\n")</p>
      <p>cat("Bootstrap critical value (0.95 quantile): q_boot(0.95) =", round(boot_crit_value, 4),".\n")</p>
      <p>cat("Bootstrap p-value: p_boot =", round(boot_p_value, 4), ".\n")</p>
      <p>if (boot_p_value &lt; alpha) {</p>
      <p>cat("H_0 (multivariate normality) is rejected at the level", alpha, "based on the bootstrap p-value.\n")</p>
      <p>} else {cat("H_0 (multivariate normality) is maintained at the level", alpha, "based on the bootstrap p-value.\n")}</p>
      <p>cat("Bootstrap power: phi_boot =", boot_power, ".\n")</p>
      <p># Representation of empirical and normative bootstrap sampling distributions</p>
      <p># Remove the preceding hashtag symbols to save as a JPEG or TIFF file</p>
      <p># jpeg("Hist_dens_curves.jpeg", width = 1200, height = 900, units = "px", res = 300)</p>
      <p># tiff("Hist_dens_curves.tiff", width = 1200, height = 900, units = "px", res = 300)</p>
      <p># par(mar = c(4.5, 4.5, 0.5, 0.5), cex.axis = 0.8)</p>
      <p># Green density curve of the chi-square distribution as the base graph</p>
      <p>x_seq &lt;- seq(0, max(qchisq(0.999, df = df_mardia), max(k2_boot_norm_vals) + 1, k2_stat + 1, max(k2_boot_vals) + 1), length.out = 1000)</p>
      <p>y_chi_sq &lt;- dchisq(x_seq, df = df_mardia)</p>
      <p>plot(x_seq, y_chi_sq, type = "l", lwd = 3, col = "darkgreen",</p>
      <p>main = "", xlab = "K_square_values", ylab = "Density",</p>
      <p>xlim = c(0, max(qchisq(0.999, df = df_mardia), max(k2_boot_norm_vals) + 1, k2_stat + 1, max(k2_boot_vals) + 1)),</p>
      <p>ylim = c(0, max(density(k2_boot_vals)$y, y_chi_sq)))</p>
      <p># Yellow histogram of the empirical bootstrap distribution of the K-square test statistic</p>
      <p>hist(k2_boot_vals, breaks = 20, freq = FALSE, col = rgb(1, 1, 0, 0.5), border = "yellow2", add = TRUE)</p>
      <p># Yellow density curve of the empirical bootstrap distribution of the K-square statistic</p>
      <p>lines(density(k2_boot_vals), col = "yellow", lwd = 2)</p>
      <p># Red density curve of the normative bootstrap distribution of the K-square statistic</p>
      <p>lines(density(k2_boot_norm_vals), col = "red", lwd = 2)</p>
      <p># Blue vertical line for the observed K-square statistic value</p>
      <p>abline(v = k2_stat, col = "darkblue", lwd = 2, lty = 2)</p>
      <p># Red vertical line for the normative bootstrap critical value</p>
      <p>abline(v = boot_crit_value, col = "red", lwd = 2, lty = 2)</p>
      <p># Green vertical line for the asymptotic critical value</p>
      <p>abline(v = qchisq(1-alpha, df = df_mardia), col = "darkgreen", lwd = 2, lty = 2)</p>
      <p># dev.off() # Remove the preceding hashtag symbol to save the figure</p>
      <p>cat("\nFigure. Histogram (yellow) with density curves for the empirical bootstrap (yellow), normative bootstrap (red), and chi-square (green) distributions. The value of the k-square statistic is indicated by a blue vertical line (k-square =", round(k2_stat, 4), "), the critical value of the normative bootstrap distribution by a red vertical line (", 1-alpha, "q_boot =", round(boot_crit_value, 4), "), and the critical value of the chi-square distribution with", df_mardia, "degrees of freedom by a green vertical line (", 1 - alpha, "Chi-square[", df_mardia, "] =", round(asymp_crit_value, 4), ").\n")</p>
    </sec>
    <sec id="sec9">
      <title>Appendix 2</title>
      <p># Royston’s multivariate normality H test for samples size from 10 to 2000 with k-tuples of measurements</p>
      <p># Load required R package</p>
      <p>library(MVN)</p>
      <p># List of original variables</p>
      <p>x1 &lt;- c(47, 97, 72, 112, 63, 79, 26, 66, 104, 68, 90, 85, 49, 64, 71, 73, 69, 96, 64, 65, 57, 91, 108, 45, 37, 45, 38, 71, 75, 46, 48, 61, 80, 93, 35, 70, 73, 72, 71, 111, 77, 56, 54, 43, 40, 67, 59, 29, 83, 64, 46, 72, 49, 44, 59, 65, 94, 48, 53, 66, 71)</p>
      <p>x2 &lt;- c(39, 97, 66, 106, 64, 71, 22, 61, 94, 61, 70, 79, 52, 51, 50, 68, 51, 102, 55, 58, 59, 83, 121, 41, 33, 58, 22, 70, 65, 57, 33, 65, 82, 75, 35, 49, 86, 44, 89, 88, 67, 36, 62, 55, 35, 73, 58, 50, 97, 39, 35, 85, 58, 35, 44, 40, 87, 43, 52, 78, 73)</p>
      <p>x3 &lt;- c(63, 108, 66, 108, 69, 89, 53, 79, 85, 64, 70, 61, 62, 71, 78, 63, 38, 68, 58, 69, 77, 99, 91, 53, 44, 73, 58, 73, 67, 44, 61, 85, 72, 60, 73, 88, 78, 49, 70, 88, 75, 54, 42, 47, 58, 71, 76, 23, 55, 84, 47, 87, 78, 46, 65, 78, 91, 47, 42, 73, 62)</p>
      <p>original_data &lt;- data.frame(x1, x2, x3)</p>
      <p>n &lt;- length(x1)</p>
      <p>k &lt;- length(original_data)</p>
      <p>alpha &lt;- 0.05</p>
      <p>result &lt;- mvn(original_data, subset = NULL, mvnTest = "royston")</p>
      <p># Statistical power for H test</p>
      <p>R &lt;- cor(original_data)</p>
      <p>lambda &lt;- 5</p>
      <p>mu &lt;- 0.715</p>
      <p>ln_n &lt;- log(n)</p>
      <p>v_n &lt;- 0.21364 + 0.015124 * ln_n^2 - 0.0018034 * ln_n^3</p>
      <p>r_ij &lt;- R[lower.tri(R)]</p>
      <p>c_ij &lt;- r_ij^lambda * (1 - (mu / v_n) * (1 - r_ij)^mu)</p>
      <p>c_bar &lt;- mean(c_ij)</p>
      <p>df_H &lt;- k / (1 + (k - 1) * c_bar)</p>
      <p>power_H &lt;- 1 - pchisq(qchisq(1-alpha, df = df_H), df = df_H, ncp = result$multivariateNormality$H, lower.tail = TRUE)</p>
      <p># Display results</p>
      <p>cat("\nSample size: n =", n, ".\n")</p>
      <p>cat("Number of variables in the original sample: k =", k, ".\n")</p>
      <p>cat("Arithmetic mean of the Pearson's product-moment correlation coefficients between the variables: m(R) =", round(mean(r_ij), 4), ".\n")</p>
      <p>cat("\nRoyston's multivariate normality H test:\n")</p>
      <p>cat("H statistic: H =", round(result$multivariateNormality$H, 4), ".\n")</p>
      <p>cat("Degrees of freedom: df =", round(df_H, 4), ".\n")</p>
      <p>cat("p-value for the null hypothesis of multivariate normality: p =", round(result$multivariateNormality$p, 4), ".\n")</p>
      <p>cat("Statistical power of the H test at a significance level of", alpha, ": phi =", round(power_H, 4), ".\n")</p>
      <p>cat("\nIf the null hypothesis is rejected, the statistical power should exceed 0.5, preferably 0.8 or higher.\n")</p>
      <p>cat("If the null hypothesis is not rejected, the statistical power should be below 0.5, preferably under 0.2.\n")</p>
    </sec>
    <sec id="sec10">
      <title>Appendix 3. Dataset for Example 2</title>
      <p># List of original variables</p>
      <p>x1 &lt;- c(4, 3, 4, 5, 5, 4, 3, 6, 5, 5, 5, 5, 7, 6, 6, 6, 6, 5, 6, 6, 7, 7, 6, 7, 6, 6, 6, 6, 7, 6, 6, 7, 7, 7, 7, 6, 7, 5, 7, 7, 7, 7, 6, 7, 6, 7, 7, 3, 1, 4, 6, 4, 4, 5, 3, 3, 3, 4, 4, 3, 5, 4, 4, 4, 4, 5, 5, 4, 4, 5, 5, 5, 5, 4, 5, 5, 5, 5, 5, 4, 5, 5, 4, 6, 5, 5, 6, 6, 4, 6, 6, 6, 6, 5, 6, 6, 6, 6, 6, 5, 5, 6, 6, 5, 6, 6, 6, 6, 5, 5, 6, 6, 5, 5, 6, 6, 5, 7, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 5, 6, 7, 7, 7, 6, 6, 6, 5, 6, 6, 6, 6, 7, 6, 7, 6, 6, 6, 6, 5, 5, 6, 7, 7, 6, 6, 6, 6, 6, 6, 6, 6, 6, 6, 7, 7, 7, 6, 6, 6, 7, 6, 7, 7, 6, 7, 6, 6, 7, 7, 7, 4, 4, 4, 5, 5, 6, 5, 6, 6, 7, 6, 6, 6, 6, 6, 7, 7, 7, 7, 7, 6, 4, 4, 5, 5, 5, 6, 1, 5, 5, 3, 6, 7, 7, 7, 6, 7, 6, 6, 6, 6, 7, 7, 7, 4, 5, 6, 7, 6, 7, 7, 3, 6, 7, 6, 7, 7, 6, 3, 3, 5, 6, 7, 5, 5, 3, 6, 6, 6, 7, 2, 5, 6, 6, 4, 5, 6, 6, 4, 6, 7, 7, 7, 7, 7, 6, 6, 6, 6, 3, 3, 5, 6, 6, 3, 4, 6, 6, 7, 5, 6, 6, 6, 4, 5, 7, 7, 7, 4, 4, 7, 4, 5, 3, 4, 7, 6, 5, 6, 5, 4, 4, 5, 6, 5, 5, 6, 7, 7, 2, 2, 5, 5, 5, 6, 6, 6, 4, 4, 4, 5, 5, 3, 3, 5, 7, 6, 3, 5, 6, 7, 4, 4, 4, 5, 6, 5, 6, 6, 6, 6, 6, 6, 6, 5, 6, 6, 6, 7, 1, 6, 5, 6, 5, 6, 6, 7, 7, 5, 4, 4, 5, 5, 6, 6, 7, 4, 5, 4, 5, 5, 5, 5, 5, 4, 4, 6, 5, 6, 6, 5, 6, 6, 6, 6, 6, 5, 6, 7, 7, 6, 6, 5, 7, 5, 6, 6, 6, 6, 6, 6, 5, 7, 6, 6, 6, 6, 7, 7, 7, 6, 7, 7, 6, 6, 6, 7, 6, 6, 7, 7, 7, 7, 7, 7, 7, 6, 7, 7, 7, 7, 7, 6, 7, 7, 7, 7, 7, 7, 7, 7, 6, 7, 6, 4, 4, 5, 5, 5, 7, 5, 6, 6, 6, 7, 6, 5, 6, 6, 5, 7, 7, 6, 6, 5, 6, 7, 5, 7, 6, 5, 5, 4, 6, 4, 6, 5, 6, 5, 6, 5, 7, 6, 5, 6, 7, 6, 5, 4, 6, 7, 6, 2, 5, 3, 5, 5, 6, 5, 3, 5, 3, 6, 4, 6, 6)</p>
      <p>x2 &lt;- c(3, 2, 3, 4, 3, 4, 4, 5, 4, 5, 4, 5, 6, 4, 5, 5, 5, 5, 6, 5, 6, 6, 7, 5, 5, 5, 7, 6, 6, 6, 6, 6, 6, 7, 7, 6, 7, 6, 7, 6, 7, 7, 6, 6, 6, 6, 7, 2, 1, 3, 1, 3, 2, 2, 4, 3, 3, 3, 4, 4, 4, 3, 4, 4, 4, 4, 4, 5, 4, 4, 4, 4, 5, 5, 5, 4, 5, 5, 4, 6, 6, 5, 5, 5, 5, 5, 4, 4, 4, 6, 5, 6, 6, 5, 6, 6, 5, 5, 5, 5, 4, 4, 5, 7, 5, 6, 5, 6, 5, 5, 6, 6, 4, 6, 4, 5, 5, 7, 4, 6, 6, 6, 5, 5, 5, 7, 4, 6, 6, 6, 5, 6, 4, 7, 7, 5, 7, 6, 5, 6, 5, 5, 6, 5, 7, 6, 4, 6, 5, 5, 5, 6, 6, 5, 5, 5, 6, 6, 6, 5, 5, 6, 6, 6, 6, 6, 7, 6, 6, 7, 7, 7, 4, 7, 6, 6, 7, 7, 7, 7, 7, 7, 7, 3, 3, 3, 4, 4, 4, 4, 6, 5, 5, 5, 6, 6, 6, 6, 6, 7, 6, 6, 7, 4, 4, 5, 5, 5, 4, 5, 7, 6, 5, 6, 5, 7, 7, 7, 6, 7, 7, 6, 6, 7, 6, 7, 7, 4, 6, 7, 7, 7, 7, 6, 1, 5, 6, 4, 7, 7, 7, 2, 2, 4, 5, 7, 5, 4, 3, 4, 7, 5, 7, 2, 6, 5, 6, 3, 4, 6, 6, 4, 6, 6, 7, 4, 6, 7, 6, 5, 4, 6, 4, 3, 5, 4, 6, 2, 4, 6, 6, 6, 4, 6, 5, 7, 4, 5, 7, 5, 7, 4, 4, 6, 4, 4, 3, 4, 7, 7, 5, 7, 4, 4, 5, 4, 6, 4, 5, 7, 7, 2, 2, 3, 4, 3, 6, 6, 5, 5, 3, 4, 4, 4, 4, 5, 5, 5, 7, 7, 3, 4, 6, 7, 4, 4, 4, 4, 5, 6, 5, 6, 6, 5, 5, 5, 5, 5, 6, 5, 7, 7, 6, 6, 5, 5, 5, 4, 6, 7, 6, 4, 3, 3, 5, 5, 4, 6, 7, 4, 4, 4, 5, 4, 4, 6, 5, 4, 4, 5, 5, 5, 5, 5, 6, 5, 6, 6, 5, 5, 6, 5, 6, 6, 5, 6, 6, 6, 6, 6, 4, 4, 6, 7, 6, 6, 4, 5, 6, 7, 7, 6, 6, 6, 7, 6, 7, 7, 7, 7, 6, 6, 7, 7, 6, 5, 7, 7, 6, 6, 6, 7, 7, 7, 6, 7, 7, 7, 7, 7, 7, 7, 7, 7, 4, 4, 7, 4, 5, 6, 4, 5, 7, 6, 4, 6, 6, 7, 7, 6, 6, 5, 6, 7, 6, 5, 6, 5, 4, 6, 5, 5, 6, 4, 5, 7, 6, 7, 6, 5, 5, 5, 5, 5, 7, 6, 5, 5, 4, 5, 5, 4, 5, 6, 6, 3, 4, 2, 5, 5, 4, 4, 3, 4, 3, 5, 5, 5, 7)</p>
      <p>x3 &lt;- c(4, 2, 3, 4, 4, 2, 4, 2, 4, 4, 3, 4, 4, 4, 6, 5, 6, 3, 5, 5, 5, 5, 5, 6, 4, 6, 6, 7, 6, 5, 6, 6, 5, 6, 6, 6, 7, 6, 6, 6, 7, 7, 6, 7, 6, 7, 6, 1, 1, 2, 3, 2, 3, 4, 3, 3, 3, 4, 3, 3, 4, 4, 4, 3, 4, 4, 4, 4, 4, 4, 4, 4, 4, 6, 4, 4, 4, 4, 4, 6, 4, 4, 5, 5, 6, 5, 5, 4, 5, 5, 5, 5, 4, 6, 5, 5, 5, 4, 4, 5, 5, 5, 4, 4, 3, 6, 4, 5, 5, 5, 5, 5, 6, 6, 5, 6, 6, 7, 5, 6, 6, 5, 5, 5, 5, 5, 6, 6, 5, 5, 5, 5, 6, 7, 7, 6, 4, 5, 6, 5, 6, 6, 6, 6, 7, 5, 6, 6, 6, 5, 5, 5, 5, 6, 5, 5, 6, 6, 6, 7, 6, 6, 6, 5, 5, 5, 7, 7, 6, 6, 6, 6, 7, 6, 7, 7, 6, 7, 7, 7, 7, 7, 7, 3, 4, 4, 3, 5, 4, 5, 3, 4, 5, 5, 5, 5, 5, 5, 6, 5, 6, 6, 7, 2, 4, 4, 5, 5, 5, 6, 6, 6, 5, 5, 5, 7, 7, 7, 6, 5, 4, 5, 6, 5, 6, 7, 7, 4, 7, 7, 5, 6, 6, 6, 3, 5, 6, 6, 5, 6, 7, 1, 2, 3, 5, 5, 4, 5, 3, 5, 5, 6, 7, 2, 6, 5, 5, 4, 3, 4, 7, 2, 3, 7, 7, 2, 6, 6, 5, 5, 6, 7, 3, 5, 4, 2, 5, 2, 3, 5, 6, 4, 4, 5, 6, 7, 4, 5, 7, 6, 7, 4, 4, 6, 4, 4, 3, 1, 6, 7, 4, 6, 3, 6, 4, 4, 5, 5, 4, 6, 7, 2, 1, 3, 3, 2, 5, 5, 6, 7, 5, 4, 4, 4, 5, 5, 5, 4, 6, 7, 5, 5, 4, 7, 3, 3, 5, 6, 7, 6, 5, 5, 6, 4, 5, 5, 6, 6, 6, 6, 7, 7, 6, 4, 5, 6, 5, 6, 5, 6, 7, 6, 1, 3, 5, 4, 5, 7, 7, 3, 3, 5, 4, 5, 3, 4, 5, 4, 6, 5, 4, 5, 4, 4, 5, 4, 6, 5, 5, 4, 5, 5, 7, 4, 4, 5, 4, 4, 5, 6, 6, 6, 6, 6, 6, 5, 6, 6, 6, 5, 6, 6, 5, 6, 5, 6, 5, 6, 6, 7, 6, 7, 6, 6, 7, 7, 7, 5, 6, 7, 7, 6, 6, 6, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 4, 5, 7, 2, 5, 5, 5, 5, 6, 5, 7, 5, 5, 5, 6, 5, 7, 6, 5, 6, 6, 5, 5, 4, 4, 6, 5, 6, 6, 5, 3, 5, 2, 5, 5, 4, 6, 5, 1, 5, 7, 6, 5, 5, 6, 5, 5, 1, 6, 6, 6, 3, 5, 3, 4, 4, 6, 5, 3, 6, 7, 5, 6, 5, 6)</p>
      <p>x4 &lt;- c(2, 7, 5, 3, 4, 6, 5, 4, 4, 4, 6, 4, 1, 5, 2, 3, 2, 6, 2, 3, 1, 1, 1, 1, 4, 3, 1, 1, 1, 3, 2, 1, 3, 1, 1, 4, 1, 5, 2, 3, 1, 1, 5, 3, 6, 5, 7, 2, 7, 2, 1, 2, 3, 1, 3, 4, 4, 3, 3, 5, 2, 4, 4, 5, 4, 3, 3, 4, 5, 4, 4, 4, 4, 3, 4, 5, 4, 4, 5, 2, 4, 5, 5, 3, 3, 4, 4, 5, 6, 2, 3, 3, 4, 4, 3, 3, 4, 5, 5, 5, 6, 5, 6, 5, 7, 3, 6, 4, 6, 6, 4, 4, 6, 4, 6, 4, 5, 1, 7, 4, 4, 5, 6, 6, 6, 4, 6, 4, 5, 5, 6, 6, 6, 1, 1, 4, 5, 5, 5, 6, 6, 6, 5, 6, 2, 6, 6, 5, 6, 7, 7, 7, 7, 7, 7, 7, 6, 6, 6, 6, 7, 6, 6, 7, 7, 7, 4, 5, 6, 6, 6, 6, 7, 6, 6, 6, 7, 6, 7, 7, 7, 7, 7, 3, 3, 5, 4, 3, 4, 5, 4, 5, 4, 5, 6, 6, 6, 7, 7, 7, 7, 7, 7, 2, 4, 4, 2, 3, 5, 3, 6, 4, 6, 7, 5, 1, 1, 2, 6, 5, 7, 7, 6, 7, 7, 7, 7, 2, 6, 6, 7, 7, 7, 6, 1, 3, 7, 7, 6, 7, 2, 1, 1, 3, 3, 4, 4, 2, 3, 4, 6, 7, 7, 1, 6, 7, 5, 5, 3, 4, 4, 3, 6, 6, 5, 3, 6, 6, 5, 6, 6, 2, 3, 4, 3, 1, 6, 1, 4, 6, 7, 4, 5, 3, 6, 6, 4, 7, 7, 7, 6, 4, 4, 7, 4, 4, 2, 4, 2, 6, 2, 2, 3, 3, 4, 4, 3, 6, 7, 3, 4, 1, 1, 2, 3, 5, 2, 1, 3, 5, 2, 4, 4, 4, 3, 5, 5, 4, 2, 6, 5, 2, 3, 2, 3, 3, 3, 2, 1, 3, 4, 3, 3, 6, 6, 6, 6, 7, 5, 6, 7, 7, 6, 4, 6, 5, 2, 6, 6, 7, 7, 5, 4, 5, 3, 6, 5, 1, 1, 3, 3, 4, 3, 4, 6, 3, 3, 6, 5, 3, 5, 4, 5, 6, 3, 5, 2, 3, 4, 6, 3, 3, 1, 5, 6, 5, 4, 6, 5, 4, 6, 6, 5, 4, 6, 5, 7, 7, 6, 6, 4, 5, 6, 6, 5, 5, 6, 6, 6, 4, 7, 7, 6, 6, 6, 7, 5, 7, 7, 7, 7, 7, 7, 7, 7, 7, 6, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 5, 3, 5, 4, 1, 5, 4, 6, 6, 6, 4, 3, 6, 6, 1, 1, 7, 3, 6, 7, 4, 7, 7, 6, 6, 4, 5, 6, 3, 6, 5, 4, 3, 6, 2, 4, 1, 6, 5, 3, 4, 4, 3, 4, 4, 4, 3, 2, 3, 6, 4, 6, 6, 5, 4, 2, 7, 6, 4, 6, 6)</p>
      <p># Database</p>
      <p>original_data &lt;- data.frame(x1, x2, x3, x4)</p>
    </sec>
    <sec id="sec11">
      <title>Appendix 4. Generation of MVT and MVN samples</title>
      <p># Sample of 50 observations in two variables, with a correlation coefficient of 0, drawn from a multivariate t-distribution with five degrees of freedom.</p>
      <p>library(mvtnorm)</p>
      <p>n &lt;- 50 # Number of participants</p>
      <p>k &lt;- 2 # Number of variables</p>
      <p>df &lt;- 5 # Degrees of freedom for MVT distribution</p>
      <p>rho &lt;- 0 # Homogeneous correlation</p>
      <p>sigma &lt;- matrix(rho, nrow = k, ncol = k) # Correlation matrix</p>
      <p>diag(sigma) &lt;- 1 # Correlation matrix</p>
      <p>set.seed(123) # Seed is set for reproducibility</p>
      <p>mvt_data_2v &lt;- rmvt(n = n, sigma = sigma, df = df)</p>
      <p>x1 &lt;- mvt_data_2v[,1]</p>
      <p>x2 &lt;- mvt_data_2v[,2]</p>
      <p>original_data &lt;- data.frame(x1, x2) # sample data</p>
      <p>print(original_data)</p>
      <p># Sample of 50 observations in two variables, with a correlation coefficient of 0, drawn from a multivariate standard normal distribution.</p>
      <p>library(mvtnorm)</p>
      <p>n &lt;- 50 # Number of observations</p>
      <p>k &lt;- 2 # Number of variables</p>
      <p>rho &lt;- 0 # Homogeneous correlation</p>
      <p>sigma &lt;- matrix(rho, nrow = k, ncol = k) # Correlation matrix (compound symmetry)</p>
      <p>diag(sigma) &lt;- 1 # Correlation matrix</p>
      <p>set.seed(123) # Seed is set for reproducibility</p>
      <p>mvn_data &lt;- rmvnorm(n = n, sigma = sigma)</p>
      <p>x1 &lt;- mvn_data[,1]</p>
      <p>x2 &lt;- mvn_data[,2]</p>
      <p>original_data &lt;- data.frame(x1, x2)</p>
      <p>print(original_data)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <title>References</title>
      <ref id="B1">
        <label>1.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Mardia, K.V. (1970) Measures of Multivariate Skewness and Kurtosis with Applications. <italic>Biometrika</italic>, 57, 519-530. https://doi.org/10.2307/2334770 <pub-id pub-id-type="doi">10.2307/2334770</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.2307/2334770">https://doi.org/10.2307/2334770</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Mardia, K.V.</string-name>
            </person-group>
            <year>1970</year>
            <article-title>Measures of Multivariate Skewness and Kurtosis with Applications</article-title>
            <source>Biometrika</source>
            <volume>57</volume>
            <pub-id pub-id-type="doi">10.2307/2334770</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B2">
        <label>2.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Mardia, K.V. (1974) Applications of Some Measures of Multivariate Skewness and Kurtosis in Testing Normality and Robustness Studies. <italic>Sankhya</italic>: <italic>The Indian Journal of Statistics</italic>, <italic>Series B</italic> (1960-2002), 36, 115-128. https://www.jstor.org/stable/25051892</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Mardia, K.V.</string-name>
              <string-name>Statistics, S</string-name>
            </person-group>
            <year>1974</year>
            <article-title>Applications of Some Measures of Multivariate Skewness and Kurtosis in Testing Normality and Robustness Studies</article-title>
            <source>Sankhya: The Indian Journal of Statistics</source>
            <volume>36</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B3">
        <label>3.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Mardia, K.V. (1980) 9 Tests of Unvariate and Multivariate Normality. <italic>Handbook</italic><italic>of</italic><italic>Statistics</italic>, 1, 279-320. https://doi.org/10.1016/s0169-7161(80)01011-5 <pub-id pub-id-type="doi">10.1016/s0169-7161(80)01011-5</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/s0169-7161(80)01011-5">https://doi.org/10.1016/s0169-7161(80)01011-5</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Mardia, K.V.</string-name>
            </person-group>
            <year>1980</year>
            <article-title>9 Tests of Unvariate and Multivariate Normality</article-title>
            <source>Handbook of Statistics</source>
            <volume>7161</volume>
            <issue>80</issue>
            <pub-id pub-id-type="doi">10.1016/s0169-7161(80)01011-5</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B4">
        <label>4.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Urzua, C.M. (1997) Omnibus Tests for Multivariate Normality Based on a Class of Maximum Entropy Distributions. In: Fomby, T.B. and Hill, R.C., Eds., <italic>Applying</italic><italic>Maximum</italic><italic>Entropy</italic><italic>to</italic><italic>Econometric</italic><italic>Problems</italic>, Emerald Group Publishing Limited, 341-358. https://doi.org/10.1108/s0731-9053(1997)0000012016 <pub-id pub-id-type="doi">10.1108/s0731-9053(1997)0000012016</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1108/s0731-9053(1997)0000012016">https://doi.org/10.1108/s0731-9053(1997)0000012016</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Urzua, C.M.</string-name>
              <string-name>Fomby, T.B.</string-name>
              <string-name>Hill, R.C.</string-name>
              <string-name>Problems, E</string-name>
            </person-group>
            <year>1997</year>
            <article-title>Omnibus Tests for Multivariate Normality Based on a Class of Maximum Entropy Distributions</article-title>
            <source>In: Fomby</source>
            <volume>9053</volume>
            <issue>1997</issue>
            <pub-id pub-id-type="doi">10.1108/s0731-9053(1997)0000012016</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B5">
        <label>5.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Koizumi, K., Okamoto, N. and Seo, T. (2009) On Jarque-Bera Tests for Assessing Multivariate Normality. <italic>Journal of Statistics Advances in Theory and Applications</italic>, 1, 207-220.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Koizumi, K.</string-name>
              <string-name>Okamoto, N.</string-name>
              <string-name>Seo, T.</string-name>
            </person-group>
            <year>2009</year>
            <article-title>On Jarque-Bera Tests for Assessing Multivariate Normality</article-title>
            <source>Journal of Statistics Advances in Theory and Applications</source>
            <volume>1</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B6">
        <label>6.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Das, A. (2025) New Methods to Compute the Generalized Chi-Square Distribution. <italic>Journal</italic><italic>of</italic><italic>Statistical</italic><italic>Computation</italic><italic>and</italic><italic>Simulation</italic>, 95, 2608-2642. https://doi.org/10.1080/00949655.2025.2501401 <pub-id pub-id-type="doi">10.1080/00949655.2025.2501401</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/00949655.2025.2501401">https://doi.org/10.1080/00949655.2025.2501401</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Das, A.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>New Methods to Compute the Generalized Chi-Square Distribution</article-title>
            <source>Journal of Statistical Computation and Simulation</source>
            <volume>95</volume>
            <pub-id pub-id-type="doi">10.1080/00949655.2025.2501401</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B7">
        <label>7.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Rousselet, G., Pernet, C.R. and Wilcox, R.R. (2023) An Introduction to the Bootstrap: A Versatile Method to Make Inferences by Using Data-Driven Simulations. <italic>Meta-Psychology</italic>, 7, 1-24. https://doi.org/10.15626/mp.2019.2058 <pub-id pub-id-type="doi">10.15626/mp.2019.2058</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.15626/mp.2019.2058">https://doi.org/10.15626/mp.2019.2058</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Rousselet, G.</string-name>
              <string-name>Pernet, C.R.</string-name>
              <string-name>Wilcox, R.R.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>An Introduction to the Bootstrap: A Versatile Method to Make Inferences by Using Data-Driven Simulations</article-title>
            <source>Meta-Psychology</source>
            <volume>7</volume>
            <pub-id pub-id-type="doi">10.15626/mp.2019.2058</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B8">
        <label>8.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">R Core Team (2025) R: A language and Environment for Statistical Computing. R Foundation for Statistical Computing. https://www.R-project.org/</mixed-citation>
          <element-citation publication-type="web">
            <year>2025</year>
            <article-title>R: A language and Environment for Statistical Computing</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B9">
        <label>9.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Royston, J.P. (1983) Some Techniques for Assessing Multivarate Normality Based on the Shapiro-Wilk W. <italic>Applied</italic><italic>Statistics</italic>, 32, 121-133. https://doi.org/10.2307/2347291 <pub-id pub-id-type="doi">10.2307/2347291</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.2307/2347291">https://doi.org/10.2307/2347291</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Royston, J.P.</string-name>
            </person-group>
            <year>1983</year>
            <article-title>Some Techniques for Assessing Multivarate Normality Based on the Shapiro-Wilk W</article-title>
            <source>Applied Statistics</source>
            <volume>32</volume>
            <pub-id pub-id-type="doi">10.2307/2347291</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B10">
        <label>10.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Braun, W.J. and Murdoch, D.J. (2021) A First Course in Statistical Programming with R. 3rd Edition, Cambridge University Press. https://doi.org/10.1017/9781108993456 <pub-id pub-id-type="doi">10.1017/9781108993456</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1017/9781108993456">https://doi.org/10.1017/9781108993456</ext-link></mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Braun, W.J.</string-name>
              <string-name>Murdoch, D.J.</string-name>
              <string-name>Edition, C</string-name>
            </person-group>
            <year>2021</year>
            <article-title>A First Course in Statistical Programming with R</article-title>
            <source>3rd Edition</source>
            <pub-id pub-id-type="doi">10.1017/9781108993456</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B11">
        <label>11.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Giorgi, F.M., Ceraolo, C. and Mercatelli, D. (2022) The R Language: An Engine for Bioinformatics and Data Science. <italic>Life</italic>, 12, Article 648. https://doi.org/10.3390/life12050648 <pub-id pub-id-type="doi">10.3390/life12050648</pub-id><pub-id pub-id-type="pmid">35629316</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/life12050648">https://doi.org/10.3390/life12050648</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Giorgi, F.M.</string-name>
              <string-name>Ceraolo, C.</string-name>
              <string-name>Mercatelli, D.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>The R Language: An Engine for Bioinformatics and Data Science</article-title>
            <source>Life</source>
            <volume>12</volume>
            <elocation-id>648</elocation-id>
            <pub-id pub-id-type="doi">10.3390/life12050648</pub-id>
            <pub-id pub-id-type="pmid">35629316</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B12">
        <label>12.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Jobst, L.J., Bader, M. and Moshagen, M. (2023) A Tutorial on Assessing Statistical Power and Determining Sample Size for Structural Equation Models. <italic>Psychological</italic><italic>Methods</italic>, 28, 207-221. https://doi.org/10.1037/met0000423 <pub-id pub-id-type="doi">10.1037/met0000423</pub-id><pub-id pub-id-type="pmid">34672644</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1037/met0000423">https://doi.org/10.1037/met0000423</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Jobst, L.J.</string-name>
              <string-name>Bader, M.</string-name>
              <string-name>Moshagen, M.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>A Tutorial on Assessing Statistical Power and Determining Sample Size for Structural Equation Models</article-title>
            <source>Psychological Methods</source>
            <volume>28</volume>
            <pub-id pub-id-type="doi">10.1037/met0000423</pub-id>
            <pub-id pub-id-type="pmid">34672644</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B13">
        <label>13.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Anis, W., Kuntoro, K. and Melaniani, S. (2021) Difference of Power Test and Type II Error ( <italic>β</italic>) on Mardia MVN Test, Henze Zikler’s MVN Test, And Royston’s MVN Test Using Multivariate Data Analysis. <italic>Jurnal</italic><italic>Biometrika</italic><italic>dan</italic><italic>Kependudukan</italic>, 10, 153-161. https://doi.org/10.20473/jbk.v10i2.2021.153-161 <pub-id pub-id-type="doi">10.20473/jbk.v10i2.2021.153-161</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.20473/jbk.v10i2.2021.153-161">https://doi.org/10.20473/jbk.v10i2.2021.153-161</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Anis, W.</string-name>
              <string-name>Kuntoro, K.</string-name>
              <string-name>Melaniani, S.</string-name>
              <string-name>Test, H</string-name>
              <string-name>Test, A</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Difference of Power Test and Type II Error (β) on Mardia MVN Test, Henze Zikler’s MVN Test, And Royston’s MVN Test Using Multivariate Data Analysis</article-title>
            <source>Jurnal Biometrika dan Kependudukan</source>
            <volume>10</volume>
            <pub-id pub-id-type="doi">10.20473/jbk.v10i2.2021.153-161</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B14">
        <label>14.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Khatun, N. (2021) Applications of Normality Test in Statistical Analysis. <italic>Open</italic><italic>Journal</italic><italic>of</italic><italic>Statistics</italic>, 11, 113-122. https://doi.org/10.4236/ojs.2021.111006 <pub-id pub-id-type="doi">10.4236/ojs.2021.111006</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.4236/ojs.2021.111006">https://doi.org/10.4236/ojs.2021.111006</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Khatun, N.</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Applications of Normality Test in Statistical Analysis</article-title>
            <source>Open Journal of Statistics</source>
            <volume>11</volume>
            <pub-id pub-id-type="doi">10.4236/ojs.2021.111006</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B15">
        <label>15.</label>
        <citation-alternatives>
          <mixed-citation publication-type="confproc">Marchev Jr., A. and Marchev, V. (2023) Automated Algorithm for Multi-Variate Data Synthesis with Cholesky Decomposition. <italic>Proceedings</italic><italic>of</italic><italic>the</italic> 7 <italic>th</italic><italic>International</italic><italic>Conference</italic><italic>on</italic><italic>Algorithms</italic>, <italic>Computing</italic><italic>and</italic><italic>Systems</italic>, Larissa, 19-21 October 2023, 1-6. https://doi.org/10.1145/3631908.3631909 <pub-id pub-id-type="doi">10.1145/3631908.3631909</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1145/3631908.3631909">https://doi.org/10.1145/3631908.3631909</ext-link></mixed-citation>
          <element-citation publication-type="confproc">
            <person-group person-group-type="author">
              <string-name>Marchev, V.</string-name>
              <string-name>Algorithms, C</string-name>
              <string-name>Systems, L</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Automated Algorithm for Multi-Variate Data Synthesis with Cholesky Decomposition</article-title>
            <source>Proceedings of the 7th International Conference on Algorithms</source>
            <volume>19</volume>
            <pub-id pub-id-type="doi">10.1145/3631908.3631909</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B16">
        <label>16.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Genz, A., Bretz, F., Miwa, T., Mi, X., Leisch, F., Scheipl, F., Bornkamp, B., Maechler, M. and Hothorn, T. (2025) MVTNORM: Multivariate Normal and t Distributions (Version 1.3-3). https://doi.org/10.32614/CRAN.package.mvtnorm <pub-id pub-id-type="doi">10.32614/CRAN.package.mvtnorm</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.32614/CRAN.package.mvtnorm">https://doi.org/10.32614/CRAN.package.mvtnorm</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Genz, A.</string-name>
              <string-name>Bretz, F.</string-name>
              <string-name>Miwa, T.</string-name>
              <string-name>Mi, X.</string-name>
              <string-name>Leisch, F.</string-name>
              <string-name>Scheipl, F.</string-name>
              <string-name>Bornkamp, B.</string-name>
              <string-name>Maechler, M.</string-name>
              <string-name>Hothorn, T.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>MVTNORM: Multivariate Normal and t Distributions (Version 1</article-title>
            <pub-id pub-id-type="doi">10.32614/CRAN.package.mvtnorm</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B17">
        <label>17.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Newcombe, R.G. (1998) Two-Sided Confidence Intervals for the Single Proportion: Comparison of Seven Methods. <italic>Statistics</italic><italic>in</italic><italic>Medicine</italic>, 17, 857-872. https://doi.org/10.1002/(sici)1097-0258(19980430)17:8&lt;857::aid-sim777&gt;3.0.co;2-e <pub-id pub-id-type="doi">10.1002/(sici)1097-0258(19980430)17:8&lt;857::aid-sim777&gt;3.0.co;2-e</pub-id><pub-id pub-id-type="pmid">9595616</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/(sici)1097-0258(19980430)17:8%3C857::aid-sim777%3E3.0.co;2-e">https://doi.org/10.1002/(sici)1097-0258(19980430)17:8&lt;857::aid-sim777&gt;3.0.co;2-e</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Newcombe, R.G.</string-name>
            </person-group>
            <year>1998</year>
            <article-title>Two-Sided Confidence Intervals for the Single Proportion: Comparison of Seven Methods</article-title>
            <source>Statistics in Medicine</source>
            <volume>17</volume>
            <fpage>8</fpage>
            <pub-id pub-id-type="doi">10.1002/(sici)1097-0258(19980430)17:8&lt;857::aid-sim777&gt;3.0.co;2-e</pub-id>
            <pub-id pub-id-type="pmid">9595616</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B18">
        <label>18.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Cochran, W.G. (1950) The Comparison of Percentages in Matched Samples. <italic>Biometrika</italic>, 37, 256-266. https://doi.org/10.1093/biomet/37.3-4.256 <pub-id pub-id-type="doi">10.1093/biomet/37.3-4.256</pub-id><pub-id pub-id-type="pmid">14801052</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/biomet/37.3-4.256">https://doi.org/10.1093/biomet/37.3-4.256</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Cochran, W.G.</string-name>
            </person-group>
            <year>1950</year>
            <article-title>The Comparison of Percentages in Matched Samples</article-title>
            <source>Biometrika</source>
            <volume>37</volume>
            <pub-id pub-id-type="doi">10.1093/biomet/37.3-4.256</pub-id>
            <pub-id pub-id-type="pmid">14801052</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B19">
        <label>19.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Serlin, R.C., Carr, J. and Marascuilo, L.A. (1982) A Measure of Association for Selected Nonparametric Procedures. <italic>Psychological</italic><italic>Bulletin</italic>, 92, 786-790. https://doi.org/10.1037/0033-2909.92.3.786 <pub-id pub-id-type="doi">10.1037/0033-2909.92.3.786</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1037/0033-2909.92.3.786">https://doi.org/10.1037/0033-2909.92.3.786</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Serlin, R.C.</string-name>
              <string-name>Carr, J.</string-name>
              <string-name>Marascuilo, L.A.</string-name>
            </person-group>
            <year>1982</year>
            <article-title>A Measure of Association for Selected Nonparametric Procedures</article-title>
            <source>Psychological Bulletin</source>
            <volume>92</volume>
            <pub-id pub-id-type="doi">10.1037/0033-2909.92.3.786</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B20">
        <label>20.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Sheskin, D.J. (2011) Handbook of Parametric and Non-Parametric Statistical Procedures. 5th Edition. Chapman and Hall. https://doi.org/10.1201/9780429186196 <pub-id pub-id-type="doi">10.1201/9780429186196</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1201/9780429186196">https://doi.org/10.1201/9780429186196</ext-link></mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Sheskin, D.J.</string-name>
            </person-group>
            <year>2011</year>
            <article-title>Handbook of Parametric and Non-Parametric Statistical Procedures</article-title>
            <pub-id pub-id-type="doi">10.1201/9780429186196</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B21">
        <label>21.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Barnett, M.J., Doroudgar, S., Khosraviani, V. and Ip, E.J. (2022) Multiple Comparisons: To Compare or Not to Compare, That Is the Question. <italic>Research</italic><italic>in</italic><italic>Social</italic><italic>and</italic><italic>Administrative</italic><italic>Pharmacy</italic>, 18, 2331-2334. https://doi.org/10.1016/j.sapharm.2021.07.006 <pub-id pub-id-type="doi">10.1016/j.sapharm.2021.07.006</pub-id><pub-id pub-id-type="pmid">34274218</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.sapharm.2021.07.006">https://doi.org/10.1016/j.sapharm.2021.07.006</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Barnett, M.J.</string-name>
              <string-name>Doroudgar, S.</string-name>
              <string-name>Khosraviani, V.</string-name>
              <string-name>Ip, E.J.</string-name>
              <string-name>Compare, T</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Multiple Comparisons: To Compare or Not to Compare, That Is the Question</article-title>
            <source>Research in Social and Administrative Pharmacy</source>
            <volume>18</volume>
            <pub-id pub-id-type="doi">10.1016/j.sapharm.2021.07.006</pub-id>
            <pub-id pub-id-type="pmid">34274218</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B22">
        <label>22.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Cohen, J. (1988) Statistical Power Analysis for Behavioral Sciences. 2nd Edition, Lawrence Erlbaum Associates.</mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Cohen, J.</string-name>
              <string-name>Edition, L</string-name>
            </person-group>
            <year>1988</year>
            <article-title>Statistical Power Analysis for Behavioral Sciences</article-title>
            <source>2nd Edition</source>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B23">
        <label>23.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">IBM Corporation (2022) IBM SPSS Statistics for Windows, Version 27.0. IBM Corporation.</mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Windows, V</string-name>
            </person-group>
            <year>2022</year>
            <article-title>IBM SPSS Statistics for Windows, Version 27</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B24">
        <label>24.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Mauchly, J.W. (1940) Significance Test for Sphericity of a Normal <italic>n</italic>-Variate Distribution. <italic>The</italic><italic>Annals</italic><italic>of</italic><italic>Mathematical</italic><italic>Statistics</italic>, 11, 204-209. https://doi.org/10.1214/aoms/1177731915 <pub-id pub-id-type="doi">10.1214/aoms/1177731915</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1214/aoms/1177731915">https://doi.org/10.1214/aoms/1177731915</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Mauchly, J.W.</string-name>
            </person-group>
            <year>1940</year>
            <article-title>Significance Test for Sphericity of a Normal n-Variate Distribution</article-title>
            <source>The Annals of Mathematical Statistics</source>
            <volume>11</volume>
            <pub-id pub-id-type="doi">10.1214/aoms/1177731915</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B25">
        <label>25.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Pillai, K.C.S. (1955) Some New Test Criteria in Multivariate Analysis. <italic>The</italic><italic>Annals</italic><italic>of</italic><italic>Mathematical</italic><italic>Statistics</italic>, 26, 117-121. https://doi.org/10.1214/aoms/1177728599 <pub-id pub-id-type="doi">10.1214/aoms/1177728599</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1214/aoms/1177728599">https://doi.org/10.1214/aoms/1177728599</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Pillai, K.C.S.</string-name>
            </person-group>
            <year>1955</year>
            <article-title>Some New Test Criteria in Multivariate Analysis</article-title>
            <source>The Annals of Mathematical Statistics</source>
            <volume>26</volume>
            <pub-id pub-id-type="doi">10.1214/aoms/1177728599</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B26">
        <label>26.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Din, I.U. and Hayat, Y. (2021) ANOVA or MANOVA for Correlated Traits in Agricultural Experiments. <italic>Sarhad</italic><italic>Journal</italic><italic>of</italic><italic>Agriculture</italic>, 37, 1250-1259. https://doi.org/10.17582/journal.sja/2021/37.4.1250.1259 <pub-id pub-id-type="doi">10.17582/journal.sja/2021/37.4.1250.1259</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.17582/journal.sja/2021/37.4.1250.1259">https://doi.org/10.17582/journal.sja/2021/37.4.1250.1259</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Din, I.U.</string-name>
              <string-name>Hayat, Y.</string-name>
            </person-group>
            <year>2021</year>
            <article-title>ANOVA or MANOVA for Correlated Traits in Agricultural Experiments</article-title>
            <source>Sarhad Journal of Agriculture</source>
            <volume>37</volume>
            <pub-id pub-id-type="doi">10.17582/journal.sja/2021/37.4.1250.1259</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B27">
        <label>27.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Hedges, L.V. and Olkin, I. (1985) Statistical Methods for Meta-Analysis. Academic Press.</mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Hedges, L.V.</string-name>
              <string-name>Olkin, I.</string-name>
            </person-group>
            <year>1985</year>
            <article-title>Statistical Methods for Meta-Analysis</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B28">
        <label>28.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Shapiro, S.S. and Wilk, M.B. (1965) An Analysis of Variance Test for Normality (Complete Samples). <italic>Biometrika</italic>, 52, 591-611. https://doi.org/10.2307/2333709 <pub-id pub-id-type="doi">10.2307/2333709</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.2307/2333709">https://doi.org/10.2307/2333709</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Shapiro, S.S.</string-name>
              <string-name>Wilk, M.B.</string-name>
            </person-group>
            <year>1965</year>
            <article-title>An Analysis of Variance Test for Normality (Complete Samples)</article-title>
            <source>Biometrika</source>
            <volume>52</volume>
            <pub-id pub-id-type="doi">10.2307/2333709</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B29">
        <label>29.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Royston, P. (1992) Approximating the Shapiro-Wilk W-Test for Non-normality. <italic>Statistics</italic><italic>and</italic><italic>Computing</italic>, 2, 117-119. https://doi.org/10.1007/bf01891203 <pub-id pub-id-type="doi">10.1007/bf01891203</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/bf01891203">https://doi.org/10.1007/bf01891203</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Royston, P.</string-name>
            </person-group>
            <year>1992</year>
            <article-title>Approximating the Shapiro-Wilk W-Test for Non-normality</article-title>
            <source>Statistics and Computing</source>
            <volume>2</volume>
            <pub-id pub-id-type="doi">10.1007/bf01891203</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B30">
        <label>30.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Friedman, M. (1937) The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance. <italic>Journal</italic><italic>of</italic><italic>the</italic><italic>American</italic><italic>Statistical</italic><italic>Association</italic>, 32, 675-701. https://doi.org/10.1080/01621459.1937.10503522 <pub-id pub-id-type="doi">10.1080/01621459.1937.10503522</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/01621459.1937.10503522">https://doi.org/10.1080/01621459.1937.10503522</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Friedman, M.</string-name>
            </person-group>
            <year>1937</year>
            <article-title>The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance</article-title>
            <source>Journal of the American Statistical Association</source>
            <volume>32</volume>
            <pub-id pub-id-type="doi">10.1080/01621459.1937.10503522</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B31">
        <label>31.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Conover, W.J. (1999) Practical Nonparametric Statistics. 3rd Edition. John Wiley and Sons.</mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Conover, W.J.</string-name>
            </person-group>
            <year>1999</year>
            <article-title>Practical Nonparametric Statistics</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B32">
        <label>32.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Kendall, M.G. and Smith, B.B. (1939) The Problem of <italic>m</italic> Rankings. <italic>The</italic><italic>Annals</italic><italic>of</italic><italic>Mathematical</italic><italic>Statistics</italic>, 10, 275-287. https://doi.org/10.1214/aoms/1177732186 <pub-id pub-id-type="doi">10.1214/aoms/1177732186</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1214/aoms/1177732186">https://doi.org/10.1214/aoms/1177732186</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Kendall, M.G.</string-name>
              <string-name>Smith, B.B.</string-name>
            </person-group>
            <year>1939</year>
            <article-title>The Problem of m Rankings</article-title>
            <source>The Annals of Mathematical Statistics</source>
            <volume>10</volume>
            <pub-id pub-id-type="doi">10.1214/aoms/1177732186</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B33">
        <label>33.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">In, J. and Lee, D.K. (2024) Alternatives to the P Value: Connotations of Significance. <italic>Korean</italic><italic>Journal</italic><italic>of</italic><italic>Anesthesiology</italic>, 77, 316-325. https://doi.org/10.4097/kja.23630 <pub-id pub-id-type="doi">10.4097/kja.23630</pub-id><pub-id pub-id-type="pmid">38835136</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.4097/kja.23630">https://doi.org/10.4097/kja.23630</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>In, J.</string-name>
              <string-name>Lee, D.K.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>Alternatives to the P Value: Connotations of Significance</article-title>
            <source>Korean Journal of Anesthesiology</source>
            <volume>77</volume>
            <pub-id pub-id-type="doi">10.4097/kja.23630</pub-id>
            <pub-id pub-id-type="pmid">38835136</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B34">
        <label>34.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">JASP Team (2024) JASP Version 0.19.2 [Computer Software]. https://jasp-stats.org/download/</mixed-citation>
          <element-citation publication-type="web">
            <year>2024</year>
            <article-title>JASP Version 0</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B35">
        <label>35.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Lyubomirsky, S. and Lepper, H.S. (1999) A Measure of Subjective Happiness: Preliminary Reliability and Construct Validation. <italic>Social</italic><italic>Indicators</italic><italic>Research</italic>, 46, 137-155. https://doi.org/10.1023/a:1006824100041 <pub-id pub-id-type="doi">10.1023/a:1006824100041</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1023/a:1006824100041">https://doi.org/10.1023/a:1006824100041</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Lyubomirsky, S.</string-name>
              <string-name>Lepper, H.S.</string-name>
            </person-group>
            <year>1999</year>
            <article-title>A Measure of Subjective Happiness: Preliminary Reliability and Construct Validation</article-title>
            <source>Social Indicators Research</source>
            <volume>46</volume>
            <fpage>100682</fpage>
            <pub-id pub-id-type="doi">10.1023/a:1006824100041</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B36">
        <label>36.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Quezada, L., Landero, R. and González, M.T. (2016) A Validity and Reliability Study of the Subjective Happiness Scale in Mexico. <italic>The</italic><italic>Journal</italic><italic>of</italic><italic>Happiness</italic><italic>and</italic><italic>Well-Being</italic>, 4, 90-100.</mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Quezada, L.</string-name>
              <string-name>Landero, R.</string-name>
            </person-group>
            <year>2016</year>
            <article-title>A Validity and Reliability Study of the Subjective Happiness Scale in Mexico</article-title>
            <source>The Journal of Happiness and Well-Being</source>
            <volume>4</volume>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B37">
        <label>37.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Jammalamadaka, S.R., Taufer, E. and Terdik, G.H. (2020) On Multivariate Skewness and Kurtosis. <italic>Sankhya</italic><italic>A</italic>, 83, 607-644. https://doi.org/10.1007/s13171-020-00211-6 <pub-id pub-id-type="doi">10.1007/s13171-020-00211-6</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s13171-020-00211-6">https://doi.org/10.1007/s13171-020-00211-6</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Jammalamadaka, S.R.</string-name>
              <string-name>Taufer, E.</string-name>
              <string-name>Terdik, G.H.</string-name>
            </person-group>
            <year>2020</year>
            <article-title>On Multivariate Skewness and Kurtosis</article-title>
            <source>Sankhya A</source>
            <volume>83</volume>
            <pub-id pub-id-type="doi">10.1007/s13171-020-00211-6</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B38">
        <label>38.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">Burgess, N. (2022) Correlated Monte Carlo Simulation Using Cholesky Decomposition. Social Science Research Network (SSRN). https://ssrn.com/abstract=4066115</mixed-citation>
          <element-citation publication-type="web">
            <person-group person-group-type="author">
              <string-name>Burgess, N.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Correlated Monte Carlo Simulation Using Cholesky Decomposition</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B39">
        <label>39.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">Leisch, F. and Kostyshak, S. (2025) Package ‘Bootstrap’. https://cran.r-project.org/web/packages/bootstrap/bootstrap.pdf</mixed-citation>
          <element-citation publication-type="web">
            <person-group person-group-type="author">
              <string-name>Leisch, F.</string-name>
              <string-name>Kostyshak, S.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Package ‘Bootstrap’</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B40">
        <label>40.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Korteling, J.E., van de Boer-Visschedijk, G.C., Blankendaal, R.A.M., Boonekamp, R.C. and Eikelboom, A.R. (2021) Human-versus Artificial Intelligence. <italic>Frontiers</italic><italic>in</italic><italic>Artificial</italic><italic>Intelligence</italic>, 4, Article 622364. https://doi.org/10.3389/frai.2021.622364 <pub-id pub-id-type="doi">10.3389/frai.2021.622364</pub-id><pub-id pub-id-type="pmid">33981990</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/frai.2021.622364">https://doi.org/10.3389/frai.2021.622364</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Korteling, J.E.</string-name>
              <string-name>Boer-Visschedijk, G.C.</string-name>
              <string-name>Blankendaal, R.A.M.</string-name>
              <string-name>Boonekamp, R.C.</string-name>
              <string-name>Eikelboom, A.R.</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Human-versus Artificial Intelligence</article-title>
            <source>Frontiers in Artificial Intelligence</source>
            <volume>4</volume>
            <elocation-id>622364</elocation-id>
            <pub-id pub-id-type="doi">10.3389/frai.2021.622364</pub-id>
            <pub-id pub-id-type="pmid">33981990</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B41">
        <label>41.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Moral De La Rubia, J. (2025) Refined Normality Test Based on the Parametric Seven-Number Summary. <italic>İstatistik</italic><italic>ve</italic><italic>Uygulamalı</italic><italic>Bilimler</italic><italic>Dergisi</italic>, 11, 1-39. https://doi.org/10.52693/jsas.1670945 <pub-id pub-id-type="doi">10.52693/jsas.1670945</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.52693/jsas.1670945">https://doi.org/10.52693/jsas.1670945</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Rubia, J.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Refined Normality Test Based on the Parametric Seven-Number Summary</article-title>
            <source>İstatistik ve Uygulamalı Bilimler Dergisi</source>
            <volume>11</volume>
            <pub-id pub-id-type="doi">10.52693/jsas.1670945</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B42">
        <label>42.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Quach, N.E., Yang, K., Chen, R., Tu, J., Xu, M., Tu, X.M., et al. (2022) Post-Hoc Power Analysis: A Conceptually Valid Approach for Power Based on Observed Study Data. <italic>General</italic><italic>Psychiatry</italic>, 35, e100764. https://doi.org/10.1136/gpsych-2022-100764 <pub-id pub-id-type="doi">10.1136/gpsych-2022-100764</pub-id><pub-id pub-id-type="pmid">36189182</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1136/gpsych-2022-100764">https://doi.org/10.1136/gpsych-2022-100764</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Quach, N.E.</string-name>
              <string-name>Yang, K.</string-name>
              <string-name>Chen, R.</string-name>
              <string-name>Tu, J.</string-name>
              <string-name>Xu, M.</string-name>
              <string-name>Tu, X.M.</string-name>
            </person-group>
            <year>2022</year>
            <article-title>Post-Hoc Power Analysis: A Conceptually Valid Approach for Power Based on Observed Study Data</article-title>
            <source>General Psychiatry</source>
            <volume>35</volume>
            <pub-id pub-id-type="doi">10.1136/gpsych-2022-100764</pub-id>
            <pub-id pub-id-type="pmid">36189182</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B43">
        <label>43.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Mokhtar, S.F., Md Yusof, Z. and Sapiri, H. (2023) Confidence Intervals by Bootstrapping Approach: A Significance Review. <italic>Malaysian</italic><italic>Journal</italic><italic>of</italic><italic>Fundamental</italic><italic>and</italic><italic>Applied</italic><italic>Sciences</italic>, 19, 30-42. https://doi.org/10.11113/mjfas.v19n1.2660 <pub-id pub-id-type="doi">10.11113/mjfas.v19n1.2660</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.11113/mjfas.v19n1.2660">https://doi.org/10.11113/mjfas.v19n1.2660</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Mokhtar, S.F.</string-name>
              <string-name>Yusof, Z.</string-name>
              <string-name>Sapiri, H.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Confidence Intervals by Bootstrapping Approach: A Significance Review</article-title>
            <source>Malaysian Journal of Fundamental and Applied Sciences</source>
            <volume>19</volume>
            <pub-id pub-id-type="doi">10.11113/mjfas.v19n1.2660</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B44">
        <label>44.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Zhang, Z. and Yuan, K.H. (2018) Practical Statistical Power Analysis Using Web Power and R. ISDSA Press. https://doi.org/10.35566/power <pub-id pub-id-type="doi">10.35566/power</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.35566/power">https://doi.org/10.35566/power</ext-link></mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Zhang, Z.</string-name>
              <string-name>Yuan, K.H.</string-name>
            </person-group>
            <year>2018</year>
            <article-title>Practical Statistical Power Analysis Using Web Power and R</article-title>
            <pub-id pub-id-type="doi">10.35566/power</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B45">
        <label>45.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Davidson, R. and MacKinnon, J.G. (2006) The Power of Bootstrap and Asymptotic Tests. <italic>Journal</italic><italic>of</italic><italic>Econometrics</italic>, 133, 421-441. https://doi.org/10.1016/j.jeconom.2005.06.002 <pub-id pub-id-type="doi">10.1016/j.jeconom.2005.06.002</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.jeconom.2005.06.002">https://doi.org/10.1016/j.jeconom.2005.06.002</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Davidson, R.</string-name>
              <string-name>MacKinnon, J.G.</string-name>
            </person-group>
            <year>2006</year>
            <article-title>The Power of Bootstrap and Asymptotic Tests</article-title>
            <source>Journal of Econometrics</source>
            <volume>133</volume>
            <pub-id pub-id-type="doi">10.1016/j.jeconom.2005.06.002</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B46">
        <label>46.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Moral de la Rubia, J. (2023) Proposal and Pilot Study: A Generalization of the W or W’ Statistic for Multivariate Normality. <italic>Open</italic><italic>Journal</italic><italic>of</italic><italic>Statistics</italic>, 13, 119-169. https://doi.org/10.4236/ojs.2023.131008 <pub-id pub-id-type="doi">10.4236/ojs.2023.131008</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.4236/ojs.2023.131008">https://doi.org/10.4236/ojs.2023.131008</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Rubia, J.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Proposal and Pilot Study: A Generalization of the W or W’ Statistic for Multivariate Normality</article-title>
            <source>Open Journal of Statistics</source>
            <volume>13</volume>
            <pub-id pub-id-type="doi">10.4236/ojs.2023.131008</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B47">
        <label>47.</label>
        <citation-alternatives>
          <mixed-citation publication-type="book">Manly, B.F. and Navarro-Alberto, J.A. (2021) Randomization, Bootstrap and Monte Carlo Methods in Biology. 4th Edition, Chapman and Hall/CRC. https://doi.org/10.1201/9780429329203 <pub-id pub-id-type="doi">10.1201/9780429329203</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1201/9780429329203">https://doi.org/10.1201/9780429329203</ext-link></mixed-citation>
          <element-citation publication-type="book">
            <person-group person-group-type="author">
              <string-name>Manly, B.F.</string-name>
              <string-name>Navarro-Alberto, J.A.</string-name>
              <string-name>Randomization, B</string-name>
              <string-name>Edition, C</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Randomization, Bootstrap and Monte Carlo Methods in Biology</article-title>
            <source>4th Edition</source>
            <pub-id pub-id-type="doi">10.1201/9780429329203</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B48">
        <label>48.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Henze, N. and Zirkler, B. (1990) A Class of Invariant Consistent Tests for Multivariate Normality. <italic>Communications</italic><italic>in</italic><italic>Statistics</italic>— <italic>Theory</italic><italic>and</italic><italic>Methods</italic>, 19, 3595-3617. https://doi.org/10.1080/03610929008830400 <pub-id pub-id-type="doi">10.1080/03610929008830400</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/03610929008830400">https://doi.org/10.1080/03610929008830400</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Henze, N.</string-name>
              <string-name>Zirkler, B.</string-name>
            </person-group>
            <year>1990</year>
            <article-title>A Class of Invariant Consistent Tests for Multivariate Normality</article-title>
            <source>Communications in Statistics—Theory and Methods</source>
            <volume>19</volume>
            <pub-id pub-id-type="doi">10.1080/03610929008830400</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B49">
        <label>49.</label>
        <citation-alternatives>
          <mixed-citation publication-type="web">Korkmaz, S. (2025) Package ‘MVN’. Multivariate Normality Tests. https://cran.r-project.org/web/packages/MVN//MVN.pdf</mixed-citation>
          <element-citation publication-type="web">
            <person-group person-group-type="author">
              <string-name>Korkmaz, S.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Package ‘MVN’</article-title>
          </element-citation>
        </citation-alternatives>
      </ref>
    </ref-list>
  </back>
</article>