<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">ALAMT</journal-id><journal-title-group><journal-title>Advances in Linear Algebra &amp; Matrix Theory</journal-title></journal-title-group><issn pub-type="epub">2165-333X</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/alamt.2018.81003</article-id><article-id pub-id-type="publisher-id">ALAMT-82800</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  High Order Tensor Forms of Growth Curve Models
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Zerong</surname><given-names>Lin</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Dongzhe</surname><given-names>Liu</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Xueying</surname><given-names>Liu</given-names></name><xref ref-type="aff" rid="aff3"><sup>3</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Lingling</surname><given-names>He</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Changqing</surname><given-names>Xu</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>School of Energy and Environment, City University of Hong Kong, Hong Kong, China</addr-line></aff><aff id="aff1"><addr-line>School of Mathematics and Physics, Suzhou University of Science and Technology, Suzhou, China</addr-line></aff><aff id="aff3"><addr-line>Zhejiang University of Water Resources and Electric Power, Hang Zhou, China</addr-line></aff><pub-date pub-type="epub"><day>09</day><month>01</month><year>2018</year></pub-date><volume>08</volume><issue>01</issue><fpage>18</fpage><lpage>32</lpage><history><date date-type="received"><day>20,</day>	<month>November</month>	<year>2017</year></date><date date-type="rev-recd"><day>26,</day>	<month>February</month>	<year>2018</year>	</date><date date-type="accepted"><day>1,</day>	<month>March</month>	<year>2018</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p><html>
 <head></head>
 
  In this paper, we first study the linear regression model 
  <img src="Edit_b8ebabe2-7823-49b1-a4bb-38eec2167754.bmp" alt="" /> and obtain a norm-minimized estimator of the parameter vector 
  <img src="Edit_195b897a-f5e7-4021-bb9c-145e631f82e2.bmp" alt="" /> by using the g-inverse and the singular value decomposition of matrix 
  <em>X</em>. We then investigate the growth curve model (GCM) and extend the GCM to a generalized GCM (GGCM) by using high order tensors. The parameter estimations in GGCMs are also achieved in this way.
 
</html></p></abstract><kwd-group><kwd>Tensor</kwd><kwd> Generalized Linear Model</kwd><kwd> Growth Curve Model</kwd><kwd> Parameter Estimation</kwd><kwd> Generalized Inverse</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Linear regression model, or called linear model (LM), is one of the most widely used models in statistics. There are many kinds of linear models including simple linear models, general linear models, generalized linear models, mixed effects linear models and some other extended forms of linear models [<xref ref-type="bibr" rid="scirp.82800-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref5">5</xref>] . The growth curve model (GCM) is a special kind of general linear models which have applications in many areas such as the psychology data analysis [<xref ref-type="bibr" rid="scirp.82800-ref6">6</xref>] . The GCMs can be used to handle longitudinal data or missing data or even the hierarchical multilevel mixed case [<xref ref-type="bibr" rid="scirp.82800-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref7">7</xref>] - [<xref ref-type="bibr" rid="scirp.82800-ref12">12</xref>] . There are some variations of GCMs such as the latent GCMs which are also very useful. The traditional treatment of the GCMs in the estimation of the parameters in the case of mixed effects for a single response factor is usually to stack all the dependent observations vertically into a very long column vector, usually denoted by y, and all the design matrices (both the fixed effect design matrix and the random effect design matrix), the random errors are accordingly concatenated to fit the size of y. This treatment makes the implementation of the related programming much slow due to the magnificent dimensions of the data (matrices and vectors). Things will get even worse if we encounter a huge dataset (big data) such as the data of genome, web related, image gallery, or social network etc.</p><p>In this paper, we first use the generalized inverse of matrices and the singular value decomposition to obtain the norm-minimized estimation of the parameters in the linear model. Then we introduce some basic knowledge about tensors before we employ tensors to express and extend the multivariate mixed effects linear models. The extended tensor form of the model can be also regarded as a generalization of the GCM.</p><p>Let us first begin with some basic linear regression models. We let y be a response variable and x 1 , ⋯ , x r be independent random variables for explaining y. The most general regression model between y and x 1 , ⋯ , x r is in form</p><p>y = f ( x 1 , ⋯ , x r ) + ε (1.1)</p><p>where ε is the error term, and f is an unknown regression function. In linear regression model, f is assumed to be a linear function, i.e.,</p><p>y i = β 0 + β 1 x 1 + β 2 x 2 + ⋯ + β r x r + ε (1.2)</p><p>where all β i are unknown parameters. Denote x = ( x 1 , ⋯ , x r ) which is called a random vector. Let P = ( y , x ) , an ( r + 1 ) -dimensional random vector, which is called an observable vector. Given N observations of P, say P i = ( y i , x i 1 , x i 2 , ⋯ , x i r ) , i = 1 , 2 , ⋯ , N . Here y i stands for the ith observation of the response variable y, and x i 1 , x i 2 , ⋯ , x i r are the corresponding explanatory observations. The sample model of Equation (1.2) turns out to be</p><p>y i = β 0 + β 1 x i 1 + β 2 x i 2 + ⋯ + β r x i r + ε (1.3)</p><p>or equivalently</p><p>y = X β + ε (1.4)</p><p>where y = ( y 1 , y 2 , ⋯ , y N ) T ∈ ℝ N (here and throughout the paper T stands for the transpose of a matrix/vector) is the sample vector of the response variable y, X = ( x i j ) ∈ ℝ N &#215; ( r + 1 ) is the data matrix or the design matrix each of whose rows corresponding to an observation of x, β = ( β 0 , β 1 , ⋯ , β r ) ∈ ℝ r + 1 is the regression coefficient vector, which is to be estimated, and ε = ( ε 1 , ε 2 , ⋯ , ε N ) T is the random error vector. A general linear regression model, abbreviated GLM, is a LM (1.4) with the error terms ε i satisfying:</p><p>1) Zero-mean: Ε [ ε i ] = 0 , ∀ i ∈ [ N ] , i.e., the expected value of the error term is zero for all the observations.</p><p>2) Homoskedasticity: Var [ ε ] = σ 2 , i.e., all the error term are distributed with the same variance.</p><p>3) Uncorrelation: Cov ( ε i , ε j ) = 0 for all distinct i , j , i.e., distinct error terms are uncorrelated.</p><p>Equations (1)-(3) is called Gauss-Markov assumption [<xref ref-type="bibr" rid="scirp.82800-ref13">13</xref>] . The model (1.4) under the Gauss-Markov assumption is called the Gauss-Markov model. Note that the variance reflects the uncertainty of the model, the zero-mean, homoskedasticity and the uncorrelation of the sample errors form the Gauss-Markov assumption. An alternative form of the Gauss-Markov model is</p><p>Ε [ Y ] = X β ,   Cov ( ε ) = σ 2 I N (1.5)</p><p>where I N is the N &#215; N identity matrix and σ 2 &gt; 0 . In order to investigate the general linear model and extend the properties, we recall some known results concerning the linear combinations of some random variables. Suppose α ∈ ℝ n is a constant vector with the same length as that of y, the random vector under the investigation.</p><p>Let A ∈ ℝ m &#215; n . The g-inverse of A, denoted A g , is a generalized inverse defined as an n &#215; m matrix satisfying [<xref ref-type="bibr" rid="scirp.82800-ref4">4</xref>] A A g A = A . An equivalent definition for g-inverse is that x = A g b is always a solution to equation A x = b whenever b ∈ C ( A ) , the column space of A. A well known result is that all the solutions to A x = b (when compatible) are in form</p><p>x = A g b + ( I − A g A ) ω , ∀ ω ∈ ℝ n . (1.6)</p><p>It is easy to verify that A g = A − 1 is unique when A is invertible. The g-inverse of a matrix (usually not unique) can be calculated by using singular value decomposition (SVD).</p><p>Lemma 1.1. Let X ∈ ℝ N &#215; p with a SVD decomposition X = U D V T such that U ∈ ℝ N &#215; N and V ∈ ℝ p &#215; p are orthogonal matrices, and D ∈ ℝ N &#215; p is in form D = d i a g ( σ 1 , σ 2 , ⋯ , σ r , 0 , ⋯ , 0 ) where r = r a n k ( X ) ≤ min ( N , p ) , and σ 1 ≥ σ 2 ≥ ⋯ ≥ σ r . Then</p><p>A g = V [ D r − 1 * * * ] U T ∈ ℝ n &#215; m (1.7)</p><p>where * denotes any matrix of suitable size and D r = d i a g ( σ 1 , σ 2 , ⋯ , σ r ) .</p><p>The Gauss-Markov Theorem (e.g. Page 51 of [<xref ref-type="bibr" rid="scirp.82800-ref13">13</xref>] ) is stated as:</p><p>Lemma 1.2. Suppose that model (1.4) satisfies the Gauss-Markov assumption and a ∈ ℝ N be a constant vector. Then z = a T y is estimable, and a T β ^ is the best (minimum variance) linear unbiased estimator (BLUE) of a T β , with β ^ = ( X T X ) g X T y .</p><p>Based on Lemma 1.2, we get</p><p>Proposition 1.3. If r a n k ( X ) = r &lt; min ( N , p ) in Equation (1.4) and X satisfies condition in Lemma 1.2. Then the estimator of β with minimal 2-norm is in form</p><p>β ^ = V [ D r − 1 y ˜ 1 0 ] (1.8)</p><p>where y ˜ = U T y = [ y ˜ 1 T , y ˜ 2 T ] T with y ˜ 1 ∈ ℝ r , y ˜ 2 ∈ ℝ N − r .</p><p>Proposition 1.3 tells us that by taking D as a block upper triangle form in the decomposition</p><p>( X T X ) g = V [ G 11 G 12 0 G 22 ] V T .</p><p>We can reach a norm-minimised estimator of β . Now denote H : = ( X T X ) g X . By Gauss-Markov Theorem, we have</p><p>‖ β ⌢ ‖ 2 = 〈 H y , H y 〉 = y T H T H y = y T X ( X T X ) 2 g X T y</p><p>which implies that ‖ β ⌢ ‖ = ‖ ( X X T ) g y ‖ .</p><p>The generalized linear model (GLM) is a generalization of LM [<xref ref-type="bibr" rid="scirp.82800-ref1">1</xref>] . In a GLM model some basic assumptions in linear regression model are relaxed. Also the fitting values of the response variables are no longer directly expressed as a linear combination of parameters, but rather a function which is usually called a link function. A GLM consists of the independent random components y i in exponential distribution, the predictive value X i T β , the system components, and the link function f, strictly monotone differentiable function in GLM η i = f ( X i T β ) . The parameters in a GLM include regression parameters β and the discrete parameters in the covariance matrix, both can be estimated with maximum likelihood method. The estimation of the regression parameter for model (1.4) can be expressed as</p><p>β ( m ) = ( X T W ( m − 1 ) X ) g X T W ( m − 1 ) z ( m − 1 ) ,   W = d i a g ( W 1 , W 2 , ⋯ , W N )</p><p>where W i = w i / { ϕ v ( μ i ) [ g T ( μ i ) ] 2 } with w i being a known priori weight,</p><p>ϕ the dispersion parameter, v ( ⋅ ) a variance function, g a link function, and z ∈ ℝ N the work dependent variable with z i = η i + ( y i − μ i ) g T ( μ i ) . The moment estimation of discrete parameters is</p><p>ϕ ^ = 1 N − k − 1 ∑ i = 1 N w i ( y i − μ ^ i ) 2 v ( μ ^ i ) . (1.9)</p><p>In order to extend the GLMS to more general case, we need some knowledge on tensors. In the next section, we will introduce some basic terminology and operations implemented on tensors, especially on low order tensors.</p></sec><sec id="s2"><title>2. The 3-Order Tensors and Their Applications in GLMs</title><p>A tensor is an extension of a matrix in the case of high order, which is an important tool to study high-dimensional arrays. The origin of tensor can be traced back to early nineteenth century when Cayley studied linear transformation theory and invariant representation. Gauss and Riemann et al. promoted the development of tensor in mathematics. In 1915 Albert Einstein used tensor to describe his general relativity, leading tensor calculus more widely accepted. In the early twentieth century, Ricci and Levi-Civita further developed tensor analysis in absolute differential methods and explored their applications [<xref ref-type="bibr" rid="scirp.82800-ref14">14</xref>] .</p><p>For our convenience we denote [ n ] : = { 1 , 2 , ⋯ , n } and use S ( m , n ) to denote the index set</p><p>S ( m , n ) : = { τ = ( i 1 , i 2 , ⋯ , i m ) : i k ∈ [ n ] , ∀ k ∈ [ m ] } .</p><p>Let I k ( k ∈ [ m ] ) be any positive integer (usually larger than 1). Sometimes we abuse I k as a set [ I k ] . Denote I : = I 1 &#215; I 2 &#215; ⋯ &#215; I m . If I k stands for an index set, then I is a tensor product of I 1 , I 2 , ⋯ , I m . An m-order tensor A = ( A σ ) of size I is an m-array whose entries are denoted by A σ : = A i 1 i 2 ⋯ i m with σ = ( i 1 , i 2 , ⋯ , i m ) ∈ I . Note that a vector is a 1-order tensor and an m &#215; n matrix is a 2-order or second order tensor. An m &#215; n tensor is a tensor with I 1 = I 2 = ⋯ = I m = n . We denote by T m , n the set of all mth order n-dimensional real tensors . An m &#215; n tensor A is called symmetric if A σ is constant under any permutation on its index.</p><p>An mth order n-dimensional real tensors A is always associated with an m-order homogeneous polynomial f A ( x ) which is defined by</p><p>f A ( x ) : = A x m = ∑ i 1 , i 2 , ⋯ , i m A i 1 , i 2 , ⋯ , i m x i 1 x i 2 ⋯ x i m . (2.10)</p><p>A is called positive definite or pd (positive semidefinite or psd) if</p><p>f A ( x ) : = A x m ≥ 0 , ∀ x ∈ ℝ n . (2.11)</p><p>A nonzero psd tensor must be of an even order. Let A be of size m &#215; n &#215; p . Given an r-order tensor A ∈ ℝ n 1 &#215; n 2 &#215; ⋯ &#215; n r and a matrix U = ( u i j ) ∈ ℝ I k &#215; J k where k ∈ [ r ] . The product of A with U along k-mode is defined as the r-order tensor A &#215; k U defined as</p><p>( A &#215; k U ) i 1 , ⋯ , i k − 1 , i k , i k + 1 , ⋯ , i m = ∑ i = 1 n A i 1 , ⋯ , i k − 1 , i k , i k + 1 , ⋯ , i m u i i k . (2.12)</p><p>Note that A &#215; k U is compressed into an ( m − 1 ) -order tensor when U ∈ ℝ I k is a column vector ( J k = 1 ) . There are two kinds of tensor decomposition, i.e., the rank-1 decomposition, also called the CP decomposition, and the Tucker decomposition, or HOSVD. The former is the generalization of matrix rank-1 decomposition and the latter is the matrix singular value decomposition in the high order case. A zero tensor is a tensor with all entries being zero. A diagonal tensor is a tensor whose off-diagonal elements are all zero, i.e., A i 1 , i 2 , ⋯ , i m = 0 if i 1 , i 2 , ⋯ , i m are not identical. Thus an m &#215; n tensor has n diagonal elements. By this way, we can define similarly (and analogous to matrix case) the identity tensor and a scalar tensor.</p><p>For any i ∈ [ n ] , an i-slice of an m-order tensor A = ( A i 1 i 2 ⋯ i m ) along mode k for any given k ∈ [ m ] is an ( m − 1 ) &#215; n tensor B with</p><p>B i 1 , i 2 , ⋯ , i m − 1 = A i 1 , ⋯ , i k − 1 , i , i k + 1 , ⋯ , i m , i = 1 , 2 , ⋯ , n .</p><p>A slice of 3-order tensor A = ( A i j k ) ∈ ℝ m &#215; n &#215; p along mode-3 is an m &#215; n matrix A ( : , : , k ) with k ∈ [ p ] , and a slice of a 4-order tensor is a 3-order tensor.</p><p>Let A ∈ ℝ m &#215; n 1 &#215; p , B ∈ ℝ n 1 &#215; n &#215; p be two tensors of 3-order. The slice-wise product of A , B , denoted by<inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x136.png" xlink:type="simple"/></inline-formula>, is defined as <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x137.png" xlink:type="simple"/></inline-formula> where C ( : , : , k ) = A ( : , : , k ) B ( : , : , k ) for all k ∈ [ p ] . This multiplication can be used to build a regression model</p><disp-formula id="scirp.82800-formula29"><label>(2.13)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x140.png"  xlink:type="simple"/></disp-formula><p>where A ( : , : , k ) is the matrix consisting of n sample points of size m in class k and X ( : , : , k ) is the design matrix corresponding to the kth sample (there are n 1 observations in each class in this situation).</p><p>Let k be a positive integer. The k-moment of a random variable x is defined as the expectation of x, i.e., m x ( k ) ( x ) = Ε ( x k ) . The traditional extension of moments to a multivariate case is done by an iterative vectorization imposed on k. This technique is employed not only in the definition of moments but also in other definitions such as that of a characteristic function. By introducing the tensor form into these definitions, we find that the expressions will be much easier to handle than the classical ones. In the next section, we will introduce the tensor form of all these definitions.</p><p>Let x = ( x 1 , x 2 , ⋯ , x n ) T be a random vector. Denote by x m the (symmetric) rank-one m-order tensor with</p><p>x σ m = x i 1 x i 2 ⋯ x i m ,   ∀ σ : = ( i 1 , i 2 , ⋯ , i m ) ∈ S ( m , n ) .</p><p>x m is called a rank-1 tensor generated by x which is also symmetric. It is shown by Comon et al. [<xref ref-type="bibr" rid="scirp.82800-ref15">15</xref>] that a real tensor A (with size I 1 &#215; I 2 &#215; ⋯ &#215; I m ) can always be decomposed into form</p><disp-formula id="scirp.82800-formula30"><label>(2.14)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x152.png"  xlink:type="simple"/></disp-formula><p>where α i ( j ) ∈ ℝ I i for all j ∈ [ r ] , i ∈ [ m ] . The smallest positive integer r is called the rank of A , denoted by r a n k ( A ) . We note that Equation (2.14) can also be used to define the tensor product of two matrices, which will be used in our next work on the covariance of random matrices. Note that the tensor product of two rank-one matrices is</p><p>( α 1 &#215; β 1 ) &#215; ( α 2 &#215; β 2 ) = α 1 &#215; α 2 &#215; β 1 &#215; β 2 .</p><p>Now consider two matrices A ∈ ℝ m &#215; n , B ∈ ℝ p &#215; q . Then write A , B in a rank-1 decomposition, i.e.,</p><p>A = ∑ j = 1 R 1 α 1 ( j ) &#215; β 1 ( j ) ,   B = ∑ k = 1 R 2 α 2 ( k ) &#215; β 2 ( k ) .</p><p>Tucker decomposition decomposes the original tensor into a product of the core tensor and a number of unitary matrices in different directions [<xref ref-type="bibr" rid="scirp.82800-ref15">15</xref>] so A can be decomposed into</p><p>A = S &#215; 1 U 1 &#215; 2 U 2 &#215; 3 ⋯ &#215; N U N (2.15)</p><p>where S is the core tensor, and U 1 , U 2 , ⋯ , U N are unitary matrices.</p><p>Example 2.1. Let <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x165.png" xlink:type="simple"/></inline-formula> be an 2 &#215; 2 &#215; 2 tensor which is defined by</p><p>X ( : , : , 1 ) = [ 1 3 2 4 ] ,   X ( : , : , 2 ) = [ 5 7 6 8 ] .</p><p>Then the unfolded matrices along 1-mode, 2-mode and 3-mode are respectively</p><p>X 1 = [ 1 5 3 7 2 6 4 8 ] ,   X 2 = [ 1 3 5 7 2 4 6 8 ] ,   X 3 = [ 1 2 5 6 3 4 7 8 ] .</p></sec><sec id="s3"><title>3. Application of 3-Order Tensors in GLMs</title><p>The growth curve model (GCM) is one of the GLMs introduced by Wishart in 1938 [<xref ref-type="bibr" rid="scirp.82800-ref16">16</xref>] to study the growth situation of animals and plant between different groups. It is a kind of generalized multivariate variance analysis model, and has been widely used in modern medicine, agriculture and biology etc. GCM originally referred to a wide array of statistical models for repeated measures data [<xref ref-type="bibr" rid="scirp.82800-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.82800-ref14">14</xref>] . The contemporary use of GCM allows the estimation of inter-object variability such as time trends, time paths, growth curves or latent trajectories, in intra-object patterns of change over time [<xref ref-type="bibr" rid="scirp.82800-ref17">17</xref>] . The trajectories are the primary focus of analysis in most cases, whereas in others, they may represent just one part of a much broader longitudinal model. The most basic GCMs contain fixed and random effects that best capture the collection of individual trajectories over time. In a GCM, the fixed effects represent the mean of the trajectory pooling of all individuals, and the random effects represent the variance of the individual trajectories around these group means. For example, the fixed effects in a linear trajectory are estimates of the mean intercept and mean slope that define the underlying trajectory pooling of the entire sample, and the random effects are estimates of the between-person variability in the individual intercepts and slopes. Smaller random effects imply the more similar parameters that define the trajectory across the sample of individuals; at the extreme situation where the random effects equal 0, all individuals are governed by precisely the same trajectory parameters (i.e., there is a single trajectory shared by all individuals). In contrast, larger random effects imply greater individual differences in the magnitude of the trajectory parameters around the mean values.</p><p>The analysis of a GCM focuses on the functional relationship among ordered responses. Conventional GCM methods apply to growth data and to other analogs such as dose-response data (indexed by dose), location-response data (indexed by distance), or response-surface data (indexed by two or more variables such as latitude and longitude). The GCM methods mainly focus on longitudinal observations on a one-dimensional characteristic even though they may also be used in multidimensional cases [<xref ref-type="bibr" rid="scirp.82800-ref2">2</xref>] .</p><p>A general GCM can be indicated by</p><p>Y = X B T + E (3.16)</p><p>where Y ∈ ℝ N &#215; p is the random response matrix whose rows are mutually independent and columns correspond to the response variables ordered according to d = [ d 1 , d 2 , ⋯ , d p ] T ; X ∈ ℝ N &#215; q is the fixed design matrix with r : = r a n k ( X ) ≤ q ≤ N ; The matrix B ∈ ℝ q &#215; m is a fixed parameter matrix whose entries are the regression coefficients; T ∈ ℝ m &#215; p is a within-subject design matrix each of whose entries is a fixed function of d, and E ∈ ℝ N &#215; p is a random error with matrix normal distribution E ∼ N N , p ( 0 , Σ , I N ) where Σ ∈ ℝ p &#215; p is an unknown symmetric positive definite matrix. Suppose the samplings corresponding to each object are recorded at p different times (moments) d 1 , d 2 , ⋯ , d p . Consider an example of a pattern of the children’s weight. The plotting of the weights against the ages indicates a temporal pattern of growth. A univariate linear model for weight given age could be fitted with a design matrix T expressing the central tendency of the children’s weights as a linear or curvilinear function of age. Here T is an example of a within-subject design matrix. If N &gt; 1 , a separate curve could be fitted for each subject to obtain a separate matrix of regression parameter estimators for each independent sampling units, β ^ i = Y i T ( T T T ) − 1 for i ∈ [ N ] , and a simple average of the N fitted curves is a proper (if not efficient) estimator of the population growth curve, that is,</p><p>B ^ = 1 N ∑ j = 1 N B ^ j .</p><p>The efficient estimator has the form</p><p>B ^ = [ ( X T X ) − 1 X T ] [ T ( T T T ) − 1 ] . (3.17)</p><p>If the subjects are grouped in a balanced way, i.e., The N observations are clustered into m groups, each containing the same number, say n, of observations. For simplicity, we may assume that first n each then X = l N , the all-one vector, is the appropriate choice for computing B ^ . The choice of T defines the functional form of the population growth curve by describing a function relationship between weight and age.</p><p>Example 3.1. We recorded the heights of n boys and n girls whose ages are 2, 3, 4 and 6 years. From the observations we make an assumption that the average height increases linearly with age. Since the observed data is partitioned into two groups (one is for the heights of n boys and another is for the height of n girls), each consisting of n objects, and p = 4 with age vector d = [ 2 , 3 , 4 , 6 ] T . Thus the model for the height vs. age shall be Y = X B T + E where</p><p>X = [ l n 0 0 l n ] , T = [ l p T d T ]</p><p>where l k ∈ ℝ k is an all-ones vector of dimension k.</p><p>Here β 11 , β 12 are respectively the intercept and the slope for girls and β 21 , β 22 are respectively the intercept and the slope for boys. We find that it is not so easy for us to investigate the relationship between the gender, height, weight, and age. In the following we employ tensor expression to deal with this issue.</p><p>Using the notation in tensor theory, we rewrite model (3.16) in form</p><p>Y = X &#215; 2 B &#215; 2 T + E</p><p>or equivalently</p><p>Y = B &#215; 1 X T &#215; 2 T + E (3.18)</p><p>where B is regarded as a second order tensor and X , T as two matrices. Actually, according to Equation (2.12), we have</p><p>( B &#215; 1 X T ) i j = ∑ k = 1 q X i k B k j .</p><p>Similarly we can define B &#215; 2 V . Note that</p><p>B &#215; 1 U &#215; 2 V = B &#215; 2 V &#215; 1 U .</p><p>Now we extend model (3.16) in a more general form as</p><p>A = B &#215; 1 X 1 &#215; 2 X 2 &#215; 3 X 3 + E (3.19)</p><p>where A ∈ ℝ n 1 &#215; n 2 &#215; n 3 , B = ( B i j k ) ∈ ℝ m 1 &#215; m 2 &#215; m 3 is a 3-order tensor, which is usually an unknown constant parameter tensor or the kernel tensor, and X i ∈ ℝ n i &#215; m i for i = 1 , 2 , 3 . Here the tensor-matrix multiplication is defined by Equation (2.12) according to the dimensional coherence along each mode.</p><p>The potential applications of Equation (3.21) are obvious. A HOSVD (high order singular value decomposition) of a 3-order tensor can be regarded as a good example for this model.</p><p>Example 3.2. A sequence of 1000 images extracted from a repository of face images of ten individuals, each with 100 face images. Suppose each face image is of size 256 &#215; 256 . Then these images can be restored in an 256 &#215; 256 &#215; 1000 tensor A . Let A be decomposed as</p><p>A = B &#215; 1 U 1 T &#215; 2 U 2 T &#215; 3 U 3 T (3.20)</p><p>where</p><p>B ∈ ℝ 16 &#215; 16 &#215; 50 , U 1 ∈ ℝ 256 &#215; 16 , U 2 ∈ ℝ 256 &#215; 16 , U 3 ∈ ℝ 1000 &#215; 50 .</p><p>The decomposition Equation (3.20) yields a set of compressed images, each with size 16 &#215; 16 . If each individual can be characterized by five images (this is called a balanced compression), then the kernel tensor B consists of 50 compressed images where each U i is a projection matrix along mode-i (i = 1, 2, 3). Specifically, U 1 and U 2 together play a role of compression of each image into an 16 &#215; 16 image, while U 3 finds the representative elements (here is the 50 images) among a large set of images (the set of 1000 face images).</p><p>Analog to GCM, we let Y i j k be the measured value of Index I k in Class C i at time T j . A tensor Y = ( Y i j k ) ∈ ℝ m &#215; n &#215; p can be used to express m objects, say P 1 , ⋯ , P m , each having p indexes I 1 , ⋯ , I p measured respectively at times t 1 , ⋯ , t n . For each index I k , k ∈ [ p ] , we have GCM form:</p><p>Y k = B k &#215; 1 X &#215; 2 T + E k (3.21)</p><p>where Y k = ( y 1 ( k ) , ⋯ , y n ( k ) ) ∈ ℝ m &#215; n . Suppose each row of Y k stands for a class of individuals, e.g., partitioned by ages. To make things more clear, we consider a concrete example.</p><p>Example 3.3. There are 30 persons under health test, each measured, at time T 1 , ⋯ , T 4 , 10 indexes such as the lower/higher blood pressures, heartbeat rate, urea, cholesterol, bilirubin, etc. We label these indexes respectively by I 1 , ⋯ , I 10 . Suppose that the 30 people are partitioned into three groups (denoted by C 1 , C 2 , C 3 ) with respect to their ages, consisting of 5, 10, 15 individuals respectively. Denote</p><p>X = [ l 5 0 0 0 l 10 0 0 0 l 15 ] , T = [ 1 1 1 1 t 1 t 2 t 3 t 4 t 1 2 t 2 2 t 3 2 t 4 2 ]</p><p>and</p><p>B k = [ β 11 k β 12 k β 1 p k β 21 k β 22 k β 2 p k β 31 k β 32 k β 3 p k ] .</p><p>Denote by Y i j k the measurement of Index I k in group C i at time T j . Set Y ( : , : , k ) = Y k , B ( : , : , k ) = B k for k = 1 , 2 , ⋯ , 10 . Then we have</p><p>Y = B &#215; 1 X T &#215; 2 T T + ε (3.22)</p><p>where Y , ε ∈ ℝ 30 &#215; 4 &#215; 10 , X ∈ ℝ 30 &#215; 3 , T ∈ ℝ 4 &#215; 3 and B ∈ ℝ 3 &#215; 3 &#215; 10 is an unknown constant parameter tensor to be estimated, where B ( : , : , k ) = B k is the parameter matrix corresponding to the k th index model. The model (3.22) can be further promoted to manipulate a balanced linear mixed model when multiple responses are measured for balanced clustered (i.e., there are same number of subjects in each cluster) subjects.</p></sec><sec id="s4"><title>4. Tensor Normal Distributions</title><p>In the multivariate analysis, the correlations between the coordinates of a random vector x = ( x 1 , ⋯ , x n ) T are represented by the covariance matrix Σ : = Σ ( x ) , which is symmetric and positive semidefinite. When the variables are arrayed as a matrix, say X = ( X i j ) ∈ ℝ m &#215; n , which is called a random matrix, the correlation between any pair of entries, say X i 1 j 1 and X i 2 j 2 of matrix X , is represented as an entry of a matrix Σ which is defined as the covariance matrix of the vector. A matrix normal distribution is defined. μ ∈ ℝ m &#215; n , and Σ ∈ ℝ m &#215; m , ϕ ∈ ℝ n &#215; n are two positive definite matrices. A random matrix X ∈ ℝ m &#215; n is said to obey a matrix normal distribution, denoted by X ∼ N m , n ( μ , Σ , ϕ ) , if it satisfies the following the conditions:</p><p>1)<inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x256.png" xlink:type="simple"/></inline-formula>, i.e., <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x257.png" xlink:type="simple"/></inline-formula>for each i ∈ [ m ] , j ∈ [ n ] .</p><p>2) Each row X i ⋅ of X obeys normal distribution X i ⋅ ∼ N p ( 0 , ϕ ) for i ∈ [ m ] .</p><p>3) Each column X ⋅ j obeys normal distribution X ⋅ j ∼ N q ( 0 , Σ ) .</p><p>It is easy to show that a matrix normal distribution X ∼ N p . q ( μ , Σ , ϕ ) is equivalent to vec ( X ) ∼ N p q ( vec ( μ ) , Σ ⊗ ϕ ) (see e.g. [<xref ref-type="bibr" rid="scirp.82800-ref8">8</xref>] ).</p><p>We now define the tensor normal distribution. Let A = ( A i 1 i 2 … i m ) ∈ T m be an m-order tensor of size N : = N 1 &#215; N 2 &#215; ⋯ &#215; N m , each of whose entries is a random variable. Let μ = ( μ i 1 i 2 ⋯ i m ) be an m-order tensor of the same size as that of A , and Σ k ( k ∈ [ m ] ) be an N k &#215; N k positive definite matrix. For convenience, we denote by I ( n ) the ( m − 1 ) -tuple ( i 1 , i 2 , ⋯ , i n − 1 , i n + 1 , ⋯ , i m ) with i k ∈ [ N k ] . We denote by A ( I ( n ) ) and μ ( I ( n ) ) , both in ℝ N n respectively the corresponding fibre (vector) of A and μ , indexed by I ( n ) , i.e.,</p><p>A ( I ( n ) ) : = A ( i 1 , i 2 , ⋯ , i n − 1 , : , i n + 1 , ⋯ , i m ) , ∀ i k ∈ N k , k ∈ [ m ] \ { n } .</p><p>A ( I ( n ) ) μ ( I ( n ) ) is called a fibre of A ( μ resp.) along mode-n indexed by I ( n ) . A is said to obey a tensor normal distribution with parameter matrices ( μ , Σ 1 , ⋯ , Σ m ) or denoted by</p><p>A ∼ N T ( μ , Σ 1 , ⋯ , Σ m )</p><p>if for any n ∈ [ m ] , we have</p><p>A ( i 1 , ⋯ , i n − 1 , : , i n + 1 , ⋯ , i m ) ∼ N N n ( μ ( I ( n ) ) Σ n ) .</p><p>A is said to follow a standard tensor normal distribution if all the Σ k ’s are identity matrices. A model (2.13) with a tensor normal distribution is called a general tensor normal (GTN) model.</p><p>To show the application of tensor normal distribution, we consider the 3-order tensor. For our convenience, we use ( i , j ) -value to denote the value related to the ith subject at jth measurement for any ( i , j ) ∈ [ m ] &#215; [ n ] . For example, the kth response observation Y i j k at ( i , j ) represents the kth response value measured on the ith subject at time j. Now we let m , n , p be respectively the number of observed objects, number of measuring times for each subject, and the number of responses for each observation. Denote by Y the response tensor with Y i j k being the kth response at ( i , j ) , and by X the covariate tensor with X i j : ∈ ℝ r being the covariate vector at ( i , j ) for fixed effects, and by U i j : the covariate vector at ( i , j ) for random effects. Further, for each k ∈ [ p ] , we denote by B k ∈ ℝ r the coefficient vector related to the fixed effects corresponding to the kth response Y i j k at ( i , j ) for each pair ( i , j ) ∈ [ m ] &#215; [ n ] , and similarly by C k ∈ ℝ q the coefficient vector related to the random effects. Now let B = [ B 1 , ⋯ , B p ] and γ = [ C 1 , ⋯ , C p ] . Then B ∈ ℝ r &#215; p , γ ∈ ℝ q &#215; p . We call X , U respectively the design matrix for fixed effects and the design matrix for random effects. Then we have</p><disp-formula id="scirp.82800-formula31"><label>(4.23)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x318.png"  xlink:type="simple"/></disp-formula><p>with Y = ( Y i j k ) ∈ ℝ m &#215; n &#215; p , X = ( X i j k ) ∈ ℝ m &#215; n &#215; r , B = ( β i j ) ∈ ℝ r &#215; p , U = ( U i j k ) ∈ ℝ m &#215; n &#215; q , γ = ( γ i j ) ∈ ℝ q &#215; p , ε = ( ε i j k ) ∈ ℝ m &#215; n &#215; p where ε i j k is the error term. Here the tensor-matrix multiplications X B and <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x327.png" xlink:type="simple"/></inline-formula> are defined by</p><p>( X B ) i j k = X ( i , j , : ) B k = ∑ k ′ = 1 r X i j k ′ β k ′ k , ∀ i ∈ [ m ] , j ∈ [ n ] , k ∈ [ p ]</p><p><inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x329.png" xlink:type="simple"/></inline-formula>.</p><p>Denote B = [ β 1 , ⋯ , β p ] , γ = [ γ 1 , ⋯ , γ p ] , and</p><p>E i ( 1 ) : = E ( i , : , : ) , E j ( 2 ) : = E ( : , j , : ) , E k ( 3 ) : = E ( : , : , k ) , ∀ i ∈ [ m ] , j ∈ [ n ] , k ∈ [ p ]</p><p>E j k ( 1 ) : = E ( : , j , k ) , E i k ( 2 ) : = E ( i , : , k ) , E i j ( 3 ) : = E ( i , j , : ) , ∀ i ∈ [ m ] , j ∈ [ n ] , k ∈ [ p ]</p><p>where each matrix E l ( s ) is called a slice on mode s, and each vector E l t [ s ] is called a fibre along mode-s. We also use E [ s ] to denote the set consisting of all fibres of ε along mode-s, and use notation E [ s ] ∼ P to express that each element of E [ s ] obeys distribution P where P is a distribution function. For example, E [ 1 ] ∼ N m ( 0 , I m ) means that each 1-mode fibre E j k [ 1 ] (there are n p 1-mode fibres) obeys a standard normal distribution, i.e., E j k [ 1 ] ∼ N m ( 0 , I m ) .</p><p>Now for convenience we let ( n 1 , n 2 , n 3 ) : = ( m , n , p ) . We assume that</p><p>1) γ obeys matrix normal distribution γ ∼ N q , p ( 0 , Σ , ϕ ) .</p><p>2) The random vectors in <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x346.png" xlink:type="simple"/></inline-formula> are independent with <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x347.png" xlink:type="simple"/></inline-formula> for each<inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x348.png" xlink:type="simple"/></inline-formula>.</p><p>3) For any <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x349.png" xlink:type="simple"/></inline-formula> with <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x350.png" xlink:type="simple"/></inline-formula> being positive definite of size<inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x351.png" xlink:type="simple"/></inline-formula>.</p><p>The model (4.23) with conditions (I, II, III) is called a 3-order general mixed tensor (GMT) model. We will generalise this model to a more general case. In the following we first define the standard normal 3-order tensor distribution:</p><p>Definition 4.1. Let <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x352.png" xlink:type="simple"/></inline-formula> be a random tensor, i.e., each entry of <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x353.png" xlink:type="simple"/></inline-formula> is a random variable. Let <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x353.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x354.png" xlink:type="simple"/></inline-formula> be a constant tensor and <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x353.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x354.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x355.png" xlink:type="simple"/></inline-formula> be a positive definite matrix for each<inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x353.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x354.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x355.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x356.png" xlink:type="simple"/></inline-formula>. Then <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x353.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x354.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x355.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x356.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x357.png" xlink:type="simple"/></inline-formula> is said to obey 3-order tensor standard normal (TSN) distribution if <inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x353.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x354.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x355.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x356.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x357.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x358.png" xlink:type="simple"/></inline-formula> for all<inline-formula><inline-graphic xlink:href="/html.scirp.org/file/3-2230151x353.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x354.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x355.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x356.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x357.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x358.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x359.png" xlink:type="simple"/></inline-formula>.</p><p>A 3-order random tensor satisfying TSN distribution has the following property:</p><p>Theorem 4.2. Let <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x360.png" xlink:type="simple"/></inline-formula> be an 3-order random tensor which obeys the tensor standard normal (TSN) distribution. Then each slice of <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x360.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x361.png" xlink:type="simple"/></inline-formula> shall obey a standard matrix normal distribution. Specifically, we have</p><disp-formula id="scirp.82800-formula32"><label>(4.24)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x362.png"  xlink:type="simple"/></disp-formula><p>Proof. □</p><p>Note that condition (III) is a generalization to the matrix normal distribution, and we denote it by<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x363.png" xlink:type="simple"/></inline-formula>,</p><p><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x364.png" xlink:type="simple"/></inline-formula>.</p><p>Note that <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x365.png" xlink:type="simple"/></inline-formula> is a diagonal matrix since both <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x365.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x366.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x365.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x366.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x367.png" xlink:type="simple"/></inline-formula> are diagonal. Write <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x365.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x366.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x367.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x368.png" xlink:type="simple"/></inline-formula> where <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x365.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x366.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x367.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x368.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x369.png" xlink:type="simple"/></inline-formula> is the expansion of <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x365.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x366.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x367.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x368.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x369.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x370.png" xlink:type="simple"/></inline-formula> along the third mode, specifically,</p><disp-formula id="scirp.82800-formula33"><graphic  xlink:href="//html.scirp.org/file/3-2230151x371.png"  xlink:type="simple"/></disp-formula><p>here <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x372.png" xlink:type="simple"/></inline-formula> is the tensor consisting of n identity matrices of size <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x372.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x373.png" xlink:type="simple"/></inline-formula> stacking along the third mode, thus <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x372.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x373.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x374.png" xlink:type="simple"/></inline-formula> and<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x372.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x373.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x374.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x375.png" xlink:type="simple"/></inline-formula>. Then we have</p><disp-formula id="scirp.82800-formula34"><label>. (4.25)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x376.png"  xlink:type="simple"/></disp-formula><p>Now we unfold <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x377.png" xlink:type="simple"/></inline-formula> along mode-3 to get matrix <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x377.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x378.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x377.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x378.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x379.png" xlink:type="simple"/></inline-formula> respectively. Then Equation (4.25) is equivalent to</p><disp-formula id="scirp.82800-formula35"><label>(4.26)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x380.png"  xlink:type="simple"/></disp-formula><p>where <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x381.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x381.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x382.png" xlink:type="simple"/></inline-formula> are generated similarly as<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x381.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x382.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x383.png" xlink:type="simple"/></inline-formula>.</p><p>The multivariate linear mixed model (4.23) or (4.25) can be transformed into a general linear model through the vectorization of matrices. Recall that the vectorization of a matrix <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x384.png" xlink:type="simple"/></inline-formula> is a vector of dimension<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x384.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x385.png" xlink:type="simple"/></inline-formula>, denoted by<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x384.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x385.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x386.png" xlink:type="simple"/></inline-formula>, formed by vertically stacking the columns of A in order, that is,</p><disp-formula id="scirp.82800-formula36"><graphic  xlink:href="//html.scirp.org/file/3-2230151x387.png"  xlink:type="simple"/></disp-formula><p>where <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x388.png" xlink:type="simple"/></inline-formula> are the column vectors of A. The vectorization is closely related to Kronecker product of matrices. The following lemma presents some basic properties of the vectorization and Kronecker product. We will use the following lemma (Proposition 1.3.14 on Page 89 of [<xref ref-type="bibr" rid="scirp.82800-ref4">4</xref>] ) to prove our main result:</p><p>Lemma 4.3. Let <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x389.png" xlink:type="simple"/></inline-formula> be matrices of appropriate sizes such that all the operations defined in the following are valid. Then</p><p>1)<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x390.png" xlink:type="simple"/></inline-formula>.</p><p>2)<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x391.png" xlink:type="simple"/></inline-formula>.</p><p>3)<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x392.png" xlink:type="simple"/></inline-formula>.</p><p>The following property of the multiplication of a tensor with a matrix is shown by Kolda [<xref ref-type="bibr" rid="scirp.82800-ref18">18</xref>] and will be used to prove our main result.</p><p>Lemma 4.4. Let <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x393.png" xlink:type="simple"/></inline-formula> be a real tensor of size<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x393.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x394.png" xlink:type="simple"/></inline-formula>, and <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x393.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x394.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x395.png" xlink:type="simple"/></inline-formula> where<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x393.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x394.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x395.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x396.png" xlink:type="simple"/></inline-formula>. Then <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x393.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x394.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x395.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x396.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x397.png" xlink:type="simple"/></inline-formula> if and only if</p><disp-formula id="scirp.82800-formula37"><label>(4.27)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x398.png"  xlink:type="simple"/></disp-formula><p>where <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x399.png" xlink:type="simple"/></inline-formula> are respectively the flattened matrices of <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x399.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x400.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x399.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x400.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x401.png" xlink:type="simple"/></inline-formula> along mode-n.</p><p>Proof. Let<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x402.png" xlink:type="simple"/></inline-formula>. Then for any<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x402.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x403.png" xlink:type="simple"/></inline-formula>, we have</p><disp-formula id="scirp.82800-formula38"><label>. (4.28)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x404.png"  xlink:type="simple"/></disp-formula><p>From which the result Equation (4.27) is immediate. □</p><p>Note that our Formula (4.27) is different from that in Section 2.5 in [<xref ref-type="bibr" rid="scirp.82800-ref18">18</xref>] since the definition of tensor-matrix multiplication is different.</p><p>We have the following result for the estimation of the parameter matrix<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x405.png" xlink:type="simple"/></inline-formula>:</p><p>Theorem 4.5. Suppose <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x406.png" xlink:type="simple"/></inline-formula> in Equation (4.23). Then the optimal estimation of the parameter matrix <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x406.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x407.png" xlink:type="simple"/></inline-formula> in Equation (4.23) is</p><disp-formula id="scirp.82800-formula39"><label>. (4.29)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x408.png"  xlink:type="simple"/></disp-formula><p>Proof. We first write Equation (4.25) in a matrix-vector form by vectorization by using the first item in Lemma 4.3,</p><disp-formula id="scirp.82800-formula40"><label>(4.30)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x409.png"  xlink:type="simple"/></disp-formula><p>where <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x410.png" xlink:type="simple"/></inline-formula> is the sum of two random terms. By the property of the vectorizations (see e.g. [<xref ref-type="bibr" rid="scirp.82800-ref4">4</xref>] ), we know that<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x410.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x411.png" xlink:type="simple"/></inline-formula>. By the ordinary least square solution method we get</p><disp-formula id="scirp.82800-formula41"><graphic  xlink:href="//html.scirp.org/file/3-2230151x412.png"  xlink:type="simple"/></disp-formula><p>By using (1) of Lemma 4.3 again (this time in the opposite direction), we get result (4.29). □</p><p>For any<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula>, we denote <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula> when<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula>, and <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula> when <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula> (<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x418.png" xlink:type="simple"/></inline-formula>stands for an g-inverse of a matrix<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x418.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x419.png" xlink:type="simple"/></inline-formula>). Then <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x418.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x419.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x420.png" xlink:type="simple"/></inline-formula> can be regarded as the projection from <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x418.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x419.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x420.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x421.png" xlink:type="simple"/></inline-formula> into <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x418.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x419.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x420.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x421.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x422.png" xlink:type="simple"/></inline-formula> since<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x418.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x419.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x420.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x421.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x422.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x423.png" xlink:type="simple"/></inline-formula>. Furthermore, we have<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x413.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x414.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x415.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x416.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x417.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x418.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x419.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x420.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x421.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x422.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x423.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x424.png" xlink:type="simple"/></inline-formula>. We now end the paper by presenting the following result as a pre diction model which follows directly from Theorem 4.5.</p><p>Theorem 4.6. Suppose <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x425.png" xlink:type="simple"/></inline-formula> in Equation (4.23). Then the mean of the response tensor <inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x425.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x426.png" xlink:type="simple"/></inline-formula> in Equation (4.23) is</p><disp-formula id="scirp.82800-formula42"><label>(4.31)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x427.png"  xlink:type="simple"/></disp-formula><p>where<inline-formula><inline-graphic xlink:href="//html.scirp.org/file/3-2230151x428.png" xlink:type="simple"/></inline-formula>.</p><p>Proof. By Theorem 4.5 and Equation (4.26), we have</p><disp-formula id="scirp.82800-formula43"><graphic  xlink:href="//html.scirp.org/file/3-2230151x429.png"  xlink:type="simple"/></disp-formula><p>It follows that</p><disp-formula id="scirp.82800-formula44"><label>. (4.32)</label><graphic position="anchor" xlink:href="//html.scirp.org/file/3-2230151x430.png"  xlink:type="simple"/></disp-formula><p>By employing Lemma 4.4, we get result (4.31). □</p></sec><sec id="s5"><title>Acknowledgements</title><p>This research was partially supported by the Hong Kong Research Grant Council (No. PolyU 15301716) and the graduate innovation funding of USTS. We shall thank the anonymous referees for their patient and elaborate reading and their suggestions which improved the writing of the paper.</p></sec><sec id="s6"><title>Cite this paper</title><p>Lin, Z.R., Liu, D.Z., Liu, X.Y., He, L.L. and Xu, C.Q. (2018) High Order Tensor Forms of Growth Curve Models. Advances in Linear Algebra &amp; Matrix Theory, 8, 18-32. https://doi.org/10.4236/alamt.2018.81003</p></sec></body><back><ref-list><title>References</title><ref id="scirp.82800-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Bilodeau, M. and Brenner, D. (1961) Theory of Multivariate Statistics. Springer, New York.</mixed-citation></ref><ref id="scirp.82800-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Bollen, K.A. and Curran, P.J. (2006) Latent Curve Models: A Structure Equation Perspective. Wiley, Hoboken.</mixed-citation></ref><ref id="scirp.82800-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Bryk, A.S. and Raudenbush, S.W. (1987) Application of Hierarchical Linear Models to Assessing Change. Psychological Bulletin, 101, 147-158. https://doi.org/10.1037/0033-2909.101.1.147</mixed-citation></ref><ref id="scirp.82800-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Kollo, T. and Rosen, D. (2005) Advanced Multivariate Statistics with Matrices. Springer, New York. https://doi.org/10.1007/1-4020-3419-9</mixed-citation></ref><ref id="scirp.82800-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Raudenbush, S.W. and Bryk, A.S. (2002) Hierarchical Linear Models: Applications and Data Analysis Methods 2. Sage Publications, Thousand Oaks, CA.</mixed-citation></ref><ref id="scirp.82800-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Bauer, D.J. (2007) Observations on the Use of Growth Mixture Models in Psychological Research. Multivariate Behavioral Research, 42, 757-786. https://doi.org/10.1080/00273170701710338</mixed-citation></ref><ref id="scirp.82800-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Bollen, K.A. and Curran, P.J. (2004) Autoregressive Latent Trajectory (ALT) Models: A Synthesis of Two Traditions. Sociological Methods and Research, 32, 336-383. https://doi.org/10.1177/0049124103260222</mixed-citation></ref><ref id="scirp.82800-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Coffman, D.L. and Millsap, R.E. (2006) Evaluating Latent Growth Curve Models Using Individual Fit Statistics. Structural Equation Modeling, 13, 1-27. https://doi.org/10.1207/s15328007sem1301_1</mixed-citation></ref><ref id="scirp.82800-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Hedeker, D. and Gibbons, R. (2006) Longitudinal Data Analysis. Wiley Inc., New York.</mixed-citation></ref><ref id="scirp.82800-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Little, R.J. and Rubin, D.B. (1987) Statistical Analysis with Missing Data. Wiley Inc., New York.</mixed-citation></ref><ref id="scirp.82800-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Singer, J.D. (1998) Using SAS Proc Mixed to Fit Multilevel Models, Hierarchical Models, and Individual Growth Models. Journal of Educational and Behavioral Statistics, 23, 323-355. https://doi.org/10.3102/10769986023004323</mixed-citation></ref><ref id="scirp.82800-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Singer, J.D. and Willett, J.B. (2003) Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. Oxford University Press, New York.https://doi.org/10.1093/acprof:oso/9780195152968.001.0001</mixed-citation></ref><ref id="scirp.82800-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Hastie, T., Tibshirani, R. and Friedman, J. (2009) The Elements of Statistical Learning. 2nd Edition, Springer, New York. https://doi.org/10.1007/978-0-387-84858-7</mixed-citation></ref><ref id="scirp.82800-ref14"><label>14</label><mixed-citation publication-type="book" xlink:type="simple">Bollen, K.A. (2007) On the Origins of Latent Curve Models. In: Cudeck, R. and MacCallum, R., Eds., Factor Analysis, at 100, Lawrence Erlbaum Associates, Mahwah, NJ, 79-98.</mixed-citation></ref><ref id="scirp.82800-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Comon, P., Golub, G., Lim, L.-H. and Mourrain, B. (2008) Symmetric Tensors and Symmetric Tensor Rank. SIAM Journal on Matrix Analysis and Applications, 30, 1254-1279. https://doi.org/10.1137/060661569</mixed-citation></ref><ref id="scirp.82800-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Wishart, J. (1938) Growth Rate Determinations in Nutrition Studies with the Bacon Pig and Their Analysis. Biometrika, 30, 16-28. https://doi.org/10.1093/biomet/30.1-2.16</mixed-citation></ref><ref id="scirp.82800-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">McArdle, J.J. (2009) Latent Variable Modeling of Differences and Changes with Longitudinal Dynamic Structural Analysis. Annual Review of Psychology, 60, 577-605. https://doi.org/10.1146/annurev.psych.60.110707.163612</mixed-citation></ref><ref id="scirp.82800-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Kolda, T.G. and Bader, B.W. (2009) Tensor Decompositions and Applications. SIAM Review, 51, 455-500. https://doi.org/10.1137/07070111X</mixed-citation></ref></ref-list></back></article>