Mehrotra-Type Predictor-Corrector Algorithms for Symmetric Cone Programming in a Wide Neighborhood of the Central Path ()
1. Introduction
Symmetric cone optimization (SCO) problem is a convex optimization problem, minimization of a linear function over the intersection of an affine subspace and a symmetric cone. The SCO problem is considered to be an important class of convex optimization problems, because it includes linear optimization (LO), second-order cone optimization (SOCO), and semidefinite optimization (SDO) as special cases. In recent years, these problems have attracted the attention of many scholars in optimization.
Among various approaches for solving SCO problem, interior-point methods (IPMs) are the most popular algorithms and have been widely used to obtain the best iteration complexity results. The basic idea for solving SCO problems using IPMs was first proposed by Nesterov and Nemirovskii [1]. Faybusovich [2] extended a primal-dual IPM for SDO to SCO using the Euclidean Jordan algebra. Consequently, several efficient IPMs for SCO have been developed; see, e.g., [3]-[10].
Predictor-corrector methods, the most studied variants of interior-point methods, are being used as Mehrotra’s predictor-corrector algorithm [11] in a number of optimization packages. In these methods, by solving linear systems with the same coefficient matrix, three directions are produced: predictor, corrector and centering. The moving direction is determined by a combination of these directions. Zhang and Zhang [12] first established convergence theory and complexity bounds for two Mehrotra-type second-order algorithms. Later, Zhang and Zhang [13] extended the idea of [12] to SCO and showed the complexity bound of
for the Nesterov-Todd search direction. Many primary IPMs are based on the small neighborhood or the negative infinity large neighborhood. Among all existing path-following interior-point algorithms, the theoretical iteration complexity of wide neighborhood algorithms is the worst, but in practice, these algorithms perform better than small neighborhood algorithms. Recently, Ai and Zhang [14] considered an IPM for linear complementarity problem (LCP), which works in a wide neighborhood and has the same complexity as the best known small neighborhood IPMs. Applying an interesting idea, they decomposed the classical Newton search direction into the nonnegative and nonpositive parts corresponding to the nonnegative and the nonpositive parts of the right-hand side. They showed that if these two directions are equipped with different and appropriate step sizes, then the method enjoys the low iteration bound of
. Liu et al. [15] modified the Ai-Zhang’s primal-dual path-following interior-point method of [14] to gain a class of Mehrotra-type predictor-corrector algorithm for monotone LCPs. Here, we are to modify the Ai-Zhang’s primal-dual interior point method to present two Mehrotra-type predictor-corrector algorithms. First, we present a Mehrotra-type predictor-corrector algorithm generalizing the approach of [15] for LCP to SCO. Then, we propose the second Mehrotra-type predictor-corrector algorithm for SCO, by adding two corrector steps to the Ai-Zhang path-following algorithm [14], to improve the performance. Finally, we show that the use of corrector steps does not impair the worst-case complexity of the algorithm.
2. Preliminaries
Here, we recall briefly the basic concept and useful results of Euclidean Jordan algebra. We refer the reader to [16] for details.
Definition 1. Let
be an
-dimensional vector space over the field of real numbers with a multiplication “
” where the map
is bilinear from
to
. The triple
is a Euclidean Jordan algebra if for all
, we have:
1)
;
2)
, where
;
3)
.
We assume that there exists an identity element
such that
, for all
. The degree of
, denoted by
, is the smallest integer
such that
is linearly dependent. The rank of
, denoted by
, is defined as the maximum
over all
.
An element c is idempotent if
and
. An idempotent
is primitive if it is nonzero and can not be expressed by sum of two other nonzero idempotents. Two idempotents
and
are said to be pairwise orthogonal if
. A set of orthogonal primitive idempotents
is called a Jordan frame if
,
, and
. For
, let
be the smallest positive integer such that
is linearly dependent;
is called the degree of
and is denoted by
. The rank of
, denoted by
, is defined by
.
A cone is symmetric if and only if it is the cone of squares of a Euclidean Jordan algebra, which is denoted by
. For a given
, the Lyapunov transformation
is defined by
, for every
. Furthermore, we define the quadratic representation of
in
by
, where
.
The spectral decomposition theorem (Theorem III.1.2 of [16]) of a Euclidean Jordan algebra
states that for any
, there exists a Jordan frame
and real numbers
(the eigenvalues of
) such that
. Some generated functions by the eigenvalues of
are: the Frobenius norm on
is defined by
, and
. Moreover, since
is a symmetric positive definite quadratic form, the inner product can be defined as
. The inverse is
, whenever all
. Accordingly,
, where
, for
. We define
. We call
positive semidefinite (positive definite), denoted by
, if all eigenvalues of
are nonnegative (positive). Moreover,
if and only if
.
3. The SCO Problem and Neighborhood of Central Path
Let
be a Euclidean Jordan algebra of dimension
with rank
, and
be the cone of squares corresponding to
. Consider the following primal and dual problems in the standard forms:
(P)
and
(D)
where
,
,
is a linear operator that maps
into
and
is its adjoint operator such that
, for all
.
The primal-dual feasible set is defined as
and we assume that the interior of the primal-dual feasible set,
is nonempty; that is, the primal-dual pair of the SCO problems satisfies the interior-point condition (IPC). Under the IPC, finding an optimal solution of (P) and (D) is equivalent to solving the following system [17]:
(1)
The basic idea of a path-following interior-point algorithm is to replace the third equation in (1), the so-called complementarity condition, by the parameterized equation
, for
, and
. Thus, we consider the following system:
(2)
Due to the IPC, a unique solution
of the perturbed system (2) exists for each
[17]. This solution is called the
-center for the SCO problem. The set of
-centers provides a path, which is called the central path of the SCO problem. The
-center approximates an optimal solution of the given problem pair as
goes to zero. Applying the Newton method for the perturbed system of (2), we get the following equations:
(3)
It has been shown that the reason for the system (3) not having a unique solution is the fact that
and
do not operator commute, in general; that is,
.
To overcome this defect, we use the scaling scheme based on the following fact (Lemma 28 of [6]). Let
be invertible. Then,
if and only if
.
In our work here, we choose
which gives the Nesterov-Todd (NT) direction. For the NT direction [18], we have
. The system (3) can be rewritten as follows:
(4)
where
,
,
and
.
Most interior point algorithms work in the classical negative infinity large neighborhood, defined by
where
and
. Our proposed algorithms start from a feasible point in a wide neighborhood of the central path due to [14] [19], as follows:
where
.
This neighborhood is even larger than the
neighborhood, since it can be verified that
(5)
4. Mehrotra Predictor-Corrector Algorithms
Here, we describe our algorithms in detail. In accordance with the Ai-Zhang’s original idea [14], we decompose the Newton direction into two separate parts corresponding to the positive and negative parts of the right-hand side of the third equality of the system (4); i.e.,
. Therefore, in our algorithms, we compute the predictor directions
and
respectively by solving the following two systems:
(6)
and
(7)
4.1. First Algorithm and Its Convergence Analysis
We present a Mehrotra predictor-corrector path-following algorithm for SCO problems based on Liu et al.’s idea for LCP [15]. For this, we compute the maximum step size
that ensures
(8)
Using the predictor step obtained by solving (6), we compute the corrector direction
by solving the following system:
(9)
Let
give the step lengths. Then, we define a combination of directions as follows:
Finally, the new iterate is:
(10)
Since
, the duality gap corresponding to the new iterate can be written as
(11)
We now describe the Mehrotra predictor-corrector method for the SCO problem as Algorithm 1.
Algorithm 1. The Mehrotra predictor-corrector method for solving the SCO problem (first method).
Accuracy parameter
; neighborhood parameters
and
; the initial point
with
. Step 0: Set
. Step 1: If
then stop. Step 2: Choose a Nesterov-Todd scaling element
. Step 3: Compute the directions , by solving (6) and by solving (7). Step 4: Find the maximum feasible step length
by (8). Step 5: Compute the direction by solving (9). Step 6: Find the step lengths
giving a sufficient reduction of the duality gap and assuring
. Step 7: Let
,
and
. Step 8: Set
and go to Step 1. |
Next, we present an analysis of Algorithm 1. Some of the obtained results will also be useful for the analysis of the second algorithm.
Lemma 1 (Lemma 2.13 of [20]). Let
with
. Then, we have:
Lemma 2 (Lemma 5.12 of [21]). If
, then we have:
Lemma 3 (Lemma 5.1 of [7]). Let
. Then,
Lemma 4 (Lemma 5.2 of [7]). Let
. Then,
Lemma 5. Let
, and
be the solution of (6). Then, we have:
1)
,
2)
.
Proof. Let
for some Jordan frame
and let the spectral eigenvalues satisfy
(12)
Thus, since
and
are operator commute, we have:
(13)
The third equation of (6) can be written as
Then, using the fact that
, we have:
(14)
By (13) and (14), we have:
(15)
This relationship along with
gives the first part of the lemma.
From (8), we have:
(16)
On the other hand, it is trivial to verify
This equality together with (16) implies:
(17)
which completes the proof. 
Let
have the spectral decomposition
, where
is a Jordan frame. Moreover, we may write
and
. Then, by (15), we have:
(18)
Lemma 6. Let
, and
be the solution of (6). Then, we have:
Proof. Using Lemma 5,
, and (18), we have:
(19)
Moreover, by Lemma 5 and (18), we have:
(20)
Now, using (19) and (20), we have:
which completes the proof. 
Lemma 7. Let
and
. Then, we have:
1)
,
2)
.
Proof. From (13), we obtain:
which implies the first part of the lemma.
Similarly, making use of the proof of (13) and since
by Lemma 3, we have:
where the last inequality follows from
. This completes the proof for the second part. 
Lemma 8. Let
and
. If
, then we have:
Proof. Since
, we have:
where the second inequality follows from Lemmas 6 and 7, the properties of
and the third inequality uses the fact that
. The proof is now complete. 
For simplicity of notations, we define:
(21)
and
(22)
Using the Cauchy-Schwarz inequality and Lemma 3, and the facts that
and
, we have:
(23)
Next, using the definition of
given by (21), we further have the following lemma.
Lemma 9. If we choose
, then we have
.
Proof. From (11) and (21), we obtain:
which completes the proof. 
Lemma 10. Let
be defined as (22). Then, the eigenvalues of
are nonnegative.
Proof. By using (14) and following the proof of (13), we obtain:
(24)
where the second inequality is due to the fact
,
, and the last inequality follows from
,
. Now, it is easy to see that the eigenvalues of
are nonnegative, and the proof is complete. 
Lemma 11. Let
,
and
. If
, then we have:
Proof. From Lemma 9, we have
. This, together with (24) and Lemma 10, gives
(25)
By using (25) and Lemma 3, we have:
which completes the proof. 
The next lemma shows that
, for
Lemma 12. Let
,
and
. If
then we have
.
Proof. It is easy to see
(26)
Using this fact and Lemma 4, we have:
Now, together with (11), we have:
(27)
On the other hand, using Lemmas 8 and 11, we have:
where the third inequality follows from (27). The proof is now complete. 
The following lemma gives an upper bound for the duality gap, which plays an important role in the analysis of Algorithm 1.
Lemma 13. Let
,
and
. If
, then we have:
Proof. From (26), it is easy to see
. Together with (11), we obtain:
(28)
which completes the proof. 
Using Lemma 13, the main result of Algorithm 4.1 is obtained as follows.
Theorem 14. If
and
, then the iteration complexity of Algorithm 1 is:
4.2. Second Algorithm and Its Convergence Analysis
Based on Algorithm 1, a new variant of Mehrotra-type predictor-corrector algorithm for SCO will be described. The predictor step will be the same as in Algorithm 1. The deficiency of Algorithm 1 is that is ignored in the corrector step. To remedy, we add the following system to the corrector step of Algorithm 1:
(29)
After deciding the step lengths
, the iterates are updated as follows:
(30)
where
Since
, we have:
(31)
In most interior point methods, the factors and are ignored; however, here we make use of these factors to reduce the duality gap faster.
Next, we describe the general framework of our second algorithm for the SCO problem.
Algorithm 2. The Mehrotra predictor-corrector method for solving the SCO problem (second method).
Inputs: Accuracy parameter
; neighborhood parameters
and
; the initial point
with
. Step 0: Set
. Step 1: If
then stop. Step 2: Choose a Nesterov-Todd scaling element
. Step 3: Compute the directions by solving (6) and by solving (7). Step 4: Find the maximum feasible step length
by (8). Step 5: Compute the directions by solving (9) and by solving (29). Step 6: Find the step lengths
giving a sufficient reduction of the duality gap and assuring
. Step 7: Let
,
and
. Step 8: Set
and go to Step 1. |
We now proceed to show the complexity of Algorithm 2. Since the proof techniques are the same as in Algorithm 1, we only give a brief outline of the proofs and omit the details.
Lemma 15. Let
be the solution of (7) and
. Then, we have:
Proof. Multiplying the third equation of (7) by
, by Lemmas 1 and 7, we have:
(32)
Similar to the proof of Lemma 2 of [13] and by (32), we have:
which completes the proof. 
Similar to Lemma 8 and by Lemma 15, we give the following lemma, which gives a bound on .
Lemma 16. Let
,
and
. If
, then we have:
Now, for simplicity, we use the following notation:
(33)
Lemma 17. Let
be defined as (33) and
. Then, the eigenvalues of
are nonnegative.
Proof. Since
, we have:
(34)
Using (34), Lemma 16 and similar arguments as in the proof of Lemma 10, we obtain:
(35)
where the third inequality follows from Lemma 3, the fourth inequality follows from the fact that
,
, and the last inequality follows from
. Therefore, we conclude that the eigenvalues of
are nonnegative, and this completes the proof. 
Lemma 18. Let
,
and
. If
, then we have:
Proof. If
, then from (31) and Lemma 9, we have
. Therefore, by (35), (28), Lemmas 2 and 17, we obtain:
where the fourth inequality follows from the fact that
and the last inequality follows from Lemma 3. The proof of lemma is now complete. 
Lemma 19. Let
,
and
. If
, then
.
Proof. From Lemmas 2, 16 and 18, we have:
where the last inequality follows from (27). This implies
, which completes the proof. 
Now, using Lemma 13, since
, we are led to the following iteration complexity of Algorithm 2.
Theorem 20. If
and
, then the iteration complexity of Algorithm 2 is:
5. Numerical Testing
Here we compare our two proposed algorithms (Alg. 1 and Alg. 2) with Algorithm 3 (Alg. 3) of [9] and Algorithm 4 (Alg. 4) of [5], using MATLAB R2017b on an Intel Core i7 (2.20 GHz) with 8 GB RAM. We also solve the numerical examples with SeDuMi [22] and display the results. In order to test the algorithms, we use some SDO problems of [23]. These test problems are random semidefinite programming problems (Table 1 and Table 2), educational testing problems (Table 3 and Table 4), norm minimization problems (Table 5 and Table 6) and max-cut problems (Table 7 and Table 8). The parameters were set to be
,
and
.
In Tables 1-8, we list the number of the constraints (m), the dimension of the blocks (n) and the obtained average number of iterations and the average CPU time in seconds. In each instance, we run the algorithm ten times for the same m and n and the average results are shown in the tables.
Table 1. Average number of iterations on Random problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg. 2 |
Alg. 3 |
Alg. 4 |
30 |
30 |
0.11 |
0.20 |
0.20 |
0.85 |
2.07 |
40 |
45 |
0.19 |
0.77 |
0.83 |
1.26 |
5.51 |
50 |
50 |
0.16 |
1.09 |
0.97 |
2.83 |
6.16 |
70 |
80 |
0.57 |
2.93 |
3.04 |
4.07 |
11.48 |
100 |
100 |
1.16 |
4.39 |
4.84 |
9.82 |
25.58 |
Table 2. Average CPU times on Random problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg.2 |
Alg. 3 |
Alg. 4 |
30 |
30 |
13.0 |
13.4 |
13.3 |
14.8 |
18.4 |
40 |
45 |
12.8 |
13.5 |
13.0 |
15.4 |
23.0 |
50 |
50 |
13.5 |
14.8 |
14.6 |
16.1 |
25.1 |
70 |
80 |
13.9 |
17.1 |
16.7 |
22.5 |
27.8 |
100 |
100 |
14.8 |
21.3 |
19.6 |
28.4 |
34.2 |
Table 3. Average number of iterations on educational test problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg. 2 |
Alg. 3 |
Alg. 4 |
5 |
10 |
12.7 |
14.8 |
14.1 |
15.9 |
22.2 |
20 |
40 |
14.6 |
35.5 |
34.6 |
38.1 |
46.3 |
25 |
50 |
19.0 |
46.7 |
46.5 |
52.4 |
72.6 |
40 |
80 |
23.8 |
63.1 |
60.8 |
86.2 |
95.3 |
50 |
100 |
22.6 |
70.8 |
68.4 |
112.5 |
118.0 |
Table 4. Average CPU times on educational test problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg. 2 |
Alg. 3 |
Alg. 4 |
5 |
10 |
0.03 |
0.74 |
0.83 |
1.72 |
2.83 |
20 |
40 |
0.15 |
3.39 |
3.86 |
4.75 |
15.18 |
25 |
50 |
0.30 |
6.41 |
6.84 |
10.63 |
36.07 |
40 |
80 |
0.50 |
12.70 |
11.53 |
18.24 |
44.62 |
50 |
100 |
0.64 |
18.26 |
18.69 |
30.73 |
58.22 |
Table 5. Average number of iterations on norm minimization problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg. 2 |
Alg. 3 |
Alg. 4 |
30 |
30 |
13.5 |
12.7 |
12.7 |
13.1 |
15.6 |
30 |
40 |
13.7 |
13.5 |
12.8 |
17.1 |
23.7 |
50 |
50 |
12.6 |
14.2 |
13.8 |
19.5 |
27.3 |
80 |
90 |
13.6 |
19.3 |
18.8 |
24.6 |
41.3 |
100 |
100 |
14.2 |
23.4 |
23.4 |
48.3 |
53.6 |
Table 6. Average CPU times on norm minimization problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg. 2 |
Alg. 3 |
Alg. 4 |
30 |
30 |
0.65 |
0.11 |
0.39 |
1.37 |
2.34 |
30 |
40 |
0.33 |
0.19 |
0.55 |
2.13 |
5.19 |
50 |
50 |
0.45 |
0.26 |
0.79 |
4.06 |
6.22 |
80 |
90 |
0.99 |
1.62 |
1.55 |
4.77 |
7.19 |
100 |
100 |
1.37 |
3.52 |
3.48 |
6.56 |
9.62 |
Table 7. Average number of iterations on Max-Cut problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg. 2 |
Alg. 3 |
Alg. 4 |
30 |
30 |
12.4 |
13.3 |
12.8 |
14.1 |
18.1 |
45 |
45 |
13.5 |
14.3 |
13.5 |
15.8 |
22.4 |
50 |
50 |
12.7 |
14.7 |
14.6 |
16.1 |
25.8 |
80 |
80 |
12.6 |
18.8 |
18.6 |
27.5 |
44.1 |
100 |
100 |
13.4 |
22.6 |
20.6 |
31.7 |
44.9 |
Table 8. Average CPU times on Max-Cut problems.
m |
n |
SeDuMi |
Alg. 1 |
Alg. 2 |
Alg. 3 |
Alg. 4 |
30 |
30 |
0.80 |
0.22 |
0.31 |
0.52 |
1.83 |
45 |
45 |
0.33 |
0.71 |
0.90 |
1.63 |
5.03 |
50 |
50 |
0.35 |
0.93 |
1.36 |
2.76 |
6.84 |
80 |
80 |
0.50 |
1.84 |
1.86 |
3.32 |
8.63 |
100 |
100 |
0.44 |
3.62 |
3.47 |
4.16 |
8.84 |
Based on the numerical results, as summarized in the tables, our proposed algorithms show to be more efficient and promise to be encouraging. It can be observed that the average number of iterations needed for Algorithm 2 mostly turns to be less than the ones obtained by the other algorithms, while most often the average CPU time of Algorithm 1 is slightly better than the one obtained by Algorithm 2.
6. Concluding Remarks
We modified the Ai-Zhang primal-dual path-following interior-point method to gain two classes of Mehrotra-type predictor-corrector algorithms for SCO. The Euclidean Jordan algebra was a basic tool in our analysis. We showed that the use of the corrector steps did not cause any loss in the worst-case complexity of the algorithms. Based on the NT direction, our proposed algorithms had an
iteration complexity bound, as the best complexity results available for SCO so far. Numerical experiments showed encouraging evidence for practical efficiency of the proposed algorithms.
Acknowledgements
The first author would like to thank Shahrekord University for financial support. The research of the first author was also partially supported by the Center of Excellence for Mathematics, University of Shahrekord, Shahrekord, Iran. The second author thanks Research Council of Sharif University of Technology for financial support.
Funding
The research of the first author was in part supported by a grant from Shahrekord University (No. 404GRD1M2025).