Mehrotra-Type Predictor-Corrector Algorithms for Symmetric Cone Programming in a Wide Neighborhood of the Central Path

Marzieh Sayadi Shahraki; Nezam Mahdavi-Amiri

doi:10.4236/ajor.2026.163006

American Journal of Operations Research > Vol.16 No.3, May 2026

Mehrotra-Type Predictor-Corrector Algorithms for Symmetric Cone Programming in a Wide Neighborhood of the Central Path

Marzieh Sayadi Shahraki¹, Nezam Mahdavi-Amiri^2*
¹Department of Applied Mathematics, Faculty of Mathematical Sciences, Shahrekord University, Shahrekord, Iran.
²Department of Mathematical Sciences, Sharif University of Technology, Tehran, Iran.
DOI: 10.4236/ajor.2026.163006 PDF HTML XML 5 Downloads 37 Views

Abstract

Two Mehrotra-type predictor-corrector interior point algorithms are proposed for solving symmetric cone optimization (SCO) problems, using the Euclidean Jordan algebra. The algorithms produce sequences of iterates in the wide neighborhood of the central path. We establish $O (\sqrt{r} \log ε^{? 1})$ iteration complexity bound for the Nesterov-Todd (NT) scaling direction. To our knowledge, this is the best complexity result obtained so far for interior-point methods over wide neighborhood. We demonstrate the computational efficiency of the proposed algorithms by numerical test results.

Keywords

Interior-Point Algorithm, Wide Neighborhood, Mehrotra-Type Algorithm, Symmetric Cone Programming, Euclidean Jordan Algebra

Share and Cite:

Sayadi Shahraki, M. and Mahdavi-Amiri, N. (2026) Mehrotra-Type Predictor-Corrector Algorithms for Symmetric Cone Programming in a Wide Neighborhood of the Central Path. American Journal of Operations Research, 16, 119-139. doi: 10.4236/ajor.2026.163006.

1. Introduction

Symmetric cone optimization (SCO) problem is a convex optimization problem, minimization of a linear function over the intersection of an affine subspace and a symmetric cone. The SCO problem is considered to be an important class of convex optimization problems, because it includes linear optimization (LO), second-order cone optimization (SOCO), and semidefinite optimization (SDO) as special cases. In recent years, these problems have attracted the attention of many scholars in optimization.

Among various approaches for solving SCO problem, interior-point methods (IPMs) are the most popular algorithms and have been widely used to obtain the best iteration complexity results. The basic idea for solving SCO problems using IPMs was first proposed by Nesterov and Nemirovskii [1]. Faybusovich [2] extended a primal-dual IPM for SDO to SCO using the Euclidean Jordan algebra. Consequently, several efficient IPMs for SCO have been developed; see, e.g., [3]-[10].

Predictor-corrector methods, the most studied variants of interior-point methods, are being used as Mehrotra’s predictor-corrector algorithm [11] in a number of optimization packages. In these methods, by solving linear systems with the same coefficient matrix, three directions are produced: predictor, corrector and centering. The moving direction is determined by a combination of these directions. Zhang and Zhang [12] first established convergence theory and complexity bounds for two Mehrotra-type second-order algorithms. Later, Zhang and Zhang [13] extended the idea of [12] to SCO and showed the complexity bound of $O (r^{3 / 2} \log ϵ^{- 1})$ for the Nesterov-Todd search direction. Many primary IPMs are based on the small neighborhood or the negative infinity large neighborhood. Among all existing path-following interior-point algorithms, the theoretical iteration complexity of wide neighborhood algorithms is the worst, but in practice, these algorithms perform better than small neighborhood algorithms. Recently, Ai and Zhang [14] considered an IPM for linear complementarity problem (LCP), which works in a wide neighborhood and has the same complexity as the best known small neighborhood IPMs. Applying an interesting idea, they decomposed the classical Newton search direction into the nonnegative and nonpositive parts corresponding to the nonnegative and the nonpositive parts of the right-hand side. They showed that if these two directions are equipped with different and appropriate step sizes, then the method enjoys the low iteration bound of $O (\sqrt{n} \log ϵ^{- 1})$ . Liu et al. [15] modified the Ai-Zhang’s primal-dual path-following interior-point method of [14] to gain a class of Mehrotra-type predictor-corrector algorithm for monotone LCPs. Here, we are to modify the Ai-Zhang’s primal-dual interior point method to present two Mehrotra-type predictor-corrector algorithms. First, we present a Mehrotra-type predictor-corrector algorithm generalizing the approach of [15] for LCP to SCO. Then, we propose the second Mehrotra-type predictor-corrector algorithm for SCO, by adding two corrector steps to the Ai-Zhang path-following algorithm [14], to improve the performance. Finally, we show that the use of corrector steps does not impair the worst-case complexity of the algorithm.

2. Preliminaries

Here, we recall briefly the basic concept and useful results of Euclidean Jordan algebra. We refer the reader to [16] for details.

Definition 1. Let $J$ be an $n$ -dimensional vector space over the field of real numbers with a multiplication “ $\circ$ ” where the map $\circ : (x, y) \mapsto x \circ y$ is bilinear from $J \times J$ to $J$ . The triple $(J, \circ, 〈 \cdot, \cdot 〉)$ is a Euclidean Jordan algebra if for all $x, y, z \in J$ , we have:

1) $x \circ y = y \circ x$ ;

2) $x \circ (x^{2} \circ y) = x^{2} \circ (x \circ y)$ , where $x^{2} : = x \circ x$ ;

3) $〈 x \circ y, z 〉 = 〈 y, x \circ z 〉$ .

We assume that there exists an identity element $e \in J$ such that $e \circ x = x \circ e = x$ , for all $x \in J$ . The degree of $x \in J$ , denoted by $deg (x)$ , is the smallest integer $k$ such that ${e, x, \dots, x^{k}}$ is linearly dependent. The rank of $J$ , denoted by $rank (J)$ , is defined as the maximum $deg (x)$ over all $x \in J$ .

An element c is idempotent if $c \neq 0$ and $c^{2} = c$ . An idempotent $c$ is primitive if it is nonzero and can not be expressed by sum of two other nonzero idempotents. Two idempotents $c_{i}$ and $c_{j}$ are said to be pairwise orthogonal if $c_{i} \circ c_{j} = 0$ . A set of orthogonal primitive idempotents ${c_{1}, c_{2}, \dots, c_{k}}$ is called a Jordan frame if $c_{i} \circ c_{j} = 0$ , $i \neq j$ , and $c_{1} + \dots + c_{k} = e$ . For $x \in J$ , let $k$ be the smallest positive integer such that ${e, x, \dots, x^{k}}$ is linearly dependent; $k$ is called the degree of $x$ and is denoted by $deg (x)$ . The rank of $J$ , denoted by $rank (J)$ , is defined by $r : = max {deg (x) : x \in J}$ .

A cone is symmetric if and only if it is the cone of squares of a Euclidean Jordan algebra, which is denoted by $K : = {x^{2} : x \in J}$ . For a given $x \in J$ , the Lyapunov transformation $L : J \to J$ is defined by $L (x) y : = x \circ y$ , for every $y \in J$ . Furthermore, we define the quadratic representation of $x$ in $J$ by $Q (x) : = 2 L {(x)}^{2} - L (x^{2})$ , where $L {(x)}^{2} = L (x) L (x)$ .

The spectral decomposition theorem (Theorem III.1.2 of [16]) of a Euclidean Jordan algebra $J$ states that for any $x \in J$ , there exists a Jordan frame ${c_{1}, c_{2}, \dots, c_{r}}$ and real numbers ${λ_{1}, \dots, λ_{r}}$ (the eigenvalues of $x$ ) such that $x = λ_{1} c_{1} + \dots + λ_{r} c_{r}$ . Some generated functions by the eigenvalues of $x$ are: the Frobenius norm on $J$ is defined by $‖ x ‖ : = \sqrt{〈 x, x 〉} = {(\sum_{i = 1}^{r} λ_{i}^{2})}^{1 / 2}$ , and $T r (x) = \sum_{i = 1}^{r} λ_{i}$ . Moreover, since $T r (x \circ y)$ is a symmetric positive definite quadratic form, the inner product can be defined as $〈 x, y 〉 : = T r (x \circ y)$ . The inverse is $x^{- 1} : = \sum_{i = 1}^{r} λ_{i}^{- 1} c_{i}$ , whenever all $λ_{i} \neq 0$ . Accordingly, $x^{+} = \sum_{i = 1}^{r} λ_{i}^{+} c_{i}$ , where $λ_{i}^{+} = max {λ_{i}, 0}$ , for $i = 1, 2, \dots, r$ . We define $x^{-} = x - x^{+}$ . We call $x \in J$ positive semidefinite (positive definite), denoted by $x \underline{≻} 0$ $(x ≻ 0)$ , if all eigenvalues of $J$ are nonnegative (positive). Moreover, $x \underline{≻} 0$ $(x ≻ 0)$ if and only if $x \in K$ $(x \in int K)$ .

3. The SCO Problem and Neighborhood of Central Path

Let $J$ be a Euclidean Jordan algebra of dimension $n$ with rank $r$ , and $K$ be the cone of squares corresponding to $J$ . Consider the following primal and dual problems in the standard forms:

(P) $min 〈 c, x 〉 : A x = b, x \in K,$

and

(D) $max 〈 b, y 〉 : A^{*} y + s = c, s \in K, y \in ℝ^{m},$

where $c \in J$ , $b \in ℝ^{m}$ , $A$ is a linear operator that maps $J$ into $ℝ^{m}$ and $A^{*}$ is its adjoint operator such that $〈 x, A^{*} y 〉 = 〈 A x, y 〉$ , for all $x \in J$ .

The primal-dual feasible set is defined as

$ℱ = {(x, y, s) \in K \times ℝ^{m} \times K : A x = b, A^{*} y + s = c},$

and we assume that the interior of the primal-dual feasible set,

$ℱ^{0} = {(x, y, s) \in int K \times ℝ^{m} \times int K : A x = b, A^{*} y + s = c},$

is nonempty; that is, the primal-dual pair of the SCO problems satisfies the interior-point condition (IPC). Under the IPC, finding an optimal solution of (P) and (D) is equivalent to solving the following system [17]:

$\begin{array}{l} A x = b, x \in K, \\ A^{*} y + s = c, s \in K, y \in ℝ^{m}, \\ x \circ s = 0. \end{array}$ (1)

The basic idea of a path-following interior-point algorithm is to replace the third equation in (1), the so-called complementarity condition, by the parameterized equation $x \circ s = τ μ e$ , for $μ = 〈 x, s 〉 / r$ , and $τ \in (0, 1)$ . Thus, we consider the following system:

$\begin{array}{l} A x = b, x \in K, \\ A^{*} y + s = c, s \in K, y \in ℝ^{m}, \\ x \circ s = τ μ e . \end{array}$ (2)

Due to the IPC, a unique solution $(x (μ), y (μ), s (μ))$ of the perturbed system (2) exists for each $μ > 0$ [17]. This solution is called the $μ$ -center for the SCO problem. The set of $μ$ -centers provides a path, which is called the central path of the SCO problem. The $μ$ -center approximates an optimal solution of the given problem pair as $μ$ goes to zero. Applying the Newton method for the perturbed system of (2), we get the following equations:

$\begin{array}{l} A Δ x = 0, \\ A^{*} Δ y + Δ s = 0, \\ x \circ Δ s + s \circ Δ x = τ μ e - x \circ s . \end{array}$ (3)

It has been shown that the reason for the system (3) not having a unique solution is the fact that $x$ and $s$ do not operator commute, in general; that is, $L (x) L (s) \neq L (s) L (x)$ .

To overcome this defect, we use the scaling scheme based on the following fact (Lemma 28 of [6]). Let $u$ be invertible. Then, $x \circ s = μ e$ if and only if $Q (u) x \circ Q (u^{- 1}) s = μ e$ .

In our work here, we choose

$u = {[Q (x^{\frac{1}{2}}) {(Q (x^{\frac{1}{2}}) s)}^{- \frac{1}{2}}]}^{- 1 / 2} = {[Q (s^{- \frac{1}{2}}) {(Q (s^{\frac{1}{2}}) x)}^{\frac{1}{2}}]}^{- 1 / 2},$

which gives the Nesterov-Todd (NT) direction. For the NT direction [18], we have $Q (u) x = Q (u^{- 1}) s$ . The system (3) can be rewritten as follows:

$\begin{array}{l} \tilde{A} Δ \tilde{x} = 0, \\ {\tilde{A}}^{*} Δ y + Δ \tilde{s} = 0, \\ \tilde{s} \circ Δ \tilde{x} + \tilde{x} \circ Δ \tilde{s} = τ μ e - \tilde{x} \circ \tilde{s}, \end{array}$ (4)

where $\tilde{A} = A Q (u^{- 1})$ , $v : = \tilde{x} = Q (u) x = \tilde{s} = Q (u^{- 1}) s$ , $Δ \tilde{x} = Q (u) Δ x$ and $Δ \tilde{s} = Q (u^{- 1}) Δ s$ .

Most interior point algorithms work in the classical negative infinity large neighborhood, defined by

$N_{\infty}^{-} (1 - γ) = {(x, y, s) \in int K \times ℝ^{m} \times int K : λ_{\min} (w) \geq γ μ},$

where $w = Q (x^{1 / 2}) s$ and $γ \in (0, 1)$ . Our proposed algorithms start from a feasible point in a wide neighborhood of the central path due to [14] [19], as follows:

$N (τ, β) = {(x, y, s) \in int K \times ℝ^{m} \times int K : ‖ {(τ μ e - w)}^{+} ‖ \leq β τ μ},$

where $β \in (0, 1)$ .

This neighborhood is even larger than the $N_{\infty}^{-} (1 - γ)$ neighborhood, since it can be verified that

$N_{\infty}^{-} (1 - τ) \subseteq N (τ, β) \subseteq N_{\infty}^{-} (1 - (1 - β) τ) .$ (5)

4. Mehrotra Predictor-Corrector Algorithms

Here, we describe our algorithms in detail. In accordance with the Ai-Zhang’s original idea [14], we decompose the Newton direction into two separate parts corresponding to the positive and negative parts of the right-hand side of the third equality of the system (4); i.e., $τ μ e - \tilde{x} \circ \tilde{s}$ . Therefore, in our algorithms, we compute the predictor directions $(Δ {\tilde{x}}_{-}, Δ y_{-}, Δ {\tilde{s}}_{-})$ and $(Δ {\tilde{x}}_{+}, Δ y_{+}, Δ {\tilde{s}}_{+})$ respectively by solving the following two systems:

$\begin{array}{l} \tilde{A} Δ {\tilde{x}}_{-} = 0, \\ {\tilde{A}}^{*} Δ y_{-} + Δ {\tilde{s}}_{-} = 0, \\ \tilde{s} \circ Δ {\tilde{x}}_{-} + \tilde{x} \circ Δ {\tilde{s}}_{-} = {(τ μ e - \tilde{x} \circ \tilde{s})}^{-}, \end{array}$ (6)

and

$\begin{array}{l} \tilde{A} Δ {\tilde{x}}_{+} = 0, \\ {\tilde{A}}^{*} Δ y_{+} + Δ {\tilde{s}}_{+} = 0, \\ \tilde{s} \circ Δ {\tilde{x}}_{+} + \tilde{x} \circ Δ {\tilde{s}}_{+} = {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} . \end{array}$ (7)

4.1. First Algorithm and Its Convergence Analysis

We present a Mehrotra predictor-corrector path-following algorithm for SCO problems based on Liu et al.’s idea for LCP [15]. For this, we compute the maximum step size $θ \in (0, 1]$ that ensures

$(\tilde{x} + θ Δ {\tilde{x}}_{-}) \circ (\tilde{s} + θ Δ {\tilde{s}}_{-}) \in K .$ (8)

Using the predictor step obtained by solving (6), we compute the corrector direction $(Δ {\tilde{x}}_{-}^{c}, Δ y_{-}, Δ {\tilde{s}}_{-}^{c})$ by solving the following system:

$\begin{array}{l} \tilde{A} Δ {\tilde{x}}_{-}^{c} = 0, \\ {\tilde{A}}^{*} Δ y_{-}^{c} + Δ {\tilde{s}}_{-}^{c} = 0, \\ \tilde{s} \circ Δ {\tilde{x}}_{-}^{c} + \tilde{x} \circ Δ {\tilde{s}}_{-}^{c} = - θ Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-} . \end{array}$ (9)

Let $α = (α_{1}, α_{2}) \in ℝ_{+}^{2}$ give the step lengths. Then, we define a combination of directions as follows:

$\begin{array}{l} (Δ \tilde{x} (α), Δ y (α), Δ \tilde{s} (α)) \\ : = α_{1} ((Δ {\tilde{x}}_{-}, Δ y_{-}, Δ {\tilde{s}}_{-}) + (Δ {\tilde{x}}_{-}^{c}, Δ y_{-}^{c}, Δ {\tilde{s}}_{-}^{c})) + α_{2} (Δ {\tilde{x}}_{+}, Δ y_{+}, Δ {\tilde{s}}_{+}) . \end{array}$

Finally, the new iterate is:

$(\tilde{x} (α), y (α), \tilde{s} (α)) = (\tilde{x}, y, \tilde{s}) + (Δ \tilde{x} (α), Δ y (α), Δ \tilde{s} (α)) .$ (10)

Since $T r (Δ \tilde{x} (α) \circ Δ \tilde{s} (α)) = 0$ , the duality gap corresponding to the new iterate can be written as

$μ (α) = μ + \frac{α_{1}}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) + \frac{α_{2}}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) .$ (11)

We now describe the Mehrotra predictor-corrector method for the SCO problem as Algorithm 1.

Algorithm 1. The Mehrotra predictor-corrector method for solving the SCO problem (first method).

Accuracy parameter $ε > 0$ ;

neighborhood parameters $0 < β \leq 1 / 3$ and $0 < τ \leq 1 / 4$ ;

the initial point $(x^{0}, y^{0}, s^{0}) \in N (τ, β)$ with $μ^{0} = 〈 x^{0}, s^{0} 〉 / r$ .

Step 0: Set $k : = 0$ .

Step 1: If $〈 x^{k}, s^{k} 〉 \leq ε$ then stop.

Step 2: Choose a Nesterov-Todd scaling element $u$ .

Step 3: Compute the directions $(Δ {\tilde{x}}_{-}, Δ y_{-}, Δ {\tilde{s}}_{-})$ , by solving (6) and $(Δ {\tilde{x}}_{+}, Δ y_{+}, Δ {\tilde{s}}_{+})$ by solving (7).

Step 4: Find the maximum feasible step length $θ$ by (8).

Step 5: Compute the direction $(Δ {\tilde{x}}_{-}^{c}, Δ y_{-}^{c}, Δ {\tilde{s}}_{-}^{c})$ by solving (9).

Step 6: Find the step lengths $α = (α_{1}, α_{2})$ giving a sufficient reduction of the duality gap and assuring $(\tilde{x} (α), y (α), \tilde{s} (α)) \in N (τ, β)$ .

Step 7: Let $α^{k} : = α$ , $({\tilde{x}}^{k + 1}, y^{k + 1}, {\tilde{s}}^{k + 1}) : = (\tilde{x} (α^{k}), y (α^{k}), \tilde{s} (α^{k}))$ and $μ^{k + 1} : = 〈 x^{k + 1}, s^{k + 1} 〉 / r$ .

Step 8: Set $k : = k + 1$ and go to Step 1.

Next, we present an analysis of Algorithm 1. Some of the obtained results will also be useful for the analysis of the second algorithm.

Lemma 1 (Lemma 2.13 of [20]). Let $x, s \in J$ with $〈 x, s 〉 = 0$ . Then, we have:

$‖ x \circ s ‖ \leq 2^{- 3 / 2} {‖ x + s ‖}^{2} .$

Lemma 2 (Lemma 5.12 of [21]). If $x, y \in J$ , then we have:

$‖ {(x + y)}^{+} ‖ \leq ‖ x^{+} + y^{+} ‖ \leq ‖ x^{+} ‖ + ‖ y^{+} ‖ .$

Lemma 3 (Lemma 5.1 of [7]). Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ . Then,

$‖ {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} ‖ \leq β τ μ .$

Lemma 4 (Lemma 5.2 of [7]). Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ . Then,

$T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) \leq \sqrt{r} β τ μ .$

Lemma 5. Let $(\tilde{x}, y, \tilde{s}) \in ℱ^{0}$ , and $(Δ {\tilde{x}}_{-}, Δ y_{-}, Δ {\tilde{s}}_{-})$ be the solution of (6). Then, we have:

1) $\frac{1}{4} e - {(L {(v)}^{- 1})}^{2} (θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) \in K$ ,

2) $(1 - θ) e + {(L {(v)}^{- 1})}^{2} (θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) \in K$ .

Proof. Let $\tilde{x} \circ \tilde{s} = \sum_{i = 1}^{r} λ_{i} c_{i}$ for some Jordan frame ${c_{1}, \dots, c_{r}}$ and let the spectral eigenvalues satisfy

$τ μ - λ_{1} \leq \dots \leq τ μ - λ_{k} \leq 0 < τ μ - λ_{k + 1} \leq \dots \leq τ μ - λ_{r} .$ (12)

Thus, since $\tilde{x}$ and $\tilde{s}$ are operator commute, we have:

$\begin{matrix} {(L {(v)}^{- 1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-})}^{2} = {(L {(v)}^{- 1} \sum_{i = 1}^{k} (τ μ - λ_{i}) c_{i})}^{2} \\ = {(\sum_{i = 1}^{k} (τ μ - λ_{i}) L {(v)}^{- 1} c_{i})}^{2} \\ = {(\sum_{i = 1}^{k} (τ μ - λ_{i}) λ_{i}^{- 1 / 2} c_{i})}^{2} \\ = \sum_{i = 1}^{k} \frac{{(τ μ - λ_{i})}^{2}}{λ_{i}} c_{i} = \sum_{i = 1}^{k} (λ_{i} - τ μ) \frac{(λ_{i} - τ μ)}{λ_{i}} c_{i} \\ ≼ \sum_{i = 1}^{k} (λ_{i} - τ μ) c_{i} ≼ \sum_{i = 1}^{r} λ_{i} c_{i} = v^{2} . \end{matrix}$ (13)

The third equation of (6) can be written as

$Δ {\tilde{x}}_{-} + Δ {\tilde{s}}_{-} = L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) .$

Then, using the fact that $Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-} = \frac{1}{4} ({(Δ {\tilde{x}}_{-} + Δ {\tilde{s}}_{-})}^{2} - {(Δ {\tilde{x}}_{-} - Δ {\tilde{s}}_{-})}^{2})$ , we have:

$\begin{matrix} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-} = \frac{1}{4} [{(L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}))}^{2} - {(Δ {\tilde{x}}_{-} - Δ {\tilde{s}}_{-})}^{2}] \\ ≼ \frac{1}{4} {[L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-})]}^{2} . \end{matrix}$ (14)

By (13) and (14), we have:

$Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-} ≼ \frac{1}{4} v^{2} .$ (15)

This relationship along with ${(L {(v)}^{- 1})}^{2} v^{2} = e$ gives the first part of the lemma.

From (8), we have:

$\tilde{x} \circ \tilde{s} + θ {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} + θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-} \in K .$ (16)

On the other hand, it is trivial to verify

$\tilde{x} \circ \tilde{s} + θ {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} ≽ \tilde{x} \circ \tilde{s} + θ {(- \tilde{x} \circ \tilde{s})}^{-} = (1 - θ) v^{2} .$

This equality together with (16) implies:

$(1 - θ) e + {(L {(v)}^{- 1})}^{2} (θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) \in K,$ (17)

which completes the proof.

Let $Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}$ have the spectral decomposition $Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-} = \sum_{i = 1}^{r} σ_{i} d_{i}$ , where ${d_{1}, \dots, d_{r}}$ is a Jordan frame. Moreover, we may write ${(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{-} = \sum_{i = 1}^{t} σ_{i} d_{i}$ and ${(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{+} = \sum_{i = t + 1}^{r} σ_{i} d_{i}$ . Then, by (15), we have:

$\begin{matrix} T r {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{+} = \sum_{i = t + 1}^{r} σ_{i} \\ = \sum_{i = t + 1}^{r} 〈 Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}, d_{i} 〉 \\ = \sum_{i = t + 1}^{r} (〈 \frac{1}{4} v^{2}, d_{i} 〉 - 〈 \frac{1}{4} v^{2} - Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}, d_{i} 〉) \\ \leq \frac{1}{4} \sum_{i = t + 1}^{r} 〈 v^{2}, d_{i} 〉 \leq \frac{1}{4} r μ . \end{matrix}$ (18)

Lemma 6. Let $(\tilde{x}, y, \tilde{s}) \in ℱ^{0}$ , and $(Δ {\tilde{x}}_{-}, Δ y_{-}, Δ {\tilde{s}}_{-})$ be the solution of (6). Then, we have:

${‖ L {(v)}^{- 1} (θ Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) ‖}^{2} \leq \frac{1}{4} r μ .$

Proof. Using Lemma 5, $〈 Δ {\tilde{x}}_{-}, Δ {\tilde{s}}_{-} 〉 = 0$ , and (18), we have:

$\begin{array}{l} 〈 {(L {(v)}^{- 1})}^{2} (θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{-} 〉 \\ = 〈 (1 - θ) e, - {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{-} 〉 \\ - 〈 (1 - θ) e + {(L {(v)}^{- 1})}^{2} (θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), - {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{-} 〉 \\ \leq (1 - θ) 〈 e, - {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{-} 〉 \leq (1 - θ) \frac{1}{4} r μ . \end{array}$ (19)

Moreover, by Lemma 5 and (18), we have:

$\begin{array}{l} θ^{2} 〈 {(L {(v)}^{- 1})}^{2} (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{+} 〉 \\ = θ^{2} 〈 \frac{1}{4} e, - {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{+} 〉 \\ - θ^{2} 〈 \frac{1}{4} e + {(L {(v)}^{- 1})}^{2} (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), - {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{+} 〉 \\ \leq θ^{2} 〈 \frac{1}{4} e, - {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{+} 〉 \leq \frac{1}{4} θ^{2} (\frac{1}{4} r μ) . \end{array}$ (20)

Now, using (19) and (20), we have:

$\begin{array}{l} {‖ L {(v)}^{- 1} (θ Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) ‖}^{2} \\ = 〈 L {(v)}^{- 1} (θ Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), L {(v)}^{- 1} (θ Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) 〉 \\ = 〈 {(L {(v)}^{- 1})}^{2} (θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-} 〉 \\ = 〈 {(L {(v)}^{- 1})}^{2} (θ^{2} Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{-} 〉 \\ + θ^{2} 〈 {(L {(v)}^{- 1})}^{2} (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}), {(Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-})}^{+} 〉 \\ \leq (1 - θ + \frac{1}{4} θ^{2}) (\frac{1}{4} r μ) \leq \frac{1}{4} r μ, \end{array}$

which completes the proof.

Lemma 7. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ and $β \leq 1 / 3$ . Then, we have:

1) ${‖ L {(v)}^{- 1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} ‖}^{2} \leq r μ$ ,

2) ${‖ L {(v)}^{- 1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} ‖}^{2} \leq \frac{1}{2} β τ μ$ .

Proof. From (13), we obtain:

${‖ L {(v)}^{- 1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} ‖}^{2} \leq r μ,$

which implies the first part of the lemma.

Similarly, making use of the proof of (13) and since $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ by Lemma 3, we have:

$\begin{matrix} {‖ L {(v)}^{- 1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} ‖}^{2} = {‖ \sum_{i = k + 1}^{r} (τ μ - λ_{i}) L {(v)}^{- 1} c_{i} ‖}^{2} \\ = {‖ \sum_{i = k + 1}^{r} (τ μ - λ_{i}) λ_{i}^{- 1 / 2} c_{i} ‖}^{2} = \sum_{i = k + 1}^{r} \frac{{(τ μ - λ_{i})}^{2}}{λ_{i}} \\ \leq \frac{1}{(1 - β) τ μ} \sum_{i = k + 1}^{r} {(τ μ - λ_{i})}^{2} \\ = \frac{1}{(1 - β) τ μ} {‖ {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} ‖}^{2} \\ \leq \frac{{(β τ μ)}^{2}}{(1 - β) τ μ} \leq \frac{1}{2} β τ μ, \end{matrix}$

where the last inequality follows from $β \leq 1 / 3$ . This completes the proof for the second part.

Lemma 8. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ and $β \leq 1 / 3$ . If $α_{1} \leq \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ , then we have:

$‖ Δ \tilde{x} (α) \circ Δ \tilde{s} (α) ‖ \leq \frac{4}{5} α_{2} β τ μ .$

Proof. Since $T r (Δ \tilde{x} (α) \circ Δ \tilde{s} (α)) = 0$ , we have:

$\begin{array}{l} ‖ Δ \tilde{x} (α) \circ Δ \tilde{s} (α) ‖ \leq \frac{1}{2} ({‖ Δ \tilde{x} (α) ‖}^{2} + {‖ Δ \tilde{s} (α) ‖}^{2}) = \frac{1}{2} ({‖ Δ \tilde{x} (α) + Δ \tilde{s} (α) ‖}^{2}) \\ = \frac{1}{2} [{‖ L {(v)}^{- 1} (α_{1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} + α_{2} {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} - α_{1} θ Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) ‖}^{2}] \\ \leq \frac{1}{2} [‖ L {(v)}^{- 1} (α_{1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} + α_{2} {(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) ‖ \\ + {α_{1} ‖ L {(v)}^{- 1} (θ Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) ‖]}^{2} \\ \leq \frac{1}{2} {(\sqrt{α_{1}^{2} r μ + \frac{α_{2}^{2}}{2} β τ μ} + \frac{α_{1}}{2} \sqrt{r μ})}^{2} \\ \leq \frac{1}{2} {(\sqrt{\frac{9 α_{2}^{2}}{25} β τ μ + \frac{α_{2}^{2}}{2} β τ μ} + \frac{3}{10} α_{2} \sqrt{β τ μ})}^{2} \leq \frac{4}{5} α_{2} β τ μ, \end{array}$

where the second inequality follows from Lemmas 6 and 7, the properties of $L (v)$ and the third inequality uses the fact that $α_{1} \leq \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ . The proof is now complete.

For simplicity of notations, we define:

$δ : = - \frac{T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+})}{T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-})},$ (21)

and

$ψ (α) : = \tilde{x} \circ \tilde{s} + α_{1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} + α_{2} {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} - α_{1} θ (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) .$ (22)

Using the Cauchy-Schwarz inequality and Lemma 3, and the facts that $β \leq 1 / 3$ and $τ \leq 1 / 4$ , we have:

$\begin{matrix} δ = - \frac{T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+})}{T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-})} \leq \frac{β τ}{(1 - τ) \sqrt{r}} \\ = \frac{\sqrt{β τ}}{1 - τ} \sqrt{\frac{β τ}{r}} \leq \frac{\sqrt{12}}{9} \sqrt{\frac{β τ}{r}} . \end{matrix}$ (23)

Next, using the definition of $δ$ given by (21), we further have the following lemma.

Lemma 9. If we choose $α_{1} \geq δ$ , then we have $μ (α) \leq μ$ .

Proof. From (11) and (21), we obtain:

$\begin{matrix} μ (α) = μ + \frac{α_{1}}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) + \frac{α_{2}}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) \\ \leq μ + \frac{δ}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) + \frac{1}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) \leq μ, \end{matrix}$

which completes the proof.

Lemma 10. Let $ψ (α)$ be defined as (22). Then, the eigenvalues of $ψ (α)$ are nonnegative.

Proof. By using (14) and following the proof of (13), we obtain:

$\begin{matrix} ψ (α) = \tilde{x} \circ \tilde{s} + α_{1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} + α_{2} {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} - α_{1} θ (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) \\ = \sum_{i = 1}^{r} λ_{i} c_{i} + α_{1} \sum_{i = 1}^{k} (τ μ - λ_{i}) c_{i} + \sum_{i = k + 1}^{r} (τ μ - λ_{i}) c_{i} - α_{1} θ (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) \\ = \sum_{i = 1}^{k} ((1 - α_{1}) λ_{i} + α_{1} τ μ) c_{i} + \sum_{i = k + 1}^{r} (λ_{i} + α_{2} (τ μ - λ_{i})) c_{i} \\ - α_{1} θ (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) \\ ≽ \sum_{i = 1}^{k} ((1 - α_{1}) λ_{i} + α_{1} τ μ) c_{i} + \sum_{i = k + 1}^{r} (λ_{i} + α_{2} (τ μ - λ_{i})) c_{i} \\ - \frac{α_{1} θ}{4} {(L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}))}^{2} \\ ≽ \sum_{i = 1}^{k} ((1 - α_{1}) λ_{i} + α_{1} τ μ) c_{i} + \sum_{i = k + 1}^{r} (λ_{i} + α_{2} (τ μ - λ_{i})) c_{i} \\ - \frac{α_{1} θ}{4} \sum_{i = 1}^{k} (τ μ - λ_{i}) c_{i} \\ = \sum_{i = 1}^{k} ((1 - α_{1} - \frac{α_{1} θ}{4}) λ_{i} + (α_{1} + \frac{α_{1} θ}{4}) τ μ) c_{i} \\ + \sum_{i = k + 1}^{r} (λ_{i} + α_{2} (τ μ - λ_{i})) c_{i} \\ ≽ \sum_{i = 1}^{k} τ μ c_{i} + \sum_{i = k + 1}^{r} (λ_{i} + α_{2} (τ μ - λ_{i})) c_{i}, \end{matrix}$ (24)

where the second inequality is due to the fact $λ_{i} - τ μ \geq 0$ , $i = 1, \dots, k$ , and the last inequality follows from $τ μ - λ_{i} \geq 0$ , $i = k + 1, \dots, r$ . Now, it is easy to see that the eigenvalues of $ψ (α)$ are nonnegative, and the proof is complete.

Lemma 11. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ , $β \leq 1 / 3$ and $τ \leq 1 / 4$ . If $α_{1} \geq δ$ , then we have:

$‖ {(τ μ (α) e - ψ (α))}^{+} ‖ \leq (1 - α_{2}) (β τ μ (α)) .$

Proof. From Lemma 9, we have $\frac{μ (α)}{μ} \leq 1$ . This, together with (24) and Lemma 10, gives

$\begin{matrix} τ μ (α) e - ψ (α) = \frac{μ (α)}{μ} \sum_{i = 1}^{r} [τ μ - \frac{μ}{μ (α)} λ_{i} (ψ (α))] c_{i} \\ ≼ \frac{μ (α)}{μ} \sum_{i = 1}^{r} [τ μ - λ_{i} (ψ (α))] c_{i} \\ ≼ \frac{μ (α)}{μ} [\sum_{i = 1}^{k} (τ μ - τ μ) c_{i} + \sum_{i = k + 1}^{r} (1 - α_{2}) (τ μ - λ_{i}) c_{i}] \\ = \frac{μ (α)}{μ} \sum_{i = k + 1}^{r} (1 - α_{2}) (τ μ - λ_{i}) c_{i} . \end{matrix}$ (25)

By using (25) and Lemma 3, we have:

$\begin{matrix} {‖ {(τ μ (α) e - ψ (α))}^{+} ‖}^{2} \leq {(1 - α_{2})}^{2} {(\frac{μ (α)}{μ})}^{2} {‖ {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} ‖}^{2} \\ \leq {(1 - α_{2})}^{2} {(β τ μ (α))}^{2}, \end{matrix}$

which completes the proof.

The next lemma shows that $(\tilde{x} (α), y (α), \tilde{s} (α)) \in N (τ, β)$ , for

$α_{1} \in [δ, \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}] .$

Lemma 12. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ , $β \leq 1 / 3$ and $τ \leq 1 / 4$ . If

$α_{1} \in [δ, \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}],$

then we have $(\tilde{x} (α), y (α), \tilde{s} (α)) \in N (τ, β)$ .

Proof. It is easy to see

$T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) = - (1 - τ) T r (\tilde{x} \circ \tilde{s}) - T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) .$ (26)

Using this fact and Lemma 4, we have:

$T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) \geq - (1 - τ) T r (\tilde{x} \circ \tilde{s}) - \sqrt{r} β τ μ \geq - T r (\tilde{x} \circ \tilde{s}) .$

Now, together with (11), we have:

$μ (α) \geq (1 - α_{1}) μ \geq (1 - \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}) μ \geq \frac{41}{50} μ .$ (27)

On the other hand, using Lemmas 8 and 11, we have:

$\begin{array}{l} ‖ {(τ μ (α) e - \tilde{x} (α) \circ \tilde{s} (α))}^{+} ‖ \\ = ‖ {(τ μ (α) e - ψ (α) - Δ \tilde{x} (α) \circ Δ \tilde{s} (α))}^{+} ‖ \\ \leq ‖ {(τ μ (α) e - ψ (α))}^{+} ‖ + ‖ {(- Δ \tilde{x} (α) \circ Δ \tilde{s} (α))}^{+} ‖ \\ \leq (1 - α_{2}) β τ μ (α) + \frac{4}{5} α_{2} β τ μ \\ \leq (1 - α_{2}) β τ μ (α) + \frac{4}{5} α_{2} β τ (\frac{50}{41} μ) \leq β τ μ (α), \end{array}$

where the third inequality follows from (27). The proof is now complete.

The following lemma gives an upper bound for the duality gap, which plays an important role in the analysis of Algorithm 1.

Lemma 13. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ , $β \leq 1 / 3$ and $τ \leq 1 / 4$ . If $α_{1} = \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ , then we have:

$μ (α) \leq (1 - \frac{2}{25} α_{2} \sqrt{\frac{β τ}{r}}) μ .$

Proof. From (26), it is easy to see $T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) \leq - (1 - τ) T r (\tilde{x} \circ \tilde{s})$ . Together with (11), we obtain:

$\begin{matrix} \tilde{μ} (α) \leq μ - α_{1} (1 - τ) μ + \frac{α_{2}}{\sqrt{r}} β τ μ \\ \leq μ - \frac{3}{4} α_{1} μ + \frac{α_{2}}{\sqrt{r}} β τ μ \\ \leq μ - (\frac{3}{4} - \frac{5}{3} \sqrt{β τ}) \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}} μ \\ \leq (1 - \frac{2}{25} α_{2} \sqrt{\frac{β τ}{r}}) μ, \end{matrix}$ (28)

which completes the proof.

Using Lemma 13, the main result of Algorithm 4.1 is obtained as follows.

Theorem 14. If $β \leq 1 / 3$ and $τ \leq 1 / 4$ , then the iteration complexity of Algorithm 1 is:

$O (\sqrt{r} \log ε^{- 1}) .$

4.2. Second Algorithm and Its Convergence Analysis

Based on Algorithm 1, a new variant of Mehrotra-type predictor-corrector algorithm for SCO will be described. The predictor step will be the same as in Algorithm 1. The deficiency of Algorithm 1 is that $Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+}$ is ignored in the corrector step. To remedy, we add the following system to the corrector step of Algorithm 1:

$\begin{array}{l} \tilde{A} Δ {\tilde{x}}_{+}^{c} = 0, \\ {\tilde{A}}^{*} Δ y_{+}^{c} + Δ {\tilde{s}}_{+}^{c} = 0, \\ \tilde{s} \circ Δ {\tilde{x}}_{+}^{c} + \tilde{x} \circ Δ {\tilde{s}}_{+}^{c} = - Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+} . \end{array}$ (29)

After deciding the step lengths $\bar{α} = (α_{1}, α_{2}, α_{3}) \in ℝ_{+}^{3}$ , the iterates are updated as follows:

$(\tilde{x} (\bar{α}), y (\bar{α}), \tilde{s} (\bar{α})) = (\tilde{x}, y, \tilde{s}) + (Δ \tilde{x} (\bar{α}), Δ y (\bar{α}), Δ \tilde{s} (\bar{α})),$ (30)

where

$\begin{matrix} (Δ \tilde{x} (\bar{α}), Δ y (\bar{α}), Δ \tilde{s} (\bar{α})) : = α_{1} ((Δ {\tilde{x}}_{-}, Δ y_{-}, Δ {\tilde{s}}_{-}) + (Δ {\tilde{x}}_{-}^{c}, Δ y_{-}^{c}, Δ {\tilde{s}}_{-}^{c})) \\ + α_{2} (Δ {\tilde{x}}_{+}, Δ y_{+}, Δ {\tilde{s}}_{+}) + α_{3} (Δ {\tilde{x}}_{+}^{c}, Δ y_{+}^{c}, Δ {\tilde{s}}_{+}^{c}) . \end{matrix}$

Since $T r (Δ \tilde{x} (\bar{α}) \circ Δ \tilde{s} (\bar{α}) = 0$ , we have:

$μ (\bar{α}) = μ + \frac{α_{1}}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}) + \frac{α_{2}}{r} T r ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) = μ (α) .$ (31)

In most interior point methods, the factors $Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}$ and $Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+}$ are ignored; however, here we make use of these factors to reduce the duality gap faster.

Next, we describe the general framework of our second algorithm for the SCO problem.

Algorithm 2. The Mehrotra predictor-corrector method for solving the SCO problem (second method).

Inputs: Accuracy parameter $ε > 0$ ;

neighborhood parameters $0 < β \leq 1 / 3$ and $0 < τ \leq 1 / 4$ ;

the initial point $(x^{0}, y^{0}, s^{0}) \in N (τ, β)$ with $μ^{0} = 〈 x^{0}, s^{0} 〉 / r$ .

Step 0: Set $k : = 0$ .

Step 1: If $〈 x^{k}, s^{k} 〉 \leq ε$ then stop.

Step 2: Choose a Nesterov-Todd scaling element $u$ .

Step 3: Compute the directions $(Δ {\tilde{x}}_{-}, Δ y_{-}, Δ {\tilde{s}}_{-})$ by solving (6) and $(Δ {\tilde{x}}_{+}, Δ y_{+}, Δ {\tilde{s}}_{+})$ by solving (7).

Step 4: Find the maximum feasible step length $θ$ by (8).

Step 5: Compute the directions $(Δ {\tilde{x}}_{-}^{c}, Δ y_{-}^{c}, Δ {\tilde{s}}_{-}^{c})$ by solving (9) and $(Δ {\tilde{x}}_{+}^{c}, Δ y_{+}^{c}, Δ {\tilde{s}}_{+}^{c})$ by solving (29).

Step 6: Find the step lengths $\bar{α} = (α_{1}, α_{2}, α_{3})$ giving a sufficient reduction of the duality gap and assuring $(\tilde{x} (\bar{α}), y (\bar{α}), \tilde{s} (\bar{α})) \in N (τ, β)$ .

Step 7: Let ${\bar{α}}^{k} : = \bar{α}$ , $({\tilde{x}}^{k + 1}, y^{k + 1}, {\tilde{s}}^{k + 1}) : = (\tilde{x} ({\bar{α}}^{k}), y ({\bar{α}}^{k}), \tilde{s} ({\bar{α}}^{k}))$ and $μ^{k + 1} : = 〈 x^{k + 1}, s^{k + 1} 〉 / r$ .

Step 8: Set $k : = k + 1$ and go to Step 1.

We now proceed to show the complexity of Algorithm 2. Since the proof techniques are the same as in Algorithm 1, we only give a brief outline of the proofs and omit the details.

Lemma 15. Let $(Δ {\tilde{x}}_{+}, Δ y_{+}, Δ {\tilde{s}}_{+})$ be the solution of (7) and $β \leq 1 / 3$ . Then, we have:

$‖ α_{3} L {(v)}^{- 1} (Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+}) ‖ \leq \frac{α_{3}}{8} \sqrt{β τ μ} .$

Proof. Multiplying the third equation of (7) by $L {(v)}^{- 1}$ , by Lemmas 1 and 7, we have:

$\begin{matrix} ‖ α_{3} (Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+}) ‖ \leq \frac{α_{3}}{\sqrt{8}} {‖ L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}) ‖}^{2} \\ \leq \frac{1}{2 \sqrt{8}} α_{3} β τ μ . \end{matrix}$ (32)

Similar to the proof of Lemma 2 of [13] and by (32), we have:

$‖ α_{3} L {(v)}^{- 1} (Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+}) ‖ \leq \frac{α_{3}}{8} \sqrt{β τ μ},$

which completes the proof.

Similar to Lemma 8 and by Lemma 15, we give the following lemma, which gives a bound on $‖ Δ \tilde{x} (\bar{α}) \circ Δ \tilde{s} (\bar{α}) ‖$ .

Lemma 16. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ , $β \leq 1 / 3$ and $τ \leq 1 / 24$ . If

$α_{3} \leq α_{1} \leq \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ , then we have:

$‖ Δ \tilde{x} (\bar{α}) \circ Δ \tilde{s} (\bar{α}) ‖ \leq \frac{4}{5} α_{2} β τ μ .$

Now, for simplicity, we use the following notation:

$\begin{array}{l} h (\bar{α}) : = \tilde{x} \circ \tilde{s} + α_{1} {(τ μ e - \tilde{x} \circ \tilde{s})}^{-} + {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} \\ - α_{1} θ (Δ {\tilde{x}}_{-} \circ Δ {\tilde{s}}_{-}) - α_{3} (Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+}) . \end{array}$ (33)

Lemma 17. Let $h (\bar{α})$ be defined as (33) and $α_{3} \leq \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ . Then, the eigenvalues of $h (\bar{α})$ are nonnegative.

Proof. Since $Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+} = \frac{1}{4} ({(Δ {\tilde{x}}_{+} + Δ {\tilde{s}}_{+})}^{2} - {(Δ {\tilde{x}}_{+} - Δ {\tilde{s}}_{+})}^{2})$ , we have:

$\begin{matrix} Δ {\tilde{x}}_{+} \circ Δ {\tilde{s}}_{+} = \frac{1}{4} ({(L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}))}^{2} - {(Δ {\tilde{x}}_{+} - Δ {\tilde{s}}_{+})}^{2}) \\ ≼ \frac{1}{4} {(L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}))}^{2} . \end{matrix}$ (34)

Using (34), Lemma 16 and similar arguments as in the proof of Lemma 10, we obtain:

$\begin{matrix} h (\bar{α}) ≽ \sum_{i = 1}^{k} ((1 - α_{1}) λ_{i} + α_{1} τ μ) c_{i} + \sum_{i = k + 1}^{r} (λ_{i} + α_{2} (τ μ - λ_{i})) c_{i} \\ - \frac{α_{1} θ}{4} {(L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{-}))}^{2} - \frac{α_{3} θ}{4} {(L {(v)}^{- 1} ({(τ μ e - \tilde{x} \circ \tilde{s})}^{+}))}^{2} \\ ≽ \sum_{i = 1}^{k} ((1 - α_{1}) λ_{i} + α_{1} τ μ) c_{i} + \sum_{i = k + 1}^{r} (λ_{i} + α_{2} (τ μ - λ_{i})) c_{i} \\ - \frac{α_{1} θ}{4} \sum_{i = 1}^{k} (τ μ - λ_{i}) c_{i} - \frac{α_{3}}{4} \sum_{i = k + 1}^{r} \frac{{(τ μ - λ_{i})}^{2}}{λ_{i}} c_{i} \\ ≽ \sum_{i = 1}^{k} ((1 - α_{1} - \frac{α_{1} θ}{4}) λ_{i} + (α_{1} + \frac{α_{1} θ}{4}) τ μ) c_{i} \\ + \sum_{i = k + 1}^{r} ((1 - α_{2}) λ_{i} + (α_{2} - \frac{α_{3}}{8} β) τ μ) c_{i} \\ ≽ \sum_{i = 1}^{k} τ μ c_{i} + \sum_{i = k + 1}^{r} ((1 - α_{2}) λ_{i} + (α_{2} - \frac{α_{3}}{8} β) τ μ) c_{i} ≽ 0, \end{matrix}$ (35)

where the third inequality follows from Lemma 3, the fourth inequality follows from the fact that $λ_{i} - τ μ \geq 0$ , $i = 1, \dots, k$ , and the last inequality follows from $δ \leq α_{3} \leq \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ . Therefore, we conclude that the eigenvalues of $h (α)$ are nonnegative, and this completes the proof.

Lemma 18. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ , $β \leq 1 / 3$ and $τ \leq 1 / 4$ . If

$δ \leq α_{3} \leq α_{1} = \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ , then we have:

$‖ {(τ μ (\bar{α}) e - h (\bar{α}))}^{+} ‖ = (1 - α_{2}) β τ μ .$

Proof. If $α_{1} \geq δ$ , then from (31) and Lemma 9, we have $μ (\bar{α}) \leq μ$ . Therefore, by (35), (28), Lemmas 2 and 17, we obtain:

$\begin{array}{l} ‖ {(τ μ (\bar{α}) e - h (\bar{α}))}^{+} ‖ \\ \leq ‖ {(\sum_{i = 1}^{k} (μ (\bar{α}) - μ) τ c_{i})}^{+} ‖ \\ + ‖ {(\sum_{i = k + 1}^{r} [(α_{2} - 1) λ_{i} + (μ (\bar{α}) - α_{2} μ + \frac{α_{3}}{8} β μ) τ] c_{i})}^{+} ‖ \\ \leq ‖ {(\sum_{i = k + 1}^{r} [(α_{2} - 1) λ_{i} + (μ - α_{1} (1 - τ) μ + \frac{α_{2}}{\sqrt{r}} β τ μ - α_{2} μ + \frac{α_{3}}{8} β μ) τ] c_{i})}^{+} ‖ \\ \leq ‖ {(\sum_{i = k + 1}^{r} [(1 - α_{2}) (τ μ - λ_{i}) + (- 1 + τ + \frac{5}{3} \sqrt{β τ} + \frac{β}{8}) α_{1} τ μ] c_{i})}^{+} ‖ \\ \leq ‖ {(\sum_{i = k + 1}^{r} [(1 - α_{2}) (τ μ - λ_{i})] c_{i})}^{+} ‖ \\ = {‖ (1 - α_{2}) {(τ μ e - \tilde{x} \circ \tilde{s})}^{+} ‖}^{2} \leq (1 - α_{2}) β τ μ, \end{array}$

where the fourth inequality follows from the fact that $α_{3} \leq α_{1} = \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ and the last inequality follows from Lemma 3. The proof of lemma is now complete.

Lemma 19. Let $(\tilde{x}, y, \tilde{s}) \in N (τ, β)$ , $β \leq 1 / 3$ and $τ \leq 1 / 4$ . If

$δ \leq α_{3} \leq α_{1} = \frac{3}{5} α_{2} \sqrt{\frac{β τ}{r}}$ , then $(\tilde{x} (\bar{α}), y (\bar{α}), \tilde{s} (\bar{α})) \in N (τ, β)$ .

Proof. From Lemmas 2, 16 and 18, we have:

$\begin{array}{l} ‖ {(τ μ (\bar{α}) e - \tilde{x} (\bar{α}) \circ \tilde{s} (\bar{α}))}^{+} ‖ \\ = ‖ {(τ μ (\bar{α}) e - h (\bar{α}) - Δ \tilde{x} (\bar{α}) \circ Δ \tilde{s} (\bar{α}))}^{+} ‖ \\ \leq ‖ {(τ μ (\bar{α}) e - h (\bar{α}))}^{+} + {(- Δ \tilde{x} (\bar{α}) \circ Δ \tilde{s} (\bar{α}))}^{+} ‖ \\ \leq (1 - α_{2}) β τ μ + \frac{4}{5} α_{2} β τ μ = (1 - α_{1}) β τ μ + (α_{1} - α_{2} + \frac{4}{5} α_{2}) β τ μ \\ \leq (1 - α_{1}) β τ μ + (\frac{3}{5} \sqrt{\frac{β τ}{r}} - 1 + \frac{4}{5}) α_{2} β τ μ \leq (1 - α_{1}) β τ μ \leq β τ μ (\bar{α}), \end{array}$

where the last inequality follows from (27). This implies $(\tilde{x} (\bar{α}), y (\bar{α}), \tilde{s} (\bar{α})) \in N (τ, β)$ , which completes the proof.

Now, using Lemma 13, since $μ (\bar{α}) = μ (α)$ , we are led to the following iteration complexity of Algorithm 2.

Theorem 20. If $β \leq 1 / 3$ and $τ \leq 1 / 4$ , then the iteration complexity of Algorithm 2 is:

$O (\sqrt{r} \log ε^{- 1}) .$

5. Numerical Testing

Here we compare our two proposed algorithms (Alg. 1 and Alg. 2) with Algorithm 3 (Alg. 3) of [9] and Algorithm 4 (Alg. 4) of [5], using MATLAB R2017b on an Intel Core i7 (2.20 GHz) with 8 GB RAM. We also solve the numerical examples with SeDuMi [22] and display the results. In order to test the algorithms, we use some SDO problems of [23]. These test problems are random semidefinite programming problems (Table 1 and Table 2), educational testing problems (Table 3 and Table 4), norm minimization problems (Table 5 and Table 6) and max-cut problems (Table 7 and Table 8). The parameters were set to be $ε = 10^{- 8}$ , $β = 1 / 3$ and $τ = 1 / 4$ .

In Tables 1-8, we list the number of the constraints (m), the dimension of the blocks (n) and the obtained average number of iterations and the average CPU time in seconds. In each instance, we run the algorithm ten times for the same m and n and the average results are shown in the tables.

Table 1. Average number of iterations on Random problems.

m	n	SeDuMi	Alg. 1	Alg. 2	Alg. 3	Alg. 4
30	30	0.11	0.20	0.20	0.85	2.07
40	45	0.19	0.77	0.83	1.26	5.51
50	50	0.16	1.09	0.97	2.83	6.16
70	80	0.57	2.93	3.04	4.07	11.48
100	100	1.16	4.39	4.84	9.82	25.58

Table 2. Average CPU times on Random problems.

m	n	SeDuMi	Alg. 1	Alg.2	Alg. 3	Alg. 4
30	30	13.0	13.4	13.3	14.8	18.4
40	45	12.8	13.5	13.0	15.4	23.0
50	50	13.5	14.8	14.6	16.1	25.1
70	80	13.9	17.1	16.7	22.5	27.8
100	100	14.8	21.3	19.6	28.4	34.2

Table 3. Average number of iterations on educational test problems.

m	n	SeDuMi	Alg. 1	Alg. 2	Alg. 3	Alg. 4
5	10	12.7	14.8	14.1	15.9	22.2
20	40	14.6	35.5	34.6	38.1	46.3
25	50	19.0	46.7	46.5	52.4	72.6
40	80	23.8	63.1	60.8	86.2	95.3
50	100	22.6	70.8	68.4	112.5	118.0

Table 4. Average CPU times on educational test problems.

m	n	SeDuMi	Alg. 1	Alg. 2	Alg. 3	Alg. 4
5	10	0.03	0.74	0.83	1.72	2.83
20	40	0.15	3.39	3.86	4.75	15.18
25	50	0.30	6.41	6.84	10.63	36.07
40	80	0.50	12.70	11.53	18.24	44.62
50	100	0.64	18.26	18.69	30.73	58.22

Table 5. Average number of iterations on norm minimization problems.

m	n	SeDuMi	Alg. 1	Alg. 2	Alg. 3	Alg. 4
30	30	13.5	12.7	12.7	13.1	15.6
30	40	13.7	13.5	12.8	17.1	23.7
50	50	12.6	14.2	13.8	19.5	27.3
80	90	13.6	19.3	18.8	24.6	41.3
100	100	14.2	23.4	23.4	48.3	53.6

Table 6. Average CPU times on norm minimization problems.

m	n	SeDuMi	Alg. 1	Alg. 2	Alg. 3	Alg. 4
30	30	0.65	0.11	0.39	1.37	2.34
30	40	0.33	0.19	0.55	2.13	5.19
50	50	0.45	0.26	0.79	4.06	6.22
80	90	0.99	1.62	1.55	4.77	7.19
100	100	1.37	3.52	3.48	6.56	9.62

Table 7. Average number of iterations on Max-Cut problems.

m	n	SeDuMi	Alg. 1	Alg. 2	Alg. 3	Alg. 4
30	30	12.4	13.3	12.8	14.1	18.1
45	45	13.5	14.3	13.5	15.8	22.4
50	50	12.7	14.7	14.6	16.1	25.8
80	80	12.6	18.8	18.6	27.5	44.1
100	100	13.4	22.6	20.6	31.7	44.9

Table 8. Average CPU times on Max-Cut problems.

m	n	SeDuMi	Alg. 1	Alg. 2	Alg. 3	Alg. 4
30	30	0.80	0.22	0.31	0.52	1.83
45	45	0.33	0.71	0.90	1.63	5.03
50	50	0.35	0.93	1.36	2.76	6.84
80	80	0.50	1.84	1.86	3.32	8.63
100	100	0.44	3.62	3.47	4.16	8.84

Based on the numerical results, as summarized in the tables, our proposed algorithms show to be more efficient and promise to be encouraging. It can be observed that the average number of iterations needed for Algorithm 2 mostly turns to be less than the ones obtained by the other algorithms, while most often the average CPU time of Algorithm 1 is slightly better than the one obtained by Algorithm 2.

6. Concluding Remarks

We modified the Ai-Zhang primal-dual path-following interior-point method to gain two classes of Mehrotra-type predictor-corrector algorithms for SCO. The Euclidean Jordan algebra was a basic tool in our analysis. We showed that the use of the corrector steps did not cause any loss in the worst-case complexity of the algorithms. Based on the NT direction, our proposed algorithms had an $O (\sqrt{r} \log ε^{- 1})$ iteration complexity bound, as the best complexity results available for SCO so far. Numerical experiments showed encouraging evidence for practical efficiency of the proposed algorithms.

Acknowledgements

The first author would like to thank Shahrekord University for financial support. The research of the first author was also partially supported by the Center of Excellence for Mathematics, University of Shahrekord, Shahrekord, Iran. The second author thanks Research Council of Sharif University of Technology for financial support.

Funding

The research of the first author was in part supported by a grant from Shahrekord University (No. 404GRD1M2025).

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1]	Nesterov, Y. and Nemirovskii, A. (1994) Interior-Point Polynomial Algorithms in Convex Programming. Society for Industrial and Applied Mathematics.[CrossRef]
[2]	Faybusovich, L. (1997) Euclidean Jordan Algebras and Interior-Point Algorithms. Positivity, 1, 331-357.[CrossRef]
[3]	Asadi, S., Mansouri, H., Darvay, Z., Lesaja, G. and Zangiabadi, M. (2019) A Long-Step Feasible Predictor-Corrector Interior-Point Algorithm for Symmetric Cone Optimization. Optimization Methods and Software, 34, 336-362.[CrossRef]
[4]	Feng, Z. and Fang, L. (2014) A New-Iteration Predictor-Corrector Algorithm with Wide Neighborhood for Semidefinite Programming. Journal of Computational and Applied Mathematics, 256, 65-76.
[5]	Pirhaji, M. and Mansouri, M.H. (2017) An Wide Neighborhood Interior Point Algorithm for Semidefinite Optimization. Journal of Computational and Applied Mathematics, 36, 145-157.
[6]	Schmieta, S.H. and Alizadeh, F. (2003) Extension of Primal-Dual Interior Point Algorithms to Symmetric Cones. Mathematical Programming, 96, 409-438.[CrossRef]
[7]	Liu, H., Yang, X. and Liu, C. (2013) A New Wide Neighborhood Primal-Dual Infeasible-Interior-Point Method for Symmetric Cone Programming. Journal of Optimization Theory and Applications, 158, 796-815.[CrossRef]
[8]	Sayadi Shahraki, M., Mansouri, H. and Zangiabadi, M. (2017) Two Wide Neighborhood Interior-Point Methods for Symmetric Cone Optimization. Computational Optimization and Applications, 68, 29-55.[CrossRef]
[9]	Sayadi Shahraki, M., Mansouri, H., Zangiabadi, M. and Mahdavi-Amiri, N. (2018) A Wide Neighborhood Primal-Dual Predictor-Corrector Interior-Point Method for Symmetric Cone Optimization. Numerical Algorithms, 78, 535-552.[CrossRef]
[10]	Shahraki, M.S., Mansouri, H. and Delavarkhalafi, A. (2022) A Wide Neighbourhood Predictor-Corrector Infeasible-Interior-Point Algorithm for Symmetric Cone Programming. Optimization Methods and Software, 37, 2103-2120.[CrossRef]
[11]	Mehrotra, S. (1992) On the Implementation of a Primal-Dual Interior Point Method. SIAM Journal on Optimization, 2, 575-601.[CrossRef]
[12]	Zhang, Y. and Zhang, D. (1995) On Polynomiality of the Mehrotra-Type Predictor-Corrector Interior-Point Algorithms. Mathematical Programming, 68, 303-318.[CrossRef]
[13]	Zhang, J. and Zhang, K. (2011) Polynomial Complexity of an Interior Point Algorithm with a Second Order Corrector Step for Symmetric Cone Programming. Mathematical Methods of Operations Research, 73, 75-90.[CrossRef]
[14]	Ai, W. and Zhang, S. (2005) An Iteration Primal-Dual Path-Following Method, Based on Wide Neighborhoods and Large Updates, for Monotone LCP. SIAM Journal on Optimization, 16, 400-417.
[15]	Liu, C., Shang, Y. and Liu, H. (2016) An Iteration Mehrotra-Type Predictor-Corrector Algorithm for Monotone Linear Complementarity Problem. Optimization Letters, 10, 619-634.
[16]	Faraut, J. and Korányi, A. (1994) Analysis on Symmetric Cones. Oxford University Press.
[17]	Faybusovich, L. (1997) Linear Systems in Jordan Algebras and Primal-Dual Interior-Point Algorithms. Journal of Computational and Applied Mathematics, 86, 149-175.[CrossRef]
[18]	Nesterov, Y.E. and Todd, M.J. (1998) Primal-Dual Interior-Point Methods for Self-Scaled Cones. SIAM Journal on Optimization, 8, 324-364.[CrossRef]
[19]	Ai, W. (2004) Neighborhood-Following Algorithms for Linear Programming. Science in China Series A, 47, 812-820.
[20]	Gu, G., Zangiabadi, M. and Roos, C. (2011) Full Nesterov-Todd Step Infeasible Interior-Point Method for Symmetric Optimization. European Journal of Operational Research, 214, 473-484.[CrossRef]
[21]	Liu, C. (2012) Study on Complexity of Some Interior-Point Algorithms in Conic Programming. Ph.D. Thesis, Xidian University.
[22]	Sturm, J.F. (1999) Using Sedumi 1.02, a Matlab Toolbox for Optimization over Symmetric Cones. Optimization Methods and Software, 11, 625-653.[CrossRef]
[23]	Todd, M.J., Toh, K.C. and Tütüncü, R.H. (1998) On the Nesterov-Todd Direction in Semidefinite Programming. SIAM Journal on Optimization, 8, 769-796.[CrossRef]

	[email protected]
	+86 18163351462 (WhatsApp)
	1655362766
	SCIRP WeChat

Journals Menu

Home

About SCIRP

Service

Policies