Method of Successive Polynomial Substitutions for Computing Roots of Polynomials

Serdar Beji

doi:10.4236/apm.2026.166023

Advances in Pure Mathematics > Vol.16 No.6, June 2026

Method of Successive Polynomial Substitutions for Computing Roots of Polynomials

Serdar Beji

Faculty of Engineering and Natural Sciences, Department of Computer Engineering, Biruni University, Zeytinburnu, Istanbul, Türkiye.
DOI: 10.4236/apm.2026.166023 PDF HTML XML 9 Downloads 40 Views

Abstract

A recursive technique, termed method of successive polynomial substitutions for computing all the real and complex roots of a polynomial of any given degree, is introduced. The method proceeds by reducing the degree of polynomial by one at each stage of successive polynomial substitutions; extracts a root, and continues until reaching a second-degree polynomial whose roots are obtained analytically. Coefficients of the polynomial may be real or complex; no initial guess is needed and the results are highly accurate. Sample computations for polynomials of various degrees and the code used in computations are given.

Keywords

Real and Complex Roots of Polynomials, Method of Successive Polynomial Substitutions, Accurate and Efficient Computation of Zeros of Polynomials

Share and Cite:

Beji, S. (2026) Method of Successive Polynomial Substitutions for Computing Roots of Polynomials. Advances in Pure Mathematics, 16, 416-430. doi: 10.4236/apm.2026.166023.

1. Introduction

Numerous different techniques such as the bisection method, fixed-point iteration, Newton-Raphson method, are available for obtaining the roots of equations in general. To a lesser extent, there are solution techniques like Laguerre’s method, Müller’s method, and Bairstow’s method, specific to the roots of polynomials (Chapra and Canale [1], pp. 123-202). The Weierstrass-Durand-Kerner method, essentially due to Weierstrass [2] but about 70 years later independently rediscovered by Durand [3] and Kerner [4], is also a well-known root-finding algorithm.

Based on the work of Traub [5], Jenkins and Traub [6] introduced an algorithm for computing zeros of polynomials with complex coefficients, programmed as the CPOLY algorithm [7]. A faster version of the algorithm for real polynomials was also formulated [8] and later presented as the RPOLY algorithm [9]. Currently, RPOLY is regarded as the standard in black-box polynomial root-finders due to its robustness, accuracy, and efficiency.

The present approach is a generalized and preconditioned version of a recursive method first introduced in Beji [10] for obtaining the roots of cubic polynomials. The method proceeds by reducing the degree of the polynomial by one at each stage of successive polynomial substitutions till the second degree. All the roots of a given polynomial, whether real or complex, are obtained in a single run without requiring any initial guess. The entire approach and the routine implementing it are found to be quite robust and accurate for polynomials of any degree with real or complex coefficients. When the implementation simplicity and efficiency of the method are considered, a professional version of the given generic algorithm may be a worthwhile competitor of the Jenkins-Traub algorithm.

2. Method of Successive Polynomial Substitutions

A polynomial of $n^{th}$ -degree can be written as

$P_{n} (x) = \sum_{j = 0}^{n} a_{j} x^{n - j} = a_{0} x^{n} + a_{1} x^{n - 1} + \dots + a_{n - 1} x + a_{n},$ (1)

where $n$ is the degree of polynomial and $a_{j}$ ’s are real or complex coefficients. Introducing a change of variable $x = λ z$ and setting $P_{n} (λ z) = 0$ give

$a_{0} {(λ z)}^{n} + a_{1} {(λ z)}^{n - 1} + \dots + a_{n - 1} λ z + a_{n} = 0,$ (2)

where $λ$ is a real or complex constant to be determined. Dividing the entire equation by $a_{0} λ^{n}$ results in

$z^{n} + \frac{a_{1}}{λ a_{0}} z^{n - 1} + \dots + \frac{a_{n - 1}}{λ^{n - 1} a_{0}} z + \frac{a_{n}}{λ^{n} a_{0}} = 0.$ (3)

Following the approach introduced in Beji [10], we set $a_{n} / λ^{n} a_{0} = 1$ and solve for $λ = {(a_{n} / a_{0})}^{1 / n}$ to transform Equation (3) into

$z^{n} + {(\frac{a_{0}}{a_{n}})}^{1 / n} \frac{a_{1}}{a_{0}} z^{n - 1} + \dots + {(\frac{a_{0}}{a_{n}})}^{1 - 1 / n} \frac{a_{n - 1}}{a_{0}} z + 1 = 0.$ (4)

Redefining the coefficients as $a_{1} \equiv (a_{1} / a_{0}) {(a_{0} / a_{n})}^{1 / n}, \dots, a_{n - 1} \equiv (a_{n - 1} / a_{0}) {(a_{0} / a_{n})}^{1 - 1 / n}$ , Equation (4) is rewritten as

$z^{n} + a_{1} z^{n - 1} + \dots + a_{n - 1} z + 1 = 0.$ (5)

Depending on n being even (+) or odd (−), the multiplication of the roots of (5) is obviously $z_{1} z_{2} \dots z_{n - 1} z_{n} = \pm 1$ ; therefore, Equation (5) has at least one root whose magnitude is less than unity, unless all the roots are equal in magnitude. Leaving the special case aside we can reason as follows. If $z_{i}$ is the root less than unity, then,

$| z_{i} | < 1 \Rightarrow {| z_{i} |}^{n} < {| z_{i} |}^{n - 1} < \dots < {| z_{i} |}^{2} < | z_{i} | .$ (6)

In view of (6), a very rough approximation is possible by neglecting the term with the highest power zⁿ in (5) and writing

$a_{1} z^{n - 1} + \dots + a_{n - 1} z + 1 \approx 0,$ (7)

which reduces the degree of polynomial to be solved by one. But it is possible to do much better by successive polynomial submissions. First, multiply Equation (5) by z and write it as

$z^{n + 1} = - a_{1} z^{n} - a_{2} z^{n - 1} - \dots - a_{n - 1} z^{2} - z .$ (8)

Next, use (5) again to replace $z^{n}$ on the right in (8):

$z^{n + 1} = - a_{1} (- a_{1} z^{n - 1} - a_{2} z^{n - 2} - \dots - a_{n - 1} z - 1) - a_{2} z^{n - 1} - \dots - a_{n - 1} z^{2} - z$ (9a)

$z^{n + 1} = (a_{1}^{2} - a_{2}) z^{n - 1} + (a_{1} a_{2} - a_{3}) z^{n - 2} + \dots + (a_{1} a_{n - 2} - a_{n - 1}) z^{2} + (a_{1} a_{n - 1} - 1) z + a_{1} .$ (9b)

We now have the term $z^{n + 1}$ on the left, and since ${| z |}^{n + 1} < {| z |}^{n}$ for $| z | < 1$ , neglecting $z^{n + 1}$ in (9b) would definitely result in a better approximation compared to (7), where $z^{n}$ is neglected:

$(a_{1}^{2} - a_{2}) z^{n - 1} + (a_{1} a_{2} - a_{3}) z^{n - 2} + \dots + (a_{1} a_{n - 2} - a_{n - 1}) z^{2} + (a_{1} a_{n - 1} - 1) z + a_{1} \approx 0.$ (10)

Repeating the substitutions as many times as desired according to a definite criterion eventually leads to a polynomial one degree less, $(n - 1)$ , than the original polynomial of degree $n$ . By recalling that the sum of roots is equal to the opposite sign value of the coefficient of the second highest power term, the first root eliminated in the reduction process can easily be determined by a simple subtraction as $x_{1} = - c_{1} + d_{2}$ . Here, $c_{1}$ is the coefficient of $x^{n - 1}$ term of the original polynomial of degree $n$ and $d_{2}$ is the coefficient of $x^{n - 2}$ term of the reduced polynomial of degree $(n - 1)$ . Afterwards, the same recursion process is applied to the reduced polynomial to reduce it one degree more and subsequently the second root is computed. The recursive process is repeated until a second-degree polynomial is obtained and the last two roots are computed from the well-known analytical expression, the quadratic formula. Afterwards, roots of the original polynomial can be obtained from $x = λ z$ . But it turns out that this transformation is not necessary at all because the inverse transformation following successive substitutions yields a polynomial which is identical to the one that can be obtained without transformation. Therefore, the change of variable, $x = λ z$ , should be viewed only as a formal justification process of the present approach.

2.1. A Demonstrative Application

To clarify the method further we apply it to a general cubic polynomial $P_{3} (x) = a_{0} x^{3} + a_{1} x^{2} + a_{2} x + a_{3}$ without transformation. First, set $P_{3} (x) = 0$ , divide by $a_{0}$ and redefine its new coefficients as $a_{1} = a_{1} / a_{0}$ , etc. so that

$x^{3} = - a_{1} x^{2} - a_{2} x - a_{3} .$ (11)

Neglecting the $x^{3}$ term results in a zeroth-order approximation with $a_{1} x^{2} + a_{2} x + a_{3} \approx 0$ . For a better approximation multiply Equation (11) by x and replace the $x^{3}$ term by $- a_{1} x^{2} - a_{2} x - a_{3}$ so that

$x^{4} = (a_{1}^{2} - a_{2}) x^{2} + (a_{1} a_{2} - a_{3}) x + a_{1} a_{3} .$ (12)

We get a first-order approximation now if the term $x^{4}$ is neglected,

$(a_{1}^{2} - a_{2}) x^{2} + (a_{1} a_{2} - a_{3}) x + a_{1} a_{3} \approx 0.$ (13)

The following second-degree polynomials listed in Table 1 are obtained for the first three approximations.

Table 1. Quadratic polynomials resulting from successive substitutions of a general cubic polynomial.

Degree Neglected	Corresponding Quadratic Polynomial
3^rd	$a_{1} x^{2} + a_{2} x + a_{3} = 0$
4^th	$(a_{1}^{2} - a_{2}) x^{2} + (a_{1} a_{2} - a_{3}) x + a_{1} a_{3} = 0$
5^th	$[a_{1} (2 a_{2} - a_{1}^{2}) - a_{3}] x^{2} + [a_{2} (a_{2} - a_{1}^{2}) + a_{1} a_{3}] x + a_{3} (a_{2} - a_{1}^{2}) = 0$

Higher-order approximations are obtained in the same manner; however, the coefficients get larger and cause computational inaccuracies. To avoid this problem, the reduced polynomial should be divided by the coefficient of $x^{2}$ term at each step. Note that this coefficient would never become zero as we must have a second-degree polynomial to get two more roots besides the one eliminated through the degree-reduction process. In other words, the fundamental theorem of algebra ensures a non-zero coefficient for the highest term of the reduced polynomial.

As a numerical example, we now consider a particular cubic polynomial $P_{3} (x) = (x - 1) (x - 2) (x - 3) = x^{3} - 6 x^{2} + 11 x - 6$ whose roots are obviously $x_{1} = 1$ , $x_{2} = 2$ , and $x_{3} = 3$ . Following the above outlined method and employing the suggested normalization at each step result in Table 2. Roots obtained from the last quadratic $x^{2} - 2.964 x + 1.964 = 0$ are $x_{1} = 1.000$ and $x_{2} = 1.964$ and the process is clearly leading to $x^{2} - 3.000 x + 2.000 = 0$ with roots $x_{1} = 1.000$ and $x_{2} = 2.000$ .

Table 2. Quadratic polynomials resulting from successive substitutions of a particular cubic polynomial.

Degree Neglected	Corresponding Quadratic Polynomial
3^rd	$x^{2} - 1.833 x + 1.000 = 0$
5^th	$x^{2} - 2.655 x + 1.666 = 0$
10^th	$x^{2} - 2.964 x + 1.964 = 0$

2.2. Formulation of Recursive Scheme

With the guidance of preceding examples, we can formulate the recursive scheme to be employed for determining coefficients of the reduced degree, $x^{n - 1}$ , polynomial. First, a single sweep is performed for introducing a second set of coefficients:

$\begin{array}{l} b_{1} = a_{1}^{2} - a_{2}, \\ b_{i} = (a_{1} a_{i} - a_{i + 1}) / b_{1} for i = 2, \dots, n - 1 \\ b_{n} = a_{1} a_{n} / b_{1}, \end{array}$ (14)

where b_i’s are the normalized coefficients exemplified in Table 2. Then, a succession of sweeps till a definite convergence criterion is met:

$\begin{array}{l} b_{1} = b_{2} - a_{1}, \\ b_{i} = (b_{i + 1} - a_{i}) / b_{1} for i = 2, \dots, n - 1 \\ b_{n} = - a_{n} / b_{1}, \end{array}$ (15)

The scheme above reduces $n^{th}$ -degree polynomial to the ${(n - 1)}^{th}$ -degree polynomial and therefore can be used only for reducing a cubic polynomial $n = 3$ to a quadratic polynomial for obtaining all the roots. For higher degree polynomials $n > 3$ it is necessary to reduce the degree of polynomial more than once till a quadratic polynomial is obtained. The generalized version, which reduces the degrees successively from $n$ to $n - 1, n - 2, \dots, 2$ , is for the initial sweep

$\begin{array}{l} b_{j} = a_{j}^{2} - a_{j + 1}, for j = 1, \dots, n - 2 \\ b_{i} = (a_{i} a_{j} - a_{i + 1}) / b_{j} for i = j + 1, \dots, n - 1 \\ b_{n} = a_{j} a_{n} / b_{j}, \end{array}$ (16)

and for the succession of sweeps till convergence

$\begin{array}{l} b_{j} = b_{j + 1} - a_{j}, for j = 1, \dots, n - 2 \\ b_{i} = (b_{i + 1} - a_{i}) / b_{j} for i = j + 1, \dots, n - 1 \\ b_{n} = - a_{n} / b_{j} . \end{array}$ (17)

Note that the special case formulated by Equations (14) and (15) corresponds to $n = 3$ hence $j = 1$ to $n - 2 = 1$ , just one-degree reduction in the general scheme of (16) and (17). FORTRAN code given in the Appendix basically uses Equations (16) and (17). At any stage of degree-reduction the repeated sweeps continue until the absolute value of the last coefficient of the polynomial, b_n, differs less than 10⁻¹⁴ from the one computed in the previous step. Checking only the last coefficient is sufficient as all the coefficients are computed interrelatedly and the convergence of a coefficient implies the convergence of all the coefficients. Computations are carried out entirely as complex numbers so that roots of polynomials in most general forms, including complex polynomial coefficients, can be computed.

3. Preconditioning

Virtually every numerical technique requires the satisfaction of a definite condition or conditions to prevent its failure. As it is obvious from Equations (8) and (9), for the present methodology this condition is $a_{1} \neq 0$ . By a simple shift, which may be called as the preconditioning procedure, this particular requirement can be handled and the scheme works virtually for all polynomials. Preconditioning is achieved by changing the independent variable, say x, to $z + θ$ and then specifying the shift θ in a way to set a specific coefficient of the polynomial to a desired value. In order to implement such a measure, it is first necessary to introduce a convenient way of formulating the intended shift.

3.1. Transformation of Polynomials via Taylor Series

A change of independent variable, $x = z + θ$ , transforms the general $n^{th}$ degree polynomial $P_{n} (x) = a_{0} x^{n} + a_{1} x^{n - 1} + \dots + a_{n - 1} x + a_{n}$ via the Taylor series expansion, into $P_{n} (z + θ)$ :

$P_{n} (z + θ) = P_{n} (θ) + {P^{'}}_{n} (θ) z + {P^{″}}_{n} (θ) \frac{z^{2}}{2!} + \dots + P_{n}^{(n - 1)} (θ) \frac{z^{n - 1}}{(n - 1)!} + P_{n}^{(n)} (θ) \frac{z^{n}}{n!},$ (18)

where θ is a real or complex constant to be determined and primes indicate differentiation with respect to the independent variable. The above expression for instance is quite practical for implementing the so-called Tschirnhaus transformation [11] which makes the coefficient of $z^{n - 1}$ term zero; specifically, $P_{n}^{(n - 1)} (θ) / (n - 1)! = 0$ . Incidentally, the transformation attempted by Tschirnhaus was more general as he stated in the first sentence of his article: “We have learned from DesCartes’ geometry by what method the second term might reliably be removed from a given equation; but on the question of removing multiple intermediate terms I have seen nothing hitherto in the analytic arts.” but was not as much successful as he claimed. Thus, according to Tschirnhaus himself, DesCartes was the first to introduce the method to remove the second term; that is, $a_{1} x^{n - 1}$ term.

The use of (18) for easily obtaining the roots of quadratic equation $P_{2} (x) = a_{0} x^{2} + a_{1} x + a_{2}$ by removing the second term is now demonstrated. In the transformed expression, $P_{2} (z + θ) = {P^{″}}_{2} (θ) z^{2} / 2! + {P^{'}}_{2} (θ) z + P_{2} (θ)$ , setting ${P^{'}}_{2} (θ) = 2 a_{0} θ + a_{1} = 0$ gives $θ = - a_{1} / 2 a_{0}$ . Then, $P_{2} (θ) = a_{0} \cdot {(- a_{1} / 2 a_{0})}^{2} + a_{1} \cdot (- a_{1} / 2 a_{0}) + a_{2}$ or $P_{2} (θ) = (4 a_{0} a_{2} - a_{1}^{2}) / 4 a_{0}$ . On the other hand, when ${P^{'}}_{2} (θ) = 0$ , the transformed polynomial becomes ${P^{″}}_{2} (θ) z^{2} / 2! + P_{2} (θ) = 0$ hence $z = \pm {[- 2 P_{2} (θ) / {P^{″}}_{2} (θ)]}^{1 / 2}$ . Noting that ${P^{″}}_{2} (θ) = 2 a_{0}$ and that $x = z + θ$ we obtain the roots as $x = θ \pm {[- 2 P_{2} (θ) / {P^{″}}_{2} (θ)]}^{1 / 2}$ , which, in turn, can be shown to be identical with the well-known quadratic formula. Applications of (18) to cubic, quartic, and quintic equations can be found in Beji [12].

3.2. Preconditioning Measure

Clearly, coefficient $a_{1}$ which multiplies $x^{n - 1}$ must be nonzero if the successive polynomial substitutions are to be carried out successfully as Equations (8) and (9) show. Equivalently, when the transformed polynomial is considered, the coefficient of $z^{n - 1}$ must never vanish to prevent a complete failure of the procedure. Thus, for ensuring a nonzero coefficient for $z^{n - 1}$ in the transformed polynomial, $P_{n}^{(n - 1)} (θ) / (n - 1)!$ is set to a constant, $n (1 + 0.5 i)$ , where $i = \sqrt{- 1}$ is the imaginary unit. Although in principle any constant would work, the value $n (1 + 0.5 i)$ is selected based on systematic trial computations with polynomials of differing degrees.

The trials revealed that this particular value typically results in fastest convergence rates and that unequal real and imaginary parts is essential for convergence in special cases such as $a_{0} x^{n} + a_{n} = 0$ , where all the intermediate coefficients are zero. Accordingly,

$\frac{P_{n}^{(n - 1)} (θ)}{(n - 1)!} = \frac{n! a_{0} θ + (n - 1)! a_{1}}{(n - 1)!} = n (1 + 0.5 i) \Rightarrow θ = [(1 + 0.5 i) - a_{1} / n] / a_{0} .$ (19)

Once the shift θ is determined according to (19), all the coefficients of transformed polynomial can easily be computed with the help of (18), and this is precisely the procedure followed for the code given in the Appendix.

4. Sample Computations

The methodology is implemented in the FORTRAN program given in the Appendix. Maximum degree of polynomials to be solved is arbitrarily set to $n \max = 100$ ; likewise, the maximum number of allowed iterations is defined as $i \max = 10000$ . In accord with the programming language and machine capacity, the computational precision for convergence at each degree-reduction is specified as $ϵ = 10^{- 14}$ . The coefficients of polynomial are entered in complex number format in parenthesis as the code is written to accommodate the most general case possible. Results are given in 10 decimal places.

4.1. P₃(x) = 6x³ − 17x² − 5x + 6

Figure 1 shows the execution outcome of the code for a third-degree polynomial constructed as $P_{3} (x) = (3 x + 2) (x - 3) (2 x - 1) = 6 x^{3} - 17 x^{2} - 5 x + 6$ with roots $x_{1} = - 0.6666666667$ , $x_{2} = 3.0000000000$ , and $x_{3} = 0.5000000000$ , which are all real. For this case, only 56 iterations or sweeps are needed to reduce the degree of polynomial from 3^rd to 2^nd with $ϵ = 10^{- 14}$ precision and these 56 iterations are all the needed to compute three roots, which are exact to 10 decimal places. To check the accuracy of each computed root, the polynomial is evaluated too; the results are shown on the right.

Figure 1. Screenshot of code execution for $P_{3} (x) = 6 x^{3} - 17 x^{2} - 5 x + 6$ .

4.2. P₄(x) = 3x⁴ − 2x³ + x² + 4x + 5

A fourth-degree polynomial with arbitrarily selected real coefficients is considered and the code gives two pairs of complex conjugate zeros. Figure 2 shows the results which are quite accurate as revealed by the polynomial evaluations, all zero, given on the right. Reducing the degree of polynomial from 4^th to 3^rd requires 209 iterations and from 3^rd to 2^nd, 245 iterations; hence, altogether, obtaining all the roots takes 454 sweeps.

Figure 2. Screenshot of code execution for $P_{4} (x) = 3 x^{4} - 2 x^{3} + x^{2} + 4 x + 5$ .

4.3. P₅(x) = (−2 + 3i)x⁵ + (5 + 5i)x⁴ + (0 − i)x³ + (7 + 0i)x² + (1 − 2i)x + (−15 + 12i)

A fifth-degree polynomial with arbitrarily selected real and complex coefficients is considered. The computed roots, seen in Figure 3, are all complex but none complex conjugate because the polynomial has complex coefficients. Iteration numbers for computational sequences, from 5^th down to 2^nd degree are 218, 4507, and 60, summing up to 4785 sweeps altogether for the computational precision requirement $ϵ = 10^{- 14}$ . Functional evaluations shown on the right are all zero to 10 decimal places, verifying that the computed roots are quite accurate.

Figure 3. Screenshot of code execution for P₅(x) = (−2 + 3i)x⁵ + (5 + 5i)x⁴ + (0 − i)x³ + (7 + 0i)x² + (1 – 2i)x + (−15 + 12i).

4.4. P₇(x) = x⁷ − 6.01x⁶ + 12.54x⁵ − 8.545x⁴ − 5.505x³ + 12.545x² − 8.035x + 2.01

A particularly difficult case used in Jenkins and Traub [8] is factored as $P_{7} (x) = (x - 0.5 + 0.5 i) (x - 0.5 - 0.5 i) {(x - 1)}^{2} (x + 1) (x - 2) (x - 2.01)$ . Obviously, the repeated roots originating from the factor ${(x - 1)}^{2}$ and the very close magnitude roots from $(x - 2) (x - 2.01)$ may cause computational problems. Figure 4 shows the computed roots and corresponding polynomial evaluations to the 10 decimal places. For the five distinct zeros, the present algorithm works virtually perfectly well and definitely better than the Jenkins-Traub algorithm. Only the coincident roots, $x_{4}$ and $x_{5}$ , resulting from ${(x - 1)}^{2} = 0$ could not be obtained with the accuracy of $ϵ = 10^{- 14}$ despite the allowed maximum 10,000 iterations, simply because the code cannot decide which one of the roots to converge as both are the same. Note however that the close magnitude roots $x_{6} = 2.01$ and $x_{7} = 2$ are computed exactly to 10 decimal places of accuracy. Iteration numbers for computational sequences, from 7^th down to 2^nd degree are 58, 135, 108, 10,000, and 54, summing up to 10,355 sweeps altogether. The greatest portion of iterations, counting to 10,000 due to the repeated roots, could be reduced down to 5,198 by setting $ϵ = 10^{- 8}$ without appreciably affecting the accuracy of the computed roots. Nevertheless, it is clear that repeated roots are the weakest spot of the present scheme and its source may be traced back to the paragraph between Equations (5) and (6).

Figure 4. Screenshot of code execution for P₇(x) = x⁷ − 6.01x⁶ + 12.54x⁵ − 8.545x⁴ − 5.505x³ + 12.545x² − 8.035x + 2.01.

4.5. P₉(x) = (−2 + i)x⁹ + (1 + i)x⁸ + (3 − 2i)x⁷ + (5 + 0i)x⁶ + (−4 + 3i)x⁵ + (7 + 7i)x⁴ + (6 + 0i)x³ + (−3 + 0i)x² + (2 + 2i)x + (10 + 10i)

A ninth-degree polynomial with arbitrarily selected real and complex coefficients is the last example treated. As in §4.3 the roots shown in Figure 5 are all complex but none complex conjugate because the polynomial has complex coefficients. The results are highly accurate as all the functional evaluations are zero to the 10 decimal places. Number of sweeps for sequences from 9^th to 2^nd degree are 369, 314, 875, 133, 108, 440, and 95, adding up to 2334 sweeps for the entire computations.

Figure 5. Screenshot of code execution for P₉(x) = (−2 + i)x⁹ + (1 + i)x⁸ + (3 − 2i)x⁷ + (5 + 0i)x⁶ + (−4 + 3i)x⁵ + (7 + 7i)x⁴ + (6 + 0i)x³ + (−3 + 0i)x² + (2 + 2i)x + (10 + 10i).

4.6. Some Special Cases

The most critical special case that could make the present scheme fail has been identified as $a_{1} = 0$ , and this pitfall is avoided by a shift that sets the corresponding term in the transformed polynomial to a constant. This preconditioning measure avoids quite a number of problems associated with a variety of special cases. In this section, we are going to enumerate some noteworthy special cases for which, despite unusual characteristics of polynomial, the code still performs well.

Polynomials like $a_{0} x^{n} + a_{n} = 0$ may at first sight appear problematic since all the coefficients but the fist and the last, are zero. However, the scheme, in virtue of a transformation to set the coefficient of the second term to a constant, handles all these cases accurately. For instance, for $ϵ = 10^{- 14}$ with 90 iterations, roots of $x^{3} + 1 = 0$ are computed as x₁ = −1.0000000000 + 0.0000000000, x₂ = 0.5000000000 − 0.8660254038i, x₃ = 0.5000000000 + 0.8660254038i, which may be regarded exact. Likewise, it takes 157 + 96 = 253 iterations to compute all the roots of $x^{4} + 1 = 0$ as x₁ = −0.7071067812 − 0.7071067812i, x₂ = −0.7071067812 + 0.7071067812i, x₃ = 0.7071067812 − 0.7071067812i, and x₄ = 0.7071067812 + 0.7071067812i. Higher degree cases $x^{5} + 1 = 0$ , $x^{6} + 1 = 0$ , which are tested to the 10^th degree, are solved with the same ease and accuracy without problem.

For the present formulation, as observed in §4.4 of the Jenkins-Traub test, repeated or coincident roots of the form ${(x + c)}^{n} = 0$ poses a potential problem since the procedure is not able to decide for a root to converge. Unsurprisingly, for ${(x + 1)}^{3} = x^{3} + 3 x^{2} + 3 x + 1 = 0$ the code produces just barely acceptable results x₁ = −1.0001999401 − 0.0000999701i, x₂ = −0.9998500497 − 0.0000499752i, $x_{3} = - 0.9999500102 + 0.0001499453 i$ after the maximum allowed 10,000 iterations. For ${(x + 1)}^{4} = x^{4} + 4 x^{3} + 6 x^{2} + 4 x + 1 = 0$ we obtain similar results after 10,000 + 10,000 = 20,000 iterations. For higher degrees we get acceptable results but clearly the repeated zeros are not easily handled. A remedy to this weakest spot should be implemented in the professional version of the code in an appropriate way.

Quite trivial case of $a_{0} x^{n} = 0$ is problematic; again due to repeated roots, but can still be computed. For instance, $x^{3} = 0$ is solved after the maximum allowed 10,000 iterations as x₁ = −0.0001999412 − 0.0000999698i, x₂ = 0.0001499502 − 0.0000499782i, and $x_{3} = 0.0000499911 + 0.0001499480 i$ . The fourth-degree case $x^{4} = 0$ produces similar near zero values after 10,000 + 10,000 = 20,000 iterations. Nevertheless, relatively low performance of the method for this most primitive form, $a_{0} x^{n} = 0$ , does not imply a serious defect at all since the solution is trivial.

The above screening of main special cases clearly shows that the repeated roots are the only special case that impair the accuracy of computations for the proposed root-finding algorithm. No case has been identified with a complete failure; though, naturally, there may remain some overlooked exceptional cases. Overall however, the methodology works remarkably well and produces quite accurate solutions for polynomials with real or complex coefficients.

5. Concluding Remarks

A new method of computing all the roots of a polynomial of any degree is introduced. The scheme proceeds by reducing the degree of the given polynomial by one at each stage of successive polynomial substitutions till reaching a quadratic. A simple shift via Taylor series expansion is introduced to implement a preconditioning measure for avoiding failure of the scheme due to the vanishing of the second highest-degree term. Computations are performed in the complex domain for the sake of generality and several demonstrative examples are included. Except for repeated roots, the code works virtually perfectly well with high efficiency and accuracy without any observed failure. A professionally restructured form of the present generic algorithm in different programming languages is expected to be very useful for practical uses as its performance proves to be quite competitive even against the Jenkins-Traub algorithm, which is the standard root-finder.

Appendix: FORTRAN Code for Implementing Method of Successive Polynomial Substitutions: SPS

use msimsl

parameter(nmax=100,imax=10000,eps=1.e-14)

double complex a(0:nmax),b(1:nmax),c(0:nmax),x(1:nmax),xb(1:nmax)

double complex dlt,poly,tht

write(*,10)

10 format(3x,31HEnter the degree of polynomial:,/)

read(*,*)n

write(*,20)

20 format(3x,37HStarting from the highest degree term,

&38H enter the coefficients of polynomial:,/)

do i=0,n

write(*,30)i

30 format(3x,1Ha,i2)

read(*,*)a(i)

c(i)=a(i)

enddo

c Divide all coefficients by a(0)

do i=1,n

a(i)=a(i)/a(0)

enddo

a(0)=dcmplx(1.,0.)

c Preconditioning of polynomial by setting a(1) to (n,n/2)

tht=(dcmplx(1.,1./2.)-a(1)/n)/a(0)

c Recompute polynomial coefficients for making a(1)=(n,n/2)

do j=n,1,-1

a(j)=a(j)*dfac(n-j)

do k=j-1,0,-1

a(j)=a(j)+a(k)*(dfac(n-k)/dfac(j-k))*tht**(j-k)

enddo; enddo

c Coefficients of transformed polynomial

do j=n,1,-1

a(j)=a(j)/dfac(n-j)

enddo

c Main Program to solve roots of transformed polynomial

do 50 j=1,n-2

c A single substitution

b(j)=a(j)*a(j)-a(j+1)

do i=j+1,n-1

b(i)=(a(i)*a(j)-a(i+1))/b(j)

enddo

b(n)=a(j)*a(n)/b(j)

c Repeat till convergence

k=0

40 xb(n)=b(n)

k=k+1

b(j)=b(j+1)-a(j)

do i=j+1,n-1

b(i)=(b(i+1)-a(i))/b(j)

enddo

b(n)=-a(n)/b(j)

if(abs(xb(n)-b(n)).gt.eps.and.k.lt.imax) then

goto 40

else; goto 99; endif

99 x(j)=-a(j)+b(j+1)

do i=j+1,n

a(i)=b(i)

enddo

dlt=cdsqrt(a(j+1)*a(j+1)-4.*a(j+2))

x(j+1)=(-a(j+1)+dlt)/2.

x(j+2)=(-a(j+1)-dlt)/2.¹

50 continue

c Write the roots and corresponding values of the polynomial

write(*,100)

100 format(/,40x,4HRoot,30x,19HValue of Polynomial,/)

write(*,200)

200 format(8x,11HRoot Number,13x,4HReal,10x,9HImaginary,18x,4HReal,

&10x,9HImaginary,/)

do j=1,n

c Evaluate polynomial for roots

poly=c(n)

do i=n-1,0,-1

poly=poly+c(i)*(x(j)+tht)**float(n-i)

enddo

write(*,300)j,x(j)+tht,poly

300 format(12x,1Hx,i2,8x,2f16.10,9x,2f16.10)

enddo

write(*,400)

400 format(/)

stop

end

NOTES

¹These three lines may be moved outside the do-loop below 50 continue after setting j = n − 1 to avoid unnecessary computations.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1]	Chapra, S.C. and Canale, R.P. (2005) Numerical Methods for Engineers. McGraw-Hill Education.
[2]	Weierstrass, K. (1891) Neuer Beweis des Satzes, dass jede ganze rationale Function einer Veranderlichen dargestellt werden kann als ein Product aus linearen Functionen derselben Veranderlichen. Sitzungsberichte der Königlich Preussischen Akademie der Wissenschaften zu Berlin. https://web.archive.org/web/20131102093616/ http://bibliothek.bbaw.de/bibliothek-digital/digitalequellen/schriften/anzeige?band=10-sitz%2F1891-2&seite%3Aint=00000565
[3]	Durand, E. (1960) Equations du type F(x)=0: Racines d’un polynome. In: Solutions Numeriques des Equations Algebriques, Vol. 1.
[4]	Kerner, I.O. (1966) Ein Gesamtschrittverfahren zur Berechnung der Nullstellen von Polynomen. Numerische Mathematik, 8, 290-294.[CrossRef]
[5]	Traub, J.F. (1966) A Class of Globally Convergent Iteration Functions for the Solution of Polynomial Equations. Mathematics of Computation, 20, 113-138.[CrossRef]
[6]	Jenkins, M.A. and Traub, J.F. (1970) A Three-Stage Variable-Shift Iteration for Polynomial Zeros and Its Relation to Generalized Rayleigh Iteration. Numerische Mathematik, 14, 252-263.[CrossRef]
[7]	Jenkins, M.A. and Traub, J.F. (1972) Algorithm 419: Zeros of a Complex Polynomial. Communications of the ACM, 15, 97-99.[CrossRef]
[8]	Jenkins, M.A. and Traub, J.F. (1970) A Three-Stage Algorithm for Real Polynomials Using Quadratic Iteration. SIAM Journal on Numerical Analysis, 7, 545-566.[CrossRef]
[9]	Jenkins, M.A. (1975) Algorithm 493: Zeros of a Real Polynomial. ACM Transactions on Mathematical Software, 1, 178-189.[CrossRef]
[10]	Beji, S. (1992) Investigations on Cubic Polynomials. International Journal of Mathematical Education in Science and Technology, 23, 167-173.[CrossRef]
[11]	Tschirnhaus, E.W. (2003) A Method for Removing All Intermediate Terms from a Given Equation. ACM SIGSAM Bulletin, 37, 204-207. https://sigsam.org/bulletin/issues/
[12]	Beji, S. (2008) A Systematic Approach to the Exact Roots of Polynomials. Mediterranean Journal of Mathematics, 5, 163-172.[CrossRef]

	[email protected]
	+86 18163351462 (WhatsApp)
	1655362766
	SCIRP WeChat

Journals Menu

Home

About SCIRP

Service

Policies