Solving Perturbed Kepler Problem by Quaternion Algebra

Abstract

In this article, we use quaternions together with Kustaanheimo-Stiefel transformation to solve the Kepler problem, first its unperturbed version and then when adding a small perturbing force. We assume that the perturbing force is autonomous, which enables us to derive a set of first-order differential equations for the corresponding orbital elements, while keeping them fully autonomous as well. This is achieved without compromising the resulting accuracy; there is no need for averaging out fast oscillations or involving a fast-angle variable. As a consequence, the equations can be solved to arbitrary accuracy by iterative and routine application of the new formulas.

Share and Cite:

Vrbik, J. (2026) Solving Perturbed Kepler Problem by Quaternion Algebra. Applied Mathematics, 17, 337-353. doi: 10.4236/am.2026.176021.

1. Quaternion Algebra

Quaternions are introduced as an extension of complex numbers where, instead of one imaginary unit i , there are three of them, denoted i, j and k ; we relate them to x , y and z directions of a three-dimensional space respectively. The square of each of these units equals to −1, while any two of them anti-commute, e.g. ij=ji etc.; furthermore ij=k , the small circle indicating quaternion multiplication (an associative operation) [1]. A quaternion is thus a quantity with four components, namely

A:=A+ a 1 i+ a 2 j+ a 3 k=A+a (1)

where A is its scalar part and a its vector part (we use the blackboard type for full quaternions and boldface for vectors). Adding two quaternions is a component-wise operation, while multiplication is carried out (consistently with the previous rules) by

( A+a )( B+b )=ABab+Ab+Ba+a×b (2)

where and × denote the dot and vector product, respectively; quaternion multiplication is associative (as stated already) but not commutative. The proof of both statements is routine; it is based on the following two identities

( a×b )c=a( b×c ) (3)

( a×b )×c=a×( b×c )

Quaternion conjugation is defined (and denoted) by

A ¯ :=Aa (4)

implying that a:= a a ¯ yields the length of vector a. Note that

AB ¯ = B ¯ A ¯ (5)

and that A A ¯ = A ¯ A yields the sum of squares of A ’s four components (we may then drop the symbol and use A A ¯ or A ¯ A to emphasize that the result is a scalar and the order irrelevant). Using the conjugate, we then build the inverse of A by

A 1 := A ¯ A A ¯ (6)

so that A A 1 = A 1 A=1 .

Finally A n , where n is a positive integer, implies A multiplied by itself n times (calling the result the n th power of A ); this means that we can now define a function of a quaternion, based on its Maclaurin expansion. The most useful of these is

exp( A )=exp( A )exp( a )=exp( A )exp( a a ^ )=exp( A )( cosa+ a ^ sina ) (7)

where a ^ := a a is the unit direction of a ; note that powers of a ^ follow the a ^ ,1, a ^ ,1, a ^ , cycle, due to aa= a 2 .

Rotation

Fixed rotation: Let be a quaternion of unit magnitude, i.e. :=R+r r ^ where R 2 + r 2 =1 (equivalent to ¯ =1 ). This further implies that can be uniquely written as

=exp( w 2 ) (8)

where w is a vector whose direction is r ^ and whose magnitude meets cos( w 2 )=R and sin( w 2 )=r . When x represents a point in a three-dimensional (3D) space (using an inertial frame for its coordinates), it is easy to show that

x R :=x ¯ =exp( w 2 )xexp( w 2 ) = x +( cosw+ w ^ sinw ) x = x + x cosw+ w ^ × x sinw (9)

where x R represents x rotated by angle w around the axis whose direction is w ^ . Here x :=( x w ^ ) w ^ is the component of x parallel to w ^ , while x :=x( x w ^ ) w ^ is perpendicular to it. Note that x commutes with w (and therefore with ) while x anti-commutes (implying that x ¯ = x ).

When x( p ) is a parametric description of a 3D object (such as an ellipse), the same operation similarly rotates the whole object, without changing its size or shape.

Uniform rotation: Changing to

=exp( w 2 s ) (10)

(where s is interpreted as time) makes the rotation time dependent; w then becomes the corresponding (constant) angular velocity.

Time-dependent rotation: Assuming that all components of are now arbitrary functions of s (yet preserving the ¯ =1 property), we now find the corresponding instantaneous angular velocity at time s ; it follows from

x R ( s )= x ¯ +x ¯ = ¯ x R ( s )+ x R ( s ) ¯ =2( ¯ )× x R ( s ) (11)

since ¯ = ¯ (showing that ¯ is a vector) and since abba=2a×b ; this implies that 2 ¯ is the resulting velocity.

Kepler frame: Letting the original coordinate frame itself rotate with , and expressing the vector of the instantaneous angular velocity of the rotation in this new (no longer inertial, but very convenient) Kepler frame, we get

Z:=2 ¯ ( ¯ )=2 ¯ (12)

Euler angles: The usual way of parametrizing is done [2] by

=exp( k ϕ 2 )exp( i θ 2 )exp( k ψ 2 ) (13)

i.e. first rotating with respect to the z axis by angle ψ , then rotating with respect to the (original) x axis by angle θ , and finally rotating with respect to the z axis again by angle ϕ ; the three angles are referred to as Euler angles [3].

Using this form of and letting each of the three angles be a function of s , the angular velocity (12), i.e. its Kepler-frame representation, becomes

Z=i( θ cosψ+ ϕ sinψsinθ )+j( ϕ cosψsinθ θ sinψ )+k( ψ + ϕ cosθ ) (14)

We can then express each of the three time derivatives in terms of components of Z by solving the corresponding three equations, thus getting

ϕ = Z 1 sinψ+ Z 2 cosψ sinθ (15)

θ = Z 1 cosψ Z 2 sinψ

ψ = Z 3 ϕ cosθ

2. Kepler Problem

To find the motion of a satellite (of negligible mass) orbiting a primary of gravitational mass μ requires solving the following differential equation

r ¨ +μ r r 3 =εf (16)

where r is a vector with three components ( x , y and z ), each a function of time t , r is the length of r , the double dot implies taking a second derivative of each of the r components with respect to t , and εf is a perturbing force per unit mass of the satellite, assumed to be relatively small (i.e. ε μ r 2 ) [3]; the factor ε then appears in all quantities proportional to ε ; higher powers of ε are thus used to indicate the degree of smallness. When building a solution to (16), it will be important to distinguish between terms of the ε 0 type (solving the equation with no perturbing force), those proportional to ε , and finally terms proportional to second or higher power of ε (ultimately to be ignored, as explained shortly).

The article considers only autonomous perturbing forces, meaning that f is a function of r but not explicitly of time.

To utilize quaternion algebra for solving (16), we introduce a new dependent variable U (a quaternion) and a new independent variable s (a scalar) called modified time [4] and [5]. These are related to the old variables by

r:=Ui U ¯ (17a)

dt ds :=2r a μ (17b)

where a>0 is assumed (by subsequent proof) to be an arbitrary scalar function of s , and r= Ui U ¯ Ui U ¯ =U U ¯ . We also define the following scalar function of s

Γ:= U i U ¯ Ui U ¯ (18)

where   indicates differentiating with respect to s . Note that Γ is thus twice the scalar part of U i U ¯ (as an alternate definition).

One can then show that, using the new variables, (16) reads

2 U 2 U U ¯ 4a r U a a ( U Γ 2r Ui )+ 2Γ r U i+ ( Γ r ) Ui =4r a μ εfUi (19)

Proof:

r ˙ = 2 U i U ¯ Γ 2r a μ = 2Ui U ¯ +Γ 2r a μ (20)

due to (17b). Post-multiplying by 2 a μ Ui results in

2 a μ r ˙ Ui=2 U Γ r Ui (21)

Differentiating each side with respect to s yields

a a a μ r ˙ Ui+2 a μ r ˙ U i+4r a μ ( εfμ r r 3 )Ui =2 U ( Γ r ) Ui Γ r U i (22)

which can be further simplified (note that rUi= rUi U ¯ U r =rU ) to get

a a ( U + Γ 2r Ui ) 2 r U ¯ U U Γ r U i+4r a μ εfUi+ 4a r U =2 U ( Γ r ) Ui Γ r U i (23)

which agrees with (19).

Note that post-multiplying U by exp( iδ ) , where δ is any scalar function of s , does not change the resulting r and r ; this implies that Uexp( iδ ) must solve (19) whenever U does (as can be explicitly verified). Nevertheless, the new solution does change the value of Γ to

Γ+Ui δ i U ¯ Ui( i δ ) U ¯ =Γ2r δ (24)

This implies that we can always find a solution which meets Γ=0 at all values of s by a proper choice of δ (referred to as gauge). Having done that, (19) is thus reduced to

2 U 2 U U ¯ 4a r U a a U =4r a μ εfUi (25)

while meeting Γ=0 . Since (25) and the last condition imply that Γ is also identically equal to 0 (just post-multiply each term of (25) by i U ¯ to show that 2 U i U ¯ has zero scalar part), it is then sufficient to make Γ equal to 0 initially; solving (25) then assures that Γ remains constant, thus maintaining the Γ=0 condition automatically at any future time s .

Unperturbed solution: When f=0 , we choose a to be a positive constant and, assuming the solution to start with Γ=0 , the equation to solve is then

U = U U ¯ 2a r U (26)

where

U U ¯ 2a r :=E= 2a μ ( r ˙ r ˙ 2 μ r ) (27)

is a constant of motion, proportional to the total (kinetic and gravitational) energy of the satellite [3].

Proof: Differentiating the LHS of (27) yields

U U ¯ + U U ¯ r E r r = EU U ¯ + U E U ¯ r E r ( U U ¯ +U U ¯ )=0 (28)

due to (26) and confirming that E' is identically zero. Furthermore

r ˙ r ˙ = U i U ¯ r a μ Ui U ¯ r a μ = μ a U U ¯ r (29)

verifies the rest.

Assuming that E is negative (to make the satellite orbit the primary), we may now choose a so that E=1 . The equation to solve is then

U +U=O (30)

( O denoting a zero quaternion) while

a= U U ¯ +r 2 (31)

Since there is no normal (perpendicular to the plane defined by the initial r and r ˙ directions) force, it is obvious that the motion, and the corresponding solution to (16), must be planar. It is thus sufficient to start with a solution that keeps r in the x-y plane; this solution then can be transformed (by applying an =exp( i θ 2 )exp( k ϕ 2 ) fixed rotation to it) to any other specific attitude (the transformation preserves the Γ=0 condition and the value of E ).

To construct the most general solution to (30) [6], we use the following notation

q:=exp( k( sσ ) ) (32)

and

z:= q 2 =exp( 2k( sσ ) ) (33)

where σ is a constant. From now on, quantities of this type (i.e. having only scalar and k components) are called complex (with k representing the purely-imaginary unit) and can be multiplied using rules of complex algebra (no need for the operation, unless one of the factors is a quaternion); their complex nature is emphasized by using a different font.

The most general form of U that keeps (17a) in the x - y plane and meets (30) is then

U=exp( k ψ 2 )( Aq+B q 1 ) (34)

with A,B,σ and ψ being real constants; this implies that

r= A 2 + B 2 +AB( z+ z 1 ) (35)

U'=kexpkψ2Aq-Bq-1(36)

and

U'Ū'=A2+B2-ABz+z-1(37)

further implying, due to (31), that

a= A 2 + B 2 (38)

It is easy to see that U'iU¯ then has only i and j components, implying that Γ=0 and confirming our original assumption.

The fully general solution is then

U=( Aq+B q 1 ) (39)

where is an arbitrary rotation. This implies that

r= ( Aq+B q 1 ) 2 i ¯ =( A 2 z+ B 2 z 1 +2AB )i ¯ =( ( ( A 2 + B 2 )cos( 2s2σ )+2AB )i+( A 2 B 2 )sin( 2s2σ )j ) ¯ (40)

We already know that A 2 + B 2 =a ; introducing e:= 2AB A 2 + B 2 , the same solution can be written as

r=a( ( cos( 2s2σ )+e )i+ 1 e 2 sin( 2s2σ )j ) ¯ (41)

which is easily recognized as an equation of an ellipse with semi-major axis equal to a and eccentricity equal to e . The remaining parameters (called orbital elements) of the general solution are the three Euler angles of the ellipse’s attitude and σ , the value of s at apocenter ( r with the largest magnitude).

Finally, since

r=a( 1+ecos( 2s2σ ) ) (42)

we get, based on (17b),

t= 2 a 3/2 μ ( 2s2σ+ecos( 2s2σ ) ) (43)

relating real time t to 2s2σ ; the latter being a modified version (after subtracting π ) of so called eccentric anomaly.

3. Perturbed Equation

When εf is nonzero, finding a solution becomes substantially more challenging; we can achieve it only to ε accuracy (when terms proportional to higher powers of ε are ignored, as done from now on). Nevertheless, once such first-order solution is found, we can use the same formulas to build a ε 2 -accurate (and higher) solution, by iterating.

To find ε -accurate solution [6], we must abandon the Γ=0 gauge and return to solving (19) directly; to make the solution unique, a different gauge will offer itself in the process. We must also allow all six orbital elements of the unperturbed solution

U= a 1+ β 2 ( q+β q 1 ):= U 0 (44)

where β:= B A (a more convenient parametrization) to be slowly varying (their derivatives proportional to ε ) functions of s (this implies that their second derivatives are ε 2 -proportional and therefore ignored). The 0 subscript of U 0 indicates that (i) the corresponding quantity is in Kepler’s frame and (ii) it is evaluated to ε 0 (i.e. unperturbed-solution) accuracy. Note that now

q'=kq(1-εσ')(45)

z'=2kz(1-εσ')

(recall that ε multiplies all ε 1 -small quantities, while ε 2 -small and higher order terms are discarded).

Finally, U 0 itself needs to be extended to (and replaced by)

U p := U 0 +ε U D +ε U S i:= a 1+ β 2 ( q+β q 1 +qεD( z )+q εS( z )+kεb 1+βz i ) (46)

where D( z ) is a Laurent-series function of z [7], with missing D 1 z 1 and D 0 z 0 terms, i.e.

D( z ):=+ D 4 z 4 + D 3 z 3 + D 2 z 2 + D 1 z+ D 2 z 2 + D 3 z 3 + D 4 z 4 + (47)

where the D j coefficients are (yet to be solved for) complex quantities.

Similarly, S( z ) is another such complex function which must furthermore meet the S( z )= S( z ) ¯ condition (our new gauge), and is missing S 1 z (consequently S 1 z 1 ) and S 0 terms. The latter one is actually needed to complete our solution, but is kept separate from S (we have denoted it kb , where b is real). Note that U 0 , U D and U S are thus complex quantities as well. The solution we hope to construct should then results in unique formulas for D j , S j , b and for the s derivatives of six orbital elements; we now proceed to find these by substituting the proposed solution (namely U p ) into (19), and matching its two sides.

Once done, (17a) would then convert the solution to r (as a function of s ), while (17b) then relates s to real time t ; this is a routine final step of the procedure.

3.1. Further Simplification

We start by simplifying the solution’s first two s derivatives, getting

( U p ) = U p + U p  =( U p + εZ 2 U 0 ) (48)

and

RUp''=RUp''+εZU0'(49)

We then simplify the resulting Γ= U p i U p ¯ U p i U p ¯ , which turns out to be nonzero and ε -small. More specifically

Γ= U p i U p ¯ U p i U p ¯ +ε Z r 0 + r 0 Z 2 =ε( U 0 U S ¯ U 0 U S ¯ U S U 0 ¯ + U S U 0 ¯ Z 1 +k Z 2 2 U 0 ¯ 2 Z 1 k Z 2 2 U 0 2 ) (50)

where r 0 := U 0 i U 0 ¯ , and a bar now implies complex conjugation. Note that contributions from the U D part of the solution have cancelled out, the individual terms are complex, but the final Γ is real.

Proof:

U p i U p ¯ =( U 0 +ε U D +ε U S i )i( U 0 ¯ +ε U D ¯ iε U S ¯ ) = U 0 i U 0 ¯ +ε U 0 U D i+ε U D U 0 i+ε U 0 U S ¯ ε U S U 0 ¯ (51)

Adding the corresponding complex conjugate removes the first three terms, as U0'iU0¯ contributes the unperturbed Γ (which equals to 0). and conjugating the next two term reverses their sign. The last two terms (plus their conjugates) then yield the first four terms of (50). Similarly

Z r 0 =( ( Z 1 +k Z 2 )i+k Z 3 ) U 0 i U 0 ¯ =( Z 1 +k Z 2 ) U 0 ¯ 2 +j Z 3 U 0 ¯ 2 (52)

Adding its conjugate, namely

( Z 1 k Z 2 ) U 0 2 Z 3 U 0 2 j=( Z 1 k Z 2 ) U 0 2 j Z 3 U 0 ¯ 2 (53)

then verifies the remaining terms of (50).

We are now ready to substitute U p into (19); matching the equation’s two sides then yields formulas for the s -derivatives of orbital elements and for the coefficients of D and S . The resulting set of first-order differential equations for orbital elements (still truly autonomous, implying no need for averaging, and unaffected by one-orbit oscillations) can then be integrated, yielding invaluable insights into their long-range behaviour. On the other hand, D and S and b components of the solution provide parallel and perpendicular (to Kepler frame) distortions, respectively, of a single orbit. Since no approximation is used to build an ε - accurate solution, the same procedure can be used (iteratively) to construct ε 2 -accurate (and higher-order) solutions, thus achieving arbitrary accuracy. The main advantage over similar techniques such as Lie-Deprit transformation [8] and Hamiltonian canonical perturbation theory [9] is in relative simplicity of our (yet to be derived) formulas. Admittedly, this derivation is rather involved (as seen shortly), but once found, the same formulas can be routinely applied to any autonomous perturbation and carried out to any order of ε accuracy. The only mathematical tool needed is already mentioned Laurent-series expansion of complex functions.

To find explicit formulas for all unknowns of the proposed solution, we first pre-multiply each term of (19) by ¯ ; this means that all quantities are transformed into the orbit’s Kepler frame, correspondingly simplifying the U p solution and its two s derivatives (48) and (49), while the RHS becomes

4 r 0 a μ ε f 0 U 0 i (54)

where r 0 and f 0 := ¯ f are evaluated using the unperturbed solution U 0 . We now proceed to de-couple the resulting equation by converting it into two complex equations (easier to deal with, as multiplication becomes commutative).

3.2. Building a Solution

To derive first of these equations, we start with the LHS of the Kepler-frame version of (19), namely (expressed to the ε 1 accuracy, and utilizing the fact that Γ, Γ and a are ε small)

2Up''+εZU0'-2Up'+εZ2U0U¯p'-U¯0εZ2-4arUp

 -εa'aU0'+2εΓr0U0'i+εΓr0'U0i(55)

Post-multiplying by U p ¯ and keeping complex components of each term only, we get (to the same ε 1 accuracy)

2U1''+kεZ3U0'U1¯-2U1'+kεZ32U0U1'¯-kεZ32U0¯+4a-εa'aU0'U0¯(56)

(where U 1 := U 0 +ε U D ), while the RHS becomes

4 r 0 a μ ( ε f 0 r 0 ) cx :=4ε r 0 ( 1+βz )Q( z ) (57)

(the cx subscript implies keeping only scalar and k components), and Q( z ) is correspondingly defined Laurent series. The Mathematica program of Figure 1 completes the task of converting the LHS (short of the a 1+ β 2 factor) into a function of D( z ) and of s derivatives of four orbital elements (highlighted); note that ε 0 terms have cancelled out, as expected. The code is reasonably self-explanatory, even to people not familiar with Mathematica.

Explicit formulas for the corresponding unknowns of the perturbed solution are then found (see Figure 2) by matching coefficients of all powers of z in (57). The program first solves for the D n coefficient of the D( z ) expansion while ignoring the a , β , σ , Z 3 terms (contributing to z 1 , z 0 and z 1 powers only). This requires simultaneously solving for D n , D n and their complex conjugates, while considering them to be algebraically independent (they must and do turn out to be consistent with each other). To build the first of these four equations, we collect (the first line of code) coefficients of z n which D( z ) contributes to the LHS of Figure 1, e.g. D( z ) ¯ contributes D n , 2z2(1+βz)D''(z) contributes 2n( n1 ) D n +2β( n1 )( n2 ) D n1 , etc., and subtract the coefficient of z n after expanding the RHS of (57); the second equation is just the nn counterpart of the first one, further including complex conjugates of both. The four equations are then expanded in powers of β and solved in an iterative manner; a careful choice of the expression multiplying Q( z ) on the RHS of (57) has made the iterative solution terminate upon reaching β 2 -proportional terms (see the first part of the program, ending with a highlighted D n solution). Note that the formula does not solve for D 1 and D 0 , implying that

D( z ):= n= 2 D n z n + n=1 D n z n (58)

Figure 1. (56) further simplified.

Figure 2. Solving (56)=(57).

This makes D( z ) match coefficients of all powers of z , with the exception of z 1 , z 0 and z 1 ; these are then used to solve for a , β , σ and Z 3 ; this is further simplified by solving, separately (yet another helpful de-coupling) the real (yielding Z 3 and σ ) and purely imaginary (yielding a and β ) parts of the corresponding three equations. Note that in both cases the equations are linearly dependent, thus allowing for a unique solution of each pair of unknowns (highlighted at the end of Figure 3).

To find S( z ) , Z 1 , Z 2 and b , we now keep only the i and j terms of (55), thus getting

2US''i+ε(Z1i+Z2j)U0'+2USi+2εΓr0U0'i+εΓr0'U0i

Post-multiplying by i U 0 ¯ results in

Figure 3. LHS of (59).

                           -2εUS''+USU0¯-2εZ1+kZ2U0'U0¯

-2εΓr0U0'U0¯-εΓr0'r0=4kr02aμ(εf0)3:=4εr0W(z)(59)

where W( z ) ¯ =W( z ) (note that each term of the equation has this property). We then proceed to simplify the LHS (short of a a 1+ β 2 factor) of (59) by Mathematica program of Figure 4.

And then to find S n (now much easier, as only one equation needs to be solved), followed by deriving formulas for Z 1 , Z 2 and b (by matching coefficients of z 0 and z 1 ); this is done in Figure 5.

We now have a complete set of formulas needed to find (based on the perturbing force) all ingredients of our solution. Using this ( ε -accurate) solution, we can then evaluate both sides of (19) to the ε 2 accuracy and use the same formulas to construct ε 2 -accurate solution, and so on. Post-multiplying the final solution by exp( iδs ) , where δ is a properly chosen constant, we can then impose the Γ=0 gauge, if desired.

3.3. Oblateness Example

To illustrate how to apply these formulas, we consider the perturbing force experienced by an artificial satellites due to Earth’s oblateness, given by (using Kepler’s frame)

Figure 4. Solving (59).

Figure 5. Orbital-element derivatives under oblateness perturbations (Part 1).

f 0 := εμ R 2 r 0 5 ( 3 2 r 0 15 2 ( r 0 u 0 ) 2 r 0 r 0 2 +3( r 0 u 0 ) u 0 ) (60)

where ε is the second zonal-harmonic coefficient, R is the Earth’s equatorial radius, and u (equal to k , by our choice of coordinates) is the unit direction of the Earth’s axis. To convert u to Kepler’s frame, we post-multiply it by of (13) and pre-multiply by ¯ , thus getting

u 0 =isinθsinψ+jsinθcosψ+kcosθ (61)

The key quantities needed to build the corresponding perturbed solution are

Q( z )= εa R 2 r 0 5 ( 1+βz ) ( 3 2 r 0 2 9 2 ( r 0 u 0 ) 2 3k( r 0 u 0 ) ( u 0 × r 0 ) 3 ) (62)

and

W( z )= εa R 2 r 0 6 3k( r 0 u 0 )cosθ (63)

Converting these into derivatives of the orbital elements is done by Mathematical programs of Figure 5 and Figure 6; while both a and β turn out to be equal to zero, further application of (15) results in the following classical formulas

ϕ = 2ε a 2 ( 1+ β 2 ) 4 ( 1 β 2 ) 4 cosθ (64)

θ =0

ψ = ε a 2 ( 1+ β 2 ) 4 ( 1 β 2 ) 4 ( 5 cos 2 θ1 )

Figure 6. Orbital-element derivatives under oblateness perturbation (Part 2).

which translate into a slow, uniform precession of the orbital plane around the Earth’s axis (with angular speed of ϕ ) and a similar rotation of the orbit’s perigee within the orbital plane with angular speed of ψ , while keeping the inclination angle fixed. Note that the perigee rotation ceases when θ=arcsin 2 5 , called the critical angle; the ε 2 -accurate solutions reveals that, instead of fully stopping, the rotation turns into a libration around the critical angle [10] and [11], but pursuing this goes beyond the scope of this article (consult the two references for further details). The first of these also derives expressions for resulting distortions of the Kepler’s frame r ; we do not quote them here due to their complexity.

4. Conclusions

In this final section we want to mention a few potential extensions of the new technique (excluded from our basic presentation). Firstly, the technique is easily capable of dealing with more than one autonomous force at a time and (at the same time) reaching an arbitrarily accurate solution by iterating; this simply requires attaching a different small parameter ( ε 1 , ε 2 , ) to each individual perturbing force and applying existing formulas to individual terms of the corresponding expansion. A more difficult task (even in a single-force situation) is to establish the largest value of ε which forces the iteration process to converge; this remains an open question. But luckily, in typical applications, perturbing forces are small enough to guarantee fast convergence.

The most obvious modification of our formulas is clearly needed when the perturbing force is no longer autonomous; this has been done in [12]. The solution can no longer avoid introducing fast (i.e. one orbit) oscillations into the resulting differential equations for orbital elements, thus making it more difficult to maintain high accuracy when exploring their long-term (millions of years) behavior. Nevertheless, in these cases, getting the correct qualitative picture is often the main goal, amply met by the extended formulas.

When all parameters of the perturbing force are numerical, it becomes beneficial (when constructing higher-order solution) to modify the iteration process not just to find the next-order solution, but also to update the existing one (till no changes are observed); this has been demonstrated in [13] and offers a way of properly dealing with small divisors.

The final challenge to solving a perturbed Kepler problem is due to so called resonances, happening when a ratio of the satellite’s (used in a generic sense) orbital period to the period of a (cyclic) perturbing force becomes a simple fraction (such as 2:1, 3:2 etc.); the new technique is well equipped to elucidate main features of the corresponding solution, as corroborated by [14] and several related articles.

The new technique has thus a large range of applications (well beyond what we could cover in this article) due to its ability to resolve several potential issues adversely affecting many traditional methods of solution. Yet, further advancement is still possible, but requires a solid background in the technique’s fundamentals; it is hoped that our article has provided it.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1] Serre, J.P. (1973) A Course in Arithmetic, Graduate Texts in Mathematics. Springer.
[2] Chelnokov, Y.N. (2022) Quaternion Methods and Models of Regular Celestial Mechanics and Astrodynamics. Applied Mathematics and Mechanics, 43, 21-80.[CrossRef]
[3] Goldstein, H. (1980) Classical Mechanics. 2nd Edition, Addison-Wesley.
[4] Kustaanheimo, P., Schinzel, A., Davenport, H. and Stiefel, E. (1965) Perturbation Theory of Kepler Motion Based on Spinor Regularization. Journal für die reine und angewandte Mathematik, 1965, 204-219.[CrossRef]
[5] Stiefel, E.L. and Scheifele, G. (1971) Linear and Regular Celestial Mechanics. Springer-Verlag.
[6] Vrbik, J. (2023) New Methods of Celestial Mechanics. Bantham Science Publishers.
[7] Rudin, W. (1987) Real and Complex Analysis. 3rd Edition, McGraw-Hill.
[8] Cary, J.R. (1981) Lie Transform Perturbation Theory for Hamiltonian Systems. Physics Reports, 79, 129-159.[CrossRef]
[9] Boccaletti, D. and Pucacco, G. (1999) Theory of Orbits. Volume 2: Perturbative and Geometrical Methods. Springer-Verlag.
[10] Vrbik, J. (1997) Oblateness Perturbations to Fourth Order. Monthly Notices of the Royal Astronomical Society, 291, 65-70.[CrossRef]
[11] Vrbik, J. (2009) Second Erratum: Oblateness Perturbations to Fourth Order. Monthly Notices of the Royal Astronomical Society, 399, 1088.[CrossRef]
[12] Vrbik, J. (1995) Perturbed Kepler Problem in Quaternionic Form. Journal of Physics A: Mathematical and General, 28, 6245-6252.[CrossRef]
[13] Vrbik, J. (2001) Quaternionic Processor. Celestial Mechanics and Dynamical Astronomy, 80, 111-118.[CrossRef]
[14] Vrbik, J. (1996) Resonance Formation of Kirkwood Gaps and Asteroid Clusters. Journal of Physics A: Mathematical and General, 29, 3311-3316.[CrossRef]

Copyright © 2026 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.