Complex quaternion functions
The quaternions were discovered by William Rowan Hamilton in 1843[1]. He had long searched for an algebra that was to three dimensions what the complex numbers are to two dimensions. He sought to multiply and divide these sought after numbers for many years before, in a flash of insight, the solution came to him. The problem was that there are no such numbers in three dimensions, only in four dimensions.
A quaternion Q can be written as where From these, using associativity, it follows that
This was the first non-commutative algebra. In hindsight, non-commutativity is to be expected since rotations about the origin in three dimensions do not in general commute but in two dimensions they do. For Hamilton a,b, c, and d were all real. Define the norm as
It is easily verified that
Since N(Q) is always positive for a non-zero real quaternion, the inverse always exists. This makes the algebra a division algebra. Also, the norm of a product is the product of the norms. This makes the algebra a composition algebra[2].
The real quaternions can be used to do spatial rotations[3] , but not to do Lorentz transformations with a boost. But if a, b, c, and d are allowed to be complex, they can[4][5] . This is what motivated the study of functions of a complex quaternion or biquaternion, such as how they are to be computed, when they are defined, and what their multiplicities are.
Since there are non-zero complex quaternions with zero norm, the inverse does not always exist. So they are not a division algebra. But they almost are. And there are zero divisors, as evidenced by (1 + i I) (1 - i I) = 0. In a way, the need for complex quaternions is not surprising, since in special relativity the Minkowski invariant, which is the norm of a 4-vector, can be any real number, including zero (null rays).
A 4-vector is represented by the complex quaternion , which is called a Minkowski quaternion. Its scalar time-like component is real and its spatial vector component is pure imaginary. This is the convention used by P. A. M. Dirac,[6] which gives the metric . Choosing the time-like component imaginary and the spatial vector component real has also been done and gives the metric .
The basis quaternions I, J, and K can be represented in terms of the Pauli spin matrices as , , and , respectively[7], as one possibility. . These have the same multiplication table. The Pauli spin matrices are used in particle physics for Lorentz transformations of 2-spinors and can do Lorentz transformations of a 4-vector by representing it as a 2x2 matrix, which is obtained from its equivalent Minkowski quaternion by replacing I, J, and K by their Pauli spin matrix representations and adding. The scalar term is replaced by multiplying it by the 2x2 identity matrix. Working with complex quaternions is simpler and more transparent and intuitive than working with matrices. But there is a one-to-one correspondence between the two viewpoints.
The exponential function with a complex quaternion argument is used to generate finite Lorentz transformations and the square root function with a complex quaternion argument is used to express a Lorentz transformation either as a pure boost followed by a spatial rotation or vice versa.
Functions with a complex quaternion argument are treated first, followed by a discussion of their application to Lorentz transformations.
Exp, Sqrt, Log, Trig and Hyperbolic with Inverses
[edit]How to compute, when defined, multiplicities
A technique that will be used frequently in the following discussion is that given a power series in a complex quaternion X = a + b V where V V = -1, we can instead work with X = a + b I and, after evaluating the power series, replace I by V. Since V V is always a complex number, any quaternion whose vector part has non-zero norm can be put in this form. Let V = b I + c J + d K and suppose is non-zero. Then The factor in square brackets has norm -1. The square root of the norm can be pulled out and absorbed into b. A vector quaternion whose norm is -1 will be called basis-like.
A second technique that will be used frequently is to realize that any power series in X = a + b V sums to the form c + d V where c and d are complex numbers, provided that the power series converges.
Exponential function
[edit]The exponential function is well-defined by its power series, which converges over the entire domain, even for complex quaternions. Since the basic circular and hyperbolic functions cos, sin, cosh, sinh are linear combinations of exponential functions, they too are well-defined.
A simple expression for exp(X) is sought. Now exp(a + b I + c J + d K) = exp(a) exp(b I + c J + d K) since the complex number a commutes with the complex quaternions. The vector part V = b I + c J+ d K has the property that its square is the complex scalar -(b2 + c2 + d2), which is the negative of its norm. Let the complex s be the square root of -V V, or, equivalently, the square root of the norm of V.
Any power series in the vector part V can be expressed as a sum of an even power series in s and another even power series in s which multiplies the vector part V. If these two power series are those of common functions, then this gives a simple way to evaluate. For s unequal to zero,
For the case for which the norm is positive real, s can be chosen to be positive real and we can let V=s I without loss of generality and get De Moivre's formula:
The quaternion basis element I acts as a square root of -1.
For the case for which the norm of V is negative, we can take s to be a positive pure imaginary. We then get
For the case for which the vector part V=N, where N has zero norm,
Square root function
[edit]Usually there are four values for the square root of a complex quaternion, but there can be infinitely many or none. We consider the special cases first.
For the special case of the square root of a complex number, any multiple by a quaternion with square +1 is also a solution. This includes -1 and vector quaternions of norm -1 such as i I, i J, i (I+J) / √2 and infinitely many more possibilities.
For the special case of the square root of a complex quaternion with a non-zero vector component N having zero norm, the square root does not exist except for special values
For the special case of a vector quaternion with non-zero norm, we have Multiplying by -1, i I, or -i I give other values.
Having considered the special cases first, consider the complex quaternion a + b I with . Since a can be factored out we only need to consider X = 1 + A I. We find The two square roots in the equation need to be taken on the same branch with a branch cut from 0 to . This means their phases must be between -90° and 90°. Multiplying by -1, i I, -i I give other possible square roots.
Logarithm function
[edit]The logarithm function log X sometimes does not exist, and, when it does, is multi-valued. The logarithm of a complex number is particularly multi-valued. Consider log(1). Some possible values are 2m π i + 2n π I and (2m+1) π i + (2n+1) π I. For I, any vector quaternion of norm +1 may be substituted.
We are to find log(X) such that exp(log(X))=X. We first do the case for which both a and b are non-zero and for which the vector part has a non-zero norm so that it can be scaled to be basis-like with norm +1 and be represented by I. Define the complex number θ by Let α be a complex number. Then
Choose α so that the complex multiplier of a + b I on the right is one. Then a solution is Adding 2m π i + 2n &pi I or adding (2m+1) π i + (2n+1) &pi I also gives a solution.
Next find log(X) where X = a + N and N is a null non-zero vector quaternion. As easily verified Adding 2m π i also gives a solution.
The log of a quaternion that is a null vector quaternion N does not exist. The above equation diverges as
The case of b=0 was discussed in the first paragraph. Lastly, log 0 is undefined.
Circular and hyperbolic functions and their inverses
[edit]The sine and cosine trigonometric functions and the hyperbolic sine and hyperbolic cosine hyperbolic functions are defined in terms of the exp function. Their inverses can be expressed in terms of the log and sqrt functions.
Application to Lorentz Transformations
[edit]The exp and sqrt functions find application here. Using the exp function we find the complex quaternion representations of a pure boost and of a spatial rotation. The ability to perform Lorentz transformations using complex quaternions is not new.[8][6][4][5] It is much easier than working with matrices. The discussion here is simple and concise.
Any Lorentz transformation can be expressed as a pure spatial rotation followed by a pure boost or as a pure boost followed by a pure spatial rotation. Given the complex quaternion representing a Lorentz transformation, the square root function is used to find the complex quaternions representing either of these. All has been verified numerically.[9]
Minkowski quaternions
[edit]A Minkowski 4-vector can be space-like, null, or time-like according to whether its norm is negative, zero, or positive. Real non-zero quaternions always have a real positive norm. Real quaternions can represent spatial rotations but only the complex quaternions have the richness to represent a general Lorentz transformation. The Minkowski quaternion representing this 4-vector has the form and , where are real numbers and is a real vector quaternion of unit norm .
The norm of X is . This has the correct metric signature since there is only the time-like component associated with but there are three spatial components associated with .
A Lorentz transformation is a linear transformation which takes a Minkowski quaternion into another Minkowski quaternion having the same norm. Let q = a + b I + c J + d K be any complex quaternion of unit norm and define q = a* - b* I - c* J - d*K. Here * represents complex conjugation. We call q the conjugate transpose of q. The conjugate transpose of a Minkowski quaternion is itself. Any complex quaternion having this property is necessarily a Minkowski quaternion. Consider
The norm is preserved because the norm of a product is the product of the norms. We have the identities The conjugate transpose of a product is the product of the conjugate transposes in reverse order. Therefore X' equals its conjugate transpose and is a Minkowski quaternion. We will show up to a sign how q is determined by the Lorentz transformation.
Relation of complex quaternions to Pauli spin matrices
[edit]The complex quaternions are closely related to the Pauli spin matrices, which in the particle physics of spin massless fermions act on 2-spinors.[10] The basis quaternions can be represented by
The multiplication table is the same.
A Minkowski quaternion can be represented in terms of the Pauli matrices as Here is the 2x2 identity matrix.
These 2x2 matrices for I, J, and K can be replaced by 4x4 real anti-symmetric matrices by replacing in each imaginary matrix element by or by its transpose and by replacing 1 in each real matrix element by . That is why the term transpose was chosen for the operation
This representation by the Pauli spin matrices is not unique. One can change the signs of any two of them and preserve the multiplication table. So another possible representation is
Also, they can be cyclically permuted and preserve the multiplication table. More generally, any similarity transform of these 2x2 matrices preserves the multiplication table.
In the equations for the Lorentz transformations that will be given, the quaternion basis elements can be replaced by their 2x2 matrix representations.
Finite Lorentz transformations using the exp function
[edit]The complex quaternion function exp can be used to perform Lorentz boosts. A real non-zero quaternion always has positive norm. A 4-vector in special relativity has the form X = t + x i I + y i J + z i K, where t, x, y, and z are real. Its norm is the Minkowski invariant t2 - x2 - y2 - z2.
Let a T superscript denote a "transpose" operation taking I into -I, J into -J, and K into -K. The reason for the "transpose" name is that I, J, and K can be represented as 4x4 real anti-symmetric matrices and that is what the matrix transpose does. The transpose of a product is the product of the transposes in reverse order, as in matrix algebra.
Let q* denote complex conjugation of the quaternion q. This operation does not do anything to I, J, and K. This definition is again reasonable since I, J, K can be expressed as 4x4 anti-symmetric real matrices.
Let an overbar denote the complex conjugate transpose. If q = a + b I + c J + d K, then q = a* - b* I - c* J - d* K. Note that X = X for the Minkowski quaternion X. Let q have norm 1. Consider X' = q X q. If q has norm 1, then since the complex quaternions are a composition algebra, X' and X have the same norm. Also X' has the same form as X with real scalar part and pure imaginary spatial part and so is also a Minkowski quaternion.
Complex quaternions representing boosts and rotations
[edit]In spacetime algebra, a boost in the x direction is done by Note that and Let v be the velocity and define as usual in special relativity
From this we identify and and easily verify that this gives the usual formulas for the Lorentz boost
A spatial rotation about the x axis by angle θ is done by
These are verified by observing that these give the usual Lorentz transformations. We have the relations
Representing a Lorentz transformation as a rotation followed by a boost or vice versa
[edit]One question remains: Suppose q has norm one. What Lorentz transformation is it associated with? It always represents some Lorentz transformation since X' = q X q is a linear transformation of the Minkowski quaternion X into another Minkowski quaternion X' of the same norm.
Any proper Lorentz transformation can be represented either as a rotation followed by a boost or vice versa. Once either of these forms is obtained, we can easily read off what the boost and rotation are.
In the first case we have So
In the second case we have So
See also
[edit]- Biquaternion
- Biquaternion algebra
- Quaternion algebra
- Hypercomplex number
- Hypercomplex analysis
- Lorentz transformation
- Spacetime algebra
References
[edit]- ^ Stillwell, John (2010). Mathematics and Its History Third Edition. Undergraduate Texts in Mathematics. New York Dordrecht Heidelberg London: Springer. p. 422. ISBN 978-1-4419-6053-5.
- ^ Conway, John H; Smith, Derek A (2003). On Quaternions and Octonions. Boca Raton, Florida: CRC Press. ISBN 978-1-56881-134-5.
- ^ Kuipers, Jack B (1999). Quaternions and Rotation Sequences. pp. 127–138.
- ^ a b Shah, Alam M; Sabar, Bauk (June 2011). "Quaternion Lorentz Transformation". Physics Essays. 24 (2): 158–162.
- ^ a b Shah, Alam M; Sabar, Bauk (June 2011). "Quaternion Lorentz Transformation".
- ^ a b Dirac, P. A. M. (November 1945). "Application of Quaternions to Lorentz Transformations". Proceedings of the Royal Irish Academy. Section A: Mathematical and Physical Sciences. 50(1944/1945). Royal Irish Academy: 261–270.
- ^ Stillwell, John (2010). Mathematics and Its History Third Edition. Undergraduate Texts in Mathematics. New York Dordrecht Heidelberg London: Springer. p. 426. ISBN 978-1-4419-6053-5.
- ^ Synge, J. L. (1972). "Quaternions, Lorentz Transformations, and the Conway-Dirac-Eddington Matrices" (PDF).
- ^ Phillion, Donald W (March 10, 2025). "Quaternion Lorentz".
- ^ Zuber, Kai (2020). Neutrino Physics Third Edition. Series in High Energy Physics, Cosmology, and Gravitation. Boca Raton, Florida: CRC Press. ISBN 978-1-138-71889-0.