© 2000, 2007
Clyde M. Davenport
cmdaven@comcast.net
| Hypercomplex Algebra | 4-D Function Theory | Hypercomplex Analysis | Links | |
|
Introduction Below, we shall give a comprehensive sketch of the 4-D commutative hypercomplex
algebra (not quaternions) and its associated function theory and analysis.
The great advantage is a complete, classical 4-D function theory, something that is
impossible with quaternions and other noncommutative or nonassociative systems.
Fortunately, a wide audience should be able to
follow the discussion, because the commutative hypercomplex math is derived directly
from well-known, fundamental concepts, such as groups, rings, calculus,
complex variables, matrices, complex function theory, and vector analysis. For a
discussion of elementary group and ring theory, see any good introductory text on
abstract algebra, such as
Herstein, 1986. There will be no occasion for deep theorems and complicated proofs.
First, some background. It is not widely taught, nor do many people know, that Sir William Rowan Hamilton developed the quaternions (4-D numbers similar to those that we will discuss here, except noncommutative) in the 1830s specifically for field calculations (this was before vector analysis was developed). Scientists and engineers of the time vehemently resisted their use. Something in the mindset at the time simply could not accept the notion that there could be a "fourth dimension," especially if it was claimed to be time. They persisted with a primitive combination of component-by-component calculation and extensive use of geometry. By the late 1800s, their field calculations were accompanied by elaborate geometrical figures that resembled the 2 X 4 framing of a house. To allay their aversions, around 1880, J. W. Gibbs in America and O. W. Heaviside in Britain reformulated
quaternion analysis so that all expressions would be constrained to three dimensions
or less. For example, in the cross product of two three-dimensional vectors they
arbitrarily set i×i=j×j=k×k=i×j×k=0
so that the result would come out as another 3-D vector. The quaternion product of
two three-dimensional vectors is ab= -a·b+a×b, which has a
scalar part and a 3-D vector part (i.e., is 4-D). Therefore, Gibbs and Heaviside
avoided the quaternion product notation, and used only the dot and [modified] cross
product components in what they cleverly renamed as vector analysis.
Scientists and engineers accepted this subterfuge because it
met their prejudice about 3-D being inviolate and it did not have the word
"quaternion" mentioned anywhere. Nevertheless, vector analysis is a form of
quaternion analysis [Crowe,
1967]. Hamilton developed the quaternion algebra by trial and error.
Apparently, he had a
prejudice of his own: that every nonzero element (i.e., having at least one nonzero
component) should have a multiplicative inverse. By adopting this view, he was led
directly to quaternions, because we now know that the quaternions make up the only 4-D
division algebra. What he didn't realize was that the quaternions form a group
ring [i.e., the 1,i,j,k elements and their negatives form a group of order
eight (the quaternion group, of course), and elements of the form
1x+iy+jz+kw, with
x,y,z,w real, form a ring]. He didn't realize it because the notions of group
and ring hadn't been developed at that time. We now know that there are exactly
five distinct groups of order eight upon which group rings of 4-D elements may be
constructed. The fact that we exclusively use the quaternion case (vector analysis)
in science and engineering apparently stems from the fact that it was discovered first
and the others were not examined for potential application when they were eventually
uncovered. For a timeline on the development of quaternion analysis, see Jeff
Biggus' quaternion
history page.
Group algebras, including those mentioned here, were first
studied and described
over one hundred years ago [Peirce, 1881],
[Study, 1889]. No less than
Dedekind published a paper [Dedekind, 1885]
describing algebras that are direct sums of copies of the complex field, including
the commutative hypercomplex algebra that will be described below. Accordingly, I
do not claim original discovery of the commutative hypercomplex algebra,
but do
claim origination of certain of its representations, interpretations, and the
formulation of the function theory and analysis that will be constructed upon it, below.
Commutative Hypercomplex Algebra Basis Group In order to keep this manageable for an Internet reader, I
will merely sketch the
line of reasoning and the main results. For convenience, I will use
We will be aiming our formulation at physics applications, so
we will use the
notation Z=1x+iy+jz+kct,
with x,y,z,ct real, for an element of the algebra
We start by establishing a group upon the basis elements
1,i,j,k. It
must be Abelian because we ultimately want multiplication of elements of
The group identity element is 1. The second line
indicates that every
element has a multiplicative inverse. Associativity is immediately proved if we
can find a real matrix representation, which we shall do below, hence we have a
group. The fact that the group is Abelian assures that the ring that we will
construct upon it will be commutative. With only a little manipulation, one may
verify that the group is C2 X C4,
where Cn is the cyclic group of order n. [ASIDE: Thanks
to Peter Jack, who pointed out that it is not the same as the dihedral group
of symmetries of the square.] [NOTE: There are two other eighth-order commutative
groups but neither has an element of cyclic order 4, which is necessary for
complex-like behavior.]
A matrix representation of the basis elements will prove to
be very useful, but
it is not intuitively obvious how to construct the same. I happened upon the
following while constructing 4-D Cauchy-Riemann conditions by trial and error:
The fact that this is a faithful representation may be verified by simple matrix multiplication and comparison with the multiplication table given earlier. These matrices are orthogonal, with determinant +1. Considering them as unit rotation operators, the application of all four in any order (e.g., 1kji; they are commutative) brings the rotated object back to its original position: 1ijk=1. The basis elements also have a
2 X 2 complex matrix representation; it is:
These are the commutative counterparts of the Pauli spin
matrices of
physics. If one were to recast quantum mechanics using commutative hypercomplex
mathematics (and I have no doubt that it could be done), then these matrices would
play an important role.
Commutative Ring We now have everything that we need to establish a ring over the basis group. The ring elements have the form Z= 1x+iy+jz+kct. Addition and subtraction are performed term-by-term, the same as for vectors. Exactly as for the complex variable case, multiplication of two elements is done by multiplying each term of the second element by every term of the first element, with reduction of the 1,i,j,k cross products by use of the group multiplication table, followed by collection of like terms. The result is:
Multiplication of 4-D elements is commutative because multiplication is commutative in the basis group. Next, we need a definition of multiplicative inverse, or division by, elements of the form Z=1x+iy+jz+kct. It is not intuitively obvious what form it might take, and that might be a serious problem, except that we can construct a matrix representation of the element, then take the inverse of that. We do so as follows: We have a matrix representation of the basis elements 1,i,j,k. We substitute the matrices into the Z=1x+iy+jz+kct form and perform simple matrix addition to telescope the element into the form of a single matrix:
which has the usual matrix inverse, itself expandable into
the vector form. It
is remarkable that the matrix inverse of the typical matrix element of
The reader may verify that, in the vector form, if one multiplies Z by Z-1 (or vice-versa), one obtains a result of unity. As I mentioned earlier: Not intuitively obvious. All of the other conditions for a ring are satisfied, as the reader may easily verify. It is a commutative ring with unity, and fails to be a field only because of the following: The denominator in the vector form inverse is the determinant of the 4-D element in matrix form:
which is zero under the conditions (x=ct,y=-z) or
(x=-ct,y=z.); therefore the ring is not defined under those conditions.
This is probably the point at which Hamilton discarded this particular algebra on
his way ultimately to quaternions. Many readers, upon learning that there are
zero divisors, dismiss this algebra, this simple group ring, as if it is somehow
invalid. However, the zero divisors are not scattered at random in the 4-D space.
They lie in two orthogonal, 4-D hyperplanes (see below) and cause something similar to
analytic branch cuts in classical complex-valued functions. Indeed, the 4-D
quotient is an ordinary analytic function: f(Z)=Z-1, and the fact that it has planar singularities should
surprise no one. We don't dismiss and discard the whole body of complex analytical
functions, for example, because some exhibit such untidy features as essential
singularities and analytic branch cuts.
The fact of zero divisors is not a problem, here; rather,
it will prove to be very useful. To explain, we will need yet another
representation of the algebra. Having a ring, we are justified in rearranging
the typical element into the form
This awkward-looking expression reveals some remarkable properties. If we use the notation
then we have
Consequently, the ring operations can be written as follows:
We shall call this the canonical form of the algebraic
notation because of its
fundamental simplicity. We have decomposed the algebra into two copies of the
classical complex field, just as Dedekind wrote in 1885. However, these are not
just any two copies of the complex plane. They have orientation with respect to the
x,y,z,ct coordinate frame: The reader may verify that although they are each
defined "everywhere" and for all times (each is a function of x,y,z,ct), they
share only the point (0,0,0,0) in common! They are, in fact, mutually orthogonal in
four dimensions; i.e., the reader may verify that
Z1=
1x+iy-jy+kx
is a general position vector in the first noninvertible plane,
Z2=
1x+iy+jy-kx
is the same in the second noninvertible plane, and that
Z1·Z2=0 by the classical dot product rules.
Actually, the above statements need some clarification. They hold as long as we are dealing with true 4-D numbers (i.e., x,y,z,ct all real and nonzero). However, if Z=1x+iy or Z=1x (Z either classical complex or real), then the eigenvalues in canonical notation will be equal. The above operations can be used to show that
the That is not all, concerning the unusual properties of the
canonical form. It
so happens that, in the matrix form, [(x-ct)+i(y+z)], [(x+ct)+i(y-z)], and their complex conjugates are eigenvalues of the typical
element of the ring It is
remarkable that any 4 X 4 real matrix would yield its determinant, eigenvalues,
and eigenvectors by inspection. Even more remarkable, the 4 X 4 matrices are a
faithful representation of the ring elements, meaning that they form a ring.
Therefore, when one multiplies two of them, the result is another 4 X 4 matrix with
the same distinctive structure; ditto, when one takes the inverse of one of them.
Everything that we do here, including functions and operators, could be stated entirely in terms of 4 X 4 real
matrices. That fact puts everything on very sound mathematical
footing, but the canonical form is much more convenient for
calculations. The 4-D vector and the canonical form provide two different interpretations for the 4-D space with which we are working. If t is time and is considered to be uniformly increasing, then the vector form 1x+iy+jz+kct implies that our three-space 1x+iy+jz and everything in it is moving uniformly along the time axis with a speed c. Conversely, the eigenvalues (x-ct)+i(y+z) and (x+ct)+i(y-z) indicate that the 4-D space can be viewed as a pair of moving, orthogonally oriented classical complex planes, one moving in the positive x and one in the negative x direction. Our choice of coordinate frame orientation in space is arbitrary, so in the canonical viewpoint we can express the 4-D space and all actions within it in terms of a pair of complementary actions, one moving radially away from the source and one collapsing radially onto the source position. For example, any kind of wave motion about an infinitesimal element source can be broken down into an outgoing and a complementary incoming wave motion. Something needs to be said about measure and metric on the algebra
as an operation is not defined within the
All operations such as this must conform to the general
definition oper (Z) = oper ( Accordingly, for vector length we define the modulus as:
It has all of the proper classical complex variable properties, but note that it is not a scalar quantity. It does not explicitly return the 4-D length. Nevertheless, the modulus as defined implicitly embodies length information about a vector Z, because
In light of the above, a metric that is representable within
the
Secondly, the hypercomplex conjugate is defined in accordance with the standard operator definition:
This has all the expected classical complex variable properties
except one:
As with the modulus, this operation does not return the 4-D length, alone, unlike for the corresponding operation on the classical complex variables. The ring
u2+v2+
w2+s2=1, are sufficient to cause the 4 X 4 matrix form to be orthogonal.
They also cause the eigenvalues to be of unit magnitude. The resulting
orthogonal transformations can be stated and applied in the matrix form, the
canonical form, or the vector form. The result is the same. [The following
update was added 8/31/03 - CMD] Although the transformation is orthogonal in
four dimensions, it is not always so when viewed in only three dimensions.
A rigid rotation in four dimensions may not appear as a rigid rotation in the
three space dimensions, and vice-versa. However, there is a
frame of reference wherein it will so appear. For the object to be rotated,
let the points be denoted by a four-vector of the following form:
Note that this represents a simple change in the coordinate
frame of reference over our standard form, given earlier. Now, we know from
elementary matrix theory that the trace of a matrix is invariant under orthogonal
transformations. In the present case, the trace is 4ct, hence in this
frame, t is invariant under, unaffected by, does not participate in, the
orthogonal transformation. Consequently, if the remaining spatial three-space is
being orthogonally rotated, it follows that the rotation is that of a rigid body.
We conclude this Section with the assertion that the
The mathematical union
* Elements of O do not have multiplicative inverses in
* If Z is an element of * If Z1Z2=O, this implies that either Z1 or Z2 (or both) is an element of O.
* If a complex-valued function f(z)
is undefined at (0,0), then the corresponding 4-D function
Under this broadened view of the zero element,
Physicists would say that O is a
closed subspace, cut off from the rest of the universe (the
4-D Function Theory How would one define an analytic function of one independent
variable of the form
Z=1x+iy+jz+kct?
It is not obvious how to do so, but this is where the several different representations
of the
The 4-D function f(Z) is analytic if both
f(
For example, if f(Z)=exp(Z)=
exp(
Each component of this, or any other, analytic function so defined
obeys a 4-D Laplace's equation, as we shall explain, below. Analytic functions such
as this represent a gravitation-like distortion, or mapping, of the entire four-space.
Notice that the four function components are very tightly linked. If one changes any
parameter value in one component, all of the other components adjust their values in
lockstep. This is also the behavior of electromagnetism
[see the Electromagnetism
page].
Additionally, 4-D functions make very pretty 3-D fractals; see the julia fractal topic on the POV-Ray site for the mathematical details and Dave Makin's fractal images page for example 3-D images; return here by use of the browser "Back" button. A Web search for "hypercomplex fractals" turns up further examples. As an aside, everything that we do here
properly subsumes and extends
the corresponding classical complex variable concepts. For example, in the 4-D
exponential function, if one sets z=ct=0, one is left with
exp(Z) = exp(x)[cos(y) + i sin(y)], the complex
variable case. Moreover, insamuch as a function of one 4-D variable reduces to
the same function applied to two different
complex variables, no new questions arise about existence, uniqueness, completeness,
internal consistency, or similar requirements, over what is already known for the
complex variable case.
Because of the way that 4-D analytic functions are
defined, they have all the
same properties as for the corresponding complex-valued functions and we can
use all the same notation as for the complex variables. We have truly extended
the complex analysis to treat a 4-D variable. This result is not possible with
noncommutative quaternions, as shown by
Scheffers, 1893.
The only unexpected property is that
there are multiple noninvertible elements; that is, whenever either eigenvalue
( The Cauchy-Riemann conditions are an extremely important part of classical 2-D complex variable theory. The same is true for the 4-D case. Because of the way that a 4-D function is defined as a pair of classical complex functions, the 4-D Cauchy-Riemann equations are immediate [but messy to develop; see Davenport(6), 1991]. Using the notation F(Z)=1U+iV+ jW+kS for an analytic function that has been expanded into the vector form, the result is:
Observe that the two upper left hand equations are the
traditional Cauchy-Riemann conditions. The 4-D relations have many and far-reaching
consequences. We present some of them, below.
Carefully note that the C-R conditions are not a simply set of PDEs that have a specific "solution." Rather, they are a set of conditions that hold for any analytic function. They can be combined with a typical linear or nonlinear PDE, and the combined solution will be analytic. To give proper credit, we note that G. Baley Price published
a text/reference work
[Price, 1991] at the same time that my monograph
was released. In it, he used the long-existing notation of bicomplex
numbers [Segre, 1892] and
developed a function theory and analysis upon them. He showed that the bicomplex
numbers are isomorphic to a ring of 4 X 4 real matrices, precisely the 4 X 4
matrix representation of the elements of the Hypercomplex Analysis Our task here is to define operators such as derivative and
integral for functions
of a 4-D variable. They must be compatible with the function definition that we
already have, and they must be amenable to formulation with the various forms of
notation for the oper (Z) = oper ( For example, if we have a function sin(Z)=sin(
Again, this looks deceptively simple and uninteresting,
but because of the form of
The 4-D Cauchy-Riemann conditions have a number of
interesting consequences that are
extensions of those for the complex variable case
[Davenport(8), 1991]. In the following, we use
the notation F(Z)=1U+iV+jW+kS for an analytic
function that has been expanded into the vector form,
The first of these says that the derivative of a 4-D
analytic function is the same
within a sign in all four coordinate directions. The first two equalities are
the same as for complex variables. These equations can be used to
reduce a partial differential equation in several real, independent variables to an
ordinary differential equation in one 4-D variable. By doing so, we would be
imposing continuity conditions on the PDE, because the Cauchy-Riemann
conditions are a statement of continuity. PDEs are typically derived with the
assumption of continuity, but without its explicit inclusion because
convenient means have not been available. Note carefully that
we are not constraining any potential solution, because the C-R conditions
hold for any and all analytic functions.
The second and third lines indicate that the 4-D gradient of an analytic function is the same within a sign in all four coordinate directions. The fourth line indicates that all four vector components of an analytic function obey a 4-D Laplace's equation, just as the components of a complex-valued function obey a 2-D Laplace's equation. That is the same as saying that the 4-D components each obey a 3-D wave equation, because the unitary transformation x'=x, y'=y, z'=z, ct'=ict, where i is the classical imaginary, transforms each into a wave equation. The last line says that the four-gradient (not the 3-D vector gradient!) of any analytic function is always and everywhere zero. It is just a succinct statement of the Cauchy-Riemann conditions because it follows so directly from them. In fact, all of these relations are extensions of the corresponding complex variable cases. The algebra, function theory, and analysis of the
The following invariants, however, might not be
anticipated: If Z=1x+iy+jz+kct is an element of In conclusion, I believe that I have not just developed a generalization of the complex numbers, but the generalization. I have found an infinite sequence of algebras and systems of analysis that treat independent variables of 1, 2, 4, 8, ... , 2n, ... dimensions, and that obey the same axioms as for the complex variables. It can be completely stated in any of the following forms: 4-D vectors, 4 X 4 real matrices, 2 X 2 complex matrices, eigenvalue/eigenvector (canonical) form, and pairs of classical complex numbers with a certain structure. All of the algebraic properties, functions, analysis,
notation, etc. carry forward.
The fourth-order system can analytically treat the entire four-space, meaning, in my
opinion, that it can be used to describe any physics effects therein. All of physics
and engineering could be recast in commutative hypercomplex notation, and would
enormously benefit from the computational ease and insight that would be afforded.
See the
Electromagnetic Theory and
Special Relativity pages for examples.
Because the hypercomplex math is built so directly upon such solid, elementary math
concepts, it cannot be dismissed without also dismissing elementary group, ring, matrix,
and complex variable theory. I believe that it has great potential usefulness.
Some Useful Links There is a commutative hypercomplex numerical computation module for the MATLAB numerical analysis software package [see www.hypercomplex.us]. As we mentioned, the quaternions are another of the five possible group algebras of 4-D numbers. They are typically written with a notation similar to that used here. However, quaternion multiplication is not commutative, the key difference being that k2= -1 for quaternions and +1 for commutative hypercomplex multiplication. If you wish to study quaternion algebras, go to Eric Weisstein's quaternion page for an overview and list of references. Clifford algebras are a superalgebra extension of the quaternion concept, carrying it to higher dimensions. They, also, are being applied to physics applications. There is an international Clifford algebra working group, with a Web page at www.clifford.org. For some example work, see Ablamowicz, who has written a module for the MAPLE symbolic algebra package that enables Clifford algebra symbolic computations. It is available via his Web page.
© 2000, 2007
|