© 2000, 2007
Clyde M. Davenport
cmdaven@comcast.net
| 2-D Formulation | 4-D Formulation | Elementary Properties | |
| Einstein's Postulates | Relativistic Effects | Unorthodox Effects | Conclusions |
|
Introduction In order to understand the following, the reader should first review the Hypercomplex Math page. It describes a system of commutative 4-D numbers of the form Z=1x+iy+jz+kct [not quaternions] that behaves in all ways like the 2-D classical complex variables. It is based upon elementary group, ring, and matrix principles. We are motivated to attempt a formulation of special
relativity in terms of commutative hypercomplex mathematics by one highly suggestive aspect of the mathematics, alone. The overall four-space is made up of
the In the following, we shall present a brief sketch of
the hypercomplex formulation
for the Internet reader. Refer to
Einstein(1), 1923 or Bergmann,
1942 or any current treatment of elementary special relativity for more details on
the basic subject, and the
Hypercomplex Math page and
Davenport(9), 1991 for more details on the
hypercomplex formulation.
A 2-D Formulation A
The relative motion v is always assumed to be along the +x direction, and any observer is assumed to be at the origin of coordinates. This means that only radially inward or outward motion is considered. In hypercomplex notation, this is
The object T is a coordinate transform, as
indicated in the last line.
Here, it is applied in the vector notation, but it could be written in the 4 X 4
real matrix notation and applied in a traditional vector-matrix equation. However,
the 4-D hypercomplex formulation is far more computationally efficient.
Notice that we applied the transform to only 2-D (x,ct) vectors. Nothing is said about what might be happening to the y and z coordinates that lie transverse to the line of motion. One may review Einstein's original paper on special relativity [ Einstein(2), 1905] to see that he assumed that the transverse coordinates might be affected in magnitude but not direction, and went on to deduce under those circumstances that magnitude, also, was not affected. We could do as he did and set y'=y, z'=z and declare that our hypercomplex formulation is complete. It would be correct according to the current physics paradigm. Note also that the above formulations do not say how a
given situation would actually
appear to the aided or unaided vision of an observer, but state only how any
measurements of objects and events would be affected due to the relative motion. In
particular, nothing is said about the distance from observer to object or about the
geometry and orientation of the object. Moreover, in these restricted-case
formulations the observer is looking end-on at the object, so will "see" very little.
A 4-D Formulation The above 2-D formulation is not completely general because it is not given in terms of a full 4-D motion vector whose line of action can be in any direction and may or may not go through the observer's origin of coordinates. We would like a concise hypercomplex formulation that allows the moving inertial frame (constant, linear motion) to move in any direction with respect to the observer's coordinate frame. It can be constructed as follows. Let
be the 4-D position vector of the origin of the
moving coordinate frame. Let
the following dot notation indicate the ordinary derivative with respect to the
observer's time. Then, the 4-D relative motion vector as measured by the
observer (we can't explain how, here; it is a nontrivial problem) is
The moving coordinate frame is considered to move in a parallel fashion with respect to the stationary frame, but their origins may or may not ever be coincident. The line of action may stand off at some distance from the origin of the stationary coordinates. Now, hold that notation while we do the following. The relativity transform T, above, can be rearranged by use of
Note that, in the last step, we divided both the numerator and the denominator under the radical by the constant, k. This last form visually emphasizes that the relative motion
v is always in the 1 (x-coordinate) direction in the traditional
treatment. We want to allow it to be in any arbitrary direction. Looking at the
above form, we notice that the signs are such that if we wrote both vectors in the
canonical form, then the one in the numerator would be the same as the one in the
denominator, except with the eigenvalues interchanged. We continue this property
while adding in the two missing components for an arbitrary relative motion vector,
as follows:
We have used the notation
Note that the eigenvalues of T (the radical
quantities) are mutual inverses. This will be important, below.
By use of standard hypercomplex math methods, T can be expanded into 4-D vector form, as follows:
Elementary Properties We do not know at this point whether the above
additions are acceptable and will
produce the correct consequences. We do know that the modified transform
contains the traditional 2-D formulation, because if we simply set the y,z
components of the 3-D velocity to zero, then we are back to the 2-D formulation. We
shall proceed by examining the properties and consequences and seeing if they agree
with what is known from experimental physics.
We begin with some very minor and elementary, but nevertheless necessary, properties of the transform T. We cannot take the space to prove the following assertions, here, see Davenport(10), 1991 for the details.
Einstein's Postulates All of the above properties are good, and promising, but now we come to the two most important properties. The transforms T must produce results in accord with Einstein's two postulates of special relativity. Einstein's First Postulate The First Postulate is that the laws of physics are the same in all inertial systems. An inertial system is one that is free of acceleration and external forces, but can have constant, straight-line motion. Agreement with this postulate is proved by showing that the equations for the laws of physics remain unchanged in form under coordinate transforms T. We shall do that here. We know that an orthogonal transformation of coordinates leaves the equations of physics unchanged in form, and we intend to show that our 4-D formulation T is a form of orthogonal transformation. The argument might appear disconnected, but it will make sense in the end. To achieve our objective, we must do some look-ahead, back-and-forth,
longitudinal thinking. We recall that the Lorentz transformation leaves the form
x2+y2+z2
-(ct)2 invariant, and if we
first perform the unitary transformation x'=x, y'=y, z'=z, ct'=ict, where "i" is the
classical imaginary, then the form
x2+y2+z2+(ct)2 is invariant. In the
commutative hypercomplex formulation, we note that the form
x2-y2+z2
-(ct)2 is invariant, and we
take the clue from the above and apply the unitary transformation x'=x, y'= -iy,
z'=z, ct'=ict. The result is that the form
x2+y2+z2
+(ct)2 is invariant.
How does that help for our present purposes? Well,
if we apply the latter unitary transformation (designate it by A) to the coordinate
transformation equation X'=TX, we get:
AX' = ATA-1 AX As shown above, the element T has a vector
representation. Additionally,
it also has a matrix representation, as do all elements of the
Now, if we apply the similarity transform
T ' = ATA-1, then we get
Recall that to find the inverse of any element of
If we use this observation to construct the inverse of
T' (the similarity-transformed T), we get:
Finally, we are at our objective: Notice that the inverse
of T' is merely its
transpose! T' is complex orthogonal. That means that under the right
viewpoint (the proper unitary transformation of coordinates) and by virtue of 4-D
orthogonality arguments, the group
Einstein's Second Postulate The Second Postulate is that the speed of light in a vacuum is constant, the same in all inertial frames, and is independent of the source and any observer. In the traditional approach, it is built into the derivation of the Lorentz transform that the form x2+y2+z2 -(ct)2 is invariant from frame to frame (i.e., under the Lorentz transformation group). Therefore, if a spherical wavefront expands at a rate given by x2+y2+z2 =(ct)2 in the observer's own inertial frame of reference, with c constant, then it does so equally in all other inertial frames. We remarked earlier that an invariant quantity of the group For those uncomfortable with the use of unitary
transformations in our proofs, note that physicists accept as equivalent any two mathematical formulations that are linked by a unitary transformation that transforms one formulation into the other.
It is a widely-used technique, especially in quantum mechanics.
Relativistic Effects Compounding of Velocities As in the traditional special relativity theory, one finds the effect of compounding (adding) two velocities (for example, the velocity of the origin of the moving coordinate frame, plus a second velocity measured with respect to the moving frame) by multiplying their two associated Lorentz transforms, then deducing the single equivalent velocity that would produce the same effect. In the
What we are after is a single velocity expression that will replace the product of velocities in the denominator. We know how to form the products under the radical. They are
Notice that the product numerator and denominator
have the same form as the single-velocity original. The components of the numerator and the denominator correspond term by term, and the j,k signs are negative in the numerator, as in the original. This is a remarkable, fortuitous, and important property. Now that we have verified that the product numerator and denominator have the
requisite form and sign patterns relative to each other, we can focus on the denominator to extract an equivalent relative motion vector.
These products have the units of velocity squared,
but we want simply equivalent velocity expressions. We will have to find a common factor in both numerator and denominator with units of velocity and cancel it from both. Moreover, the k
component for a 4-D relative motion vector is always c (in agreement with Einstein's second postulate), so our factoring/canceling should leave c as the k component in both the numerator and denominator. In effect, we are renormalizing to keep the speed of light constant. Looking ahead toward what we will need to be
compatible with the Einstein law for compounding of velocities, we see that we will have to divide through by some quantity containing a term of the form
then we get a promising-looking result for the denominator:
We shall call this expression the generalized Einstein's addition law for relative velocities. Recall that before renormalization, this was the product of two 4-D vectors, V1 and V2. Now notice the remarkable
symmetry of the renormalized expression: If one sets the subscripted-2 quantities to zero, then the expression reverts to V1, and if one sets the subscripted-1 quantities to zero, the expression reverts to V2. Similar statements apply to the renormalized numerator. Also, the compounded velocity
expression reduces properly to the single velocity case when either 3-D spatial velocity is set to zero.
The acid test of this expression is: Does it reduce to the Einstein addition law for velocities when V1 and V2 are aligned in the same direction, along the +x-axis? It certainly does. If we set all y,z quantities to zero (forcing alignment with the x-axis), then we are left with
and the 1-component is precisely the Einstein
addition law for velocities, the resultant being aligned along the +x-axis, as required.
Length Contraction We can readily provide a formula for this property.
Any 4-D position vector X transforms according to X'=TX. The prime ( ' ) mark refers to a quantity at rest in the moving system, that is measured in the observer's (at rest) system. In all cases below, the unprimed coordinates refer to the (arbitrarily-assumed) stationary frame of reference, and the primed quantities refer to the (arbitrarily-selected) moving frame of reference. An interval vector transforms the same way, with U,R,W,S as given above:
From the traditional treatments of special relativity, when one is discussing length measurements one apparently does not consider how much time might be required to make the measurements. It is as if a measurement is instantaneous ( both primed and unprimed time intervals are zero). In the above, if we take that approach, then we are left with
From this, we can construct the generalized Lorentz contraction law:
The first formula provides the means to calculate the at-rest length Lo of an object in a moving frame from observations made from a stationary frame. This is the reverse of
the traditional Lorentz contraction formula, which states how an object with a known length and in constant, linear motion would appear to a stationary observer (foreshortened in the direction of motion). It might appear that we could go back to the 3-D vector-matrix equation above and solve for the length in terms of the primed-system measurements, but that would do no good - we would get the same result!
That is because neither observer in either frame can tell who is moving and who is stationary; to each, the other appears to be moving. The T transform as constructed by the moving observer would be the same as for the stationary observer. The second formula, above, gives the apparent length as seen by an observer in the stationary frame.
Again, the acid test of this formula is: Does it give the same result as the traditional Lorentz contraction formula under the same conditions? The answer is yes, but it is messy and space-consuming to explain in detail because of the complicated definitions of U,R,W,S. Nevertheless, it is just a matter of algebraic manipulation. We shall simply give the results here. If the dimension to be measured is aligned along the +x-axis, then
Moreover, in the traditional treatment the relative velocity is aligned along the +x-axis. That yields R=W=0 and
We are left with
The delta-x quantity is the length measurement L made by a stationary observer of the moving object whose length is Lo. Therefore, the length as seen by a stationary observer is:
which is precisely the Lorentz contraction formula.
Time Dilation We can dispense with this one rather easily. It is expedient to go back to the 4-D vector transform equation X'=TX and work with the inverse, X=T-1X':
As we pointed out above, because of its peculiar structure we can get the inverse of T by merely changing the signs on its W,S vector components. This is very helpful in calculations. Now, the above transformation indicates that the time component of a 4-D vector transforms according to:
In the second equation, we have merely factored out the delta-t quantity in order to compare it with the corresponding rest-frame quantity. The result is an odd expression at first glance. Recall that the relative motion vector V is incorporated into the U,R,W,S vector components of the relativistic
transform T. Apparently, the derivatives in parentheses represent any additional components of velocity measured with respect to the moving (primed) coordinate frame. We are not interested in adding velocity in the primed frame. A comparison of time effects between the stationary and moving origins of coordinates will be sufficient. We therefore set the additional velocity components
to zero, leaving
We shall call this the generalized time dilation formula. It readily reduces to that from the traditional special relativity analysis. As we saw earlier, in the restricted case of relative velocity along the +x-axis, U reduces to a division by the Lorentz contraction factor:
The effect of this formula is that, from a
stationary observer's perspective, time appears to run more slowy for
a moving body. This has been tested in elementary particle physics experiments and is found to be true.
Unorthodox Effects Finally - we can put it off no longer - we get to the nontraditional aspects of the present formulation. First, in all of the above formulas for relativistic effects, the observer's perceptions change dramatically if there is a standoff of the line of action and the relative velocity line of motion is rotated off of the observer's line of sight. For example, look at the definition of T:
We have already shown that it reduces to the restricted case when the i,j components are set to zero. But look what happens when the 1,i components are set to zero (leaving the remaining velocity turned perpendicular to the line of sight): the transform reduces to unity, no effect! In the latter case, it is assumed that the line of action does not go through the observer's position; otherwise, we have the traditional
restricted case, just oriented along a different direction in space. Of course, with a standoff, the nonlinearly-varying origin-to-origin distance between the frames will cause an apparent deceleration/acceleration effect as the moving frame passes the
point of closest approach, with acceleration dropping to zero only at closest approach. It would be difficult to make measurements under those circumstances. However, the acceleration effects are minimized if the moving frame is standing well off, say at astronomical distances; then when it is near the closest approach, no relativistic effects will be observed. There are many cases of
astronomical objects moving apart at apparently superluminal speeds [Hughes, 1990], which they couldn't do if a transverse view of the action still showed full relativistic effects. But, note: If we were on one of the objects and viewing the other along the line of motion, then we would see all of the traditional relativistic effects. Therefore, relativistic effects must shade out from the traditional ones along the line of motion to none when there is an offset of the line of motion and the moving object passes through the point of closest approach. Secondly, all coordinates are affected equally in the 4-D commutative hypercomplex formulation. In Einstein's treatment, the transverse coordinates were more or less assumed to be unaffected. To see how the hypercomplex formulation differs, go back and look at our 2-D hypercomplex formulation. Recall that it addresses the restricted case of relative motion along the +x-axis. In it, we have a transform T with only 1,k components, which is applied to position vectors X, also with only 1,k components (the latter to maintain agreement with Einstein's formulation). However, even an element T with only 1,k components has a 4 X 4 real matrix representation. It is
This has an attractive symmetry. If we apply it in a coordinate transformation X'=TX, but this time with 4-D position vectors, then we get
In this view, all coordinates are treated
qualitatively the same by the relativity transform. The result has a compelling symmetry. More importantly, it is in keeping with continuity of effects, in the sense of analytic functions,
and it obeys Einstein's two postulates.
Observe that (x',ct') and (x,ct) are a transform pair [i.e., do not involve (y,z)], and similarly for (y',z'),(y,z). Each of these transformations is of the form
with
Therefore, these transformations represent hyperbolic rotations. We are discussing the restricted case of motion along the +x-axis, but similar statements hold for the general transform T; it is just a matter of our perspective. A
similar observation regarding the nature of the Lorentz transform is made by Aharoni, 1965, adding another note of agreement between the two separate formulations. However, an important note of disagreement is that the transverse coordinates will be affected both in direction and magnitude.
Conclusions We have produced a direct, concise formulation of
special relativity with an arbitrary relative motion vector, and have shown that it reduces properly for the restricted case of relative velocity along the +x-axis. We showed that it obeys Einstein's two postulates of special relativity, under the proper viewpoint
(similarity transformation). Recall that Einstein conditioned his formulation on only the two postulates, so inasmuch as our present formulation obeys the two postulates, we can argue for a successful alternative formulation.
The group One will also hear arguments that much of physics is inherently noncommutative. Examples will be given such as successive rotations of a 3-D object around orthogonal axes. One cannot get back to the original orientation by merely reversing the order of operations. However, operations that look noncommutative in three dimensions may well be commutative in four. Note that a rigid rotation in four dimensions does not necessarily result in a rigid rotation of the three-space and vice-versa. For those who will say that they don't understand or
don't believe the strange mathematics, we point out that everything can be stated in terms of 4 X 4 real matrices. One would not normally want to go that route, however, because the canonical form is so much more concise, computationally efficient, and theoretically illuminating. Its formulation upon matrix eigenvalues and with numerous rotational invariants should be attractive to physicists.
© 2000, 2007
|