Gauss's Analysis of Curved Surfaces - The Origin of the Metric Tensor
(Very draught, please tell me if there are any errors).
Suppose there is a line in a space. The length of this line can be expressed in terms of a coordinate system. A short length of line Ds in a two dimensional space may be expressed in terms of Pythagoras' theorem as:
D
s2 = Dx 2 + Dy 2Suppose there is another coordinate system with two axes: x1, x2, how can the length of the line be expressed in terms of these new coordinates? Gauss tackled this problem and his analysis is quite straightforward for two coordinate axes:
Figure 1:

Therefore:
D
Y = Dx1 dY/dx1 + Dx2 dY/dx2D
Y2 = (Dx1 dY/dx1 + Dx2 dY/dx2)2D
Y2 = Dx1 dY/dx1 * Dx1 dY/dx1 + Dx1 dY/dx1 * Dx2 dY/dx2 + Dx1 dY/dx1 * Dx2 dY/dx2 + Dx2 dY/dx2 * Dx2 dY/dx2D
Y2 = Dx1Dx1 dY/dx1 dY/dx1 + Dx1Dx2 dY/dx1 dY/dx2 + Dx1Dx2 dY/dx1 dY/dx2 + Dx2Dx2 dY/dx2 dY/dx2And
D
Z = Dx1 dZ/dx1 + Dx2 dZ/dx2D
Z2 = (Dx1 dZ/dx1 + Dx2 dZ/dx2)2D
Z2 = Dx1 dZ/dx1 * Dx1 dZ/dx1 + Dx1 dZ/dx1 * Dx2 dZ/dx2 + Dx1 dZ/dx1 * Dx2 dZ/dx2 + Dx2 dZ/dx2 * Dx2 dZ/dx2D
Z2 = Dx1Dx1 dZ/dx1 dZ/dx1 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx2Dx2 dZ/dx2 dZ/dx2Therefore:
D
Y2 + DZ2 =(d
Y/dx1dY/dx1 + dZ/dx1dZ/dx1)Dx1 Dx1+ (d
Y/dx2Y/dx1 + dZ/dx2dZ/dx1)Dx2Dx1+ (d
Y/dx1dY/dx2 + dZ/dx1dZ/dx2)Dx1 Dx2+ (d
Y/dx2dY/dx2 + dZ/dx2dZ/dx2)Dx2 Dx2For a flat surface dY= dx2 and dZ= dx1 so dY/dx2 = 1 and dZ/dx1 = 1 also dY/dx1 = 0 and dZ/dx2 = 0.
D
s2 = DY2 + DZ2 =(0 + 1)D
x1 Dx1+ (0 + 0)D
x2Dx1+ (0 + 0)D
x1 Dx2+ (1 + 0)D
x2 Dx2so Ds2 = DY2 + DZ2 = Dx12 + Dx22
Which recovers Pythagoras' theorem. However in the most general case the small intervals may not be related by Pythagoras' theorem:
Suppose
D
s2 = -DY2 + DZ2So, as before:
D
Y = Dx1 dY/dx1 + Dx2 dY/dx2D
Y2 = Dx1Dx1 dY/dx1 dY/dx1 + Dx1Dx2 dY/dx1 dY/dx2 + Dx1Dx2 dY/dx1 dY/dx2 + Dx2Dx2 dY/dx2 dY/dx2D
Z = Dx1 dZ/dx1 + Dx2 dZ/dx2D
Z2 = Dx1Dx1 dZ/dx1 dZ/dx1 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx2Dx2 dZ/dx2 dZ/dx2So:
-D
Y2 + DZ2 =(-(d
Y/dx1dY/dx1) + dZ/dx1dZ/dx1)Dx1 Dx1+ (-(d
Y/dx2Y/dx) + dZ/dx2dZ/dx1)Dx2Dx1+ (-(d
Y/dx1dY/dx2) - dZ/dx1dZ/dx2)Dx1 Dx2+ (-(d
Y/dx2dY/dx2) - dZ/dx2dZ/dx2)Dx2 Dx2This type of analysis can be extended to any number of dimensions. In 3D:
D
s2 = SS (dx/dxi dx/dxk + dy/dxi dy/dxk + dz/dxi dz/dxk) DxiDxk(for i=1 to 3 and k=1 to 3)
and so, using indicial notation (see maths.htm):
D
s2 = gikDxiDxkWhere
gik = (dx/dxi dx/dxk + dy/dxi dy/dxk + dz/dxi dz/dxk)
If the coordinates are not merged then Ds is dependent on both sets of coordinates. In matrix notation:
D
s2 = gDxDxbecomes:
|
Dx1 Dx2 |
Times |
a |
b |
times |
D x1 |
|
|
|
c |
d |
|
D x2 |
Where a, b, c, d stand for the values of gik.
|
Dx1a + Dx2c |
D x1b + Dx2d |
Times |
D x1 |
|
|
|
|
D x2 |
Which is:
(Dx1a + Dx2c) Dx1 + (Dx1b + Dx2d) Dx2 = Dx12a + 2Dx1Dx2(c + b) + Dx22d
So:
D
s2 = Dx12a + 2Dx1Dx2(c + b) + Dx22dD
s2 is a bilinear form that depends on both Dx1 and Dx2. It can be written in matrix notation as:D
s2 = DxT A DxWhere A is the matrix containing the values in gik. This is a special case of the bilinear form known as the quadratic form because the same matrix (Dx) appears twice; in the generalised bilinear form B = xTAy (the matrices x and y are different).
If the surface is a Euclidean plane then the values of gik are:
|
dY/dx1dY/dx1 + dZ/dx1dZ/dx1 |
d Y/dx2Y/dx1 + dZ/dx2dZ/dx1 |
|
d Y/dx2Y/dx1 + dZ/dx2dZ/dx1 |
d Y/dx2dY/dx2 + dZ/dx2dZ/dx2 |
Which become:
|
1 |
0 |
|
0 |
1 |
So the matrix A is the unit matrix I and:
D
s2 = DxT I Dxand:
D
s2 = Dx12 + Dx22Which recovers Pythagoras' theorem.
If the surface is derived from Ds2 = -DY2 + DZ2 then the values of gik are:
|
-(dY/dx1dY/dx1) + dZ/dx1dZ/dx1 |
-(d Y/dx2Y/dx1) + dZ/dx2dZ/dx1 |
|
-(d Y/dx2Y/dx1) + dZ/dx2dZ/dx1 |
-(d Y/dx2dY/dx2) + dZ/dx2dZ/dx2 |
Which becomes:
|
-1 |
0 |
|
0 |
1 |
Which allows the original 'rule' to be recovered ie: Ds2 = -Dx12 + Dx22
The Space-Time Interval
The fundamental assumption of modern relativity theory is that the space-time interval is invariant. The space-time interval is given by the following equation rather than Pythagoras' theorem:
D
s2 = - Dt2 + Dx12 + Dx22 + Dx32The origin of the negative sign in front of Dt is of considerable interest.
Imaginary Time
Suppose that Pythagoras theorem applied to the space-time interval and:
D
s2 = D t 2 + Dx12 + Dx22 + Dx32gik = (dt/dxi dt/dxk + dx/dxi dx/dxk + dy/dxi dy/dxk + dz/dxi dz/dxk)
For a flat surface dt/dx0 = dx/dx1 = dy/dx2 = dz/dx3 = 1 and all other coefficients are zero therefore:
'g' =
|
1 |
0 |
0 |
0 |
|
0 |
1 |
0 |
0 |
|
0 |
0 |
1 |
0 |
|
0 |
0 |
0 |
1 |
Which means that the time interval must be imaginary if the assumption of relativity is to be supported ie: Dt 2 = (DT Ö -1) 2
So that Ds2 = Dt2 + Dx12 + Dx22 + Dx32 becomes Ds2 = - DT2 + Dx12 + Dx22 + Dx32
This form of time is not supported in General Relativity Theory
Real Time
If real time is used then the expressions for each displacement along each coordinate axis remain the same eg:
D
T = Dx1 dT/dx1 + Dx2 dT/dx2 + Dx3 dT/dx3 + Dx4 dT/dx4etc.
But when they are combined the formula Ds2 = - DT2 + Dx12 + Dx22 + Dx32 is used instead of Pythagoras' theorem (see above for a fully worked example in 2D). This results in the following metric tensor:
'g' =
|
-1 |
0 |
0 |
0 |
|
0 |
1 |
0 |
0 |
|
0 |
0 |
1 |
0 |
|
0 |
0 |
0 |
1 |
Where g00 is given by -1 times (dt/dx0) 2.
Mixed Real and Imaginary Time.
There is a third possibility that is not generally discussed. The 'plane' in figure 1 is a plane in the observer's coordinate system and the surface has its own coordinate system. If the time coordinate on the plane were 'imaginary' and that on the plane were real then using Pythagoras' theorem:
D
s2 = Dt2 + Dx12 + Dx22 + Dx32where t equals (kt), k being a constant that is yet to be determined. In flat space-time g00 is given by (dt/dx0) 2. But t is imaginary so g00 equals -1. This then gives exactly the same metric tensor as the assumption of real time.
'g' =
|
-1 |
0 |
0 |
0 |
|
0 |
1 |
0 |
0 |
|
0 |
0 |
1 |
0 |
|
0 |
0 |
0 |
1 |
The Modern Formulation of the Metric Tensor
The modern formulation uses the following mathematical expression for the space-time interval:
s2 = gmn xnxm
where the values of s, x represent tiny displacements in each of the four coordinate axes and 'g' is the metric of the space. This expression becomes
s2 = x12 + x22 + x32 - x42 when expanded as is shown below. In matrix notation this is:
|
T X1 X2 X3 |
-1 |
0 |
0 |
0 |
times |
T |
|
|
0 |
1 |
0 |
0 |
|
X1 |
|
|
0 |
0 |
1 |
0 |
|
X2 |
|
|
0 |
0 |
0 |
1 |
|
X3 |
(Where t is time in metres, ie:c times time in secs). The numbers -1,1,1,1 are the values of the combinations of differential cofficients that were described above.
Evaluating the first matrix multiplication this becomes:
|
-T x1 x2 x3 |
times |
T |
|
|
|
X1 |
|
|
|
X2 |
|
|
|
X3 |
Which resolves to: s2 = x12 + x22 + x32 - t2
Which is the metric of space-time and applies to quite large values of s,x and t in the absence of accelerations and strong gravitational fields. Notice how the computation is more like a squared norm than a simple square and carries with it the physical implication of a product of a vector with its reflection (!).
ds2 = dx12 + dx22 + dx32 - dt2
Or, equivalently:
ds2 = dt2 - dx12 - dx22 - dx32
(The use of the term "Pythagoras' Theorem" to describe the metric comes from Weyl's statement that: "Einstein = Newton + Pythagoras" and from the derivation of the metric tensor from Gauss' analysis based on Pythagoras' theorem.)