Gauss's Analysis of Curved Surfaces - The Origin of the Metric Tensor

 

(Very draught, please tell me if there are any errors).

Suppose there is a line in a space. The length of this line can be expressed in terms of a coordinate system. A short length of line Ds in a two dimensional space may be expressed in terms of Pythagoras' theorem as:

Ds2 = Dx 2 + Dy 2

Suppose there is another coordinate system with two axes: x1, x2, how can the length of the line be expressed in terms of these new coordinates? Gauss tackled this problem and his analysis is quite straightforward for two coordinate axes:

Figure 1:

 

Therefore:

DY = Dx1 dY/dx1 + Dx2 dY/dx2

DY2 = (Dx1 dY/dx1 + Dx2 dY/dx2)2

DY2 = Dx1 dY/dx1 * Dx1 dY/dx1 + Dx1 dY/dx1 * Dx2 dY/dx2 + Dx1 dY/dx1 * Dx2 dY/dx2 + Dx2 dY/dx2 * Dx2 dY/dx2

DY2 = Dx1Dx1 dY/dx1 dY/dx1 + Dx1Dx2 dY/dx1 dY/dx2 + Dx1Dx2 dY/dx1 dY/dx2 + Dx2Dx2 dY/dx2 dY/dx2

And

DZ = Dx1 dZ/dx1 + Dx2 dZ/dx2

DZ2 = (Dx1 dZ/dx1 + Dx2 dZ/dx2)2

DZ2 = Dx1 dZ/dx1 * Dx1 dZ/dx1 + Dx1 dZ/dx1 * Dx2 dZ/dx2 + Dx1 dZ/dx1 * Dx2 dZ/dx2 + Dx2 dZ/dx2 * Dx2 dZ/dx2

DZ2 = Dx1Dx1 dZ/dx1 dZ/dx1 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx2Dx2 dZ/dx2 dZ/dx2

Therefore:

DY2 + DZ2 =

(dY/dx1dY/dx1 + dZ/dx1dZ/dx1)Dx1 Dx1

+ (dY/dx2Y/dx1 + dZ/dx2dZ/dx1)Dx2Dx1

+ (dY/dx1dY/dx2 + dZ/dx1dZ/dx2)Dx1 Dx2

+ (dY/dx2dY/dx2 + dZ/dx2dZ/dx2)Dx2 Dx2

For a flat surface dY= dx2 and dZ= dx1 so dY/dx2 = 1 and dZ/dx1 = 1 also dY/dx1 = 0 and dZ/dx2 = 0.

Ds2 = DY2 + DZ2 =

(0 + 1)Dx1 Dx1

+ (0 + 0)Dx2Dx1

+ (0 + 0)Dx1 Dx2

+ (1 + 0)Dx2 Dx2

so Ds2 = DY2 + DZ2 = Dx12 + Dx22

Which recovers Pythagoras' theorem. However in the most general case the small intervals may not be related by Pythagoras' theorem:

Suppose

Ds2 = -DY2 + DZ2

So, as before:

DY = Dx1 dY/dx1 + Dx2 dY/dx2

DY2 = Dx1Dx1 dY/dx1 dY/dx1 + Dx1Dx2 dY/dx1 dY/dx2 + Dx1Dx2 dY/dx1 dY/dx2 + Dx2Dx2 dY/dx2 dY/dx2

DZ = Dx1 dZ/dx1 + Dx2 dZ/dx2

DZ2 = Dx1Dx1 dZ/dx1 dZ/dx1 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx1Dx2 dZ/dx1 dZ/dx2 + Dx2Dx2 dZ/dx2 dZ/dx2

So:

-DY2 + DZ2 =

(-(dY/dx1dY/dx1) + dZ/dx1dZ/dx1)Dx1 Dx1

+ (-(dY/dx2Y/dx) + dZ/dx2dZ/dx1)Dx2Dx1

+ (-(dY/dx1dY/dx2) - dZ/dx1dZ/dx2)Dx1 Dx2

+ (-(dY/dx2dY/dx2) - dZ/dx2dZ/dx2)Dx2 Dx2

This type of analysis can be extended to any number of dimensions. In 3D:

Ds2 = SS (dx/dxi dx/dxk + dy/dxi dy/dxk + dz/dxi dz/dxk) DxiDxk

(for i=1 to 3 and k=1 to 3)

and so, using indicial notation (see maths.htm):

Ds2 = gikDxiDxk

Where

gik = (dx/dxi dx/dxk + dy/dxi dy/dxk + dz/dxi dz/dxk)

 

If the coordinates are not merged then Ds is dependent on both sets of coordinates. In matrix notation:

Ds2 = gDxDx

becomes:

Dx1 Dx2

Times

a

b

times

Dx1

 

 

c

d

 

Dx2

  Where a, b, c, d stand for the values of gik.

Dx1a + Dx2c

Dx1b + Dx2d

Times

Dx1

 

 

 

Dx2

  Which is:

(Dx1a + Dx2c) Dx1 + (Dx1b + Dx2d) Dx2 = Dx12a + 2Dx1Dx2(c + b) + Dx22d

So:

Ds2 = Dx12a + 2Dx1Dx2(c + b) + Dx22d

Ds2 is a bilinear form that depends on both Dx1 and Dx2. It can be written in matrix notation as:

Ds2 = DxT A Dx

 Where A is the matrix containing the values in gik. This is a special case of the bilinear form known as the quadratic form because the same matrix (Dx) appears twice; in the generalised bilinear form B = xTAy (the matrices x and y are different).

If the surface is a Euclidean plane then the values of gik are:

dY/dx1dY/dx1 + dZ/dx1dZ/dx1

dY/dx2Y/dx1 + dZ/dx2dZ/dx1

dY/dx2Y/dx1 + dZ/dx2dZ/dx1

dY/dx2dY/dx2 + dZ/dx2dZ/dx2

Which become:

1

0

0

1

So the matrix A is the unit matrix I and:

Ds2 = DxT I Dx

and:

Ds2 = Dx12 + Dx22

Which recovers Pythagoras' theorem.

If the surface is derived from Ds2 = -DY2 + DZ2 then the values of gik are:

-(dY/dx1dY/dx1) + dZ/dx1dZ/dx1

-(dY/dx2Y/dx1) + dZ/dx2dZ/dx1

-(dY/dx2Y/dx1) + dZ/dx2dZ/dx1

-(dY/dx2dY/dx2) + dZ/dx2dZ/dx2

Which becomes:

-1

0

0

1

Which allows the original 'rule' to be recovered ie: Ds2 = -Dx12 + Dx22

 

The Space-Time Interval

The fundamental assumption of modern relativity theory is that the space-time interval is invariant. The space-time interval is given by the following equation rather than Pythagoras' theorem:

Ds2 = - Dt2 + Dx12 + Dx22 + Dx32

The origin of the negative sign in front of Dt is of considerable interest.

Imaginary Time

Suppose that Pythagoras theorem applied to the space-time interval and:

Ds2 = D t 2 + Dx12 + Dx22 + Dx32

gik = (dt/dxi dt/dxk + dx/dxi dx/dxk + dy/dxi dy/dxk + dz/dxi dz/dxk)

For a flat surface dt/dx0 = dx/dx1 = dy/dx2 = dz/dx3 = 1 and all other coefficients are zero therefore:

'g' =

1

0

0

0

0

1

0

0

0

0

1

0

0

0

0

1

 

Which means that the time interval must be imaginary if the assumption of relativity is to be supported ie: Dt 2 = (DT Ö -1) 2

So that Ds2 = Dt2 + Dx12 + Dx22 + Dx32 becomes Ds2 = - DT2 + Dx12 + Dx22 + Dx32

This form of time is not supported in General Relativity Theory

Real Time

If real time is used then the expressions for each displacement along each coordinate axis remain the same eg:

DT = Dx1 dT/dx1 + Dx2 dT/dx2 + Dx3 dT/dx3 + Dx4 dT/dx4

etc.

But when they are combined the formula Ds2 = - DT2 + Dx12 + Dx22 + Dx32 is used instead of Pythagoras' theorem (see above for a fully worked example in 2D). This results in the following metric tensor:

'g' =

-1

0

0

0

0

1

0

0

0

0

1

0

0

0

0

1

 

Where g00 is given by -1 times (dt/dx0) 2.

Mixed Real and Imaginary Time.

There is a third possibility that is not generally discussed. The 'plane' in figure 1 is a plane in the observer's coordinate system and the surface has its own coordinate system. If the time coordinate on the plane were 'imaginary' and that on the plane were real then using Pythagoras' theorem:

Ds2 = Dt2 + Dx12 + Dx22 + Dx32

where t equals (kt), k being a constant that is yet to be determined. In flat space-time g00 is given by (dt/dx0) 2. But t is imaginary so g00 equals -1. This then gives exactly the same metric tensor as the assumption of real time.

'g' =

-1

0

0

0

0

1

0

0

0

0

1

0

0

0

0

1

 

The Modern Formulation of the Metric Tensor

The modern formulation uses the following mathematical expression for the space-time interval:

s2 = gmn xnxm

where the values of s, x represent tiny displacements in each of the four coordinate axes and 'g' is the metric of the space. This expression becomes

s2 = x12 + x22 + x32 - x42 when expanded as is shown below. In matrix notation this is:

T X1 X2 X3

-1

0

0

0

times

T

 

0

1

0

0

 

X1

 

0

0

1

0

 

X2

 

0

0

0

1

 

X3

 

(Where t is time in metres, ie:c times time in secs). The numbers -1,1,1,1 are the values of the combinations of differential cofficients that were described above.

Evaluating the first matrix multiplication this becomes:

-T x1 x2 x3

times

T

 

 

X1

 

 

X2

 

 

X3

 

Which resolves to: s2 = x12 + x22 + x32 - t2

Which is the metric of space-time and applies to quite large values of s,x and t in the absence of accelerations and strong gravitational fields. Notice how the computation is more like a squared norm than a simple square and carries with it the physical implication of a product of a vector with its reflection (!).

ds2 = dx12 + dx22 + dx32 - dt2

Or, equivalently:

ds2 = dt2 - dx12 - dx22 - dx32

(The use of the term "Pythagoras' Theorem" to describe the metric comes from Weyl's statement that: "Einstein = Newton + Pythagoras" and from the derivation of the metric tensor from Gauss' analysis based on Pythagoras' theorem.)