projection

Lorentz transformations in Space Time Algebra (STA)

December 12, 2020 math and physics play , , , , , , , , , , , , , , , , , ,

[If mathjax doesn’t display properly for you, click here for a PDF of this post]

Motivation.

One of the remarkable features of geometric algebra are the complex exponential sandwiches that can be used to encode rotations in any dimension, or rotation like operations like Lorentz transformations in Minkowski spaces. In this post, we show some examples that unpack the geometric algebra expressions for Lorentz transformations operations of this sort. In particular, we will look at the exponential sandwich operations for spatial rotations and Lorentz boosts in the Dirac algebra, known as Space Time Algebra (STA) in geometric algebra circles, and demonstrate that these sandwiches do have the desired effects.

Lorentz transformations.

Theorem 1.1: Lorentz transformation.

The transformation
\begin{equation}\label{eqn:lorentzTransform:580}
x \rightarrow e^{B} x e^{-B} = x’,
\end{equation}
where \( B = a \wedge b \), is an STA 2-blade for any two linearly independent four-vectors \( a, b \), is a norm preserving, that is
\begin{equation}\label{eqn:lorentzTransform:600}
x^2 = {x’}^2.
\end{equation}

Start proof:

The proof is disturbingly trivial in this geometric algebra form
\begin{equation}\label{eqn:lorentzTransform:40}
\begin{aligned}
{x’}^2
&=
e^{B} x e^{-B} e^{B} x e^{-B} \\
&=
e^{B} x x e^{-B} \\
&=
x^2 e^{B} e^{-B} \\
&=
x^2.
\end{aligned}
\end{equation}

End proof.

In particular, observe that we did not need to construct the usual infinitesimal representations of rotation and boost transformation matrices or tensors in order to demonstrate that we have spacetime invariance for the transformations. The rough idea of such a transformation is that the exponential commutes with components of the four-vector that lie off the spacetime plane specified by the bivector \( B \), and anticommutes with components of the four-vector that lie in the plane. The end result is that the sandwich operation simplifies to
\begin{equation}\label{eqn:lorentzTransform:60}
x’ = x_\parallel e^{-B} + x_\perp,
\end{equation}
where \( x = x_\perp + x_\parallel \) and \( x_\perp \cdot B = 0 \), and \( x_\parallel \wedge B = 0 \). In particular, using \( x = x B B^{-1} = \lr{ x \cdot B + x \wedge B } B^{-1} \), we find that
\begin{equation}\label{eqn:lorentzTransform:80}
\begin{aligned}
x_\parallel &= \lr{ x \cdot B } B^{-1} \\
x_\perp &= \lr{ x \wedge B } B^{-1}.
\end{aligned}
\end{equation}
When \( B \) is a spacetime plane \( B = b \wedge \gamma_0 \), then this exponential has a hyperbolic nature, and we end up with a Lorentz boost. When \( B \) is a spatial bivector, we end up with a single complex exponential, encoding our plane old 3D rotation. More general \( B \)’s that encode composite boosts and rotations are also possible, but \( B \) must be invertible (it should have no lightlike factors.) The rough geometry of these projections is illustrated in fig 1, where the spacetime plane is represented by \( B \).

Projection and rejection geometry.

fig 1. Projection and rejection geometry.

 

What is not so obvious is how to pick \( B \)’s that correspond to specific rotation axes or boost directions. Let’s consider each of those cases in turn.

Theorem 1.2: Boost.

The boost along a direction vector \( \vcap \) and rapidity \( \alpha \) is given by
\begin{equation}\label{eqn:lorentzTransform:620}
x’ = e^{-\vcap \alpha/2} x e^{\vcap \alpha/2},
\end{equation}
where \( \vcap = \gamma_{k0} \cos\theta^k \) is an STA bivector representing a spatial direction with direction cosines \( \cos\theta^k \).

Start proof:

We want to demonstrate that this is equivalent to the usual boost formulation. We can start with decomposition of the four-vector \( x \) into components that lie in and off of the spacetime plane \( \vcap \).
\begin{equation}\label{eqn:lorentzTransform:100}
\begin{aligned}
x
&= \lr{ x^0 + \Bx } \gamma_0 \\
&= \lr{ x^0 + \Bx \vcap^2 } \gamma_0 \\
&= \lr{ x^0 + \lr{ \Bx \cdot \vcap} \vcap + \lr{ \Bx \wedge \vcap} \vcap } \gamma_0,
\end{aligned}
\end{equation}
where \( \Bx = x \wedge \gamma_0 \). The first two components lie in the boost plane, whereas the last is the spatial component of the vector that lies perpendicular to the boost plane. Observe that \( \vcap \) anticommutes with the dot product term and commutes with he wedge product term, so we have
\begin{equation}\label{eqn:lorentzTransform:120}
\begin{aligned}
x’
&=
\lr{ x^0 + \lr{ \Bx \cdot \vcap } \vcap } \gamma_0
e^{\vcap \alpha/2 }
e^{\vcap \alpha/2 }
+
\lr{ \Bx \wedge \vcap } \vcap \gamma_0
e^{-\vcap \alpha/2 }
e^{\vcap \alpha/2 } \\
&=
\lr{ x^0 + \lr{ \Bx \cdot \vcap } \vcap } \gamma_0
e^{\vcap \alpha }
+
\lr{ \Bx \wedge \vcap } \vcap \gamma_0.
\end{aligned}
\end{equation}
Noting that \( \vcap^2 = 1 \), we may expand the exponential in hyperbolic functions, and find that the boosted portion of the vector expands as
\begin{equation}\label{eqn:lorentzTransform:260}
\begin{aligned}
\lr{ x^0 + \lr{ \Bx \cdot \vcap} \vcap } \gamma_0 e^{\vcap \alpha}
&=
\lr{ x^0 + \lr{ \Bx \cdot \vcap} \vcap } \gamma_0 \lr{ \cosh\alpha + \vcap \sinh \alpha} \\
&=
\lr{ x^0 + \lr{ \Bx \cdot \vcap} \vcap } \lr{ \cosh\alpha – \vcap \sinh \alpha} \gamma_0 \\
&=
\lr{ x^0 \cosh\alpha – \lr{ \Bx \cdot \vcap} \sinh \alpha} \gamma_0
+
\lr{ -x^0 \sinh \alpha + \lr{ \Bx \cdot \vcap} \cosh \alpha } \vcap \gamma_0.
\end{aligned}
\end{equation}
We are left with
\begin{equation}\label{eqn:lorentzTransform:320}
\begin{aligned}
x’
&=
\lr{ x^0 \cosh\alpha – \lr{ \Bx \cdot \vcap} \sinh \alpha} \gamma_0
+
\lr{ \lr{ \Bx \cdot \vcap} \cosh \alpha -x^0 \sinh \alpha } \vcap \gamma_0
+
\lr{ \Bx \wedge \vcap} \vcap \gamma_0 \\
&=
\begin{bmatrix}
\gamma_0 & \vcap \gamma_0
\end{bmatrix}
\begin{bmatrix}
\cosh\alpha & – \sinh\alpha \\
-\sinh\alpha & \cosh\alpha
\end{bmatrix}
\begin{bmatrix}
x^0 \\
\Bx \cdot \vcap
\end{bmatrix}
+
\lr{ \Bx \wedge \vcap} \vcap \gamma_0,
\end{aligned}
\end{equation}
which has the desired Lorentz boost structure. Of course, this is usually seen with \( \vcap = \gamma_{10} \) so that the components in the coordinate column vector are \( (ct, x) \).

End proof.

Theorem 1.3: Spatial rotation.

Given two linearly independent spatial bivectors \( \Ba = a^k \gamma_{k0}, \Bb = b^k \gamma_{k0} \), a rotation of \(\theta\) radians in the plane of \( \Ba, \Bb \) from \( \Ba \) towards \( \Bb \), is given by
\begin{equation}\label{eqn:lorentzTransform:640}
x’ = e^{-i\theta} x e^{i\theta},
\end{equation}
where \( i = (\Ba \wedge \Bb)/\Abs{\Ba \wedge \Bb} \), is a unit (spatial) bivector.

Start proof:

Without loss of generality, we may pick \( i = \acap \bcap \), where \( \acap^2 = \bcap^2 = 1 \), and \( \acap \cdot \bcap = 0 \). With such an orthonormal basis for the plane, we can decompose our four vector into portions that lie in and off the plane
\begin{equation}\label{eqn:lorentzTransform:400}
\begin{aligned}
x
&= \lr{ x^0 + \Bx } \gamma_0 \\
&= \lr{ x^0 + \Bx i i^{-1} } \gamma_0 \\
&= \lr{ x^0 + \lr{ \Bx \cdot i } i^{-1} + \lr{ \Bx \wedge i } i^{-1} } \gamma_0.
\end{aligned}
\end{equation}
The projective term lies in the plane of rotation, whereas the timelike and spatial rejection term are perpendicular. That is
\begin{equation}\label{eqn:lorentzTransform:420}
\begin{aligned}
x_\parallel &= \lr{ \Bx \cdot i } i^{-1} \gamma_0 \\
x_\perp &= \lr{ x^0 + \lr{ \Bx \wedge i } i^{-1} } \gamma_0,
\end{aligned}
\end{equation}
where \( x_\parallel \wedge i = 0 \), and \( x_\perp \cdot i = 0 \). The plane pseudoscalar \( i \) anticommutes with \( x_\parallel \), and commutes with \( x_\perp \), so
\begin{equation}\label{eqn:lorentzTransform:440}
\begin{aligned}
x’
&= e^{-i\theta/2} \lr{ x_\parallel + x_\perp } e^{i\theta/2} \\
&= x_\parallel e^{i\theta} + x_\perp.
\end{aligned}
\end{equation}
However
\begin{equation}\label{eqn:lorentzTransform:460}
\begin{aligned}
\lr{ \Bx \cdot i } i^{-1}
&=
\lr{ \Bx \cdot \lr{ \acap \wedge \bcap } } \bcap \acap \\
&=
\lr{\Bx \cdot \acap} \bcap \bcap \acap
-\lr{\Bx \cdot \bcap} \acap \bcap \acap \\
&=
\lr{\Bx \cdot \acap} \acap
+\lr{\Bx \cdot \bcap} \bcap,
\end{aligned}
\end{equation}
so
\begin{equation}\label{eqn:lorentzTransform:480}
\begin{aligned}
x_\parallel e^{i\theta}
&=
\lr{
\lr{\Bx \cdot \acap} \acap
+
\lr{\Bx \cdot \bcap} \bcap
}
\gamma_0
\lr{
\cos\theta + \acap \bcap \sin\theta
} \\
&=
\acap \lr{
\lr{\Bx \cdot \acap} \cos\theta

\lr{\Bx \cdot \bcap} \sin\theta
}
\gamma_0
+
\bcap \lr{
\lr{\Bx \cdot \acap} \sin\theta
+
\lr{\Bx \cdot \bcap} \cos\theta
}
\gamma_0,
\end{aligned}
\end{equation}
so
\begin{equation}\label{eqn:lorentzTransform:500}
x’
=
\begin{bmatrix}
\acap & \bcap
\end{bmatrix}
\begin{bmatrix}
\cos\theta & – \sin\theta \\
\sin\theta & \cos\theta
\end{bmatrix}
\begin{bmatrix}
\Bx \cdot \acap \\
\Bx \cdot \bcap \\
\end{bmatrix}
\gamma_0
+
\lr{ x \wedge i} i^{-1} \gamma_0.
\end{equation}
Observe that this rejection term can be explicitly expanded to
\begin{equation}\label{eqn:lorentzTransform:520}
\lr{ \Bx \wedge i} i^{-1} \gamma_0 =
x –
\lr{ \Bx \cdot \acap } \acap \gamma_0

\lr{ \Bx \cdot \acap } \acap \gamma_0.
\end{equation}
This is the timelike component of the vector, plus the spatial component that is normal to the plane. This exponential sandwich transformation rotates only the portion of the vector that lies in the plane, and leaves the rest (timelike and normal) untouched.

End proof.

Problems.

Problem: Verify components relative to boost direction.

In the proof of thm. 1.2, the vector \( x \) was expanded in terms of the spacetime split. An alternate approach, is to expand as
\begin{equation}\label{eqn:lorentzTransform:340}
\begin{aligned}
x
&= x \vcap^2 \\
&= \lr{ x \cdot \vcap + x \wedge \vcap } \vcap \\
&= \lr{ x \cdot \vcap } \vcap + \lr{ x \wedge \vcap } \vcap.
\end{aligned}
\end{equation}
Show that
\begin{equation}\label{eqn:lorentzTransform:360}
\lr{ x \cdot \vcap } \vcap
=
\lr{ x^0 + \lr{ \Bx \cdot \vcap} \vcap } \gamma_0,
\end{equation}
and
\begin{equation}\label{eqn:lorentzTransform:380}
\lr{ x \wedge \vcap } \vcap
=
\lr{ \Bx \wedge \vcap} \vcap \gamma_0.
\end{equation}

Answer

Let \( x = x^\mu \gamma_\mu \), so that
\begin{equation}\label{eqn:lorentzTransform:160}
\begin{aligned}
x \cdot \vcap
&=
\gpgradeone{ x^\mu \gamma_\mu \cos\theta^b \gamma_{b 0} } \\
&=
x^\mu \cos\theta^b \gpgradeone{ \gamma_\mu \gamma_{b 0} }
.
\end{aligned}
\end{equation}
The \( \mu = 0 \) component of this grade selection is
\begin{equation}\label{eqn:lorentzTransform:180}
\gpgradeone{ \gamma_0 \gamma_{b 0} }
=
-\gamma_b,
\end{equation}
and for \( \mu = a \ne 0 \), we have
\begin{equation}\label{eqn:lorentzTransform:200}
\gpgradeone{ \gamma_a \gamma_{b 0} }
=
-\delta_{a b} \gamma_0,
\end{equation}
so we have
\begin{equation}\label{eqn:lorentzTransform:220}
\begin{aligned}
x \cdot \vcap
&=
x^0 \cos\theta^b (-\gamma_b)
+
x^a \cos\theta^b (-\delta_{ab} \gamma_0 ) \\
&=
-x^0 \vcap \gamma_0

x^b \cos\theta^b \gamma_0 \\
&=
– \lr{ x^0 \vcap + \Bx \cdot \vcap } \gamma_0,
\end{aligned}
\end{equation}
where \( \Bx = x \wedge \gamma_0 \) is the spatial portion of the four vector \( x \) relative to the stationary observer frame. Since \( \vcap \) anticommutes with \( \gamma_0 \), the component of \( x \) in the spacetime plane \( \vcap \) is
\begin{equation}\label{eqn:lorentzTransform:240}
\lr{ x \cdot \vcap } \vcap =
\lr{ x^0 + \lr{ \Bx \cdot \vcap} \vcap } \gamma_0,
\end{equation}
as expected.

For the rejection term, we have
\begin{equation}\label{eqn:lorentzTransform:280}
x \wedge \vcap
=
x^\mu \cos\theta^s \gpgradethree{ \gamma_\mu \gamma_{s 0} }.
\end{equation}
The \( \mu = 0 \) term clearly contributes nothing, leaving us with:
\begin{equation}\label{eqn:lorentzTransform:300}
\begin{aligned}
\lr{ x \wedge \vcap } \vcap
&=
\lr{ x \wedge \vcap } \cdot \vcap \\
&=
x^r \cos\theta^s \cos\theta^t \lr{ \lr{ \gamma_r \wedge \gamma_{s}} \gamma_0 } \cdot \lr{ \gamma_{t0} } \\
&=
x^r \cos\theta^s \cos\theta^t \gpgradeone{
\lr{ \gamma_r \wedge \gamma_{s} } \gamma_0 \gamma_{t0}
} \\
&=
-x^r \cos\theta^s \cos\theta^t \lr{ \gamma_r \wedge \gamma_{s}} \cdot \gamma_t \\
&=
-x^r \cos\theta^s \cos\theta^t \lr{ -\gamma_r \delta_{st} + \gamma_s \delta_{rt} } \\
&=
x^r \cos\theta^t \cos\theta^t \gamma_r

x^t \cos\theta^s \cos\theta^t \gamma_s \\
&=
\Bx \gamma_0
– (\Bx \cdot \vcap) \vcap \gamma_0 \\
&=
\lr{ \Bx \wedge \vcap} \vcap \gamma_0,
\end{aligned}
\end{equation}
as expected. Is there a clever way to demonstrate this without resorting to coordinates?

Problem: Rotation transformation components.

Given a unit spatial bivector \( i = \acap \bcap \), where \( \acap \cdot \bcap = 0 \) and \( i^2 = -1 \), show that
\begin{equation}\label{eqn:lorentzTransform:540}
\lr{ x \cdot i } i^{-1}
=
\lr{ \Bx \cdot i } i^{-1} \gamma_0
=
\lr{\Bx \cdot \acap } \acap \gamma_0
+
\lr{\Bx \cdot \bcap } \bcap \gamma_0,
\end{equation}
and
\begin{equation}\label{eqn:lorentzTransform:560}
\lr{ x \wedge i } i^{-1}
=
\lr{ \Bx \wedge i } i^{-1} \gamma_0
=
x –
\lr{\Bx \cdot \acap } \acap \gamma_0

\lr{\Bx \cdot \bcap } \bcap \gamma_0.
\end{equation}
Also show that \( i \) anticommutes with \( \lr{ x \cdot i } i^{-1} \) and commutes with \( \lr{ x \wedge i } i^{-1} \).

Answer

This problem is left for the reader, as I don’t feel like typing out my solution.

The first part of this problem can be done in the tedious coordinate approach used above, but hopefully there is a better way.

For the last (commutation) part of the problem, here is a hint. Let \( x \wedge i = n i \), where \( n \cdot i = 0 \). The result then follows easily.

Fundamental Theorem of Geometric Calculus

September 20, 2016 math and physics play , , , , , , , , , , , , , , , ,

[Click here for a PDF of this post with nicer formatting]

Stokes Theorem

The Fundamental Theorem of (Geometric) Calculus is a generalization of Stokes theorem to multivector integrals. Notationally, it looks like Stokes theorem with all the dot and wedge products removed. It is worth restating Stokes theorem and all the definitions associated with it for reference

Stokes’ Theorem

For blades \(F \in \bigwedge^{s}\), and \(m\) volume element \(d^k \Bx, s < k\), \begin{equation*} \int_V d^k \Bx \cdot (\boldpartial \wedge F) = \oint_{\partial V} d^{k-1} \Bx \cdot F. \end{equation*} This is a loaded and abstract statement, and requires many definitions to make it useful

  • The volume integral is over a \(m\) dimensional surface (manifold).
  • Integration over the boundary of the manifold \(V\) is indicated by \( \partial V \).
  • This manifold is assumed to be spanned by a parameterized vector \( \Bx(u^1, u^2, \cdots, u^k) \).
  • A curvilinear coordinate basis \( \setlr{ \Bx_i } \) can be defined on the manifold by
    \begin{equation}\label{eqn:fundamentalTheoremOfCalculus:40}
    \Bx_i \equiv \PD{u^i}{\Bx} \equiv \partial_i \Bx.
    \end{equation}

  • A dual basis \( \setlr{\Bx^i} \) reciprocal to the tangent vector basis \( \Bx_i \) can be calculated subject to the requirement \( \Bx_i \cdot \Bx^j = \delta_i^j \).
  • The vector derivative \(\boldpartial\), the projection of the gradient onto the tangent space of the manifold, is defined by
    \begin{equation}\label{eqn:fundamentalTheoremOfCalculus:100}
    \boldpartial = \Bx^i \partial_i = \sum_{i=1}^k \Bx_i \PD{u^i}{}.
    \end{equation}

  • The volume element is defined by
    \begin{equation}\label{eqn:fundamentalTheoremOfCalculus:60}
    d^k \Bx = d\Bx_1 \wedge d\Bx_2 \cdots \wedge d\Bx_k,
    \end{equation}

    where

    \begin{equation}\label{eqn:fundamentalTheoremOfCalculus:80}
    d\Bx_k = \Bx_k du^k,\qquad \text{(no sum)}.
    \end{equation}

  • The volume element is non-zero on the manifold, or \( \Bx_1 \wedge \cdots \wedge \Bx_k \ne 0 \).
  • The surface area element \( d^{k-1} \Bx \), is defined by
    \begin{equation}\label{eqn:fundamentalTheoremOfCalculus:120}
    d^{k-1} \Bx = \sum_{i = 1}^k (-1)^{k-i} d\Bx_1 \wedge d\Bx_2 \cdots \widehat{d\Bx_i} \cdots \wedge d\Bx_k,
    \end{equation}

    where \( \widehat{d\Bx_i} \) indicates the omission of \( d\Bx_i \).

  • My proof for this theorem was restricted to a simple “rectangular” volume parameterized by the ranges
    \(
    [u^1(0), u^1(1) ] \otimes
    [u^2(0), u^2(1) ] \otimes \cdots \otimes
    [u^k(0), u^k(1) ] \)

  • The precise meaning that should be given to oriented area integral is
    \begin{equation}\label{eqn:fundamentalTheoremOfCalculus:140}
    \oint_{\partial V} d^{k-1} \Bx \cdot F
    =
    \sum_{i = 1}^k (-1)^{k-i} \int \evalrange{
    \lr{ \lr{ d\Bx_1 \wedge d\Bx_2 \cdots \widehat{d\Bx_i} \cdots \wedge d\Bx_k } \cdot F }
    }{u^i = u^i(0)}{u^i(1)},
    \end{equation}

    where both the a area form and the blade \( F \) are evaluated at the end points of the parameterization range.

After the work of stating exactly what is meant by this theorem, most of the proof follows from the fact that for \( s < k \) the volume curl dot product can be expanded as \begin{equation}\label{eqn:fundamentalTheoremOfCalculus:160} \int_V d^k \Bx \cdot (\boldpartial \wedge F) = \int_V d^k \Bx \cdot (\Bx^i \wedge \partial_i F) = \int_V \lr{ d^k \Bx \cdot \Bx^i } \cdot \partial_i F. \end{equation} Each of the \(du^i\) integrals can be evaluated directly, since each of the remaining \(d\Bx_j = du^j \PDi{u^j}{}, i \ne j \) is calculated with \( u^i \) held fixed. This allows for the integration over a ``rectangular'' parameterization region, proving the theorem for such a volume parameterization. A more general proof requires a triangulation of the volume and surface, but the basic principle of the theorem is evident, without that additional work.

Fundamental Theorem of Calculus

There is a Geometric Algebra generalization of Stokes theorem that does not have the blade grade restriction of Stokes theorem. In [2] this is stated as

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:180}
\int_V d^k \Bx \boldpartial F = \oint_{\partial V} d^{k-1} \Bx F.
\end{equation}

A similar expression is used in [1] where it is also pointed out there is a variant with the vector derivative acting to the left

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:200}
\int_V F d^k \Bx \boldpartial = \oint_{\partial V} F d^{k-1} \Bx.
\end{equation}

In [3] it is pointed out that a bidirectional formulation is possible, providing the most general expression of the Fundamental Theorem of (Geometric) Calculus

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:220}
\boxed{
\int_V F d^k \Bx \boldpartial G = \oint_{\partial V} F d^{k-1} \Bx G.
}
\end{equation}

Here the vector derivative acts both to the left and right on \( F \) and \( G \). The specific action of this operator is
\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:240}
\begin{aligned}
F \boldpartial G
&=
(F \boldpartial) G
+
F (\boldpartial G) \\
&=
(\partial_i F) \Bx^i G
+
F \Bx^i (\partial_i G).
\end{aligned}
\end{equation}

The fundamental theorem can be demonstrated by direct expansion. With the vector derivative \( \boldpartial \) and its partials \( \partial_i \) acting bidirectionally, that is

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:260}
\begin{aligned}
\int_V F d^k \Bx \boldpartial G
&=
\int_V F d^k \Bx \Bx^i \partial_i G \\
&=
\int_V F \lr{ d^k \Bx \cdot \Bx^i + d^k \Bx \wedge \Bx^i } \partial_i G.
\end{aligned}
\end{equation}

Both the reciprocal frame vectors and the curvilinear basis span the tangent space of the manifold, since we can write any reciprocal frame vector as a set of projections in the curvilinear basis

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:280}
\Bx^i = \sum_j \lr{ \Bx^i \cdot \Bx^j } \Bx_j,
\end{equation}

so \( \Bx^i \in sectionpan \setlr{ \Bx_j, j \in [1,k] } \).
This means that \( d^k \Bx \wedge \Bx^i = 0 \), and

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:300}
\begin{aligned}
\int_V F d^k \Bx \boldpartial G
&=
\int_V F \lr{ d^k \Bx \cdot \Bx^i } \partial_i G \\
&=
\sum_{i = 1}^{k}
\int_V
du^1 du^2 \cdots \widehat{ du^i} \cdots du^k
F \lr{
(-1)^{k-i}
\Bx_1 \wedge \Bx_2 \cdots \widehat{\Bx_i} \cdots \wedge \Bx_k } \partial_i G du^i \\
&=
\sum_{i = 1}^{k}
(-1)^{k-i}
\int_{u^1}
\int_{u^2}
\cdots
\int_{u^{i-1}}
\int_{u^{i+1}}
\cdots
\int_{u^k}
\evalrange{ \lr{
F d\Bx_1 \wedge d\Bx_2 \cdots \widehat{d\Bx_i} \cdots \wedge d\Bx_k G
}
}{u^i = u^i(0)}{u^i(1)}.
\end{aligned}
\end{equation}

Adding in the same notational sugar that we used in Stokes theorem, this proves the Fundamental theorem \ref{eqn:fundamentalTheoremOfCalculus:220} for “rectangular” parameterizations. Note that such a parameterization need not actually be rectangular.

Example: Application to Maxwell’s equation

{example:fundamentalTheoremOfCalculus:1}

Maxwell’s equation is an example of a first order gradient equation

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:320}
\grad F = \inv{\epsilon_0 c} J.
\end{equation}

Integrating over a four-volume (where the vector derivative equals the gradient), and applying the Fundamental theorem, we have

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:340}
\inv{\epsilon_0 c} \int d^4 x J = \oint d^3 x F.
\end{equation}

Observe that the surface area element product with \( F \) has both vector and trivector terms. This can be demonstrated by considering some examples

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:360}
\begin{aligned}
\gamma_{012} \gamma_{01} &\propto \gamma_2 \\
\gamma_{012} \gamma_{23} &\propto \gamma_{023}.
\end{aligned}
\end{equation}

On the other hand, the four volume integral of \( J \) has only trivector parts. This means that the integral can be split into a pair of same-grade equations

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:380}
\begin{aligned}
\inv{\epsilon_0 c} \int d^4 x \cdot J &=
\oint \gpgradethree{ d^3 x F} \\
0 &=
\oint d^3 x \cdot F.
\end{aligned}
\end{equation}

The first can be put into a slightly tidier form using a duality transformation
\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:400}
\begin{aligned}
\gpgradethree{ d^3 x F}
&=
-\gpgradethree{ d^3 x I^2 F} \\
&=
\gpgradethree{ I d^3 x I F} \\
&=
(I d^3 x) \wedge (I F).
\end{aligned}
\end{equation}

Letting \( n \Abs{d^3 x} = I d^3 x \), this gives

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:420}
\oint \Abs{d^3 x} n \wedge (I F) = \inv{\epsilon_0 c} \int d^4 x \cdot J.
\end{equation}

Note that this normal is normal to a three-volume subspace of the spacetime volume. For example, if one component of that spacetime surface area element is \( \gamma_{012} c dt dx dy \), then the normal to that area component is \( \gamma_3 \).

A second set of duality transformations

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:440}
\begin{aligned}
n \wedge (IF)
&=
\gpgradethree{ n I F} \\
&=
-\gpgradethree{ I n F} \\
&=
-\gpgradethree{ I (n \cdot F)} \\
&=
-I (n \cdot F),
\end{aligned}
\end{equation}

and
\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:460}
\begin{aligned}
I d^4 x \cdot J
&=
\gpgradeone{ I d^4 x \cdot J } \\
&=
\gpgradeone{ I d^4 x J } \\
&=
\gpgradeone{ (I d^4 x) J } \\
&=
(I d^4 x) J,
\end{aligned}
\end{equation}

can further tidy things up, leaving us with

\begin{equation}\label{eqn:fundamentalTheoremOfCalculus:500}
\boxed{
\begin{aligned}
\oint \Abs{d^3 x} n \cdot F &= \inv{\epsilon_0 c} \int (I d^4 x) J \\
\oint d^3 x \cdot F &= 0.
\end{aligned}
}
\end{equation}

The Fundamental theorem of calculus immediately provides relations between the Faraday bivector \( F \) and the four-current \( J \).

References

[1] C. Doran and A.N. Lasenby. Geometric algebra for physicists. Cambridge University Press New York, Cambridge, UK, 1st edition, 2003.

[2] A. Macdonald. Vector and Geometric Calculus. CreateSpace Independent Publishing Platform, 2012.

[3] Garret Sobczyk and Omar Le\’on S\’anchez. Fundamental theorem of calculus. Advances in Applied Clifford Algebras, 21\penalty0 (1):\penalty0 221–231, 2011. URL https://arxiv.org/abs/0809.4526.

Geometric algebra notes collection split into two volumes

November 10, 2015 math and physics play , , , , , , , , , , , , ,

I’ve now split my (way too big) Exploring physics with Geometric Algebra into two volumes:

Each of these is now a much more manageable size, which should facilitate removing the redundancies in these notes, and making them more properly book like.

Also note I’ve also previously moved “Exploring Geometric Algebra” content related to:

  • Lagrangian’s
  • Hamiltonian’s
  • Noether’s theorem

into my classical mechanics collection (449 pages).

Schwartz inequality in bra-ket notation

July 6, 2015 phy1520 , , , , , ,

[Click here for a PDF of this post with nicer formatting]

Motivation

In [2] the Schwartz inequality

\begin{equation}\label{eqn:qmSchwartz:20}
\boxed{
\braket{a}{a}
\braket{b}{b}
\ge \Abs{\braket{a}{b}}^2,
}
\end{equation}

is used in the derivation of the uncertainty relation. The proof of the Schwartz inequality uses a sneaky substitution that doesn’t seem obvious, and is even less obvious since there is a typo in the value to be substituted. Let’s understand where that sneakiness is coming from.

Without being sneaky

My ancient first year linear algebra text [1] contains a non-sneaky proof, but it only works for real vector spaces. Recast in bra-ket notation, this method examines the bounds of the norms of sums and differences of unit states (i.e. \( \braket{a}{a} = \braket{b}{b} = 1 \).)

\begin{equation}\label{eqn:qmSchwartz:40}
\braket{a – b}{a – b}
= \braket{a}{a} + \braket{b}{b} – \braket{a}{b} – \braket{b}{a}
= 2 – 2 \textrm{Re} \braket{a}{b}
\ge 0,
\end{equation}

so
\begin{equation}\label{eqn:qmSchwartz:60}
1 \ge \textrm{Re} \braket{a}{b}.
\end{equation}

Similarily

\begin{equation}\label{eqn:qmSchwartz:80}
\braket{a + b}{a + b}
= \braket{a}{a} + \braket{b}{b} + \braket{a}{b} + \braket{b}{a}
= 2 + 2 \textrm{Re} \braket{a}{b}
\ge 0,
\end{equation}

so
\begin{equation}\label{eqn:qmSchwartz:100}
\textrm{Re} \braket{a}{b} \ge -1.
\end{equation}

This means that for normalized state vectors

\begin{equation}\label{eqn:qmSchwartz:120}
-1 \le \textrm{Re} \braket{a}{b} \le 1,
\end{equation}

or
\begin{equation}\label{eqn:qmSchwartz:140}
\Abs{\textrm{Re} \braket{a}{b}} \le 1.
\end{equation}

Writing out the unit vectors explicitly, that last inequality is

\begin{equation}\label{eqn:qmSchwartz:180}
\Abs{ \textrm{Re} \braket{ \frac{a}{\sqrt{\braket{a}{a}}} }{ \frac{b}{\sqrt{\braket{b}{b}}} } } \le 1,
\end{equation}

squaring and rearranging gives

\begin{equation}\label{eqn:qmSchwartz:200}
\Abs{\textrm{Re} \braket{a}{b}}^2 \le
\braket{a}{a}
\braket{b}{b}.
\end{equation}

This is similar to, but not identical to the Schwartz inequality. Since \( \Abs{\textrm{Re} \braket{a}{b}} \le \Abs{\braket{a}{b}} \) the Schwartz inequality cannot be demonstrated with this argument. This first year algebra method works nicely for demonstrating the inequality for real vector spaces, so a different argument is required for a complex vector space (i.e. quantum mechanics state space.)

Arguing with projected and rejected components

Notice that the equality condition in the inequality holds when the vectors are colinear, and the largest inequality holds when the vectors are normal to each other. Given those geometrical observations, it seems reasonable to examine the norms of projected or rejected components of a vector. To do so in bra-ket notation, the correct form of a projection operation must be determined. Care is required to get the ordering of the bra-kets right when expressing such a projection.

Suppose we wish to calculation the rejection of \( \ket{a} \) from \( \ket{b} \), that is \( \ket{b – \alpha a}\), such that

\begin{equation}\label{eqn:qmSchwartz:220}
0
= \braket{a}{b – \alpha a}
= \braket{a}{b} – \alpha \braket{a}{a},
\end{equation}

or
\begin{equation}\label{eqn:qmSchwartz:240}
\alpha =
\frac{\braket{a}{b} }{ \braket{a}{a} }.
\end{equation}

Therefore, the projection of \( \ket{b} \) on \( \ket{a} \) is

\begin{equation}\label{eqn:qmSchwartz:260}
\textrm{Proj}_{\ket{a}} \ket{b}
= \frac{\braket{a}{b} }{ \braket{a}{a} } \ket{a}
= \frac{\braket{b}{a}^\conj }{ \braket{a}{a} } \ket{a}.
\end{equation}

The conventional way to write this in QM is in the operator form

\begin{equation}\label{eqn:qmSchwartz:300}
\textrm{Proj}_{\ket{a}} \ket{b}
= \frac{\ket{a}\bra{a}}{\braket{a}{a}} \ket{b}.
\end{equation}

In this form the rejection of \( \ket{a} \) from \( \ket{b} \) can be expressed as

\begin{equation}\label{eqn:qmSchwartz:280}
\textrm{Rej}_{\ket{a}} \ket{b} = \ket{b} – \frac{\ket{a}\bra{a}}{\braket{a}{a}} \ket{b}.
\end{equation}

This state vector is normal to \( \ket{a} \) as desired

\begin{equation}\label{eqn:qmSchwartz:320}
\braket{a}{b – \frac{\braket{a}{b} }{ \braket{a}{a} } a }
=
\braket{a}{ b} – \frac{ \braket{a}{b} }{ \braket{a}{a} } \braket{a}{a}
=
\braket{a}{ b} – \braket{a}{b}
= 0.
\end{equation}

How about it’s length? That is

\begin{equation}\label{eqn:qmSchwartz:340}
\begin{aligned}
\braket{b – \frac{\braket{a}{b} }{ \braket{a}{a} } a}{b – \frac{\braket{a}{b} }{ \braket{a}{a} } a }
&=
\braket{b}{b} – 2 \frac{\Abs{\braket{a}{b}}^2}{\braket{a}{a}} +\frac{\Abs{\braket{a}{b}}^2 }{ \braket{a}{a}^2 } \braket{a}{a} \\
&=
\braket{b}{b} – \frac{\Abs{\braket{a}{b}}^2}{\braket{a}{a}}.
\end{aligned}
\end{equation}

Observe that this must be greater to or equal to zero, so

\begin{equation}\label{eqn:qmSchwartz:360}
\braket{b}{b} \ge \frac{ \Abs{ \braket{a}{b} }^2 }{ \braket{a}{a} }.
\end{equation}

Rearranging this gives \ref{eqn:qmSchwartz:20} as desired. The Schwartz proof in [2] obscures the geometry involved and starts with

\begin{equation}\label{eqn:qmSchwartz:380}
\braket{b + \lambda a}{b + \lambda a} \ge 0,
\end{equation}

where the “proof” is nothing more than a statement that one can “pick” \( \lambda = -\braket{b}{a}/\braket{a}{a} \). The Pythagorean context of the Schwartz inequality is not mentioned, and without thinking about it, one is left wondering what sort of magic hat that \( \lambda \) selection came from.

References

[1] W Keith Nicholson. Elementary linear algebra, with applications. PWS-Kent Publishing Company, 1990.

[2] Jun John Sakurai and Jim J Napolitano. Modern quantum mechanics. Pearson Higher Ed, 2014.

Parallel projection of electromagnetic fields with Geometric Algebra

March 8, 2015 ece1229 , , , ,

[Click here for a PDF of this post with nicer formatting]

When computing the components of a polarized reflecting ray that were parallel or not-parallel to the reflecting surface, it was found that the electric and magnetic fields could be written as

\begin{equation}\label{eqn:gaFieldProjection:280}
\BE = \lr{ \BE \cdot \pcap } \pcap + \lr{ \BE \cdot \qcap } \qcap = E_\parallel \pcap + E_\perp \qcap
\end{equation}
\begin{equation}\label{eqn:gaFieldProjection:300}
\BH = \lr{ \BH \cdot \pcap } \pcap + \lr{ \BH \cdot \qcap } \qcap = H_\parallel \pcap + H_\perp \qcap.
\end{equation}

where a unit vector \( \pcap \) that lies both in the reflecting plane and in the electromagnetic plane (tangential to the wave vector direction) was

\begin{equation}\label{eqn:gaFieldProjection:340}
\pcap = \frac{\kcap \cross \ncap}{\Abs{\kcap \cross \ncap}}
\end{equation}
\begin{equation}\label{eqn:gaFieldProjection:360}
\qcap = \kcap \cross \pcap.
\end{equation}

Here \( \qcap \) is perpendicular to \( \pcap \) but lies in the electromagnetic plane. This logically subdivides the fields into two pairs, one with the electric field parallel to the reflection plane

\begin{equation}\label{eqn:gaFieldProjection:240}
\begin{aligned}
\BE_1 &= \lr{ \BE \cdot \pcap } \pcap = E_\parallel \pcap \\
\BH_1 &= \lr{ \BH \cdot \qcap } \qcap = H_\perp \qcap,
\end{aligned}
\end{equation}

and one with the magnetic field parallel to the reflection plane

\begin{equation}\label{eqn:gaFieldProjection:380}
\begin{aligned}
\BH_2 &= \lr{ \BH \cdot \pcap } \pcap = H_\parallel \pcap \\
\BE_2 &= \lr{ \BE \cdot \qcap } \qcap = E_\perp \qcap.
\end{aligned}
\end{equation}

Expressed in Geometric Algebra form, each of these pairs of fields should be thought of as components of a single multivector field. That is

\begin{equation}\label{eqn:gaFieldProjection:400}
F_1 = \BE_1 + c \mu_0 \BH_1 I
\end{equation}
\begin{equation}\label{eqn:gaFieldProjection:460}
F_2 = \BE_2 + c \mu_0 \BH_2 I
\end{equation}

where the original total field is

\begin{equation}\label{eqn:gaFieldProjection:420}
F = \BE + c \mu_0 \BH I.
\end{equation}

In \ref{eqn:gaFieldProjection:400} we have a composite projection operation, finding the portion of the electric field that lies in the reflection plane, and simultaneously finding the component of the magnetic field that lies perpendicular to that (while still lying in the tangential plane of the electromagnetic field). In \ref{eqn:gaFieldProjection:460} the magnetic field is projected onto the reflection plane and a component of the electric field that lies in the tangential (to the wave vector direction) plane is computed.

If we operate only on the complete multivector field, can we find these composite projection field components in a single operation, instead of working with the individual electric and magnetic fields?

Working towards this goal, it is worthwhile to point out consequences of the assumption that the fields are plane wave (or equivalently far field spherical waves). For such a wave we have

\begin{equation}\label{eqn:gaFieldProjection:480}
\begin{aligned}
\BH
&= \inv{\mu_0} \kcap \cross \BE \\
&= \inv{\mu_0} (-I)\lr{ \kcap \wedge \BE } \\
&= \inv{\mu_0} (-I)\lr{ \kcap \BE – \kcap \cdot \BE} \\
&= -\frac{I}{\mu_0} \kcap \BE,
\end{aligned}
\end{equation}

or

\begin{equation}\label{eqn:gaFieldProjection:520}
\mu_0 \BH I = \kcap \BE.
\end{equation}

This made use of the identity \( \Ba \wedge \Bb = I \lr{\Ba \cross \Bb} \), and the fact that the electric field is perpendicular to the wave vector direction. The total multivector field is

\begin{equation}\label{eqn:gaFieldProjection:500}
\begin{aligned}
F
&= \BE + c \mu_0 \BH I \\
&= \lr{ 1 + c \kcap } \BE.
\end{aligned}
\end{equation}

Expansion of magnetic field component that is perpendicular to the reflection plane gives

\begin{equation}\label{eqn:gaFieldProjection:540}
\begin{aligned}
\mu_0 H_\perp
&= \mu_0 \BH \cdot \qcap \\
&= \gpgradezero{ \lr{-\kcap \BE I} \qcap } \\
&= -\gpgradezero{ \kcap \BE I \lr{ \kcap \cross \pcap} } \\
&= \gpgradezero{ \kcap \BE I I \lr{ \kcap \wedge \pcap} } \\
&= -\gpgradezero{ \kcap \BE \kcap \pcap } \\
&= \gpgradezero{ \kcap \kcap \BE \pcap } \\
&= \BE \cdot \pcap,
\end{aligned}
\end{equation}

so

\begin{equation}\label{eqn:gaFieldProjection:560}
F_1
= (\pcap + c I \qcap ) \BE \cdot \pcap.
\end{equation}

Since \( \qcap \kcap \pcap = I \), the component of the complete multivector field in the \( \pcap \) direction is

\begin{equation}\label{eqn:gaFieldProjection:580}
\begin{aligned}
F_1
&= (\pcap – c \pcap \kcap ) \BE \cdot \pcap \\
&= \pcap (1 – c \kcap ) \BE \cdot \pcap \\
&= (1 + c \kcap ) \pcap \BE \cdot \pcap.
\end{aligned}
\end{equation}

It is reasonable to expect that \( F_2 \) has a similar form, but with \( \pcap \rightarrow \qcap \). This is verified by expansion

\begin{equation}\label{eqn:gaFieldProjection:600}
\begin{aligned}
F_2
&= E_\perp \qcap + c \lr{ \mu_0 H_\parallel } \pcap I \\
&= \lr{\BE \cdot \qcap} \qcap + c \gpgradezero{ – \kcap \BE I \kcap \qcap I } \lr{\kcap \qcap I} I \\
&= \lr{\BE \cdot \qcap} \qcap + c \gpgradezero{ \kcap \BE \kcap \qcap } \kcap \qcap (-1) \\
&= \lr{\BE \cdot \qcap} \qcap + c \gpgradezero{ \kcap \BE (-\qcap \kcap) } \kcap \qcap (-1) \\
&= \lr{\BE \cdot \qcap} \qcap + c \gpgradezero{ \kcap \kcap \BE \qcap } \kcap \qcap \\
&= \lr{ 1 + c \kcap } \qcap \lr{ \BE \cdot \qcap }
\end{aligned}
\end{equation}

This and \ref{eqn:gaFieldProjection:580} before that makes a lot of sense. The original field can be written

\begin{equation}\label{eqn:gaFieldProjection:620}
F = \lr{ \Ecap + c \lr{ \kcap \cross \Ecap } I } \BE \cdot \Ecap,
\end{equation}

where the leading multivector term contains all the directional dependence of the electric and magnetic field components, and the trailing scalar has the magnitude of the field with respect to the reference direction \( \Ecap \).

We have the same structure after projecting \( \BE \) onto either the \( \pcap \), or \( \qcap \) directions respectively

\begin{equation}\label{eqn:gaFieldProjection:660}
F_1 = \lr{ \pcap + c \lr{ \kcap \cross \pcap } I} \BE \cdot \pcap
\end{equation}
\begin{equation}\label{eqn:gaFieldProjection:680}
F_2 = \lr{ \qcap + c \lr{ \kcap \cross \qcap } I} \BE \cdot \qcap.
\end{equation}

The next question is how to achieve this projection operation directly in terms of \( F \) and \( \pcap, \qcap \), without resorting to expression of \( F \) in terms of \( \BE \), and \( \BB \). I’ve not yet been able to determine the structure of that operation.