H = (p-(e/c)A)^2/2m = p^2/2m - (e/2mc)(p.A + A.p) + (e^2/2mc^2) A^2.
For a uniform magnetic field we can use the gauge A = B x r/2, for which H becomes
H = p^2/2m - (e/2mc) L.B + (e^2B^2/8mc^2)(x^2 + y^2).
The term linear in B is of the form -mu.B, where mu = (e/2mc) L is the
magnetic moment,
so gamma = e/2mc is the gyromagnetic ratio for orbital motion.
This term is associated with
paramagnetism, since a system with a permanent dipole moment
can lower its energy by
orienting the magnetic moment parallel to B. (If B is inhomogeneous,
the system will
feel a force pulling it into the region where B is strongest.)
The term quadratic in B is always positive, and is associated with diamagnetism.
(A system with
no permanent magnetic moment will have only positive energy in a magnetic
field, hence will be
repelled from a region of strong field.)
# Let's compare orders of magnitude:
Set L = hbar, and x^2 + y^2 = r^2. Then, up to factors of pi and 2,
diamag. term / paramag. term = B r^2/Phi_0 ~ 10^-6 (Br^2/T-Angstroms^2).
For an atomic scale system at a B of one Tesla, the diamagnetic term
is a million times
smaller than the paramagnetic term.
Comparing the paramagnetic term to the Coulomb energy in an atom on
the other hand, we have
paramag term/ Rydberg = B a^2/Phi_0, where a is the Bohr radius,
which is again 10^-6 for
a one Tesla field. So the paramagnetic term is small for any laboratory
fields. It is still important
however, mainly because it contributes to the Zeeman effect by
the splitting of degenerate levels
according to the value of L_z.
E = -grad phi - (1/c)d_t A, B = curl A,
Conversely, if potentials define E and B in this way, they automatically
satisfy the sourceless Maxwell eqns.
Potentials (A', phi') will define the same E and B fields as (A, phi)
if they are related by a gauge transformation:
A' = A + grad g , phi' = phi - (1/c)d_t g,
where the gauge parameter g(x,t) is an arbitrary function of
space and time.
# To simplify notation here, let us adopt units with hbar=c=1.
# The Hamiltonian H = [p - eA]^2/2m + e phi is not gauge invariant,
but the physics is. That is,
there exists a unitary transformation U such that
(i d_t - H') U = U (i d_t - H) , namely, U = exp(i eg).
In other words, the gauge transformation of the potentials is accompanied
by a transformation of the wave function,
to psi' = exp(ieg) psi. Then if psi satisfies the Schrodinger eqn with
H, psi' will satisfy it with H'.
# The probability density psi*psi is unchanged by a gauge transformation.
The conserved current in the
presence of a background magnetic field is modified in such a way that
it is gauge invariant:
j = Re(psi* v psi), with v = (p - eA)/m.
# The gauge parameter g(x,t) is fixed just once, but since the charge
q appears in the Hamiltonian, the appropriate
U = exp(iqg) for a given particle type depends on q.
# The coupling to electromagnetic fields makes quantum theory invariant
under local phase transformations of the
wave function. In a sense, local phase symmetry implies the existence
of electromagnetism. This symmetry in fact
implies conservation of charge, and it fixes the form of the coupling
between the electromagetic potentials and the
charged particles. The dynamics of the electromagnetic field itself---i.e.
the Maxwell equations with source terms---
is supplemental information. These equations however are the only equations
consistent with Lorentz invariance
and gauge invariance that are second order in derivatives of
the potentials. This is easy to see with Lorentz-covariant
formalism, but I will not go into it here.
# The notion of electromagnetic gauge covariance is the tip of
an iceberg. One way to glimpse the iceberg is to
write (p_i - eA_i) = -i (d_i - ie A_i) =: -i D_i, and
(id_t - ephi) = i (d_t + ie phi) =: iD_t.
The operators D_i and D_t are gauge-covariant derivatives, in
the sense that D'_i psi' = U D_i psi, and
similarly for D_t, i.e. D U = U D. The gauge transformation of D is
cancelled by the derivative of U.
The Schrodinger equation in tems of the gauge covariant derivatives
is iD_t psi = -(1/2m) D_iD_i psi.
A differential eqaution constructed with gauge covariant derivatives
will transofrm covariantly under gauge
transformations, so if satisfied in one gauge it will be satisfied
in all gauges, i.e. it will be gauge invariant.
# In general relativity the role of the gauge transformations
is played by general coordinate transformations,
and the equations are expressed in terms of a derivative operator that
is is covariant with respect to these,
so that the equations are valid for all coordinate systems.
# In 1954 Yang and Mills invented a generalization of electromagnetic
gauge invariance using matrix-valued
potentials and gauge transformations. Their motivation was that various
symmetries among particles had been
suspected, so it seemed natural to group together certain particles
into multiplets, which we can
think of as multicomponent wavefunctions Psi. Yang and Mills
were trying to construct a theory that would be
symmetric under local unitary transformations, Psi -> Psi' = U Psi,
where now U is a unitary matrix which is
an element of some group. A special case of this is electromagnetism,
where U is the 1x1 matrix exp(ieg).
The phases comprise the group called U(1). The group U(n) is
the n x n unitary matrices, while SU(n) is the
subgroup of those with unit determinant. The matrices do not commute,
i.e. the group is non-abelian, which
requires a generalization of the gauge transformation law. Suppose
we want the derivative D = d - ieA to be
gauge-covariant in the sense that D' U = U D. Then
(d - ieA') U = U d - ieA' U + (dU) = U(d - i eA) = U d - ie U
A, which implies that
-ieA' U + (dU) = -i eU A, so
A' = U A U^-1 - (i/e) (dU) U^-1.
If U=exp(ieg) this reduces to the usual electromagnetic gauge transformation
A' = A + e dg,
but for non-Abelian groups it is different. In particular, the gauge
potential A must be matrix-valued.
It can be seen that the matrix i dU U^-1 is hermitian if U is unitary,
so A should be hermitian. If U has
unit determinant, then in addition A should be trace-free. In addition
to the gauge transformation rule,
Yang & Mills worked out the analog of Maxwell's equations, which
turned out to necessarily be
a non-linear equation. The particles corresponding to the potential
fields A are called gauge bosons, and are
analogous to the photon, but due to the non-linearity they interact
directly with each other, unlike photons.
# Initially Yang-Mills theory looked intriguing, but problematic,
since the gauge particles were, like
the photon, necessarily massless, since a mass term in their equation
would violate gauge invariance,
and no extra massless particles were seen in nature.
Nevertheless the idea looked very nice, and eventually it turned out
to be correct.
The standard model of particle physics has gauge group SU(3) x SU(2)
x U(1).
The SU(3) is the QCD color gauge group. Since there are 8 independent
3 x 3 tracefree hermitian matrices,
there are 8 gauge fields, called gluons. The SU(2) x U(1) part
is the electroweak group. The electromagnetic
U(1) is a combination of a U(1) subgroup of SU(2) and the explicit
U(1) factor. The remaining three generators
are the charged W+ and W- and the neutral Z boson.
The gluons are in a sense massless, however
due to the dynamical binding mechanism, called color confinement,
gluons do not propagate freely.
The W and Z bosons on the other hand do propagate freely (until they
decay), but they are massive.
This is possible, without violating gauge invariance, thanks to the
phenomenon of
spontaneous symmetry breaking: one of the fields to which they
couple, the Higgs field, has a vacuum
expectation value, which gives them an effective mass.