Bethe Theory

2.16. Bethe Theory#

part of

MSE672: Introduction to Transmission Electron Microscopy

Spring 2025
by Gerd Duscher

Microscopy Facilities
Institute of Advanced Materials & Manufacturing
Materials Science & Engineering
The University of Tennessee, Knoxville

Background and methods to analysis and quantification of data acquired with transmission electron microscopes

2.16.1. Import numerical and plotting python packages#

Import the python packages that we will use:

We will use only the basic numerical (numpy) and plotting (pylab of matplotlib) libraries:

%matplotlib  widget
import matplotlib.pyplot as plt
import numpy as np

2.16.2. Notation#

In the following I will introduce this Bethe diffraction theory in Dirac’s bra–ket notation. $ $| ψ ⟩ = k e t = ψ_{1} | 1 ⟩ + ψ_{2} | 2 ⟩ = (\begin{matrix} ψ_{1} \\ ψ_{2} \end{matrix}) = c o l u m n v e c t o r$ $⟨ ϕ | = b r a = ϕ_{1}^{*} ⟨ 1 | + ϕ_{2}^{*} ⟨ 2 | = (ϕ_{1}^{*}, ϕ_{2}^{*}) = r o w v e c t o r$ $C o n s e q u e n t l y t h e D i r a c " b r a c k e t " i s :$ $⟨ ϕ | ψ ⟩ = (ϕ_{1}^{*}, ϕ_{2}^{*}) (\begin{matrix} ψ_{1} \\ ψ_{2} \end{matrix}) = ϕ_{1}^{*} ψ_{1} + ϕ_{2}^{*} ψ_{2}$ $

Similarly

| ψ ⟩ ⟨ ϕ | = a m a t r i x

and this will make it easier to follow the equations of many beams in diffraction, which otherwise can become very messy.

Throughout this chapter we will use the notation for the wave vector common in solid state physics and not the one found in many crystallography books ( $k = 1 / λ$ ): \begin{equation} k = \frac{2\pi}{\lambda} = \frac{\sqrt{2meE}}{\hbar} \end{equation} with:

$λ$ : wavelength
$m = γ_{0}$ : relativistically corrected mass of the electron
$γ = 1 + \frac{e^{2} E^{2}}{2 m_{0} c^{2}}$ : relativistic correction necessary for $E$ above 2kV
$e$ : electron charge
$E$ : acceleration voltage
$c$ : speed of light in vacuum

This is the short version of quantum mechanics. The Axioms of quantum mechanics are:

The state of a system is described by its state vector $| ψ ⟩$ .
An observables are expressed by hermitic operators $A$
The mean value of an observable is given by $⟨ A ⟩$
The time dependence is given by the time-dependent Schrödinger
equation: $H | ψ, t ⟩ = i ℏ \frac{\partial}{\partial t} | ψ, t ⟩$
If you measure $A$ the system changes to $| n ⟩$ if $a_{n}$ was measured

Axioms 2. and 3. give:

If a system is in the state $| ψ ⟩ = \sum_{n} c_{n} | n ⟩$

with $c_{n} = ⟨ n | ψ ⟩$ , where $| n ⟩$ are the Eigen states of $A$ , meaning $A | n ⟩ = a_{n} | n ⟩$ . The probability to find the value $a_{n}$ when measuring $A$ is given by $| c_{n} |^{2}$ .

2.16.3. Introduction to Bethe Theory#

The dynamic theory calculates the probability of a transition from an initial state $| i ⟩$ to a final state $| f ⟩$ . In our case, the initial state $| i ⟩$ is the incoming beam which scatters to a final state the Bragg reflection $| f ⟩$ . What we want to know is the transition probability from the initial to the final state: \begin{equation} \omega_{i\rightarrow f} = \langle i |T|f\rangle \end{equation} the initial state is the beam with wave vector ${\vec{k}}_{0}$ and the end state is a wave-vector of diffracted beam according to Bragg’s law $\vec{q} = \vec{g}$ . The above equation now looks like this: \begin{equation} \omega_{0\rightarrow g} = \langle 0 |T|g\rangle \end{equation}

For the stationary problems the states $| ψ, t ⟩$ can be expressed as: \begin{equation} |\psi_n,t\rangle = \exp(-E_nt/\hbar) |\psi_n\rangle \end{equation} and we get the time-independent Schröinger equation for $| ψ_{n} ⟩$ : \begin{equation} H|\psi_n\rangle = E_n |\psi_n \rangle \end{equation}

For the incoming wave, we get:

\begin{equation} H_0 = { -\frac{h^2}{8\pi^2} \nabla^2 } \end{equation} which expresses the kinetic energy.

Within a crystal the Hamiltonian will change to: \begin{equation}
H = H_0 + V \end{equation} We have the Schrödinger equation for the incoming wave: \begin{equation} (H_0 ) |\vec{k}\rangle = E_k |\vec{k}\rangle \end{equation}

and we want to solve: \begin{equation} (H_0 +V) |\psi\rangle = E_k |\psi\rangle \end{equation}

which we transform with equation \ref{IncomingWave}:

Effectively, we changed in the integral equation.

\begin{equation*} |\psi\rangle = |\vec{k}\rangle + \frac{1}{E_k -H_0}V|\psi\rangle \end{equation*}

2.16.3.1. Hamiltonian in Bethe Theory#

For our diffraction experiment is often better to use a Hamiltonian that contains the wave vector: \begin{equation} H_0 = { -\Delta_\rho -\xi^2 } \end{equation}

OOps, where is the wavevector $\vec{k}$ ?

I replaced it by $ξ$ which is the effective deviation from wavevector $\vec{k}$

We also take into account that our electrons are very fast and distort the space, reducing the problem to two dimensions ( $ρ = (x, y)$ ). The crystal potential V has then to be changed as well to: \begin{equation} V = V(\rho,z) \end{equation} and the full Hamiltonian is changed to: \begin{equation} H = \frac{1}{k_z} \left(H_0 + V(\rho,z) \right) \end{equation}

The time- independent wave equation is then: \begin{equation} \nabla^2 \psi + k^2 \psi = 0 \end{equation} with the plane wave solution: \begin{equation} \psi = \exp(\pm\vec{k}\bullet\vec{r}) = 0 \end{equation}

2.16.4. Schrödinger Equation of Bethe Theory#

The Bethe theory is based on the (time independent, non relativistic) Schrödinger equation:

\begin{equation} \label{Schroedinger_equation} \Big[ \underbrace{ -\frac{h^2}{8\pi^2} \nabla^2 } +\underbrace{ \mathcal{V}(\vec{r})} \Big], |\psi(\vec{r}) \rangle = \underbrace{\mathcal{E}} \underbrace{|\psi(\vec{r})\rangle } \end{equation}

What does that mean for the TEM?

We have a acceleration voltage (electric field potential) $E$ of 100kV

We have a charge of the electron $q$ with the value $e$ .

We have a total Energy $- E = E \cdot q$ which is just $E$ in the units of $[e V]$ .

We have a crystal with the potential $V (\vec{r})$ , which we declare positive inside the crystal and zero outside.

We have a potential Energy $V (\vec{r}) = q \cdot V (\vec{r})$ .

Now, that we declared all our variables we can transform the Schrödinger equation we started with to:

\nabla^{2} | ψ (\vec{r}) ⟩ = - \frac{8 π m e}{h^{2}} [E + V] | ψ (\vec{r}) ⟩

The left hand part of this equation is the impulse of the electron and the right hand part consists of a total energy part, which is boring and a part which originates from the crystal (interesting!).

2.16.5. Bloch Waves in Bethe Theory#

Well, if the potential is periodic, then the solution (wave function) must be periodic, too.

First we make a substitution in case our wave function is complicated: we define it as a linear combination of other waves. That is a useful trick, which makes the mathematics easier as we’ll see in a bit. $ $| ψ (\vec{r}) ⟩ = \sum_{j} | b_{j} ⟩$ $| b_{j} ⟩ = | b ({\vec{k}}^{(j)}, \vec{r}) ⟩$ $

The $| b ({\vec{k}}^{(j)}, \vec{r}) ⟩$ are called Bloch waves and are only defined for specific $\vec{k}$ -vectors, because the $| b ({\vec{k}}^{(j)}, \vec{r}) ⟩$ are plane waves, each traveling in $k^{(j)}$ direction. For the $| ψ (\vec{r}) ⟩$ we did not and could not have made any assumption like that.\

Now, we express the fact that these Bloch waves are indeed plane waves mathematically:

\begin{equation} |b^{(j)}(\vec{r})\rangle = b(\vec{k}^{(j)}, \vec{r})=\mu(\vec{k}^{(j)}, \vec{r})\cdot e^{2\pi i \vec{k}^{(j)} \vec{r}} = \underbrace{\mu^{(j)}(\vec{r})}_ e^{2\pi i \vec{k}^{(j)} \vec{r}} \end{equation}

by dividing it in a plane wave part (the exponential function) and a amplitude part (the Bloch function). Because of the periodicity which we assume for the solution, we expand the Bloch waves in a into a Fourier series, again (the same as in equation \ref{FourierExpand} for the potential).

\begin{equation} b^{(j)}(\vec{r}) = \sum_g C_g^{(j)} e^{2\pi i (\vec{k}^{(j)} + \vec{g}) \vec{r}} \end{equation} The sum in this equation goes over all excited (aha!) points in the reciprocal lattice, including the incident direction $g_{1} = 0$ . ( $\vec{g}$ are defined through the Milller indices as (h/a, k/b, l/c), where the (a,b,c) are the real space lattice vectors). | Theoretically, there are an infinite number of $\vec{g}$ vectors, but only few are allowed and only a few have a small excitation error.

So in practice there are only a few $\vec{g}$ vectors to consider.

2.16.6. Crystal Potential in Bethe Theory#

The crystal potential is periodic and so we also make a Fourier expansion of that potential $ $V = V (x, y, z) = V (\vec{r}) = \sum_{g} V_{g} \exp (2 π i \vec{g} \cdot \vec{r})$ $

The Fourier component of the crystal potential (in Volts) consists of several atoms $j$ over which we sum: $ $V_{g} = \frac{h^{2}}{2 π m_{0} e} \frac{1}{Ω} \sum_{j} f_{e_{j}} (\vec{g}) \exp (- 2 π i \vec{g} \cdot \vec{r_{j}}) = \frac{2 π e a_{0}}{Ω} \sum_{j} f_{e_{j}} (\vec{g}) \exp (- 2 π i \vec{g} \cdot \vec{r_{j}})$ $ where:

$f_{e_{j}}$ : atomic form (atomic scattering) factor of the $j$ th atom
$e$ : charge of electron
$m_{0}$ : rest mass of electron
$a_{0}$ : Bohr radius
$Ω$ : Volume of the unit cell

2.16.7. Solution of Bethe Theory#

Now, so far we haven’t done anything, but substitute and expand. Let’s put all this into the Schrödinger equation above: \begin{equation} 4\pi \left[ K^2 - (k_0^{(j)} + g)^2 +\sum_{h \neq 0} U_h e^{2\pi i \vec{h}\vec{r} } \right] \cdot C_g^{(j)} e^{2\pi i (\vec{k}o^{(j)} +\vec{g})\cdot \vec{r} }= 0 \end{equation} This can only be zero, if all coefficients with same exponential function simultaneous become zero; this results in a set of equations, after collecting up terms containing the factor $e^{2 π i ({\vec{k}}_{o}^{(j)} + \vec{g}) \cdot \vec{r}}$ : \begin{equation} \left[ K^2 -(\vec{k}o^{(j)} +\vec{g})^2 \right] C_g^{(j)} + \sum{h\neq 0} U_h C{g-h}^{(j)}=0; \qquad \vec{g}=\vec{g}_1, \vec{g}_2, …, \vec{g}_n \end{equation}

I made use of an abbreviation: \begin{equation} K=\frac{1}{h}\left[ 2m_0 E (1+\frac{E}{2E_0}) +2m_0 e U_0(1+\frac{E}{E_0}) \right]^{\frac{1}{2}} \end{equation} for the wave vector inside the the crystal which are not identical to the magnitude of the wave vectors of the Bloch waves ${\vec{k}}_{g}^{(j)} = {\vec{k}}_{o}^{(j)} + \vec{g}$ .

Please note, that I introduced relativistic corrections (the terms in the round brackets in the equation above), too. It is enough to add this corrections for the energy at this point; it is not necessary to solve the Dirac equation (relativistic Schrödinger equation).

The set of equations defined in \ref{setEquat} are essential for the understanding of dynamic diffraction. Let’s look at it a little more closely.

We get for each $j$ one equation; and this means we get for each Bloch wave one equation.

The second term in equation \ref{setEquat} (the term with the sum) mixes the Bloch waves ( $C_{g - h}^{(j)}$ ). Effectively, we state that the inner potential $U_{h}$ mixes the Bloch waves; this is called dynamical coupling.

In summation:

We separated the problem!

2.16.8. Two Beam Case#

We rewrite the matrix expression for the boundary condition in the two beam case: \begin{equation} \left( \begin{matrix} C_{0}^{(1)} & C_{0}^{(2)} \ C_{g}^{(1)} & C_{g}^{(2)} \ \end{matrix} \right) \cdot \left( \begin{matrix} \gamma^{(1)} \ \gamma^{(2)} \ \end{matrix} \right) = \left( \begin{matrix} \phi_0^{(0)} \ \phi_g^{(0)} \ \end{matrix} \right) = \left( \begin{matrix} 1 \ 0\ \end{matrix} \right) \end{equation}

In the kinematic case, the centers M of the various Ewald spheres (for the various incident directions) lay on a sphere of radius $k = \frac{1}{λ}$ around the origin of the reciprocal lattice. At some point the intensity in the diffracted beam will be more intense than in the incident beam, and therefore, we have to treat this scattered beam now in the same way we have treated the incident beam before: we need an Ewald sphere of radius $k$ for this direction/ for this reciprocal lattice point $\vec{g}$ . Now we have two Ewald spheres, one around the origin $0$ and one around the reciprocal lattice point $\vec{g}$ . The two spheres are not allowed to intersect each other, but a smooth kind of complicated surface has to be constructed.

The fundamental equations of the dynamic theory for the two beam case are:

- γ^{(j)} C_{0}^{j} + \frac{U_{g}}{2 K} C_{g}^{(j)} = 0

\frac{U_{g}}{2 K} C_{0}^{j} + (- γ^{(j)} + s) C_{g}^{(j)} = 0

Such a homogeneous linear equation system for the $C_{g}^{(j)}$ has a non-zero solution of and only if the determinant of the coefficients is zero:

\begin{equation} \left| \begin{matrix} -\gamma^{(j)} & \frac{U_g}{2K}\ \frac{U_g}{2K}& (-\gamma^{(j)}+s)\ \end{matrix} \right| = {\gamma^{(j)}}^2 -s\gamma^{(j)}-\frac{U_g^2}{4K^2} =0 \end{equation}

Which is the same as the Howie-Whelan equation (on which we will use extensivly) with $ξ_{g} = K^{2} / U_{g}$ , but now we know that the $γ^{(j)}$ are the Eigenvalues of a matrix problem.

Solution: $ $γ^{(j)} = \frac{1}{2} [s - (- 1)^{j} \sqrt{(U_{g} / K)^{2} + s^{2}}] = \frac{1}{2} [s - (- 1)^{j} \sqrt{(1 / ξ^{2} + s^{2}}] = \frac{1}{2 ξ_{g}} [w - (- 1)^{j} \sqrt{(1 + w^{2}}]$ $

We made the substitution $w = s ξ_{g}$ , in which the parameter $w$ characterizes the tilt out of the exact Bragg condition ( $w = 0$ ). The excitation error $s$ is zero in the exact Bragg condition, isn’t it.

The separation is $Δ k_{z, m i n} = γ^{(1)} - γ^{(2)} = \frac{U_{g}}{2 K} = \frac{1}{ξ_{g}}$ .

By use of the eigenvalues $γ^{(j)}$ , the linear systems of equations can be solved for the $C_{g}^{(j)}$ . For the amplitude $ϵ^{(j)} C_{g}^{(j)} = C_{0}^{j} C_{g}^{(j)}$ of the four Bloch waves with the vector ${\vec{k}}_{0}^{(j)} + \vec{g}$ we obtain: $ $C_{0}^{j} C_{0}^{(j)} = \frac{1}{2} [1 + (- 1)^{j} \frac{w}{\sqrt{1 + w^{2}}}] C_{0}^{j} C_{g}^{(j)} = - \frac{1}{2} [\frac{(- 1)^{j}}{\sqrt{1 + w^{2}}}]$ $

We put this into the equation for the scattered wave and substitute the thickness $t$ for the $z$ component of the vector $\vec{r}$ : $ $ψ_{0} (t) = \sum_{j = 1}^{2} C_{0}^{j} C_{0}^{(j)} e^{2 π i k_{z}^{(j)} t} ψ_{g} (t) = \sum_{j = 1}^{2} C_{0}^{j} C_{g}^{(j)} e^{2 π i k_{z}^{(j)} t} e^{2 π i k_{z} g x}$ $a n d w e f i n d (o m i t t i n g c o m m o n p h a s e f a c t o r s) :$ $ψ_{0} (t) = \cos (π \sqrt{1 + w^{2}} \frac{t}{ξ_{g}}) - \frac{i w}{\sqrt{1 + w^{2}}} \sin (π \sqrt{1 + w^{2}} \frac{t}{ξ_{g}}) ψ_{g} (t) = \frac{i}{\sqrt{1 + w^{2}}} \sin (π \sqrt{1 + w^{2}} \frac{t}{ξ_{g}}) e^{2 π i k_{z} g x}$ $

The intensities of the transmission $T$ and reflection $R$ become: \begin{equation} \underbrace{\psi_g \psi_g^}_R = \underbrace{1-\psi_0 \psi_0^}{1-T} = \frac{1}{1+w^2} \sin^2 (\pi \sqrt{\sqrt{1+w^2} ( \frac{1}{\xi_g}}){s_{eff}} \cdot t) \end{equation}

The solution is the Pendellösung of two coupled oscillators (in mechanics: two pendulums connected with a spring).

Even in exact Bragg condition, the intensity oscillates between primary beam and Bragg reflected beam with increasing film thickness. Look at the plot below, the Pendellösung is shown without absorption.

Normally one would want to add an absorption term to reduce the intensity with thickness. This absorption term is better named a damping term and stems form the inelatic scattering to random angles instead of the considered (here two) Bragg angles

These oscillations of the intensities are commonly called rocking curve.

# ------ Input ------
xi_g = 4  # extiction distance (in terms of relative thickness )
omega = 0.4 # tilt from Bragg condition
damping = 0.3
# --------------------

t = np.linspace(0,8,401)


plt.figure()
plt.plot(t, (1-np.sin(np.pi * np.sqrt(np.sqrt(1.+ omega**2)*1/xi_g)*t)**2) * 1/np.exp(t*damping), label='incident beam')
plt.plot(t, np.sin(np.pi * np.sqrt(np.sqrt(1.+ omega**2)*1/xi_g)*t)**2 * 1/np.exp(t*damping), label='reflected beam')

plt.legend();

2.16.9. Summary of Bethe Theory#

The solution is the Pendellösung of two coupled oscillators.

The periodicity is the extinction length $ξ$ , which tells at which thickness a beam is completely vanished.

Considering some absorption (well it’s not a real absorption, but inelastic scattering) then we see that the amplitudes decrease slowly.

2.16.10. Using Bethe Theory for Thickness Determination#

We will do this in a lab and it will be your homework.

The accurate thickness of the sample is an important but hard to obtain parameter, but it influences the contrast in all imaging modes.
Be aware that with different techniques you perform different thickness measurement. In any high resolution image and diffraction experiment, you always look at the thickness of the crystalline part of the sample, omitting the contribution of contamination and amorphous surface layer (from sample preparation).
In the Analytic Section of this class we learn how to the thickness from the whole sample.

We can observe the above rocking curve in convergent beam electron diffraction patterns (CBED).

But we have to ensure that:

Excitation error is as small as possible
We are in two beam condition

2.16.10.1. Experimental Considerations#

Choose a convergence angle $α$ so that $α < θ_{B}$ , to avoid overlapping of disks in the ZOLZ.
The 000 disk usually contains concentric diffuse fringes, the Kossel-Möllenstedt fringes
If you move the specimen, then you will see that the number of this fringes changes. In fact the number of each fringes increases by one every time the thickness increases by one extinction length.
The foil thickness can be measured precisely at the point where you do your other analysis.

Please be aware that dynamic effects also occur for the HOLZ lines in a CBED pattern.

In practice to simplify the interpretation, we don’t use zone axis conditions, but tilt to two–beam conditions with only one strongly excited Bragg beam.

The CBED disks contain then parallel rather than concentric intensity oscillations as shown in the earlier figure.
In fact, this intensity oscillations are equivalent to the rocking curve intensity oscillations discussed earlier.
It helps, of you use an energy filter for this method.

2.16.10.2. Thickness Determination#

Because the oscillations are symmetric in the hkl disk we concentrate the analysis on this disk.

The middle of the hkl disk is bright and originates from the exact Bragg condition ( $\vec{s} = 0$ ).
We measure the distance between the midle (bright fringe) of the $h k l$ disk and the dark lines.

You obtain a deviation $s_{i}$ for each fringe from the equation: \begin{equation} s_i=\lambda\frac{\Delta\theta_i}{e\theta_B d^2} \end{equation} The Bragg angle $θ_{B}$ is known from the separation of two disks and the lattice spacing $d$ is known from the sample or can be calculated through the camera length.

If the extinction distance $ξ_{g}$ is known you can calculate the foil thickness $t$ with: \begin{equation} \frac{1}{t^2} = \frac{s_1^2}{n_k^2}+\frac{1}{\xi_g^2n_k^2} \end{equation} where $n_{k}$ is an integer.

2.16.10.3. Data Analysis#

assign $n = 1$ to the first fringe $s_{1}$
assign $n = 2$ to the second fringe $s_{2}$ and so on for all other fringes
plot $(s_{1} / n_{k})^{2}$ versus $(1 / n_{k})^{2}$ .
if you get a straight line, then you are finished and you have $k = i + j$ , where $j$ is the largest integer $< (t / ξ_{g})$ .
if not repeat the same thing with $n = 2$ for $s_{1}$ , $n = 3$ for $s_{2}$ , etc.
repeat this increase by one till you get a straight line
the slope of the line is $1 / ξ_{g}^{2}$
the extrapolated value for $1 / n_{k}^{2}$ is $1 / t^{2}$ .

The whole procedure is summarized in the figure below.

CBED-thickness

2.16.11. More about Bloch Waves#

We can replace the exponential functions by trigonometric functions and get: $ $A^{(1)} = \cos \frac{β}{2} A^{(2)} = \sin \frac{β}{2}$ $ Some of the Bloch waves are located (have their maxima) between the atoms. These Bloch waves channel and are more or less undisturbed.

Another set is located on the atomic rows and will cause much more inelastic scattering than the other, also they will travel much faster.

The second set is especially important for Z-contrast image, where a small convergent beam is located at the atomic rows. You might consider the atoms like little lenses which keep the beam focused on the column.