Skip to article frontmatterSkip to article content

Bethe Theory

Chapter 2: Diffraction


Bethe Theory

Download

OpenInColab

part of

MSE672: Introduction to Transmission Electron Microscopy

Spring 2025
by Gerd Duscher

Microscopy Facilities
Institute of Advanced Materials & Manufacturing
Materials Science & Engineering
The University of Tennessee, Knoxville

Background and methods to analysis and quantification of data acquired with transmission electron microscopes

Import numerical and plotting python packages

Import the python packages that we will use:

We will use only the basic numerical (numpy) and plotting (pylab of matplotlib) libraries:

%matplotlib  widget
import matplotlib.pyplot as plt
import numpy as np

Notation

In the following I will introduce this Bethe diffraction theory in Dirac’s bra--ket notation.

ψ=ket=ψ11+ψ22=(ψ1ψ2)=column vector|\psi\rangle = {\rm ket} = \psi_1 |1\rangle + \psi_2 |2\rangle = \left( \begin{array}{c} \psi_1 \\\psi_2 \end{array} \right)= {\rm column\ vector}\\

ϕ=bra=ϕ11+ϕ22=(ϕ1,ϕ2)=row vector\langle \phi| = {\rm bra} = \phi^*_1\langle 1| + \phi^*_2 \langle 2| = \left( \phi^*_1 ,\phi^*_2 \right)= {\rm row\ vector}

Consequently the Dirac “bracket” is:

ϕψ=(ϕ1,ϕ2)(ψ1ψ2)=ϕ1ψ1+ϕ2ψ2\langle \phi|\psi\rangle = \left( \phi^*_1 ,\phi^*_2 \right) \left( \begin{array}{c} \psi_1 \\\psi_2 \end{array} \right) = \phi^*_1 \psi_1 +\phi^*_2 \psi_2

Similarly

ψϕ=a matrix|\psi\rangle \langle \phi| = {\rm a\ matrix}

and this will make it easier to follow the equations of many beams in diffraction, which otherwise can become very messy.

Throughout this chapter we will use the notation for the wave vector common in solid state physics and not the one found in many crystallography books (k=1/λk=1/\lambda):

k=2πλ=2meEk = \frac{2\pi}{\lambda} = \frac{\sqrt{2meE}}{\hbar}

with:

  • λ\lambda: wavelength

  • m=γ0m=\gamma_0: relativistically corrected mass of the electron

  • γ=1+e2E22m0c2\gamma = 1+\frac{e^2E^2}{2m_0c^2}: relativistic correction necessary for EE above 2kV

  • ee: electron charge

  • EE: acceleration voltage

  • cc: speed of light in vacuum

This is the short version of quantum mechanics. The Axioms of quantum mechanics are:

  1. The state of a system is described by its state vector ψ|\psi\rangle.

  2. An observables are expressed by hermitic operators AA

  3. The mean value of an observable is given by A\langle A \rangle

  4. The time dependence is given by the time-dependent Schrödinger
    equation: Hψ,t=itψ,t H|\psi,t\rangle = i\hbar \frac{\partial}{\partial t} |\psi,t\rangle

  5. If you measure AA the system changes to n|n\rangle if ana_n was measured

Axioms 2. and 3. give:

If a system is in the state ψ=ncnn|\psi\rangle = \sum_n c_n |n\rangle

with cn=nψc_n = \langle n|\psi\rangle, where n|n\rangle are the Eigen states of AA, meaning An=annA|n\rangle = a_n |n\rangle. The probability to find the value ana_n when measuring AA is given by cn2|c_n|^2.

Introduction to Bethe Theory

The dynamic theory calculates the probability of a transition from an initial state i|i\rangle to a final state f|f\rangle. In our case, the initial state i|i\rangle is the incoming beam which scatters to a final state the Bragg reflection f|f\rangle. What we want to know is the transition probability from the initial to the final state:

ωif=iTf\omega_{i\rightarrow f} = \langle i |T|f\rangle

the initial state is the beam with wave vector k0\vec{k}_0 and the end state is a wave-vector of diffracted beam according to Bragg’s law q=g\vec{q} = \vec{g}. The above equation now looks like this:

ω0g=0Tg\omega_{0\rightarrow g} = \langle 0 |T|g\rangle

For the stationary problems the states ψ,t|\psi,t\rangle can be expressed as:

ψn,t=exp(Ent/)ψn|\psi_n,t\rangle = \exp(-E_nt/\hbar) |\psi_n\rangle

and we get the time-independent Schröinger equation for ψn|\psi_n\rangle:

Hψn=EnψnH|\psi_n\rangle = E_n |\psi_n \rangle

For the incoming wave, we get:

H0=h28π22H_0 = { -\frac{h^2}{8\pi^2} \nabla^2 }

which expresses the kinetic energy.

Within a crystal the Hamiltonian will change to:

H=H0+VH = H_0 + V

We have the Schrödinger equation for the incoming wave:

(H0)k=Ekk(H_0 ) |\vec{k}\rangle = E_k |\vec{k}\rangle

and we want to solve:

(H0+V)ψ=Ekψ(H_0 +V) |\psi\rangle = E_k |\psi\rangle

which we transform with equation \ref{IncomingWave}:

Effectively, we changed in the integral equation.

ψ=k+1EkH0Vψ|\psi\rangle = |\vec{k}\rangle + \frac{1}{E_k -H_0}V|\psi\rangle

Hamiltonian in Bethe Theory

For our diffraction experiment is often better to use a Hamiltonian that contains the wave vector:

H0=Δρξ2H_0 = { -\Delta_\rho -\xi^2 }

OOps, where is the wavevector k\vec{k}?

I replaced it by ξ\xi which is the effective deviation from wavevector k\vec{k}

We also take into account that our electrons are very fast and distort the space, reducing the problem to two dimensions (ρ=(x,y)\rho = (x,y)). The crystal potential V has then to be changed as well to:

V=V(ρ,z)V = V(\rho,z)

and the full Hamiltonian is changed to:

H=1kz(H0+V(ρ,z))H = \frac{1}{k_z} \left(H_0 + V(\rho,z) \right)

The time- independent wave equation is then:

2ψ+k2ψ=0\nabla^2 \psi + k^2 \psi = 0

with the plane wave solution:

ψ=exp(±kr)=0\psi = \exp(\pm\vec{k}\bullet\vec{r}) = 0

Schrödinger Equation of Bethe Theory

The Bethe theory is based on the (time independent, non relativistic) Schrödinger equation:

[h28π22kinetic energy+V(r)pot.energy]ψ(r)=Etotal energyψ(r)wave function\Big[ \underbrace{ -\frac{h^2}{8\pi^2} \nabla^2 }_{{\rm kinetic\ energy}} +\underbrace{ \mathcal{V}(\vec{r})}_{{\rm pot. energy}} \Big]\, |\psi(\vec{r}) \rangle = \underbrace{\mathcal{E}}_{{\rm total\ energy}} \underbrace{|\psi(\vec{r})\rangle }_{{\rm wave\ function}}

What does that mean for the TEM?

We have a acceleration voltage (electric field potential) EE of 100kV

We have a charge of the electron qq with the value ee.

We have a total Energy E=Eq-\mathcal{E} = E\cdot q which is just EE in the units of [eV][eV].

We have a crystal with the potential V(r)V(\vec{r}), which we declare positive inside the crystal and zero outside.

We have a potential Energy V(r)=qV(r)\mathcal{V}(\vec{r}) = q\cdot V(\vec{r}).

Now, that we declared all our variables we can transform the Schrödinger equation we started with to:

2ψ(r)=8πmeh2[E+V]ψ(r)\nabla^2 |\psi(\vec{r})\rangle = -\frac{8\pi m e}{h^2}\, [E+V]\, |\psi(\vec{r})\rangle

The left hand part of this equation is the impulse of the electron and the right hand part consists of a total energy part, which is boring and a part which originates from the crystal (interesting!).

Bloch Waves in Bethe Theory

Well, if the potential is periodic, then the solution (wave function) must be periodic, too.

First we make a substitution in case our wave function is complicated: we define it as a linear combination of other waves. That is a useful trick, which makes the mathematics easier as we’ll see in a bit.

ψ(r)=jbj|\psi(\vec{r})\rangle = \sum_j |b_j\rangle

bj=b(k(j),r)|b_j\rangle = |b(\vec{k}^{(j)}, \vec{r})\rangle

The b(k(j),r) |b(\vec{k}^{(j)}, \vec{r})\rangle are called Bloch waves and are only defined for specific k\vec{k}-vectors, because the b(k(j),r)|b(\vec{k}^{(j)}, \vec{r})\rangle are plane waves, each traveling in k(j)k^{(j)} direction. For the ψ(r)|\psi(\vec{r})\rangle we did not and could not have made any assumption like that.\

Now, we express the fact that these Bloch waves are indeed plane waves mathematically:

b(j)(r)=b(k(j),r)=μ(k(j),r)e2πik(j)r=μ(j)(r)Bloch functione2πik(j)r|b^{(j)}(\vec{r})\rangle = b(\vec{k}^{(j)}, \vec{r})=\mu(\vec{k}^{(j)}, \vec{r})\cdot e^{2\pi i \vec{k}^{(j)} \vec{r}} = \underbrace{\mu^{(j)}(\vec{r})}_{{\rm Bloch\ function}} e^{2\pi i \vec{k}^{(j)} \vec{r}}

by dividing it in a plane wave part (the exponential function) and a amplitude part (the Bloch function). Because of the periodicity which we assume for the solution, we expand the Bloch waves in a into a Fourier series, again (the same as in equation \ref{FourierExpand} for the potential).

b(j)(r)=gCg(j)e2πi(k(j)+g)rb^{(j)}(\vec{r}) = \sum_g C_g^{(j)} e^{2\pi i (\vec{k}^{(j)} + \vec{g}) \vec{r}}

The sum in this equation goes over all excited (aha!) points in the reciprocal lattice, including the incident direction g1=0g_1 = 0. ( g\vec{g} are defined through the Milller indices as (h/a, k/b, l/c), where the (a,b,c) are the real space lattice vectors). | Theoretically, there are an infinite number of g\vec{g} vectors, but only few are allowed and only a few have a small excitation error.

So in practice there are only a few g\vec{g} vectors to consider.

Crystal Potential in Bethe Theory

The crystal potential is periodic and so we also make a Fourier expansion of that potential

V=V(x,y,z)=V(r)=gVgexp(2πigr)V = V(x,y,z) = V(\vec{r}) = \sum_g V_g \exp(2\pi i\vec{g}\cdot\vec{r})

The Fourier component of the crystal potential (in Volts) consists of several atoms jj over which we sum:

Vg=h22πm0e1Ωjfej(g)exp(2πigrj)=2πea0Ωjfej(g)exp(2πigrj)V_g = \frac{h^2}{2 \pi m_0 e} \frac{1}{\Omega} \sum_j f_{e_j}(\vec{g}) \exp(-2\pi i\vec{g}\cdot\vec{r_j})\\ = \frac{2 \pi e a_0}{\Omega} \sum_j f_{e_j}(\vec{g}) \exp(-2\pi i\vec{g}\cdot\vec{r_j})\\

where:

  • fejf_{e_j}: atomic form (atomic scattering) factor of the jjth atom

  • ee: charge of electron

  • m0m_0: rest mass of electron

  • a0a_0: Bohr radius

  • Ω\Omega: Volume of the unit cell

Solution of Bethe Theory

Now, so far we haven’t done anything, but substitute and expand. Let’s put all this into the Schrödinger equation above:

4π[K2(k0(j)+g)2+h0Uhe2πihr]Cg(j)e2πi(ko(j)+g)r=04\pi \left[ K^2 - (k_0^{(j)} + g)^2 +\sum_{h \neq 0} U_h e^{2\pi i \vec{h}\vec{r} } \right] \cdot C_g^{(j)} e^{2\pi i (\vec{k}_o^{(j)} +\vec{g})\cdot \vec{r} }= 0

This can only be zero, if all coefficients with same exponential function simultaneous become zero; this results in a set of equations, after collecting up terms containing the factor e2πi(ko(j)+g)re^{2\pi i (\vec{k}_o^{(j)} +\vec{g})\cdot \vec{r} }:

[K2(ko(j)+g)2]Cg(j)+h0UhCgh(j)=0;g=g1,g2,...,gn\left[ K^2 -(\vec{k}_o^{(j)} +\vec{g})^2 \right] C_g^{(j)} + \sum_{h\neq 0} U_h C_{g-h}^{(j)}=0; \qquad \vec{g}=\vec{g}_1, \vec{g}_2, ..., \vec{g}_n

I made use of an abbreviation:

K=1h[2m0E(1+E2E0)+2m0eU0(1+EE0)]12K=\frac{1}{h}\left[ 2m_0 E (1+\frac{E}{2E_0}) +2m_0 e U_0(1+\frac{E}{E_0}) \right]^{\frac{1}{2}}

for the wave vector inside the the crystal which are not identical to the magnitude of the wave vectors of the Bloch waves kg(j)=ko(j)+g\vec{k}_g^{(j)}=\vec{k}_o^{(j)} +\vec{g}.

Please note, that I introduced relativistic corrections (the terms in the round brackets in the equation above), too. It is enough to add this corrections for the energy at this point; it is not necessary to solve the Dirac equation (relativistic Schrödinger equation).

The set of equations defined in \ref{setEquat} are essential for the understanding of dynamic diffraction. Let’s look at it a little more closely.

We get for each jj one equation; and this means we get for each Bloch wave one equation.

The second term in equation \ref{setEquat} (the term with the sum) mixes the Bloch waves (Cgh(j)C_{g-h}^{(j)}). Effectively, we state that the inner potential UhU_h mixes the Bloch waves; this is called dynamical coupling.

In summation:

We separated the problem!

Two Beam Case

We rewrite the matrix expression for the boundary condition in the two beam case:

(C0(1)C0(2)Cg(1)Cg(2))(γ(1)γ(2))=(ϕ0(0)ϕg(0))=(10)\left( \begin{matrix} C_{0}^{(1)} & C_{0}^{(2)} \\ C_{g}^{(1)} & C_{g}^{(2)} \\ \end{matrix} \right) \cdot \left( \begin{matrix} \gamma^{(1)} \\ \gamma^{(2)} \\ \end{matrix} \right) = \left( \begin{matrix} \phi_0^{(0)} \\ \phi_g^{(0)} \\ \end{matrix} \right) = \left( \begin{matrix} 1 \\ 0\\ \end{matrix} \right)

In the kinematic case, the centers M of the various Ewald spheres (for the various incident directions) lay on a sphere of radius k=1λk=\frac{1}{\lambda} around the origin of the reciprocal lattice. At some point the intensity in the diffracted beam will be more intense than in the incident beam, and therefore, we have to treat this scattered beam now in the same way we have treated the incident beam before: we need an Ewald sphere of radius kk for this direction/ for this reciprocal lattice point g\vec{g}. Now we have two Ewald spheres, one around the origin 0 and one around the reciprocal lattice point g\vec{g}. The two spheres are not allowed to intersect each other, but a smooth kind of complicated surface has to be constructed.

The fundamental equations of the dynamic theory for the two beam case are:

γ(j)C0j+Ug2KCg(j)=0-\gamma^{(j)} C_0^{j} + \frac{U_g}{2K} C_g^{(j)} = 0
Ug2KC0j+(γ(j)+s)Cg(j)=0\frac{U_g}{2K} C_0^{j} + (-\gamma^{(j)}+s) C_g^{(j)} = 0

Such a homogeneous linear equation system for the Cg(j)C_g^{(j)} has a non-zero solution of and only if the determinant of the coefficients is zero:

γ(j)Ug2KUg2K(γ(j)+s)=γ(j)2sγ(j)Ug24K2=0\left| \begin{matrix} -\gamma^{(j)} & \frac{U_g}{2K}\\ \frac{U_g}{2K}& (-\gamma^{(j)}+s)\\ \end{matrix} \right| = {\gamma^{(j)}}^2 -s\gamma^{(j)}-\frac{U_g^2}{4K^2} =0

Which is the same as the Howie-Whelan equation (on which we will use extensivly) with ξg=K2/Ug\xi_g = K^2/U_g, but now we know that the γ(j)\gamma^{(j)} are the Eigenvalues of a matrix problem.

Solution:

γ(j)=12[s(1)j(Ug/K)2+s2]=12[s(1)j(1/ξ2+s2]=12ξg[w(1)j(1+w2]\gamma^{(j)} = \frac{1}{2} \left[ s-(-1)^j \sqrt{(U_g/K)^2 +s^2} \right]\\ = \frac{1}{2} \left[ s-(-1)^j \sqrt{(1/\xi^2 +s^2} \right]\\ = \frac{1}{2\xi_g} \left[ w-(-1)^j \sqrt{(1+w^2} \right]\\

We made the substitution w=sξgw =s\xi_g, in which the parameter ww characterizes the tilt out of the exact Bragg condition (w=0w=0). The excitation error ss is zero in the exact Bragg condition, isn’t it.

The separation is Δkz,min=γ(1)γ(2)=Ug2K=1ξg\Delta k_{z,min} = \gamma^{(1)}-\gamma^{(2)}= \frac{U_g}{2K}=\frac{1}{\xi_g}.

By use of the eigenvalues γ(j)\gamma^{(j)}, the linear systems of equations can be solved for the Cg(j)C_g^{(j)}. For the amplitude ϵ(j)Cg(j)=C0jCg(j)\epsilon^{(j)} C_g^{(j)} = C_0^{j} C_g^{(j)} of the four Bloch waves with the vector k0(j)+g\vec{k}_0^{(j)}+\vec{g} we obtain:

C0jC0(j)=12[1+(1)jw1+w2]C0jCg(j)=12[(1)j1+w2]C_0^{j} C_0^{(j)} = \frac{1}{2}\left[ 1 + (-1)^j \frac{w}{\sqrt{1+w^2}} \right]\\ C_0^{j} C_g^{(j)} = -\frac{1}{2}\left[ \frac{(-1)^j}{\sqrt{1+w^2}} \right]

We put this into the equation for the scattered wave and substitute the thickness tt for the zz component of the vector r\vec{r}:

ψ0(t)=j=12C0jC0(j)e2πikz(j)tψg(t)=j=12C0jCg(j)e2πikz(j)te2πikzgx\psi_0(t) = \sum_{j=1}^2 C_0^{j} C_0^{(j)}e^{2\pi i k_z^{(j)}t}\\ \psi_g(t) = \sum_{j=1}^2 C_0^{j} C_g^{(j)}e^{2\pi i k_z^{(j)}t}\, e^{2\pi i k_z gx}

and we find (omitting common phase factors):

ψ0(t)=cos(π1+w2tξg)iw1+w2sin(π1+w2tξg)ψg(t)=i1+w2sin(π1+w2tξg)e2πikzgx\psi_0(t)= \cos (\pi \sqrt{1+w^2}\frac{t}{\xi_g}) - \frac{iw}{\sqrt{1+w^2}} \sin (\pi \sqrt{1+w^2}\frac{t}{\xi_g})\\ \psi_g(t)= \frac{i}{\sqrt{1+w^2}} \sin (\pi \sqrt{1+w^2}\frac{t}{\xi_g}) \, e^{2\pi i k_z gx}

The intensities of the transmission TT and reflection RR become:

ψgψgR=1ψ0ψ01T=11+w2sin2(π1+w2(1ξg)sefft)\underbrace{\psi_g \psi_g^*}_R = \underbrace{1-\psi_0 \psi_0^*}_{1-T} = \frac{1}{1+w^2} \sin^2 (\pi \sqrt{\sqrt{1+w^2} ( \frac{1}{\xi_g}})_{s_{eff}} \cdot t)

The solution is the Pendellösung of two coupled oscillators (in mechanics: two pendulums connected with a spring).

Even in exact Bragg condition, the intensity oscillates between primary beam and Bragg reflected beam with increasing film thickness. Look at the plot below, the Pendellösung is shown without absorption.

Normally one would want to add an absorption term to reduce the intensity with thickness. This absorption term is better named a damping term and stems form the inelatic scattering to random angles instead of the considered (here two) Bragg angles

These oscillations of the intensities are commonly called rocking curve.

# ------ Input ------
xi_g = 4  # extiction distance (in terms of relative thickness )
omega = 0.4 # tilt from Bragg condition
damping = 0.3
# --------------------

t = np.linspace(0,8,401)


plt.figure()
plt.plot(t, (1-np.sin(np.pi * np.sqrt(np.sqrt(1.+ omega**2)*1/xi_g)*t)**2) * 1/np.exp(t*damping), label='incident beam')
plt.plot(t, np.sin(np.pi * np.sqrt(np.sqrt(1.+ omega**2)*1/xi_g)*t)**2 * 1/np.exp(t*damping), label='reflected beam')

plt.legend();
Loading...

Summary of Bethe Theory

The solution is the Pendellösung of two coupled oscillators.

The periodicity is the extinction length ξ\xi, which tells at which thickness a beam is completely vanished.

Considering some absorption (well it’s not a real absorption, but inelastic scattering) then we see that the amplitudes decrease slowly.

Using Bethe Theory for Thickness Determination

We will do this in a lab and it will be your homework.

  • The accurate thickness of the sample is an important but hard to obtain parameter, but it influences the contrast in all imaging modes.

  • Be aware that with different techniques you perform different thickness measurement. In any high resolution image and diffraction experiment, you always look at the thickness of the crystalline part of the sample, omitting the contribution of contamination and amorphous surface layer (from sample preparation).

  • In the Analytic Section of this class we learn how to the thickness from the whole sample.

We can observe the above rocking curve in convergent beam electron diffraction patterns (CBED).

But we have to ensure that:

  • Excitation error is as small as possible

  • We are in two beam condition

Experimental Considerations

  • Choose a convergence angle α\alpha so that α<θB\alpha < \theta_B, to avoid overlapping of disks in the ZOLZ.

  • The 000 disk usually contains concentric diffuse fringes, the Kossel-Möllenstedt fringes

  • If you move the specimen, then you will see that the number of this fringes changes. In fact the number of each fringes increases by one every time the thickness increases by one extinction length.

  • The foil thickness can be measured precisely at the point where you do your other analysis.

Please be aware that dynamic effects also occur for the HOLZ lines in a CBED pattern.

In practice to simplify the interpretation, we don’t use zone axis conditions, but tilt to two--beam conditions with only one strongly excited Bragg beam.

  • The CBED disks contain then parallel rather than concentric intensity oscillations as shown in the earlier figure.

  • In fact, this intensity oscillations are equivalent to the rocking curve intensity oscillations discussed earlier.

  • It helps, of you use an energy filter for this method.

Thickness Determination

Because the oscillations are symmetric in the hkl disk we concentrate the analysis on this disk.

  • The middle of the hkl disk is bright and originates from the exact Bragg condition (s=0\vec{s}=0).

  • We measure the distance between the midle (bright fringe) of the hklhkl disk and the dark lines.

You obtain a deviation sis_i for each fringe from the equation:

si=λΔθieθBd2s_i=\lambda\frac{\Delta\theta_i}{e\theta_B d^2}

The Bragg angle θB\theta_B is known from the separation of two disks and the lattice spacing dd is known from the sample or can be calculated through the camera length.

If the extinction distance ξg\xi_g is known you can calculate the foil thickness tt with:

1t2=s12nk2+1ξg2nk2\frac{1}{t^2} = \frac{s_1^2}{n_k^2}+\frac{1}{\xi_g^2n_k^2}

where nkn_k is an integer.

Data Analysis

  • assign n=1n=1 to the first fringe s1s_1

  • assign n=2n=2 to the second fringe s2s_2 and so on for all other fringes

  • plot (s1/nk)2(s_1/n_k)^2 versus (1/nk)2(1/n_k)^2.

  • if you get a straight line, then you are finished and you have k=i+jk=i+j, where jj is the largest integer <(t/ξg)<(t/\xi_g).

  • if not repeat the same thing with n=2n=2 for s1s_1, n=3n=3 for s2s_2, etc.

  • repeat this increase by one till you get a straight line

  • the slope of the line is 1/ξg21/\xi_g^2

  • the extrapolated value for 1/nk21/n_k^2 is 1/t21/t^2.

The whole procedure is summarized in the figure below.

CBED-thickness

More about Bloch Waves

We can replace the exponential functions by trigonometric functions and get:

A(1)=cosβ2A(2)=sinβ2A^{(1)} = \cos \frac{\beta}{2}\\ A^{(2)} = \sin \frac{\beta}{2}

Some of the Bloch waves are located (have their maxima) between the atoms. These Bloch waves channel and are more or less undisturbed.

Another set is located on the atomic rows and will cause much more inelastic scattering than the other, also they will travel much faster.

The second set is especially important for Z-contrast image, where a small convergent beam is located at the atomic rows. You might consider the atoms like little lenses which keep the beam focused on the column.