key: cord-0147794-g22umekl
authors: Gohmann, Frank
title: Introduction to Solid State Physics
date: 2021-01-05
journal: nan
DOI: nan
sha: e091a1194fda6acbb350e07babec2d281df49401
doc_id: 147794
cord_uid: g22umekl

Lecture notes of the author's Introduction to Theoretical Solid State Physics, held at the University of Wuppertal since 2003.

This script is based on lecture notes prepared for the regular Introduction to Theoretical Solid State Physics at the University of Wuppertal held by the author in the winter semesters of 2003/04, 2004/05, 2010 /11, 2011/12, 2013/14, 2014/15 and in the summer semesters of 2006 and 2020. Due to the prevailing Covid 19 pandemic all teaching at the University of Wuppertal in the summer of 2020 went online. In order to support my students with their home training programme I decided to typeset at least the present part of my lecture notes. In regular semesters I would have delivered 28-30 lectures, 90 minutes each. Since the beginning of our semester was delayed due to the pandemic, the number of lectures was restricted to 23. For this reason I cut out two lectures on the Hartree-Fock approximation and two lectures usually devoted to recall the formalism of the second quantization. I condensed the remaining material to fit into the available 23 time slots. The missed out material as well as some of the material of the regular lectures on Advanced Theoretical Solid State Physics may follow on a later occasion, e.g., after the next pandemic. Most of the lectures are complemented with a few intermediate level homework exercises which are considered an integral part of the course. These exercises were discussed by the participants in a separate online exercise session on a weekly basis.

At the university of Wuppertal the introduction to theoretical solid state physics is part of the Master course. Students who attend this course are expected to have successfully passed the basic courses in theoretical physics (Mechanics, Electrodynamics, Quantum Mechanics, Statistical Mechanics) and a course on Advanced Quantum Mechanics.

I would like to thank all those participants of my lectures whose attention and whose questions helped to improve this manuscript. Particular thanks are due to Saskia Faulmann and Siegfried Spruck who read the entire text and pointed out a number of typos and inaccuracies to me. When I started teaching Theoretical Solid State Physics in 2003, I devised my lecture after lecture notes of my dear colleague Holger Fehske whom I would like to thank at this point. My own first lecture notes then gradually evolved over the years and most probably will continue to evolve in future. I prefer to think of the following typeset version as of a snapshot taken in the year of the pandemic 2020.

What is solid state physics?

• Application of what we have learned heretofore (QM + StatMech) to the description of 'condensed matter', no new fundamental theory.

What are the goals of this lecture?

• Introduction to the basic concepts, meaning that the emphasis is, in the first instance, on the single-particle aspects.

• Service for Experimental Solid State Physics.

• Emphasis on the explanation of concepts and basic ideas, not always quantitative, justification of the use of simplified 'model Hamiltonians'.

• Raise some understanding why many-body physics is mostly phenomenology.

• Convey the following main idea: (Collective) elementary excitations are 'quasiparticles' characterized by their dispersion relation p → ε(p) and by certain quantum numbers like spin and charge. The most important two are 'the phonon' (= quantized lattice vibration) and 'the electron' (= quantized charge excitation of the solid, which has as much to do with the electron of elementary particle physics as water waves have to do with water). L1 The Hamiltonian of the solid and its eigenvalue problem 1 

Solid state physics is quantum mechanics and statistical mechanics of many (∼ 10 23 ) particles at 'low energies' (typically ∼ 1 meV). Relativistic effects (retardation, spin-orbit coupling, . . . ) can often be neglected in the explanation of phenomena in solids (apart from the fact the spin is a relativistic effect). The large number of particles N p is best taken care of by considering the thermodynamic limit N p → ∞ which typically brings about simplifications of the theory.

In solid state physics not only the electrons composing the solid but also the ions are regarded as elementary. The only relevant interaction is the Coulomb interaction. Thus, the Hamiltonian of the solid is the Hamiltonian of non-relativistic ions and electrons interacting via Coulomb interaction, There is no sharp distinction between a molecule and a solid. In between molecules and solids are macro molecules. DNA, for instance, is composed of about 2 · 10 11 atoms.

A first understanding of the length and energy scales in solids comes from a dimensional analysis that allows us to introduce natural units. Asking, when the kinetic energy of an electron associated with a given wavelength is of the same order of magnitude as the Coulomb energy of two electrons one wavelength apart from each other,

we recover the typical length scale of atomic physics is twice the ionization energy of a hydrogen atom (which is one Rydberg (1 Ry)) and thus a typical atomic binding energy. Measuring all lengths in units of the Bohr radius (i.e., replacing r j /a 0 → r j , R j /a 0 → R j ) and the energy in units of 2 Ry (by replacing H (me 4 / 2 ) → H) we obtain the Hamiltonian of the solid in natural units,

where

Here we have used the summation convention with respect to Greek indices and the further convention that R = (R 1 , . . . , R L ). Similarly, we shall write r = (r 1 , . . . , r N ).

Our scale analysis shows us that the ionisation numbers Z j and the mass ratios m/M j are the only parameters of the system. The variation of these pure numbers is responsible for the rich phenomenology of molecules and solids and, in fact, of the world around us as we perceive it with our senses.

Note that the mass ratio m M j ∼ 10 −4 (1. 7) in (1.5) is a small parameter. This fact turns out to be of fundamental importance for the theory of solids and determines much of the structure of the world around us.

If we naively send all masses M j in (1.5) to infinity, the kinetic energy of the ions goes to zero, the ions stop moving. The corresponding ionic parts of the eigenfunctions separate multiplicatively and become products of delta functions of the form δ(R j − R (0) j ). If the masses are large but finite, the ions will still move, but slower than the electrons. Their wavefunctions will not be delta functions, but typically more localized than those of the electrons.

Our favorite classical example system of interacting point particles of very different masses is the planetary system. The ratio of the earth mass m to the sun mass M , for instance, is about m/M = 1/3 × 10 −5 . Earth and sun exert equal but oppositely directed forces ±F onto each other, MẌ = F = −mẍ, if X and x are the position vectors of sun and earth, respectively. This means that sun experiences a much smaller average acceleration than the earth. Consequentially, as compared to the sun, the earth moves much faster and has a much larger orbit around the center of mass of the sun-earth system. In this case, as the mass ratio is so small, the center of mass lies inside the sun. Hence, to a very good approximation, the earth moves around the sun and follows it along its way through the universe.

In a similar way we expect the electrons in a solid to follow the slower motion of the more massive ions. Translated into the language of quantum mechanics we expect that the joint motion of electrons and ions can be approximately described by a product of an ionic wave function times an electronic wave function calculated for fixed positions of the ions. The latter would be interpreted as a conditional probability amplitude for the electrons given the positions of the ions. The product structure would mimic the fact in probability theory that the joint probability p(A ∩ B) of two events A and B (corresponding to the wave function of electrons and ions) is equal to p(A|B)p(B), where p(B) is the probability of B (corresponding to the ions) and p(A|B) is the conditional probability of A given B (corresponding to the electronic wave function for fixed positions of the ions).

We shall try to work out this idea more formally. Let us start with the 'electronic eigenvalue problem'

which depends parametrically on the positions R of all ions. For every R the eigenstates ϕ n (r|R) n∈N of H el (R) corresponding to the eigenvalues ε n (R) form a basis of the electronic Hilbert space. Hence, every solution Ψ(r, R) of the full eigenvalue problem HΨ = EΨ can be expanded in terms of the ϕ n ,

Assuming the ϕ n to be known we want to derive an eigenvalue problem for the φ n which will later be interpreted as the ionic wave functions. For this purpose we insert (1.9) into the full eigenvalue problem and write the result as

From here we obtain an equation for the φ n upon multiplication by ϕ * n (r|R) and integration over r,

If there is no external magnetic field we may assume that the ϕ n are real. Then, using that

we see that A nn (R) = 0. Introducing the notation

we can therefore rewrite (1.11) in the form

This equation still contains the full information about the ion system. The ions are now coupled through their Coulomb interaction (contained in ε n (R)) and through the electrons, whose degrees of freedom have been formally integrated out. Equation (1.15) will turn out to be an appropriate starting point for a perturbative analysis of the ion system with m/M j taken as a small parameter.

Classically we have a general idea, how the time scale, e.g. for the motion of a particle of mass m with one degree of freedom in a potential V , depends on the mass. The Lagrangian of such a system is

Denoting by x µ (t) a trajectory of a particle with mass µ, we see that

is a trajectory of a particle of mass M . The motion slows down if M > m. Heavier particles (of the same energy) move slower. In particular, oscillators oscillate with lower frequency if their mass is enlarged.

In quantum mechanics we have to consider the stationary Schrödinger equation, a timeindependent problem. Hence, we are rather interested in how the spatial behaviour of the eigenfunctions varies with mass. For the bounded motion around an equilibrium position described by a quadratic minimum of the potential we may recourse to the harmonic oscillator

Comparing kinetic and potential energy in a similar scaling argument as in (1.2),

we find the intrinsic length scale

Comparing the extension L of an eigenfunction of a heavy particle of mass M with the extension of a lighter particle of mass m we obtain a ratio of

This means that the larger the mass the more localized becomes the wave function. On the other hand, in a more localized wave function the particle is closer to the origin and the harmonic approximation is better justified.

The same conclusions as above can be drawn from the solution of the eigenvalue problem of the harmonic oscillator (2.3).

(i) Determine the full width at half height of the ground state wave function of the harmonic oscillator. How does it depend on the mass of the oscillator?

(ii) Consider the oscillator with a quartic correction term

where K > 0. Show that the correction can be taken into account perturbatively if the mass is large. For this purpose use as defined in (2.5) as a small parameter. Calculate the correction to the ground state energy in first oder perturbation theory. How does it depend on the mass of the oscillator?

Let M be a typical ion mass, e.g. the arithmetic average of all ion masses M = {M j } . Then

is an intrinsic length parameter for the bounded motion of the ions. We expect that the eigenvalue problem (1.15) has solutions which, on the scale of the electronic wave functions, are strongly localized around certain equilibrium positions R (0) .

To take account of this expectation we introduce new coordinates u on the scale of the electronic wave functions and relative to this equilibrium position, setting

For the wave functions we shall write φ n (u) = φ n (R) .

(2.10)

Using the new coordinates (2.9) we can make the κ dependence of the operators in (1.15) explicit. For this purpose we define the rescaled operators

which remain finite for κ → 0. With these definitions the eigenvalue problem (1.15) assumes the form

Recall that we assume that the wave functions φ n are strongly localized around R (0) , implying that the redefined functions φ n are strongly localized around u = 0. For this reason it makes sense to expand the operators in (2.12) which act on the functions φ n in a Taylor series in κu and to solve the resulting eigenvalue problem perturbatively for small κ. The latter means to look for solutions in the form of formal series in κ,

Inserting these perturbation series into (2.12) and performing the Taylor expansion we obtain

Here we compare the coefficients in front of the powers of κ order by order. To the order κ 0 we obtain

Then

(2.16)

For n = we conclude for the higher orders in κ that

The latter equation means that at order κ 3 ∼ 1/1000 the wave function Ψ(r|R) (1.9) of the coupled electron ion system ceases to be a simple product of two factors. For n = we find at order κ that

Then, necessarily,

For higher orders of κ we remain with the equation

Conceiving this as an equation up to the order κ 2 , we see that the right hand side can be consistently neglected.

(i) Within the scheme of the above perturbation theory the solutions φ n, , E n, of the eigenvalue problem

consistently determine the eigenfunctions of the solid in product form

up to the fourth order expansion of ε n (R) in κ and up to the zeroth order expansion of B nn (R) in κ. ϕ n (r|R) is interpreted as a conditional probability amplitude and φ n, (R) as an ionic wave function. The product form then means that the electrons follow the motion of the ions.

(ii) The approximation (2.22), (2.23) is called the Born-Oppenheimer approximation and originated in molecular physics [2, 3] . Another common name is 'the adiabatic approximation'.

(iii) Within the Born-Oppenheimer approximation the equilibrium positions of the ions are determined by the condition

In solids the ions arrange themselves in regular lattices.

(iv) Consistently up to the second order in κ the eigenvalue problem (2.22) takes the form

This is called the harmonic approximation. Since only φ (0) is taken into account in the harmonic approximation, it is consistent to consider the electronic wave function only to lowest order ϕ(r|R (0) ) as well. In this approximation the electrons move in a static lattice determined by the equilibrium positions of the ions (like the earth was approximately moving around a static sun in our entrance example).

(v) Notice that, in spite of the ratio of electron to ion mass being very small, our actual expansion parameter κ ∼ 1/10 is only moderately small.

(vi) Our perturbative analysis becomes questionable, if the electronic levels are degenerate or close to degenerate (it is well justified for the electronic ground state of an insulator, but problematic for a metal).

(vii) Many properties of solids can be understood qualitatively and often also quantitatively within the harmonic approximation. Most of the treatment of solids in this lecture will be based on it. It explains e.g. the scattering of light or neutrons, the propagation of sound and the specific heat.

(viii) An example of an effect which cannot be explained within the harmonic approximation is the thermal extension of a solid. It is a higher order effect and therefore small. Still it can be understood within the adiabatic approximation if we proceed to the fourth order expansion of ε n (R).

A heavy particle (M ) and a light particle (m M ) move inside an infinitely high potential well of width L. The particles experience an attractive interaction described by the potential W (r − R) = −λ · δ(r − R), where λ > 0 and R and r are the positions of the heavy and light particle, respectively.

Calculate the spectrum of the corresponding Hamiltonian

in Born-Oppenheimer approximation:

(i) In step one transform the Hamiltonian into a dimensionless form and neglect the kinetic energy of the heavy particle. Determine the (unnormalized) eigenfunctions and an equation for the energy eigenvalues ε(R) of the light particle. It may turn out to be useful to distinguish the cases ε < 0, ε = 0 and ε > 0. Recall the relation

for the jump of the derivative of the wave function caused by the δ-function.

(ii) In step two sketch the energy ε(R) of the light particle for the ground state and for the first excited state as a function of the position of the heavy particle. What is the meaning of ε(R) and how should we include (qualitatively) the energy levels of the heavy particle into our picture?

Within the adiabatic approximation the electrons and ions form bound states, molecules or solids, in which the ions oscillate around the minima R (0) of the effective potentials ε n (R) determined by the mutual Coulomb interaction of the ions and the energy eigenvalues of the electrons for fixed ion positions. We derived this statement in the previous lecture presupposing that such minima exist. The argument would have been more convincing if we would have been able to prove the existence of minima starting with the Hamiltonian (1.1). Such undertaking seems to be out of reach with our current methods. Still, in nature, atoms always form bound states at low enough temperatures (the only notable exception being helium which condenses into a 'super fluid' before solidification). If many atoms are put together in stoichiometric ratios they form periodic structures called crystals. Crystals are formed, since (i) two atoms prefer a certain binding length and (ii) no direction is preferred in the large (this is very much like packing balls into a box and carefully shaking it such that the balls find their equilibrium positions). In this and in the following lecture we introduce the terminology and certain mathematical structures needed for the description of crystal lattices.

The most important structural feature of a crystal is that it can be thought of as being generated by periodic repetitions of a finite elementary structure, the unit cell, in three space directions.

Let a 1 , a 2 , a 3 ∈ R 3 , a 1 , a 2 × a 3 = det(a 1 , a 2 , a 3 ) = 0. Then In the examples and exercises we will frequently work with d = 1 and d = 2.

Remark. We shall develop part of the theory of infinite crystals which is expected to give a realistic description, if the ratio of the number of ions at the surface to the number of ions in the bulk is small. For a typical macroscopic total number of L ∼ 10 24 ions, of the order of L 2/3 ∼ 10 16 of them are at the surface, and the ratio is ∼ 10 −8 .

Bravais lattice vectors a j , j = 1, 2, 3, that generate the Bravais lattice as an Abelian group with respect to vector addition are called primitive (lattice) vectors. They are not unique.

Lemma 1. Let a j , j = 1, 2, 3, a set of primitive vectors of a Bravais lattice. Then

m ij a j , i = 1, 2, 3, primitive ⇔ m ij ∈ Z and | det m| = 1 .

Since | det m| = 1 it follows that

the volume of the parallelepiped spanned by a set of primitive vectors, is independent of their choice. By definition, the unit cell of a crystal with Bravais lattice B is a simply connected finite volume (of size V u ) which covers R 3 through translation by B. We often use a special unit cell, the Wigner-Seitz cell, which reflects the symmetries of the Bravais lattice, and is defined as

Geometrically this is the set of points in R 3 for which the closest Bravais lattice point is the origin. It can be constructed by drawing lines from the origin to the neighbouring sites in the Bravais lattice and erecting the perpendicular bisectors on these lines. Exercise: Draw the Wigner-Seitz cells for 2d Bravais lattices composed of equilateral squares and triangles.

The reciprocal lattice which we shall define now is a lattice which, in a sense, is dual to the Bravais lattice. It is one of the most important notions in solid state physics and will accompany us throughout this lecture.

Given a Bravais lattice B and a set of primitive vectors a j , j = 1, 2, 3, generating it, we would like to know how to decompose any x ∈ R 3 with respect to the a j , i.e., we would like to know its coordinates with respect to the basis {a 1 , a 2 , a 3 }. Suppose that b j , j = 1, 2, 3, exist such that 

is called the reciprocal lattice (associated with the Bravais lattice B).

It is easy to solve (3.4) for the b j . For this purpose we rewrite it in matrix form

where I 3 is the 3 × 3 unit matrix, and use Cramer's rule,

Without restriction of generality we have assumed here that det(a 1 , a 2 , a 3 ) > 0.

(i) Involutivity. Equation (3.4) implies that B = B. The reciprocal of the reciprocal lattice is the original Bravais lattice.

(ii) Brillouin zone. The Wigner-Seitz cell of the reciprocal lattice is called the (first) Brillouin zone. It plays an important role in solid state physics.

(iii) Volume of the Brillouin zone. Fix a set of primitive reciprocal lattice vectors b j and denote the volume of the parallelepiped spanned by these vectors by

Taking the determinant on the left and right hand side of the first equation (3.7) we see that

is the volume of the unit cells of the reciprocal lattice, which is the same as the volume of the Brillouin zone.

(iv) Lattice planes. A lattice plane S ⊂ R 3 associated with a Bravais lattice B is a plane for which ∃ a 1 , a 2 , a 3 ∈ B primitive, ∃ ∈ Z such that

x ;m,n = a 1 + ma 2 + na 3 ∈ S , ∀ m, n ∈ Z .

(3.11)

Thus, for every lattice plane there is a primitive reciprocal lattice vector b 1 ∈ B and an ∈ Z such that (3.12) holds. If we vary in (3.12) we obtain a family of equidistant lattice planes. Conversely, given any primitive vector b 1 ∈ B and any ∈ Z, the plane S defined by (3.12) is a lattice plane. Thus, families of lattice planes are in one-to-one correspondence with primitive vectors of the reciprocal lattice.

(v) Miller indices. Lattice planes play an important role in the spectroscopy of solids,

We shall see in the course of this lecture that waves impinging on a crystal are reflected as if they were reflected by families of lattice planes. In spectroscopy the families of lattice planes are usually labeled by the so-called Miller indices defined relative to a fixed triple {b 1 , b 2 , b 3 } of primitive vectors of the reciprocal lattice. According to lemma 1 every primitive vector b ∈ B can be uniquely presented as an integer linear combination 

Let m j = 0, j = 1, 2, 3, the Miller indices of a family of lattice planes. Convince yourself that they indicate in which three points

the lines along the a j directions cut the lattice plane with = 1 in (3.12). Note that in the excluded cases with one or two of the m j being equal to zero there is no intersection in the corresponding direction. The lines are parallel to the lattice plane.

Many quantities that characterize the state of a crystal have the same periodicity as the crystal itself. For this reason we will often have to deal with periodic functions whose periods are the primitive vectors of a Bravais lattice. These are often conveniently described by their Fourier series. Let us recall Fourier series in one spatial dimension. In this case the unit cell is necessarily an interval [0, a], where a > 0 is called the lattice spacing or lattice constant, and the primitive vector is equal to a. A lattice periodic function f : R → C is a function satisfying

A natural basis for the expansion of such functions can be constructed as follows. Let ϕ ∈ [0, 2π) and z = e iϕ = e i(ϕ+2π) = e i 2π

we see that ab = 2π (3.18) implying that b generates the reciprocal lattice, and that the functions

are linear independent and periodic with period a.

If the series

converges uniformly, f is periodic with period a and continuous, and

By means of the latter formula we can associate a sequence of Fourier coefficients (A kn ) n∈Z and a Fourier series (3.20) with every complex valued function f that is integrable on [0, a]. In the following we will understand Fourier series in such a formal sense. But when we will be dealing with concrete examples, we will have in mind that the convergence properties of Fourier series are a delicate matter and that, in general, neither uniform nor pointwise convergence is guaranteed. In order to construct Fourier series that have the periodicity defined by a Bravais lattice B, we fix a set of primitive vectors {a 1 , a 2 , a 3 } and a set of reciprocal vectors

and the monomials

are linear independent and periodic with all a ∈ B being periods. In analogy with the 1d case we may define k mn = b 1 + mb 2 + nb 3 and the Fourier series

where

and the integral is over a unit cell U of volume V u . With these remarks on Fourier series we have obtained an interpretation of the reciprocal lattice. The reciprocal lattice is a lattice in Fourier space dual to the real space Bravais lattice.

L4 Crystal symmetries 4 

We define a (physical) crystal lattice as the set M of the positions (measured as expectation values) of the ions of a solid in its ground state. The translational symmetry of the crystal (lattice) is described by the corresponding Bravais lattice. In general there are several ions in a unit cell of the Bravais lattice. The positions of the ions inside a unit cell determine the so-called lattice basis. If every unit cell contains only one ion, the crystal is called simple. In this case it can be identified with its Bravais lattice. A crystal that is not simple is called a crystal with basis.

The Euclidean group is the group of all maps R 3 → R 3 which leave the distance be- 

The symmetry group R of a crystal M is defined as

i.e., as the subgroup of E(3) that leaves M invariant. If we identify b with (id, b) for every b ∈ B, the Bravais lattice of M , we see that B is a subgroup of R (since it leaves M invariant and is a group).

Remark. (id, a) ∈ R implies that a ∈ B, but (A, a) ∈ R, A = id does not imply that a ∈ B. In crystals with basis smaller translations may occur which are parts of so-called glide reflections or screw rotations. For an example see Figure 1 . 

For a crystal M with symmetry group R define Proof

Remark. In general, R 0 M = M , i.e. the point group of M does not necessarily leave M invariant.

We have seen in the proof of lemma 2 that b ∈ B, (A, a) ∈ R ⇒ Ab ∈ B. Since, A ∈ R 0 , it follows that R 0 B ⊂ B, B is invariant under R 0 . This has two important implications: 

For point groups of the second kind we distinguish point groups containing i from point groups not containing i.

(iii) Since b ∈ B ⇒ −b ∈ B for every Bravais lattice vector, the symmetry groups of the Bravais lattices are point groups of the second kind which do contain i.

(iv) Which rotations are possible? The fact that any point group must leave a Bravais lattice invariant restricts the possible rotation angles. It means that every rotation D ∈ R 0 must map primitive vectors on vectors in B, Combining (4.7) and (4.8) we conclude that allowed angles ϕ must satisfy the condition 2 cos(ϕ) ∈ Z, or

Thus, the only admissible values of ϕ ∈ [0,2π) are

(4.10)

The corresponding rotation axes are called 6-fold, 4-fold, 3-fold, 2-fold.

(v) Point groups do not only contain only rotation axes of finite order, they are also finite groups. Comparison with the known finite subgroups of SO(3) leaves 11 point groups of the first kind compatible with (4.10). From these we can construct altogether 32 point groups. Their number is, in particular, finite.

The full symmetry groups R of crystals M are discrete subgroups of E(3) which contain a Bravais lattice B as a normal subgroup. Their number is finite as well. In mathematics (and crystallography) such groups are called space groups. They have been completely classified and can be described by symmetry elements like rotations, reflections, glide reflections and screw rotations. The table gives an overview over the group theoretic classification of the Bravais lattices and crystals. (ii) Show that the reciprocal lattice of the face-centered cubic lattice with lattice constant a is a body-centered cubic lattice with lattice constant 4π/a. By definition the lattice constant a is the edge length of the cube which envelops the unit cell of the face-centered cubic lattice with primitive vectors a 1 = a 2 (e y + e z ) , a 2 = a 2 (e x + e z ) , a 3 = a 2 (e x + e y ) . e x , e y , e z is the canonical orthonormal basis of R 3 . The corresponding primitive vectors for the body-centered cubic lattice with lattice constant a are a 1 = a 2 (−e x + e y + e z ) , a 2 = a 2 (e x − e y + e z ) , a 3 = a 2 (e x + e y − e z ) .

(iii) Find the Miller indices (m 1 , m 2 , m 3 ) of that plane of the face-centered cubic lattice which has the highest density of lattice points. Here it may be helpful to use the connection between the density and the reciprocal lattice vector k. For a set of primitive vectors {a 1 , a 2 , a 3 } ∈ B define the corresponding shift operators U a j , acting on a single-particle space of states, by

Crystal system Group primitive base-centered body-centered face-centered

Cubic O h Figure 2 : The crystal systems and Bravais classes (from Wikipedia, the free encyclopedia). A parallelepiped representing the point-group symmetry of a Bravais lattice has six parameters, three lengths of its edges and three angles. If all edge lengths and all angles are mutually distinct, the symmetry is minimal (only the inversion). This is the triclinic crystal class in the table. Considering all possible degeneracies (two right angles, to equal edge lengths etc.) one runs through all the listed symmetry classes, the crystal systems. Some of them can be realized by several Bravais lattices, giving the different Bravais classes. The second column in the table contains the name of the point group in so-called Schönflies notation.

Then

For any Bravais lattice vector R = a 1 + ma 2 + na 3 the operator

is therefore uniquely defined and naturally acts as

on single-particle wave functions. Equation (5.2) implies that the U a j have a joint system of eigenfunctions. If Ψ is such an eigenfunction, then

Thus, for every common eigenfunction Ψ of the three generators U a j of lattice translations ∃ k ∈ R 3 such that (5.6) holds for all R ∈ B. The vector k is a triple of quantum numbers characterizing the eigenstates of the lattice translation operators in very much the same manor as the momentum p is a triple of quantum numbers that characterize the eigenstates of the operator of infinitesimal translations, the momentum operator. For this analogy k is called the lattice momentum.

Let g ∈ B, R ∈ B. Then g, R = m2π for some m ∈ Z and e i k+g,R = e i k,R . This means that k and k + g characterize the same eigenstate of the lattice translation operator, or that the lattice momentum is defined only modulo reciprocal lattice vectors. For this reason we may restrict the domain of definition of k to any unit cell of the reciprocal lattice. This domain is conventionally taken as the first Brillouin zone, which explains the importance of the latter.

In the language of group theory the lattice momenta k label the irreducible representations of the Bravais lattice. Since the Bravais lattice is an Abelian group, all of its irreducible representations must be one-dimensional. They act by multiplication with complex numbers as can be seen in equation (5.6) .

The Hamiltonian of the solid (1.1) is invariant under any infinitesimal translation. For this reason the center of mass momentum of the solid is conserved. In nature the translation symmetry is affected by a mechanism called spontaneous symmetry breaking. After separating the center of mass motion the ground state of the Hamiltonian (1.1) is less symmetric than the Hamiltonian itself. Instead of the full translation symmetry it exhibits a discrete translation symmetry with an underlying Bravais lattice B. Effective Hamiltonians describing the dynamics above the ground state again have the reduced symmetry described by a Bravais lattice. The simplest example for a class of such effective Hamiltonians is the Hamiltonian a single electron in a lattice periodic potential. The study of this class of Hamiltonians is called band theory. We will have a closer look at it below. In any case, single-particle Hamiltonians H which are invariant under the action of a Bravais lattice,

play an important role in solid state physics. As we have seen, their wave functions can be labeled by the lattice momentum quantum numbers. This is the statement of Bloch's theorem.

Theorem 1. Bloch [1] . The eigenfunctions of a single-particle Hamiltonian H, periodic with respect to a Bravais lattice B, can be labeled by lattice momenta k ∈ BZ, where BZ ⊂ B is the Brillouin zone associated with B. An eigenfunction Ψ k of H then has the following properties with respect to translations by Bravais lattice vectors,

Let Ψ k,α an eigenstate with lattice momentum k of a lattice periodic Hamiltonian H. Here we denote all other quantum numbers needed to specify the state by α. According to Bloch's theorem u k,α (x) = e −i k,x Ψ k,α (x) is a lattice periodic function, u k,α (x + R) = u k,α (x). This implies the following corollary to Bloch's theorem. Corollary 1. The eigenfunctions of a lattice periodic single-particle Hamiltonian H are of the form

where k ∈ BZ is a lattice momentum vector and u k,α is a lattice periodic function.

Hence, we may think of the eigenfunctions of a lattice periodic Hamiltonian as of amplitude-modulated plane waves, for which the modulation has the periods of the corresponding Bravais lattice.

For the calculation of thermodynamic quantities in the framework of statistical mechanics (in particular) it is necessary to count states. For this reason we prefer systems of finite size which have a discrete spectrum. In solid state physics this can be enforced by introducing 'boundaries' (by putting the system into a box). After having counted the states one considers the limit, when the systems size goes to infinity (the thermodynamic limit).

In general, boundaries are incompatible with lattice translations induced by a Bravais lattice. They break the translational symmetry and invalidate Bloch's theorem. A way out of this dilemma is by employing periodic boundary conditions. For a d-dimensional systems periodic boundary conditions can be realized by starting with a parallelepiped and identifying opposite faces. This amounts to bending the parallelepiped to a torus in d + 1 dimensions. For this reason periodic boundary conditions are also sometimes called toroidal boundary conditions. Imposing periodic boundary conditions on a 1d system of L sites with lattice constant a we find for a state with lattice momentum k that

For this to hold, the lattice momentum must be restricted to the values

where m ∈ Z in such a way that k lies in the Brillouin zone. The reciprocal lattice is generated by b = 2π/a. The Brillouin zone is the interval BZ = [−π/a, π/a), and k ∈ BZ ⇔ −L/2 ≤ m < L/2. Thus, for every L ∈ N there are L inequivalent ks in the Brillouin zone. The argument is similar in any number of dimensions. In order to obtain the lattice momentum quantization condition, e.g. in 3d, we expand k in a basis of primitive vectors of the reciprocal lattice, k = v 1 b 1 +v 2 b 2 +v 3 b 3 . Then, for a state Ψ k of lattice momentum k,

i.e., there are L 3 inequivalent lattice momenta in the first Brillouin zone. Let us rephrase this statement in the following form.

Lemma 4. There are as many lattice momenta in the Brillouin zone that are compatible with periodic boundary conditions as unit cells in the crystal.

Remark. In 3d periodic boundary conditions cannot be physically realized. However, the density of states of a macroscopically large system (∼ 10 23 particles) is practically independent of the boundary conditions. As long as we are not interested in the boundaries themselves, periodic boundary conditions are justified.

Remark. Mathematically periodic boundary conditions imply that we are dealing with functions that are periodic with periods of the large parallelepiped spanned by La 1 , La 2 , La 3 . Hamiltonians must be defined in a way that is compatible with this periodicity. On the corresponding space of states the generators U a j of translations by primitive vectors turn into generators of the cyclic group of order L,

which equivalently might have served as a starting point for introducing periodic boundary conditions.

L6 Phonons -spectrum and states

In lecture L2 we have discussed the Born-Oppenheimer approximation. We have seen that, in this approximation, the motion of the much heavier ions decouples from the motion of the electrons up to fourth order in an expansion in the deviations u of the ion positions from their equilibrium values R (0) . In most of this lecture we shall be dealing with ideal solids. By definition, the equilibrium positions of the ions R (0) in an ideal solid are the points of a crystal lattice. Solids in nature can be very close to ideal. The idealization of a perfect crystal is a good starting point to describe real solids. Consider a crystal with Bravais lattice B and N ions per unit cell. In such a crystal it makes sense to label the coordinates of the vector u as u α (R), where R ∈ B and

Here r = 1, . . . , N counts the ions in a unit cell, and j = x, y, z denotes their Cartesian coordinates. For the dimensionless ion masses (cf. Section 2.3) we introduce the notation

Then the operator T u , equation (2.11a), of the kinetic energy of the ions takes the form

We shall treat the motion of the ions within the harmonic approximation (2.25). This will allow us to develop a rather simple and general theory which nevertheless describes many of the experimental observations quite accurately. Within the harmonic approximation the potential energy of the ions can be written as

is the so-called force matrix (see (2.26)). Thus, the Hamiltonian of the lattice vibrations in harmonic approximation is

It is a quadratic form in the position operators of the ions and the corresponding derivatives. We will diagonalize this quadratic form. This will reduce the spectral problem of the Hamiltonian to the spectral problem of independent harmonic oscillators describing the quantized normal modes of the ideal harmonic solid. In order to control the number of the normal modes, we will employ periodic boundary conditions as introduced in the previous lecture. 

The space spanned by such functions is a finite dimensional vector space. For the action of U a j on this space there is an L ∈ N such that U L a j = id, j = 1, 2, 3. Thus, if ω j is an eigenvalue of U a j we must have |ω j | = 1 (because of unitarity) and ω L j = 1. It follows that ω j = e im j 2π L = e i k,a j (6.8)

As we have seen in the previous lecture there are altogether L 3 such vectors.

It is easy to find the corresponding eigenfunctions of the shift operators. If v k is an eigenfunctions with lattice momentum k, and R = a 1 + ma 2 

for all R ∈ B. This determines v k up to normalization. We fix the normalization by setting

All joint eigenfunctions of the U a j are of this form, and all these functions are joint eigenfunctions of the U a j . Hence, they form a basis of on the space of complex valued functions on B.

Remark.

(i) We have just invented the (discrete) Fourier transformation.

(ii) With the choice (6.10) the usual Hermitian scalar product of two eigenfunctions takes the values

for any two k, q ∈ BZ.

We will first of all diagonalize the force matrix K αβ (R, S) defined in (6.5). For this purpose and for later use as well we list its main properties.

(i) Symmetry. From its very definition as a second derivative matrix and from the commutativity or the partial derivatives we get at once that

(ii) Translation symmetry.

Inserting here a = −S we obtain

where the second equation is a definition. The meaning is that forces between ions depend only on the relative positions of unit cells.

(iii) In crystal lattices with inversion symmetry we have in addition that

Combining this with (6.12) we see that the force matrix in crystal lattices with inversion symmetry is symmetric in every unit cell,

The force matrix defines an operator on the space of functions with periodic boundary conditions on B,

It is easy to see that K αβ commutes with the shift operators U a j , j = 1, 2, 3,

Hence, U a j and K αβ possess a joint system of eigenfunctions. Since the v k , equation (6.9), form already an orthonormal basis of non-degenerate eigenfunctions, they must be also eigenfunctions of K αβ ,

In order to block diagonalize the quadratic form (6.4) we expand the displacements u α (R) into their Fourier modes,

Inserting (6.20) into (6.4) and making use of (6.19), (6.21) we obtain

the block diagonal form of the potential energy.

Let us now apply the same transformation to the kinetic energy operator. First of all

(6.24)

It follows that

Here we have used (6.11) in the second equation. We see that the lattice Fourier transformation has diagonalized T u .

In the next step we want to completely diagonalize the force matrix while keeping the diagonal form of the kinetic energy operator. To achieve the latter goal, we first rescale the complex coordinates, setting

(6.26)

Further defining

we obtain the following form of the Hamiltonian (6.6),

(6.28)

Before we can proceed we have to understand the properties of the matrix κ(q).

Here we have used the symmetry of the force matrix in the third equation.

(ii) κ(q) is non-negative. This follows, since the potential energy V is assumed to have a total minimum for u = 0 with V = 0. Hence

(iii) κ(q) and κ(−q) are similar matrices. First of all

since the force matrix is real. Then also

According to (i) and (ii) the matrix κ(q) can be diagonalized by a unitary transformation and has a non-negative spectrum {ω 2 α (q)} α∈I for every q ∈ BZ. Let {y α (q)} α∈I the set of corresponding orthonormal eigenvectors. Then

implying that {ω 2 α (q)} α∈I is the spectrum of κ(−q). Hence, κ(q) and κ(−q) are similar matrices.

Since, on the other hand,

by definition of the eigenvectors and eigenvalues, the identification

(which is one possible choice of indexing the eigenvectors of κ(−q) once the eigenvectors of κ(q) are given) implies that

The lectures on lattice vibrations will be accompanied by a set of exercises on the classical harmonic chain with broken translation invariance. We shall study the influence of fixed and open boundary conditions and of a mass defect. There is a good deal to learn from these exercises, namely something about the irrelevance of the boundary conditions as far as bulk thermodynamic properties are concerned, but also something about the typical effects of impurities, such as the appearance of localized states and impurity levels inside the band gap. In this exercise we shall consider the particular case of equal masses m 1 = . . . = m N = m. We want to analyze the harmonic chain by the so-called transfer matrix method. Upon slight modifications, it is possible to treat the different boundary conditions in a similar way. (ii) Show, that, with the substitution ψ(n) = x n , ϕ(n) = ψ(n − 1), the eigenvalue problem in (i) can be reformulated as

In the periodic case L n (Ω 2 ) is independent of the site index. Calculate the eigenvalues and the corresponding eigenvectors of L(Ω 2 ). Because L(Ω 2 ) acts like a translation operator, it is useful to write the eigenvalues in the form e ±iκ . Diagonalize the equation

How can Ω 2 be expressed in terms of κ?

(iii) The periodic boundary conditions turn into ψ(N + 1) = ψ(1) and ϕ(N + 1) = ϕ(1). From this determine all possible eigenfrequencies ω! (iv) Which modification is required for fixed boundaries? Determine all possible eigenfrequencies ω in this case.

(v) Show that the modifications necessary for open boundaries lead to the equation 

and, because of (6.35), has the property that

while for the kinetic term

Here we have used (7. 3) in the last equation. Altogether, we have transformed H into a diagonal quadratic form,

The term in the bracket can be interpreted as the Hamiltonian of a 1d harmonic oscillator with 'complex coordinates'.

Before going on we have to discuss the question whether the functions ω α (q) can be zero.

What we can say is that there are always at least three values of α for which ω α (0) = 0. These special 'modes' are connected with the center of mass motion of the solid. Their existence can be inferred from the translation invariance of the Hamiltonian (6.6) which is inherited from the full Hamiltonian (1.5) of the solid. For (6.6) translation invariance means invariance under the transformation

for all j ∈ R and every (r, j) ∈ I. This transformation leaves the kinetic energy (6.3) trivially invariant. For the potential energy (6.4) we infer that

for arbitrary u (s, ) (S) ∈ R. Here we have used the symmetry (6.12) of the force matrix in the second equation. Setting all but one of the displacements equal to zero and this one equal to one we obtain the relation

for the force matrix, which holds for all (s, ) ∈ I and j = x, y, z. On the other hand

Setting α = (r, j), β = (s, ) summing over r and using (7.10) we conclude that

for all (s, ) ∈ I and j = x, y, z. Thus, there are three independent linear relations between the rows of the matrix κ(0) which therefore has at least a threefold eigenvalue zero. We may order the spectrum of κ αβ (0) in such a way that the corresponding eigenvectors are

, η 0 . Using this notation, the Hamiltonian (7.7) splits into

can be interpreted a the kinetic energy of the center of mass motion of the crystal. The center of mass motion of the crystal is unbounded. If there were any other 'zero modes', i.e., a higher than threefold degeneracy of the eigenvalue zero, then there would be another eigenvector y α (q) corresponding to unbounded motion. This would necessarily involve an unbounded relative motion of different parts of the crystal, meaning that the crystal would disintegrate. In the following we shall exclude this possibility and concentrate on stable crystals. We shall also discard the center of mass motion. Then we remain with the Hamiltonian of the proper lattice vibrations which we denote

Here we have introduced the notation

The subindex 'ph' refers to 'phonon' which is the name of a quantized normal mode of the lattice.

To accomplish a complete diagonalization of the Hamiltonian H ph we introduce the operators

for all (q, α) ∈ Q. They satisfy the commutation relations (exercise: check it!)

The latter equation implies that

whenever ω α (q) = 0. Here we have used the evenness of the functions ω α , equation (6.36), and the commutation relations (7.18). Inserting (7.20) into (7.15) and using once more that ω α is an even function of q we arrive at

Thus, H ph is decomposed into a sum of independent harmonic oscillators.

Going step by step backwards, we express the creation and annihilation operators of the phonons in term of the original displacement variables and their associated momentum operators

We obtain

Inserting the latter two equations into the definitions (7.17) of the annihilation and creation operators we obtain

In this form it is obvious that a α q and a + α q are mutually adjoint operators.

Ψ 0 is the ground state, since it is the ground state for every 1d harmonic oscillator in the sum (7.21). Comparing (7.26) and (7.15 ) and recalling the original definition (6.4) of the harmonic potential, we obtain the ground state wave function as a function of the displacements u of the ions,

which is a natural generalization of the 1d case. A general phonon state is generated by the multiple action of phonon creation operators a + α q on the ground state which, for this reason, is also sometimes called the phonon vacuum (exercise: repeat the construction of excited states for the 1d harmonic oscillator based on the Heisenberg algebra (7.18)). Such states are parameterized by maps Q → N 0 , (q, α) → n α q . Accordingly we shall denote them as

It follows from the commutation relations (7.18) that

The eigenvectors y α (q) of κ(q), together with the corresponding eigenfrequencies ω α (q), determine the creation operators a + α q , equation (7.25b), since Y β α (q) = (y α ) β . In analogy with the expression for the quantized electro-magnetic field we shall call them polarization vectors.

Let us summarize the insight we have gained so far in the following Theorem 2. In order to obtain the spectrum (7.31) and the eigenstates (7.29), (7.30) of the vibrational motion of the ions in a solid in harmonic approximation, it suffices to calculate the dispersion relations ω α (q) and the polarization vectors y α (q). For this purpose one first calculates the matrix

and then its eigenvectors y α (q) and eigenvalues ω 2 α (q).

The input here is the force matrix. In applications it comes from quantum chemical calculations or from simple heuristic models. Note that in (7.32) every matrix element κ αβ (q) is represented as a (finite) Fourier series (cf. section 3.5) defining it as a periodic function in reciprocal space with periods in B. We expect the forces between ions to decay rapidly with distance and the convergence of the Fourier series (7.32) in the thermodynamic limit to be uniform, implying that the limit function is differentiable in q.

To start with we reconsider the harmonic chain within the framework of the general theory. This is a 1d problem with one ion per unit cell, thus no indices α, β are required and µ = 1. Denoting the lattice spacing by a we obtain

for the quantized lattice momenta q and the Bravais lattice vectors R.

The model force matrix is

where we have to keep the periodic boundary conditions in mind. This is a 1 × 1 matrix. The polarization vector is y = 1 and

which is called the dispersion relation of the harmonic chain with nearest-neighbour interactions. 

The classical model for this configuration is given by the equations of motion

are the dimensionless masses and α 1 , α 2 dimensionless force constants (see Figure 3 ). The corresponding force matrix is

It follows that

and

As it should be, this is a Hermitian 2 × 2 matrix. We have to calculate its eigenvalues and eigenvectors.

For the eigenvalues λ ± of a 2 × 2 matrix A we have the general formula (exercise:

For the matrix κ(q) we calculate

It follows that 

Recalling that 4xy ≤ (x + y) 2 ⇔ 0 ≤ (x − y) 2 for all x, y ∈ R we may conclude that

as it must be for ω 2 ± (q) to be real. The two branches ω ± of the dispersion relation are sketched in Figure 4 . Note that the lower branch is going to zero linearly as q goes to zero,

a .

(8.14)

This is the limit of long wave lengths. For this reason the branch is called the acoustic branch, v s is the sound velocity. The branch ω + is called the optical branch. The normal modes at small q in the acoustic branch correspond to motions when the two atoms in the unit cell move in phase, while small values of q in the optical branch correspond to motions when the two atoms move against each other. This can be seen by looking at the polarization vectors, which we leave as an exercise. As we see in Figure 4 the frequencies in the optical branch are higher that in the acoustic branch. In real solids typical optical branches correspond to frequencies in the infrared.

The numbers

are called the band width of the optical and acoustic branches. They quantify the ranges of available frequencies or bands. In real solids, like in our Figure 4 , optical bands have typically smaller band widths than acoustic bands. The number

is called the band gap. It corresponds to a range of 'forbidden frequencies'. As we shall see the existence of bands and band gaps explain many of the characteristic feature of solids observed in experiments.

It is interesting to see, how the monoatomic chain of example 1 is recovered for α 1 = α 2 = α and m 1 = m 2 . For this special choice of parameters

since |q| ≤ π/a. This can be rewritten as

We see that ω + is the same function as ω − , shifted by ± 2π a . The band gaps have vanished, and by shifting ω + on the interval [−π/a,0] by 2π/a to the right and the same function on the interval [0, π/a] by 2π/a to the left, we obtain the function ω − on the doubled Brillouin zone [−2π/a,2π/a]. Thus, we have two equivalent descriptions, two branches of the dispersion relation on the original Brillouin zone [−π/a, π/a] or one branch on [−2π/a,2π/a]. The doubling of the Brillouin zone corresponds to a bisection of the unit cell in the Bravais lattice (exercise: draw the pictures!).

(i) There are 3N branches ω α (q) of the dispersion relation for a crystal lattice with N atoms per unit cell.

(ii) In the thermodynamic limit the lattice momenta q densely fill the Brillouin zone, and the matrix κ(q) as defined in equation (7.32) becomes a continuously differentiable function of q. Then, due to the implicit function theorem, the functions ω 2 α (q) become differentiable functions of q. As also follows from (7.32) they are naturally extended as periodic functions on the reciprocal lattice B.

(iii) There are precisely three acoustic branches for which ω α (0) = 0. All other branches are optical branches with min ω α > 0. Since any continuous functions assumes its extremum on compact sets, every phonon band has finite band width.

(iv) Of the three acoustic branches one has longitudinal, the others have transversal polarization. In general the longitudinal acoustic modes are faster than the transversal acoustic modes (reversing force is larger for pressure waves than for shear waves, becomes clear when thinking about transition to fluid).

(v) The dispersion relations of the phonons are invariant under the action of the point group R 0 of the crystal,

. Then G acts naturally, as a rotation or a rotation followed by an inversion, on the Bravais lattice and on the displacements u (r,j) (R), j = x, y, z, of the individual ions from their equilibrium positions. The latter action combines into the action of a representation D of R 0 on the vectors of displacement u,

This transformation leaves the kinetic energy and the potential energy of the Hamiltonian (7.15) of the harmonic crystal separately invariant, as all ions simultaneously undergo the same O(3) transformation, which does not affect their relative displacements. The crystal lattice is not necessarily invariant under this transformation, but the effect on the crystal lattice is at most a translation, since any point group operation can be seen as a combination of a space group operation and a translation. This is another way of understanding the point group invariance of the Hamiltonian (7.15). Let us work out the consequences of the invariance for the representation D. First of all, setting R = GR,

and hence

where we have used the invariance of B under R 0 in the third equation. Form invariance of the kinetic energy now means that

Similarly, the potential energy transforms like

Then form invariance of this expression implies that

Here we have used (8.25 ) and the fact that Gq, GR = q, R in the last equation. Equation (8.26 ) is equivalent to saying that

and therefore

This means that κ(q) and κ(Gq) are similar matrices which implies our claim.

For equal masses the solution of the periodic chain in Exercise 6 leads to the acoustic phonons of the simple one-dimensional lattice. If the masses are different, however, there is, in general, no simple closed solution, not even of the one-dimensional problem.

In the following we shall study the influence of a mass defect. We assume that all masses but one are equal to m and that the remaining mass is equal to m(1 + µ). As in Exercise 6 the ansatz of an harmonic time dependence leads to an eigenvalue problem of the form

Here ψ(n) and ϕ(n) are defined as in Exercise 6, and Ω 2 n = m n ω 2 /k with m 1 = m(1 + µ) and m j = m for j = 2, . . . , N .

(i) Solve the eigenvalue problem (8.30 ) and show that κ in Ω 2 = 4 sin 2 (κ/2) satisfies the transcendental equation

or e iκ = ±1 or e iN κ = 1.

(ii) In order to solve (8.31) graphically we transform it into a more convenient form. For this purpose set z = e iκ . First prove that

and obtain an analogous relation for z −1 j . Here the z j , j = 1, . . . , N , are the N th roots of unity. Then show that (8.31) is equivalent to

where the Ω j are the eigenfrequencies of the problem with equal masses. Now discuss (8.33) graphically. Which equations for the frequencies do you obtain for the particular cases when µ = −1 or µ = ∞?

(iii) How many solution do you obtain from (8.33)? Where are the missing solutions and what is their interpretation?

In exercise 7 the consequence of a mass defect on the spectrum of the harmonic chain with periodic boundaries was analyzed. All eigenfrequencies except the translation mode are either determined by

wherein N is the number of masses and Ω 2 j are the eigenvalues of the periodic chain without mass defect, or they agree with one of the Ω 2 j . For a negative mass defect −1 < µ < 0 there is one mode, i.e. one solution of (8.34), outside of the domain [0,4] of the Ω 2 j .

(i) Calculate the eigenfrequency of this mode in the thermodynamic limit N → ∞ directly from (8.34). 

Consider a two-dimensional square lattice composed of identical ions of mass m under periodic boundary condition. Every ion interacts with nearest and next-nearest neighbours. The spring constants of the harmonic potential are given by β 1 for nearest neighbours and by β 2 for next-nearest neighbours. All other interactions are assumed to be negligible. Furthermore all motions of the ions are confined to the lattice plane. Set up the force-matrix K αβ (R, S) and compute κ(q). Diagonalize κ(q). How does the frequency depend on the wave vector q? Plot the dispersion relation in (q,0)and in (q, q)-direction.

L9 Statistical mechanics of the harmonic crystal 9.1 Partition function and free energy

The thermodynamic properties of the harmonic crystal are completely determined by the canonical partition function

Here F ph is the free energy of the harmonic crystal, and we are using units such that the Boltzmann constant k B = 1. Inserting (7.21) into (9.1) we obtain

It follows that

Here we have used (9.1) and (7.28).

From (9.2) and (9.3) we also conclude that

Recalling thatn α q = a + α q a α q (9.5)

is the occupation number operator that measures the occupancy of the mode (q, α), we may interpret

as a function of T as the average occupancy of the mode (q, α) in a canonical ensemble of temperature T . Seen as a functions of q and α this functions measures the distribution at temperature T of the quanta of vibration energy over the modes. Equation (9.6) defines the famous Bose-Einstein distribution.

In physics we call excitations of a 'quantum field', that carry energy and momentum, particles. The particles associated with the quantized lattice vibrations are called phonons. Within the approximation of the harmonic crystal the phonons do not interact. According to (9.6) they form an ideal gas of non-conserved Bosons, very much like the photons which are the quanta of the electro-magnetic field. If the number of phonons would be conserved, a chemical potential that would control their number would appear in (9.6).

In the theory of ideal quantum gases the free energy and all derived thermodynamic quantities in the thermodynamic limit are usually written as functionals of the density of states. We would like to briefly recall its definition and its use in approximating sums like in (9.3) by integrals. Usually the latter is done in a two-step procedure. In step one sums over lattice momenta are converted into integrals. In step two integrals over momenta are transformed into integrals over energies by means of the density of states.

Recall that the volume of the Brillouin zone is (cf. equation (3.10)) V R = (2π) 3 /V u , where V u is the volume of the unit cell, and that there are L 3 lattice momenta in the Brillouin zone, if we introduce periodic boundary conditions as described in section 5.2. Then the volume of the crystal is V = L 3 V u . The volume per lattice momentum in the Brillouin zone or volume element is

Thus, the free energy (9.3) can be approximated as

We would like to rewrite the integral on the right hand side of (9.8) as an integral over energies. For this purpose we define the counting function

where α ∈ I and Θ is the Heaviside step function. The density of states of the αth phonon branch is equal to the number of states in [ω, ω + ∆ω] divided by V ∆ω or, for ∆ω → 0,

(9.10)

Introducing the total density of states as

we can rewrite the expression (9.8) for the free energy in the form

This form will be the starting point for the discussion of the specific heat of the harmonic crystal below. Before entering this discussion we will have a closer look at the density of states g(ω). The integral on the right hand side of (9.10) can be interpreted as representing the 'area' of a surface S(ω) in reciprocal space, where ω is implicitly defined by

In the vicinity of this surface we introduce local coordinates

where n is a unit vector normal to the surface and t j ∆q 1 , j = 1, 2, are parallel unit vectors. Then

and thus

This formula can be used to actually calculate the density of states, given the dispersion relations of the phonons.

The density of states g(ω) exhibits singularities where grad ω = 0. These are called van-Hove singularities. Their character depends on the space dimension and is different for 3d, 2d and 1d lattices. We can understand them by expanding ω in the vicinity of a critical point k 0 , grad ω(k 0 ) = 0, where

and Γ is the matrix of the second derivatives of ω at k 0 . Setting x = k − k 0 we have grad ω = Γ(k − k 0 ) = Γx. Hence, for values of ω close to a critical point,

The matrix Γ can be diagonalized by an orthogonal transformation. Such a transformation leaves the surface element dS invariant. Hence, we may assume that Γ = diag(γ 1 , γ 2 , γ 3 ). Thus,

When discussing this integral, we have distinguish several cases. The critical point may be a minimum, a maximum or a saddle point depending on the signature of Γ which is defined as sign Γ = diag(sign γ 1 , sign γ 2 , sign γ 3 ) . Clearly m corresponds to a minimum, M to a maximum, and S 1 , S 2 to two kinds of saddle points. In 3d the singularities are of square root type in all four cases. The details will be worked out in exercise 10.

Theorem 3. Van Hove [14] . In 3d the density of states g has at least one singularity of type S 1 , and one of type S 2 . The derivative at the upper edge of the spectrum is −∞.

The density of states per unit volume of the αth phonon branch in a crystal lattice in d dimensions is given by

where S α (ω) is the surface ω α = const. in the reciprocal space. The total density of states per unit volume is the sum over all branches. Determine the four distinct types of singularities of the density of states for space dimension d = 3. For this purpose expand ω α close to a critical point ω 0 ,

and discuss the four different cases associated with different choices of the relative sign of the coefficients γ 1 , γ 2 and γ 3 . Hint: the saddle point cases require the introduction of a cut-off.

The dimensionless dispersion relations of the monoatomic linear chain and of the linear chain with alternating masses with 0 ≤ κ < 2π and ratio of masses µ = m/M are given by Ω 2 (κ) = 4 sin 2 (κ/2) and (9.23)

Calculate and sketch the densities of states. Which types of singularities do appear?

Given the free energy of the phonons as a functional of the density of states (9.12) we can calculate their internal energy,

Here S ph in the first equation is the entropy of the harmonic lattice vibrations. The integrand on the right hand side of the last equation has a clear interpretation as the "density of states g(ω) × energy ω × thermal occupation."

The quantity that is measured in experiments is the specific heat

It is a linear functional of the density of states. Due to (9.11) the specific heat of the phonons is the sum of the contributions from all branches of the dispersion relation. From any model for the force matrix we can calculate the dispersion ω α , then the density of states g α and finally the specific heat by means of the above equation.

Many bulk characteristic properties of solids at low and high temperatures are rather universal. A prime example, which we shall consider now, is provided by the contribution of the phonons to the specific heat. In order to understand its low-T behaviour we use (10.1) to present it as

The density of states has a Taylor series expansion around ω = 0. Let ω 0 > 0 be its radius of convergence and fix δ such that 0

Using the Taylor expansion of the logarithm we can estimate the integral on the right hand side as

where Γ is the gamma function and ζ Riemann's zeta function. Inserting (10.5) and (10.4) into (10.3) we obtain the low-T expansion of the specific heat,

which holds up to exponentially small corrections in the temperature. Equation (10.6) holds separately for every phonon branch. If we replace g by g α , we obtain the contribution C V,α of the phonon branch number α to the specific heat. For every optical branch the density of states at ω = 0 is identically zero and so are all the coefficients in its Taylor expansion around this point. Thus,

for every optical branch. In other words, the low-T specific heat of the phonons is entirely determined by the three acoustic branches.

Consider an acoustic phonon branch with (isotropic) sound velocity v.

Then

for small q. The corresponding counting function (9.9) for small ω is

Hence, for each acoustic branch with sound velocity v α , the density of states at small ω is

(10.10)

According to (10.6 ) the contribution to the specific heat is

Here we have used that ζ(4) = π 4 /90. Summing over the three acoustic branches we obtain the total low-T specific heat of the ions in harmonic approximation,

where · h stands for the harmonic mean of the sound velocities. Equation (10.12) is the T 3 law for the specific heat of insulators (in conductors the electrons contribute significantly to C V ). In simple solids it holds up to temperatures of about 10K. Notice that (10.12) implies that the low-T specific heat can be determined by measuring the sound velocities, or, taking it the other way round, that the average sound velocity can be obtained from a specific heat measurement.

Recall that, for |x| < 2π,

where the B 2n are the Bernoulli numbers. This allows us to derive a convergent high-T expansion of the internal energy of the phonon gas. Inserting (10.13) into (10.1) we obtain

Here N at = N L 3 is the total number of ions and · g denotes the average with respect to the probability density

. (10.15) Thus, ω m g is the mth moment of the probability density. In the derivation of (10.14) we have used that

which follows from (9.9). Equation (10.14) implies the high-T series representation

for the specific heat. Here we have used that B 0 = 1 and B 2 = 1/6. The leading order contribution corresponds to the Dulong-Petit law

indicating a constant heat capacity at high temperature, whence the name 'heat capacity'. Note that this high-temperature limit is approached monotonously from below.

We have seen that the specific heat of the phonon gas shows universal behaviour at low and high temperatures. Let us seek for a simple model interpolating between these two universal regimes. For this purpose we take the low-frequency form of the density of states and cut it off at a frequency ω D in such a way that the normalization condition ( Then our model density of states is

where the cut-off frequency ω D is fixed by the condition

implying that is a reciprocal length parameter sometimes called the 'radius of the Brillouin zone'. g D is called Debye density of states and the frequency ω D the Debye frequency. These notions go back to the Dutch-American noble laureate Peter Debye [5] .

The internal energy for the Debye density of states is

Defining the Debye function

we may recast the internal energy as

while the corresponding specific heat takes the form

This is called the Debye formula or the Debye interpolation formula. According to this formula the specific heat of the phonon gas is a universal function of ω D /T (see Figure 5 ). With respect to the phonon gas contribution to the specific heat different solids are distinguished by a single parameter, the Debye frequency. Because it is simple and at the same time covers the basic features of the temperature dependence of the specific heat, the Debye model is prevailing in experimental solid state physics.

Usually the basic tasks in statistical physics are to derive the specific heat and the equation of state, expressing the pressure p = −∂F/∂V as a function of T and V . For the harmonic crystal one can find the statement that the pressure is temperature independent, 

it follows, for instance, that the thermal expansion coefficient

This contradicts our experience that solids usually expand when they are heated. We conclude that the thermal expansion of solids is a feature that must be attributed the higher order corrections to the harmonic crystal. Another anomaly implied by (10.28) is, for instance, C V = C p , the coincidence of the specific heats at constant pressure and constant volume. The simplest model system for which we can verify (10.28) is a classical chain of particles of equal masses m that interact with their nearest neighbours through a pair potential V (r), where r is the distance between the particles. Let us consider N + 1 such particles with coordinates x n , n = 0, . . . , N , and nearest-neighbour distances r n = x n − x n−1 , n = 1, . . . , N . Then the Hamiltonian of the system is

where the p n are the momenta canonically conjugate to the x n . Note that this system has no periodic boundary conditions. Instead of applying periodic boundary conditions for the interaction we apply an external mechanic pressure to control the length of the system. This can be achieved by adding a term (x N − x 0 )p to the Hamiltonian (10.31). As a configuration space K for the particles we consider a ring of finite length L on which the 'springs' connecting neighbouring particles can be arbitrarily expanded by moving them several times relative to each other around the ring, but the center of mass of all particles is confined to the ring. This construction is necessary in order to regularize the integral

representing the classical partition function. It would be otherwise divergent. By construc-

is the average length of the system at given T and p which justifies the interpretation of p as the pressure. It is not difficult to calculate the integral on the right hand side of (10.32). The momentum integration reduces to Gaussian integrals, whereas the remaining integrals over the configuration space can be dealt with after introducing Jacobi coordinates

This transformation is linear, and it is easy to see that its Jacobi matrix equals one. Taking this into account we obtain In general the latter expression does depend on T , but if we insert the harmonic potential (i) The neutron is electrically neutral. It penetrates deeply into the solid, can come close to the atomics nuclei and is scattered by nuclear forces.

(ii) Because of their relatively large mass the de Broglie wave length of thermal neutrons is of the order of magnitude of atomic distances in solids and fluids.

(iii) Their energy is of the order of magnitude of the energy of elementary excitations in solids. If a neutron is scattered inelastically its relative change of energy is generally large enough to be resolved experimentally. Hence, neutrons do not only resolve the structure of solids, but can be also used to measure their excitation spectra, e.g. the dispersion relations of phonons.

(iv) Neutrons carry a magnetic moment which is sensitive against intra-atomic magnetic fields. Therefore they can be used to measure short-wavelength magnetic structures (antiferromagnetism!) and magnetic excitations. ∆n(ϑ, ϕ, E) = counting rate Figure 6 : Schematic view of a typical scattering experiment. An incident particle current hits a target and is scattered in all directions. The currents of the scattered particles through a sphere centered around the target are measured. They determine the differential cross section.

The traditional source of neutrons are nuclear reactors. Note that neutrons which come out of a nuclear reactor have a Maxwell velocity distribution leading to a neutron current per unit time for neutrons with velocity v of

where m is the neutron mass and ρ their density (exercise: derive (11.1)).

The typical geometry of a scattering experiment is sketched in Figure 6 . A stationary current of incident particles impinges on a target and is scattered. Detectors in equal distance from the target measure the current scattered in direction (ϑ, ϕ), where ϑ, ϕ are spherical coordinates. We denote the counting rate at (ϑ, ϕ) for particles with energies in an infinitesimal interval around E by ∆n(ϑ, ϕ, E). In a stationary situation the counting rate is expected to be proportional to the current density j of the incident current j. Hence, the counting rate normalized by the current density of the incident current,

is expected to be independent of the current and to characterize the target. If we further divide by the solid angle ∆Ω = sin(ϑ)∆ϑ∆ϕ and by the width ∆E of the energy interval, we obtain a quantity called the differential cross section,

In the limit of infinite energy and angle resolution this function is denoted

We shall assume that the incident neutron beam is monochromatic and coherent. Quantum mechanically it is then represented by a plane wave

with wave vector k and 'normalization volume V ' (we shall use units such that = 1). The corresponding current is

The interaction of the neutrons with the target determines the transition rate P (k, k ) for a transition from an incoming wave with wave vector k to a scattered wave with wave vector k under the influence of the perturbation caused by the target (recall the time-dependent perturbation theory from the quantum mechanics lecture). Since d 3 k (2π) 3 /V is the number of states in the volume d 3 k around k we can express the counting rate as

Using (11.6) and (11.7) in (11.3) we have expressed the differential cross section in terms of the transition rate,

This formula connects the basic measurable quantity on the left hand side with a quantity depending on the microscopic properties of the target on the right hand side. Let us recall how the transition rate appears in quantum mechanics. It is usually first encountered in the context of time-dependent perturbation theory and Fermi's 'golden rule' which states in our case that

is the rate for transitions Ψ i → Ψ f , where Ψ i = Ψ k (x)φ i is a fixed initial state and Ψ f = Ψ k (x)φ f runs through all possible final states with fixed k . Here φ i and φ f denote eigenstates of the ions (recall that the neutrons interact with the ions!). Hence, the energies of initial and final states ε i and ε f are

where E i and E f are the energies of the ionic states. Introducing the notation ω = (k 2 − k 2 )/2m the energy difference in (11.9) takes the form

The potential U that describes the interaction of the neutrons with the ions is of the form

where the x r (R) are the position vectors of the ions. In order to simplify the notation we suppress from now on the sum over the ion positions in the unit cell. It will always come with the sum over the Bravais lattice an can be restored at any later stage if necessary.

Substituting the explicit form of the potential and of the factorized wave functions we can calculate the matrix elements in (11.9), (11.13) where q = k − k . We see that the integrals factorize into a factor that depends on the interaction potential between the neutrons and the ions and a factors that depends on the crystal structure. Denoting

we obtain the following factorized form of the transition rate,

Here |b(q)| 2 is called the atomic form factor, since it depends only on the interaction of the individual ions with the neutrons. The sum over f divided by N at is called the dynamic structure factor S(q, ω). It contains the information about the structure and the dynamics of the ions. Inserting the expression (11.15) for the transition rate into (11.8) we obtain a formula for the differential cross section,

Note that this formula is very general.

(i) With slight modifications it holds for fluids as well.

(ii) It solely relies on Fermi's golden rule (= time dependent perturbation theory + Born approximation).

Using the 'Fourier inversion formula' (11.17) we obtain the following expression for the dynamic structure factor.

Here we have used that

where H is an effective Hamiltonian for the ion-ion interaction. The expectation value under the sum on the right hand side of (11.18) is called a dynamical two-point correlation function. The appearance of such a correlation function is generic. As we shall see with further examples below, most spectroscopic experiments and transport experiments in solids measure two-or four-point correlation functions.

So far we have assumed that the scattering potentials of all nuclei are equal. This is only true if the solid (the fluid) consists of a single isotope and if all nuclear spins are aligned (nuclear spin ferromagnet) or zero. In reality the latter conditions are almost never satisfied. We typically rather have to deal with mixtures of different isotopes and with nuclear spin paramagnets. This means that we have to modify our above derivation, replacing

In particular b R (q) remains under the sum over the Bravais lattice in the expression for the differential cross section, which now reads

In order to simplify this expression again, we assume that the potentials u R are randomly distributed. We shall indicate the disorder average by brackets · d . We assume that the distribution underlying the average is such that the mean value of b R (q) is translation invariant and, for this reason, use the notation

3)

It further reasonable to assume that potentials at different lattice sites are uncorrelated,

for R = S. Let us further introduce the averaged coherent and incoherent atomic form factors

Averaging the differential cross section over the disorder then results in

So far we have assumed that the ions are in a pure initial state φ i . This is not realistic in experiments on a macroscopic sample. In a macroscopic sample the ions have to be described by a density matrix. If the crystal can exchange energy with its environment it will be the density matrix of the canonical ensemble in which each state φ i is occupied with probability e −E i /T /Z, where Z is the canonical partition function. Performing the canonical ensemble average and denoting the canonical expectation values by · T we obtain the final formula for the theory of neutron scattering.

First of all the dynamic structure factor at finite temperature becomes

The differential cross section splits into a coherent part and an incoherent part,

These formulae are widely used in order to analyze the data of neutron scattering experiments. Before specializing them to crystal structures we would like make a few general comments.

(i) In our derivation of (12.7) we have used the letters R and S in x(R), x(S) as mere particle labels, which is the reason why equation (12.7) , if properly read, also defines the dynamic structure factor of a fluid or a glass. When we used the notation R ∈ B we had in mind to apply the formula to a mono-atomic (simple) lattice, but so far B was rather an index set which may refer to any labeling of the ions. Similar formulae hold, in particular, for a lattice with basis.

(ii) The factors σ coh and σ inc depend, in the experimentally relevant range of neutron wave lengths of the order of inter-atomic distances only weakly on q, since the potentials u R vary on a scale of the size of the nuclei. For this reason σ coh and σ inc can often be treated as constants.

(iii) The most interesting contribution to the differential cross section is the dynamic structure factor S(q, ω). It is completely determined by the properties of the sample and independent of the properties of the neutrons.

(iv) The incoherent cross section sums contributions from the same nuclei at different times. It carries no information about the structure of the sample. If all b R (q) are identical, the incoherent part vanishes. Its occurance is a direct consequence of the disorder in the system.

What happens if we specialize (12.7) to a mono-atomic crystal is, that the Hamiltonian is then invariant under the action of the Bravais lattice B, implying that

Inserting this into (12.7) and also using that, x(R) = R + u(R), where u(R) is the deviation from the equilibrium position at R, we obtain

The dynamic structure factor of the ions in a crystal lattice is the spatio-temporal Fourier transform of the dynamical two-point function e −i q,u(0) e i q,u(R,t) T . Similarly, the incoherent part of the differential cross section simplifies to

the Fourier transformation in time of the auto correlation function e −i q,u(0) e i q,u(0,t) T .

The harmonic approximation brings more simplifications about. If A and B are any two operators that depend linearly on the deviations u(R) of the ions from their equilibrium position and linearly on the conjugate momentum p(R), then

where the canonical averages are calculated with the Hamiltonian of the harmonic crystal (see exercise 12.8). Applying this formula to A = −i q, u(0) and B = i q, u(R, t) and denoting 2W (q) = q, u(0) 2 T = q, u(R, t) 2 T , (12.14)

we obtain the following formula for the dynamic structure factor of the harmonic crystal, The function W (q) is called the Debye-Waller factor.

A harmonic crystal has 3N at eigenstates which can be excited independently. Denote the corresponding occupation numbers n 1 , . . . , n 3Nat . Transition matrix elements can be classified according to the differences in occupation numbers between the initial state {n i } and final state {n i }.

Elastic processes. n i = n i , all occupation numbers remain unaltered, no exchange of energy between crystal and neutron.

Single-phonon processes. ∃ α ∈ {1, . . . , 3N at } such that n i = n i ∀ i = α, n α = n α ± 1, the occupation number of a single mode is changed due to the scattering process.

Multi-phonon processes. The occupation numbers of several modes are changed.

We shall see below that the nth term in the expansion e q,u(0) q,u(R,t) T = 1 + . . . T + 1 2 . . . can be identified with the n-phonon processes.

The contribution of the first term in (12.16) to the dynamic structure factor is

Recall that the argument of the delta function is ω = (k 2 − k 2 )/2m. Hence, the condition ω = 0 imposed by the delta function in (12.17) is the condition of energy conservation for the scattered neutrons which justifies the interpretation of the zeroth order term as the elastic scattering term. Since k, k > 0, the zeroth order structure factor (12.17) can only be non-vanishing if k = k . Consequentially, we obtain the expression

for the elastic contribution to the coherent differential cross section of the harmonic crystal. The sum over Kronecker deltas on the right hand side means that elastic scattering takes place, if the wave vectors k of the incident neutron beam and k of the scattered beam differ by a reciprocal lattice vector K,

This is the famous Bragg condition which also holds in X-ray diffraction experiments.

The interpretation of the Bragg condition in reciprocal space is depicted in Figure 7 . The Bragg condition is often formulated as a relation between the distance d of lattice planes in the original Bravais lattice, the angle ϑ between incident and scattered neutron and Figure 7 : Illustration of the Bragg condition (12.19 ) in the reciprocal space.

the wave length λ = 2π/k of the incident neutron. Recalling from section 3.3 that the reciprocal lattice vector K corresponds to a family of lattice planes perpendicular to K in such a way that K = K = n · 2π/d for some n ∈ N and taking into account that elastic scattering implies k = k , we conclude that

Relative to the Bravais lattice we can interpret the Bragg condition in the following way.

(i) The scattering occurs as if it would happen at the lattice planes according to the reflection law of geometric optics.

(ii) Waves which are reflected by parallel planes interfere constructively, meaning the difference in optical distance between two 'rays' reflected from consecutive lattice planes must be an integer multiple of the wave length, 

In the derivation of the formula (12.15) for the dynamic structure factor of the harmonic crystal we have used equation (12.13) . Here we would like to provide a step-by-step derivation (cf. [11] ).

where the a i and a + j are annihilation and creation operators with commutation relations

where α i , β j , γ k , δ l ∈ C. Denote the canonical ensemble average with Hamiltonian (12.22) by · . Our goal is to prove that

The proof will be based on the formula (iii) Show that (formally) lim n→∞ C n = 0 and that the series ∞ k=0 C n has a formal limit. Denoting this limit by S and using that e D = 1, equation ( 

If we solve equations (7.25) for u α (R) we obtain Let us assume for simplicity that we are dealing with a mono-atomic lattice. Then the indices α, β in (13.1) take values x, y, z, the dimensionless masses reduce to µ α = 1, and L 3 = N at . The scalar products defining the Debye-Waller factor (12.14) become

where the y α (k) are the polarization vectors defined below (7.31). It follows that

Here we have used (cf. (9.6))

in the second equation and (6.35) in the third equation. Taking the thermodynamic limit in (13.3) we end up with

Here two remarks are in order.

(i) In general (13.5) cannot be rewritten by means of the density of states, since the polarization vectors y α (k) depend on k not necessarily through ω α (k).

(ii) The Debye-Waller factor describes the weakening of the coherent scattering due to thermal fluctuation and as a function of the change of momentum q, i.e. as a function of the scattering angle. It appears as a prefactor e −2W (q) in the dynamic structure factor. Since W (q) is quadratic in q, the weakening is larger for larger q.

In elastic scattering, where q must equal a reciprocal lattice vector K, the scattering is strong in the direction corresponding to minimal K or maximal distance between the associated family of lattice planes. Recall (cf. exercise 4.7) that this family has maximal density of lattice points within a plane.

As a function of the temperature the scattered intensity weakens with growing T , since cth

as T → ∞, leading to a linear temperature dependence of W (q) and to a suppression of the scattered intensity exponentially in T .

For a cubic lattice consisting of N atoms the Debye-Waller factor can be written as (i) Calculate the Debye-Waller factor for a cubic Bravais lattice by using the Einstein model for the phonons. This is a model for an optical phonon branch, where all ions oscillate with the same frequency ω 0 ; thus all N states are located at ω 0 , and the density of states is given by g(ω) = (N/V )δ(ω − ω 0 ). Determine 2W (q) for T ω 0 or T ω 0 , respectively.

(ii) What is the Debye-Waller factor for the Debye model? What follows for 2W (q) in the limits T ω D and T ω D ?

Let us now consider the contribution of the second term in (12.16 ) to the dynamic structure factor,

As we shall see, this term corresponds to the single-phonon contributions. This can be understood from a closer inspection of the two-point function on the right hand side. We infer from (13.1) that

Hence, performing a similar calculation as in (13.3), q, u(0) q, u(R, t) T (13.12)

Here we have substituted −k for k in the second sum in the last equation and have used that ω α (k) and | q, y α (k) | 2 are even functions of k.

Inserting the latter equation into (13.9) we obtain the single-phonon contribution to the dynamic structure factor in the form S 1 (q, ω) = S 1,+ (q, ω) + S 1,− (q, ω) , (13.14) where (13.15) and

α=x,y,z | q, y α (q ) | 2 2ω α (q ) δ(ω + ω α (q )) e ωα(q )/T −1 K∈B δ K,q+q . (13.16)

These are those contributions to the dynamic structure factor, where precisely one phonon is excited (emitted) or absorbed. We shall denote the corresponding contributions to the differential cross section

Like the elastic cross section these cross sections are 'discontinuous' and describe a pattern of bright spots at certain energy and momentum transfers. The delta functions and Kronecker deltas in (13.15 ) and (13.16) force the dispersion relations of the scattered neutron and the phonons to match in the following sense.

Emission of a phonon. In S 1,+ the momenta of the neutron and the phonon involved in the scattering process must satisfy

where k − k is the momentum transfer to the lattice and q is the momentum taken by the phonon, i.e. the phonon takes the momentum transferred to the lattice and reduced to the Brillouin zone. For the energies we have the matching condition

where we have used the periodicity of ω α with respect to the reciprocal lattice in the last equation. Since ω α ≥ 0 we see that k ≥ k in this process. The lattice absorbs energy, 'a phonon is emitted.'

The temperature dependence of the emission is encoded in the factor 1/(1 − e −ωα(q )/T ) in S 1,+ . This factor decreases as T decreases but stays finite for T → 0+.

Absorption of a phonon. For S 1,− the momentum balance is

which we interpret such that the neutron takes momentum q − K. The relation for the exchange of energy between the scattered neutron an the phonon involved in the process is

Here k ≥ k, and energy is absorbed by the neutron, which is interpreted as a phonon being absorbed in the process.

In this case the temperature dependence comes in through a factor 1/(e ωα(q )/T −1) which vanishes as T → 0+. At zero temperature no phonon remains in the system and there is nothing left to be absorbed. Figure 8 : Schematic picture explaining how phonon dispersion relations can be inferred from the differential cross section.

Dispersion relations of phonons can be inferred from the differential cross section. Assume that a beam of mono-energetic neutrons impinges on a crystal and the velocity distribution of the neutrons scattered in a fixed direction n is measured . Strong scattering in direction n takes place whenever k = k n satisfies one of the resonance conditions (13.19) or (13.21) for emission or absorption. The situation is then as sketched in Figure 8 . 

We saw in section L2 that the adiabatic principle implies a decoupling of the lattice and electronic degrees of freedom. To leading order, the dynamics of the electrons is governed by the Hamiltonian

where the ions are fixed to their equilibrium positions R k and

is the electro-static energy of the ions, the so-called Madelung energy. The latter plays no role for the dynamics of the electrons. Thus, to leading order in the adiabatic approximation, the electrons in a crystal are described by a repulsive Coulomb gas with interaction V elel that is filled into the periodic potential V I generated by the ions sitting on their equilibrium positions.

Because of the mutual Coulomb interaction of the electrons, this is still an interacting many-body quantum system, and imperturbable optimism is required to believe that it can ever be solved exactly or numerically with sufficient accuracy. At least at the current stage of our knowledge drastic further approximations are necessary in order to be able to make any quantitative prediction.

14.2 Reduction to a single-particle problem (i) Only the valence electrons (electrons outside closed shells) contribute significantly to the typical properties of solids, because they are responsible for the chemical bonding and are distributed over the solid. For this reason we shall interpret V I as the potential of ions with completely filled shells (Coulomb with reduced charge number Z j ; example sodium, alkali metal, one valence electron, Z = 1).

(ii) If the electron-electron interaction V elel could be neglected, the 'electronic problem' would be reduced to the problem of a single particle in the periodic potential V I .

(iii) Instead of simply neglecting the electron-electron interaction, we shall try to split it into an 'effective single-particle contribution' which modifies the periodic potential V I and a residual 'many-body contribution' which for the understanding of certain quantities can be neglected in many cases, at least in the first instance. Some ideas about the systematic derivation of an effective single-particle description will be explained below.

(iv) An intuitive explanation of how the reduction to a single-particle problem is possible is through the notions of 'screening' and 'mean fields'. The full many-body Hamiltonian H el is invariant under the action of the Bravais lattice associated with the equilibrium positions of the ions, [H el , U R ] = 0 ∀ R ∈ B. Hence, following the same reasoning as for the derivation of Bloch's theorem in section 5.1, we see that the lattice momentum k ∈ BZ is a good quantum number for the many-body eigenfunctions and that those transform like for all R ∈ B. The electronic charge density associated with this state,

clearly inherits the invariance under translations by Bravais lattice vectors, since

where we have used (14.4) in the third equation. We imagine this as the charge density 'screening' the ionic potential V I and combining together with it to an effective periodic potential which would be felt by an additional electron inserted as a probe into the solid. This 'screened periodic potential' provides an intuitive single-particle description for the electron gas in a crystal: a particle moving in the mean field of all other particles.

(v) Another way of thinking about the existence of an effective single-particle description of the solid is the following. Consider the excitations of the full many-particle system (14.1). If there are excitations with charge quantum numbers ±e which do have a dispersion, i.e. excitations for which a definite change of (lattice) momentum of the many-body system always comes with one and the same definite change of energy, then we call them quasi-particle excitations or simply particles. It is always possible to define a single-particle Hamiltonian (in momentum representation) that has exactly the same dispersion relation (the same spectrum) as the full many-particle Hamiltonian (14.1) . This Hamiltonian may be seen as an effective single-particle Hamiltonian of the full system. How well it describes the full system depends on the details. Unlike e.g. in the harmonic crystal which realizes an ideal gas of phonons, the two-and multi-particle excitations of the full electronic system are not just superpositions of single-particle excitations. Still, the effective interactions between the quasi-particles may be weak, in which case the effective single-particle description will give a good description of at least the thermodynamic properties of the system.

It is beyond the current capabilities of theoretical physics to prove the existence of quasi particles for the Hamiltonian (14.1). But many experiments show that solids generically admit quasi-particle electronic excitations with charge quantum numbers ∓e. These are called 'electrons' and 'holes' (the holes are the solid-state analogues of the positrons). Experiments show in addition that the interaction between several electrons or holes or between the electrons and the holes can often be neglected in the first instance.

The above discussion should provide enough motivation to study the general problem of a particle moving in a periodic potential. The corresponding Hamiltonian is

The study of this Hamiltonian will lead us to the extraordinarily successful band model of solids. Let us start with some general remarks.

(i) The one dimensional Kronig-Penney model with potential

where a is the lattice constant and V 0 the strength of the interaction, is the only simple but non-trivial model of particles in a periodic potential which admits a closed analytic solution. The next simple case V (x) = −V 0 cos(2πx/a) involves Mathieu functions. Its understanding requires already some mathematical effort.

(ii) There is no other choice than trying to understand the general properties of particles in a periodic potential by the common means of mathematics or theoretical physics. As far as the mathematical part is concerned it would be instructive to study Floquet theory as part of the theory of ordinary differential equations. For time limitations we refrain from touching this interesting subject and rather go ahead with typical methods of theoretical physics which are based on perturbation theory.

The Kronig-Penney model is a simple one-dimensional model for the understanding of the band structure in solids. Consider an electron of mass m moving in a periodic potential

Here V 0 is the strength of the potential and a the lattice constant. Note that either sign of V 0 makes sense. For V 0 > 0 the potential is repulsive, for v 0 < 0 it is attractive.

(i) Introduce dimensionless units such that the time-independent Schrödinger equation takes the form

Which connections exist between x and y, c and V 0 , and between q 2 and the energy E?

(ii) Because of the periodicity of the potential, the Hamiltonian (14.10) commutes with the shift operator defined by T ϕ(y) = ϕ(y + 1). Thus, H and T have a common system of eigenfunctions, i.e., the eigenvalue equations

T ϕ(y) = e ik ϕ(y) (14.11b) can be solved simultaneously. Determine the solutions of (14.11) as a function of k and q. Which equation connects k with q? For the calculation it is sufficient to consider the Schrödinger equation with the general solution ϕ(y) = Ae iqy + Be −iqy in the interval [0,1]. Note that q can take real as well as imaginary values.

(iii) Discuss the dispersion relation cos(k) = cos(q) + (c/q) sin(q) following from (ii) graphically. Observe that k has to be real in order for ϕ(y) to be bounded and normalizable. Therefore the eigenstates of H and T are restricted on certain energy bands.

The qualitative features of the motion of particles can be understood from time-independent perturbation theory. Since V (x) is periodic, it has the Fourier series representation (cf. section 3.5)

Here we took the liberty to set V 0 = 0 which just fixes the zero point of the energy. In order to be able to apply perturbation theory we assume that V (x) is a weak periodic potential in the sense that V g = λv g , where |v g | is uniformly bounded in λ and |λ| 1. where u k (x) = u k (x + R) for all R ∈ B and thus has a Fourier series representation

Inserting this back into (14.13) we see that

If we substitute this representation into the Schrödinger equation for H and use (14.12), we obtain g ∈B

λv g u k,g e i g −g ,x = 0 . (14.16)

Here we multiply by e i g,x /V u and integrate over x over the unit cell. Then ka ε(k) degeneracies ε 0 (k) ε 2π/a (k) ε −2π/a (k) Figure 9 : The dispersion relation of free particles in 1d, conceived as particles in a periodic potential. The parabolic dispersion relation splits up in branches, obtained by 'back-folding' into the Brillouin zone. The picture shows the three lowest lying branches for the dispersion ε(k) = k 2 /2m with m = 4. The red dots denotes degenerate points in the spectrum which belonged to different momenta in the non-periodic picture, but to the same lattice momentum in the periodic picture. This is an eigenvalue problem for the vector u k of the Fourier components u k,g , g ∈ B.

We will study it perturbatively in λ.

The reference point are free electrons, λ = 0. Then

Clearly, the solutions of this eigenvalue problem are

for all h ∈ B. They are parameterized by reciprocal lattice vectors. Example: free electrons in 1d. If a > 0 is the lattice constant, then

(14.20)

It follows that

where k ∈ BZ, are the different branches of the dispersion relation. The situation is sketched in Figure 9 .

For non-zero potential we assume that the eigenvalues and the Fourier coefficients of the wave functions can be expanded in an asymptotic series in λ,

We choose the normalization of u k such that u (1) k,h = 0. Then (14.22 ) in (14.17) for g = h implies −λε

whence, for g = h,

If we insert this back into (14.23) and use that v −g = v * g we obtain the second order corrections to the energies,

As we see, for a weak periodic potential, the energy in second order perturbation theory can be expressed in terms of the Fourier coefficients of the potential.

for some g ∈ B, g = 0 and some k ∈ BZ, then the expression (14.27) is singular at that specific value of k. In order to interpret this problem set h = 0. Then (14.28) reduces to

If g runs through the nearest-neighbour sites of the origin in B, this equation describes the boundaries of the Brillouin zone. Hence, we expect that the exact solution of the problem exhibits a strong deviation of the dispersion relation from the dispersion relation of free electrons at the boundary of the Brillouin zone. For h = 0 additional singular manifolds appear due to back-folding into the Brillouin zone. For the interpretation of the perturbative result we further recall that the perturbation theory it applied to each individual energy level, in our case for fixed h and k. Equation (14.27 ) is valid for all h, k which do not satisfy (14.28). If (14.28) is satisfied for a pair h, k, we have to modify our calculation, applying the scheme of degenerate perturbation theory instead.

Fix k and assume that there are precisely two vectors h, h ∈ B, h = h , such that

Then (14.17) has two degenerate zeroth order solutions with energies

and Fourier coefficients

where a, b ∈ C are to be determined. The space of solutions is two-dimensional. We assume again an asymptotic dependence of the dispersion relations on the interaction parameter λ as in (14.22a). The corresponding ansatz for the Fourier coefficients (14.22b) has to be modified due to the degeneracy,

Substituting this into (14.17) for g = h, h and comparing coefficients at order λ we obtain

The solvability condition for this homogeneous system implies that ε (1) (k) = ±|v h−h |. Thus, the two degenerate energy levels split into

By way of contrast to the non-degenerate case, the corrections to the eigenstates are now of first order in λ. This means that close to the Brillouin zone boundaries the analytic structure of the corrections change. As mentioned above, this makes only sense if we consider a finite system under periodic boundary conditions. Still, we take it as another indication that the strongest effect on the dispersion relation of a free particle by a weak periodic perturbation is at the boundaries of the Brillouin zone.

The possible values of the coefficients a, b follow from

Remark. We have considered a two-fold degeneracy. At special symmetric points at the boundaries of the Brillouin zone in more than one dimension higher degeneracies (like four or six) may occur. Figure 9 . The blue line represent the deformation of the free-particle dispersion under the influence of a periodic perturbation.

(i) Let h = 0 and h = 2π/a. Then a degeneracy occurs at k = π/a, and ε ± π a = 1 2 π a 2 ± λ|v 2π/a | . (15.8) (ii) Let h = 2π/a and h = −2π/a. Then the spectrum is degenerate at k = 0,

The two cases are sketched in Figure 10 . We observe the 'opening of band gaps' at the degenerate points in the dispersion relation. Band gaps are most characteristic for the phenomenology of crystalline solids.

In order to develop more intuition we discuss the wave functions connected with the first case above. These are Ψ ± (x) = a ± e iπx/a +b ± e −iπx/a +O(λ) = 1 √ 2 e i(πx/a+δ) ± e −i(πx/a+δ) + O(λ) = √ 2 cos(πx/a + δ) i sin(πx/a + δ) + O(λ) . (15.10)

It follows that |Ψ ± (x)| 2 = 1 ± cos 2(πx/a + δ) .

(15.11)

The periodic potential V , on the other hand, has the Fourier series representation V (x) = λ v −2π/a e −i2πx/a +v 2π/a e i2πx/a + higher Fourier modes = 2λ|v 2π/a | cos 2(πx/a + δ) + higher Fourier modes . (15.12) Drawing (15.11) and (15.12) in the same picture and choosing λ < 0 (attractive effective potential near origin) we see that for Ψ + the particle is in the average in the valleys of the potential, whereas it is more on the hills for Ψ − . Accordingly, ε + (π/a) < ε − (π/a) in this case.

So far we have studied the formation of energy bands starting from free electrons (plane waves) subject to a periodic perturbation. Now we would like to turn to the opposite extreme. We shall start with atomic wave functions and ask what happens, when the atoms are brought close to each other. For simplicity we assume a mono-atomic lattice with N at atoms.

(i) Let φ a (x) an atomic wave function of an electron. In order to satisfy Bloch's theorem we consider the linear combination

k ∈ BZ.

(ii) Further define

where H is the one-particle Hamiltonian (14.7). Since the atomic wave functions decay exponentially fast with the distance from the nucleus, we expect these functions to behave as

if the interatomic distance becomes large. It follows that, asymptotically for large interatomic distances, (iii) This highly degenerate level splits under the influence of the mutual perturbations of the atoms, if they come closer to each other. In order to take into account the perturbation we determine the norms and energy expectation value in these states,

Note that the sum on the right hand side vanishes for large lattice spacing. For the expectation value of the energy we obtain in a similar way (15.18) where again the sum in the brackets on the right hand side vanishes for large lattice spacing.

(iv) Let us now assume that the functions j(R) and h(R) decrease rapidly with increasing distance from the origin in B. Then

. . . , (15.19a )

. . . , (15.19b) where 'nn' refers to nearest neighbours to the origin, 'nnn' to next-to-nearest neighbours etc. It follows that

This formula describes the so-called 'tight-binding bands' which give a realistic description of bands with 'pronounced atomic character' for which the electrons are close to the atoms. We shall denote

We conclude from the inversion symmetry of the Bravais lattice that t(R) = t(−R). For this reason tight-binding bands are sums over cosines.

(v) Example fcc lattice: The fcc lattice has 12 nearest neighbours to the origin located at

and t(R) = t by symmetry. Hence, Note that the band width is proportional to t. This means that small overlaps of the wave function induces narrow bands. The tight-binding model is thus expected to provide a good description of narrow energy bands in solids.

(vi) We close this section with two remarks. First, the tight binding bands can be better justified by introducing so-called Wannier orbitals instead of atomic orbitals (see below).

Second, perhaps the most important insight we can gain from the above reasoning is the intuitive physical picture. Under the influence of the mutual interaction an N at -fold degenerate energy level splits into a band with N at states. This means that bands can be classified according to the character of the underlying atomic orbitals as s, p, d, f bands. Our calculation above makes sense for an isolated s-orbital. For p, d, f orbitals we cannot start with a single atomic state φ a , but have to take into account a set {φ a (x)} amax a=1 of atomic states. This corresponds to a combination of the LCAO (linear combination of atomic orbitals) method of molecular physics with the tight-binding method. Like in molecular physics hybrid orbitals (like s-d orbitals) may appear in the general case.

The tight-binding method works for low-energy narrow bands formed by atomic states in which, in the average, the electrons are close to the ions. A method for the calculation of more realistic band structure is obtained by combining the tight binding method with the method of almost free electrons. This is called the OPW (orthogonalized plane waves) method.

We assume that the low-lying states are known. They may be, for instance, sufficiently well described by the tight-binding wave function (15.13) ,

(16.1)

the projector onto the subspace of the full Hilbert space that is spanned by the ϕ ak . In order to determine the remaining part of the spectrum it suffices to consider

Here

is called a 'pseudo potential'. The pseudo potential is not a potential, since in position representation it is represented by an integral operator. Note that

since E > E tb . We interpret this in such a way that W (E, x) includes the effects of screening as discussed above. It is therefore a more appropriate starting point for a perturbation theory for almost free electrons.

Within the 'augmented plane wave method' the Schrödinger equation is solved in spheres around the ions and plane waves are fitted into the space between the spheres (were the potential is assumed to be negligible).

The KKR (Korringa, Kohn, Rostocker) method is a variant of the augmented plane wave method, where, in a first step, the Green function of the Laplace operator is used in order to transform the Schrödinger equation into an integral equation.

(i) For the understanding of many of the electronic properties of solids it suffices to take into account the interaction of the electrons only in so far as they screen the attractive potential of the core ions. The remaining problem is the problem of independent electrons in a periodic potential.

(ii) Electrons in a periodic potential are characterized by the branches ε n (k), n ∈ N, of their dispersion relation. As opposed to the spectral problem of phonons the number of branches for the electrons is infinite. The branches are called energy bands, their entirety is called the 'band structure' of the solid.

(iii) Like for the phonons the ε n (k) become differentiable functions of k in the thermodynamic limit which exhibit the full translation symmetry of the reciprocal lattice and the full point group symmetry of the solid.

The Schrödinger equation for a non-relativistic electron of mass M in a 1d periodic potential of period L reads

Here the energy and the length are measured in units of 2 2M L 2 and L, respectively. We would like to solve equation (16.6) by means of the WKB-approximation and find a condition which determines the band structure, i.e. all allowed energy values E.

we can express the general WKB-solution of the Schrödinger equation for a single potential barrier V (x) (see figure) in the classically accessible regions left and right of the barrier as 

(ii) The current conservation |A 0 | 2 − |B 0 | 2 = |F 1 | 2 − |G 1 | 2 implies det M = 1. Let T be the transmission coefficient and R = 1 − T the reflection coefficient with the corresponding phase shifts e iµ and e −iν . Verify the representation

(iii) A periodic continuation of the potential barrier leads to another representation of the solution in the classically accessible domain b 0 < x < a 1 , (iv) The full solution for the periodic potential stays bounded as long as P has eigenvalues of absolute value 1. Show that this fact implies a condition that determines the band structure,

(v) For a potential of the form V (x) = V 0 cos(2πx), V 0 > 0, the Schrödinger equation (16.6) of the non-relativistic particle is equal to the Mathieu equations. The transition coefficient T (E) of a single potential barrier in WKB-approximation follows as

where for E > V 0 the integral has to be calculated on the direct line between both imaginary reversal points. Represent φ(E) and W (E) by complete elliptical integrals of the first and second kind K(m) and E(m), with the dimensionless 

Electrons are Fermions. According to the Pauli principle many-electron wave functions must be totally anti-symmetric under the permutations of any two electrons. Consequentially, in a system of many non-interacting electrons (or holes) no two of them can be in the same single-particle state. When we calculate the grand canonical partition function we therefore have to count every single-particle state as either unoccupied or occupied by just one electron (or hole),

(16.12)

Here 1 = e 0 stands for an unoccupied state, while e − εn(k)−µ T stands for an occupied state. Every factor on the right hand side of (16.12) represents the partition function corresponding to a single-electron state. Hence, the grand-canonical probability for having this state occupied is As we recall from our lecture on statistical mechanics, this is the Fermi distribution function.

As for every ideal gas of spin-1 2 Fermions we can immediately write down the total particle number N , internal energy E and entropy S of the system as sums involving the Fermi function,

Here the factor of 2 accounts for the spin degree of freedom. The grand canonical potential is obtained as

In a similar way as for the phonon gas we can rewrite the sums over all lattice momenta asymptotically for large volume V first as an integral over the Brillouin zone and then as an integral over all energies. For the second step we need to define an electronic density of states. We proceed as for the phonon gas and first of all introduce the density of states for a single branch of the dispersion relation,

where S(ε) is the surface implicitly determined by the equation ε = ε n (k). With this the (total) density of states is defined as

Again a factor of 2 is included to take the spin degrees of freedom into account.

Remark. Our statements about van-Hove singularities in section 9.4 remain valid for electrons.

Two important notions in solid state physics are the 'Fermi energy' and the 'Fermi surface'. Using the density of states introduced above we may write the particle number as a function of temperature and chemical potential as

equation (16.18) can be inverted at any T > 0 to give µ = µ(T, N ) (the latter fact follows, of course, also from general arguments on the equivalence of thermodynamic ensembles in the thermodynamic limit). The Fermi energy E F is defined as the chemical potential at T = 0,

It is clear from (16.18) and (16.19) that the Fermi energy depends only on the particle density N/V and is a monotonically increasing function of the particle density. In a canonical ensemble description, i.e. if we fix the particle number and consider µ as a function of N and T , we obtain the pointwise pointwise limit,

where Θ is the Heaviside function. This reflects the fact that in the ground state all singleparticle states with energies up to E F are occupied, while those with energies larger than E F are unoccupied. For the electronic ground state of solids (within the band model) exist two alternatives,

which come with drastically different phenomenologies. In case (i) the Fermi energy lies within a band. In case (ii) it is situated in a band gap (cf. Figure 11 ). In case (i) excitations of arbitrarily small energy are possible. In case (ii), due to the Pauli principle, the smallest possible excitation energy is equal to the band gap. In case (i) an arbitrarily small electric field causes a current, in case (ii) this does not happen. This is our first explanation, within a single-particle picture, of the difference between conductors and insulators. In conductors the Fermi energy is in at least one band, and the equations ε n (k) = E F n ∈ N (16.23) determine a surface in reciprocal space which is called the Fermi surface S F . The volume enclosed by the Fermi surface

is called the Fermi sphere. In general the Fermi surface has a complicated shape and topology (not necessarily simply connected). It is of crucial importance for the understanding of the transport properties of solids, in particular in the presence of a magnetic field.

L17 Low-temperature specific heat of the electron gas A first quantitative example, showing that it makes a big difference whether or not the Fermi energy lies within a band, is the thermodynamics of band electrons at low temperature. In order to prepare for the low-T analysis we rewrite this as

Since g(x) is the density of states of an electronic band structure, there are two interlaced sequences (a n ) n∈N , (b n ) n∈N with a n < b n < a n+1 < b n+1 and g(x) = 0 ∀ x ∈ (a n , b n ),

If µ is in one of the band gaps (b n , a n+1 ), then the second integral on the right hand side of (17.2) vanishes exponentially fast for T → 0, which is not the case, if µ is situated inside a band.

Both cases require a separate asymptotic analysis. Let us start with the metallic case µ ∈ (a n , b n ) for some n ∈ N. We assume that g is analytic in (a n , b n ). Then it has a Taylor series expansion around ε = µ with some finite radius of convergence δ > 0,

if |ε − µ| < δ. It follows that

Here we have substituted x = (ε − µ)/T in the second equation. The remaining integral vanishes for symmetry reasons if k is odd. If k = 2m we obtain

where ζ is Riemann's zeta function. Inserting this back into (17.4) we obtain the low-T asymptotic series

for the grand canonical potential. The corresponding series for the particle number and entropy in a grand canonical description are

The asymptotic series allow us to calculate the low-temperature expansion of the specific heat order by order, using

Equation (17.7a) can be used iteratively to obtain µ as a function of N and T . The equation

following from (17.7a) at T = 0, determines E F = µ(0, N ) as a function of the density of particles N/V . Since only T 2 enters (17.7a), µ must be even in T ,

Inserting this into (17.7a) and using (17.9) we can calculate α,

Furthermore, using (17.7b) we get at once that

Using (17.11) and (17.12a) in (17.8) and recalling that ζ(2) = π 2 /6 we finally arrive at the sought for low-temperature asymptotics of the specific heat of a metal,

Let us add a few comments in conclusion.

(i) The above low-T asymptotic expansion of the thermodynamic quantities of a Fermi gas goes back to Sommerfeld [13] and is called the Sommerfeld expansion.

(ii) Equation (17.13) is an important result stating that the electronic contribution to the specific heat of metals is linear in T and proportional to the density of states at the Fermi energy.

(iii) Taking it the other way round, we see that we can experimentally determine the density of states of a metal close to its Fermi energy by measuring its specific heat.

(iv) Since typical electronic energies in solids, like band gaps or band widths, are of the order of 1 eV, low-temperature expansions for the electrons in solids are usually valid even above room temperature.

Starting once more from equation (17.2) we consider the specific heat of an insulator. By definition a band insulator has g(E F ) = 0. The Fermi energy is located inside a band gap, b n < E F < a n+1 for some n ∈ N. In this situation the closest band below the Fermi energy, the one with index n here, is called 'valence band', the closest band above the Fermi energy, the one with index n + 1, 'conduction band'. It follows from (17.2) that, for b n < µ < a n+1 , which determines µ at small T to be

(17.20)

In particular, 

for the low-temperature asymptotic behaviour of the specific heat of a band insulator.

Here again a few concluding remarks are in order.

(i) The functional form of the low-T specific heat in (17.23) is called 'thermally activated behaviour'. One says that the activation barrier equals half of the band gap.

(ii) The size of the electronic contribution to the specific heat of an insulator depends in an extremal way on the band gap. Example:

at T ≈ 300K. This makes the difference, as far as the electronic specific heat is concerned, between insulators and 'semi-conductors'.

(iii) For an insulator in the low-temperature regime the electrons contribute almost nothing to the specific heat. The specific heat of insulators is mostly determined by the phonons. In conductors, on the other hand, the electrons do contribute to the specific heat and even dominate it at low enough temperatures. For this reason metals have, in general, a larger heat capacity than insulators.

We recall that the Hamiltonian (14.1) of the electrons in solids in adiabatic approximation is

where V I (x) is the periodic potential of the ions in the crystal, V C (x) = 1/ x is the Coulomb potential, and N is the number of electrons we are taking into account.

Our aim in the following lectures is to proceed beyond the single-particle approximation.

To begin with, we remark that we can introduce an auxiliary potential V A (x) without changing too much the structure of the Hamiltonian. Let

Note that the choice of the auxiliary potential is entirely at our disposal. We may, for instance, choose the mean field potential defined in section 14.2. If we neglect U we are applying a single-particle approximation. Good single-particle approximations are obtained for appropriate choices of V A . A single-particle approximation is good, if the two-particle matrix elements of U (x, y) calculated with eigenstates of the single-particle Hamiltonian

are small for single-particle energies close to the Fermi surface.

In the following calculation we will not fix the auxiliary potential V A . Our only explicit assumption is that it is periodic. Implicitly we shall also assume that it allows us to take screening into account. Due to the periodicity the eigenfunctions of h are 'Bloch functions' ϕ αk labeled by a band index α ∈ N and a lattice momentum k ∈ BZ,

We shall call the ε α (k) the single-particle energies. The Bloch theorem implies

with a lattice periodic function u αk . The set {ϕ αk } α∈N,k∈BZ is a single-particle orthonormal basis of the electronic Hilbert space called the 'Bloch basis'. Define

where L is the number of unit cells. * Then {φ α (x − R j )|α ∈ N, R j ∈ B} is another orthonormal basis (prove it!) called the Wannier basis. φ α (x) is called a Wannier function. The Wannier functions generalize the atomic orbitals in section 15.3. Bloch basis and Wannier basis are connected via Fourier transformation,

Here we have used that the second sum on the right hand side of the second equation equals Lδ k,p .

Let c + αk,a the creation operator of a Bloch electron of spin a ∈ {↑, ↓}. The operators

are then an alternative set of creation operators, creating electrons in Wannier orbitals. This can be seen by writing down the corresponding field operators,

According to the general prescription (see lecture on QM) the Hamiltonian in 'occupation number representation' ('second quantization') can be written as Here implicit summation over the spin indices a, b is implied. The t α ij are called 'transition matrix elements' or 'hopping matrix elements', the U αβγδ ijk are called 'interaction parameters'.

Note that:

(i) So far the Hamiltonian is only rewritten in 'second quantization'. No other approximation than the adiabatic approximation has been applied.

(ii) In this form it is the starting point of the 'theory of strongly correlated electron systems'.

(iii) An optimal choice of the Wannier functions (through an optimal choice of V A ) minimizes the strength and the range of the interaction parameters.

(iv) Suppose the auxiliary potential V A can be chosen in such a way that the interaction parameters are always small. Then they can be neglected and we are in the realm of band theory.

(v) If the Wannier functions can be constructed in such a way that they resemble atomic wave function in the sense that they are localized around the origin and decay sufficiently fast away from it, then the interaction parameters become short-range, and it may be justified to consider only on-site or near-neighbour contributions.

The dispersion relation of a free non-relativistic particle of mass m is ε(k) = 2 k 2 /2m. Consider a lattice of N sites and physical length L under periodic boundary conditions. Then only N discrete wave vectors of the form k = 2πn/L, n ∈ Z, are possible inside the Brillouin zone. All wave vectors outside will be folded back to the Brillouin zone, i.e. new bands with dispersion ε(k + 2πm/a) develop, where a = L/N is the lattice constant and m ∈ Z is the band index. The corresponding Bloch functions are labeled in the following way, 13) where, in the thermodynamic limit, the summation over the first Brillouin zone can be replaced by an integration between the zone boundaries ±π/a. 

The formalism of second quantization, in which the Hamiltonian of the electrons takes the form (18.11), gives us a more intuitive access to the problem. If the Fermi surface is located within a single conduction band with band index α, say, then the interaction between different bands can be neglected for small excitation energies, and we can suppress the band indices (Greek indices) in (18.11). If, moreover, the 'intra-atomic Coulomb interaction' U iiii is dominant, then a single effective interaction parameter U , say, remains, and H el can be approximated by

which defines the so-called (one-band) Hubbard model [7, 8] .

(i) The Hubbard model is a 'minimal extension' of the band model in the sense that as few as possible of the interaction parameters of the full Hamiltonian (18.11) are taken into account.

(ii) In spite of its apparent simplicity the Hubbard model is a true many-body model. In general it is very hard to deal with by any of the means of modern theoretical physics as e.g. perturbation theory, renormalization group analysis, quantum Monte-Carlo methods.

(iii) As simple it is intuitively, the Hubbard model is the starting point for extensions. Basically all models of 'strongly correlated electrons' (everything beyond the band model) are extended Hubbard models which are either obtained by taking interaction parameters of wider range into account (e.g. nearest neighbours, next-to-nearest neighbours) or by taking into account a larger number of bands (e.g. two-band Hubbard model, three-band Hubbard model).

(iv) As it stands the Hubbard model is believed to give a realistic description of • the electronic properties of solids with tight bands

• band magnetism (iron, copper, nickel)

• the interaction-induced metal-insulator transition (Mott transition)

(v) If the Hubbard Hamiltonian is supplied with periodic boundary conditions, the number of lattice sites (which is equal to the number of Wannier orbitals) is finite.

As we are are also dealing with a finite number of states per site, the model has a finite-dimensional space of states, and can be thought of as a 'fully regularized' quantum field theory. Its Hamiltonian can be represented by a finite Hermitian matrix. This makes the Hubbard model attractive for computer based approaches.

(vi) The 1d Hubbard model has the amazing feature of being integrable [10, 12] . Rather much is known about its elementary excitations and its thermodynamics [6] .

The assumption that the Wannier functions are strongly localized in the vicinity of the 'lattice sites' R j is compatible with the restriction of the hopping amplitudes t ij to nearest neighbours ij on the lattice, which is called the 'tight-binding approximation'. If we introduce the 'density operators' (local particle-number operators)

and apply the tight-binding approximation to (19.1) we obtain

Here we have assumed isotropic nearest-neighbour hopping of strength −t and the vanishing of the on-site energies t ii , which for a homogeneous model can always be assumed, since in this case it is equivalent to a redefinition of the chemical potential. For the interaction part we have calculated 

Let us have a closer look at the Hubbard Hamiltonian. For simplicity we consider the one-dimensional model, 

is a Wannier state representing N electrons at sites x j with spins a j . The set of all different states of this form,

is a basis of Wannier states, since all such states are linear independent and since their number is

The operators n j,↑ , n j,↓ are the local particle number operators for electrons of spin ↑ and ↓ at site j. Let us recall why this name is justified. Using the canonical anticommutation relations of the Fermi operators and the fact that the c j,a annihilate the Fock vacuum |0 we conclude that

and therefore n j,↑ |x, a = N k=1 δ j,x k δ ↑,a k |x, a (19. 10) and similarly for n j,↓ . Thus, n j,a |x, a = |x, a , if site j is occupied by an electron of spin a, and zero elsewise. A first interpretation of the Hubbard model can be obtained by considering separately the two contributions that make up the Hamiltonian (19.5). For t = 0 or U = 0 it can be diagonalized and understood by elementary means. For t = 0 the Hamiltonian reduces to H = U D, where D = L j=1 n j↑ n j↓ .

(19.11)

Using (19.10) we can calculate the action of D on a state |x, a ,

(19.12)

Here we used δ ↑,a k δ ↓,a k = 0 in the second equation and the Pauli principle in the third equation. As we learn from (19.12) every state |x, a is an eigenstate of the operator D.

Thus, D is diagonal in the Wannier basis. The limit t → 0 of the Hubbard Hamiltonian (19.5) is called the atomic limit, because the eigenstate |x, a describes electrons localized at the sites x 1 , . . . , x N , which are identified with the loci of the atomic orbitals the electrons may occupy. The meaning of the operator D is evident from equation (19.12). D counts the number of double-occupied sites in the state |x, a . The contribution of the term U D to the energy is non-negative for positive U and increases with the number of double-occupied sites. This can be viewed as on-site repulsion among the electrons. Negative U on the other hand, means on-site attraction. Hence, it is natural to refer to D as to the operator of the on-site interaction.

In the other extreme, when U = 0, the Hamiltonian (19.5) turns into Thus, acting with the creation operatorsc + k,a on the empty lattice |0 we obtain an alternative basis B B . Let us introduce the row vectors q = (q 1 , . . . , q N ) = (k 1 , . . . , k N )φ and the states

It can be shown that these states are eigenstates of a lattice momentum operator with eigenvalue N j=1 q j mod 2π. The set

is a basis of H (L) . This basis is sometimes called the Bloch basis. Electrons in Bloch states |q, a are delocalized, but have definite momenta q 1 , . . . , q N . By virtue of (19.17), the analogues of (19.9) and (19.10) are satisfied byñ j,a andc + k,b . It follows that is a measure for the relative contribution of both terms and is the intrinsic, dimensionless coupling constant of the Hubbard model.

In this lecture we shall consider the limiting case, when the intra-atomic Coulomb interaction U of the Hubbard model is large compared to the band width t. We define

Here we only assume that t jk = t * kj , guaranteeing Hermiticity of T , and that t jj = 0. For fixed particle number the latter setting merely shifts the energy scale and, for this reason, does not imply any restriction to generality. As we shall see, however, this assumption will have several technical advantages in the calculations below.

Using an appropriate definition of the t jk and an appropriate enumeration of the lattice sites in (20.1), we may write the general Hubbard Hamiltonian (19.1) on any finite lattice, under any kind of boundary conditions and in any dimension in the form

(20.2)

We shall assume that U > 0. This is natural, since positive U corresponds to the repulsion of electrons in the same Wannier orbital which is expected as a consequence of their mutual Coulomb repulsion. If |t jk | U we can consider T as a small perturbation of U D. As we have seen in (19.12), D counts the number of double-occupied sites. Thus, the eigenvalues of U D are 0, U, 2U, . . . , LU . Their number grows linearly with L, while the number of states in H (L) grows exponentially like 4 L . The eigenvalues of U D are therefore highly degenerate. Let us denote the projection operators onto the corresponding eigenspaces H n by P n , n = 0, 1, . . . , L. Then 

Consider a Hamiltonian H + λV on a Hilbert space H. Assume that H has the spectral decomposition H = n E n P n (20.5)

with mutually distinct eigenvalues E n and orthogonal projectors P n (i.e. P n P m = δ nm P n ), such that dim P n H is not necessarily equal to one. We consider λV , λ ∈ R, as a small perturbation of 'strength λ'.

(20. 6) We assume that the eigenvalues and eigenvectors of the perturbed Hamiltonian H + λV are characterized by the quantum numbers n of the unperturbed problem and sets of additional quantum numbers ν n ∈ {1, . . . , dim P n H}. In other words (H + λV )|Ψ n,νn = ε n,νn |Ψ n,νn (20. 7) in such a way that lim λ→0 |Ψ n,νn = |Ψ (0) n,νn ∈ P n H , lim Here the series converges if H is finite dimensional and |λ| is small enough. Otherwise we may have to interpret it as an asymptotic series. Applying, on the other hand, P n to (20.9) we obtain P n V |Ψ n,νn = ∆ n,νn λ |ϕ n,νn .

(20.11) Thus, ∞ k=0 P n V R n (λV − ∆ n,νn ) k |ϕ n,νn = ∆ n,νn λ |ϕ n,νn .

(20.12) This is a non-linear spectral problem on P n H describing the splitting of the energy level E n under the influence of the perturbation λV . The operator on the left hand side is an effective Hamiltonian on P n H. Up to second order (∆ n,νn = λ∆ (1) n,νn + λ 2 ∆ (2) n,νn + . . . ) it is given by H 2 = P n V P n + P n V R n (λV − ∆ n,νn )P n = P n V P n + λP n V R n V P n = P n V P n + λ m,m =n

Thus, up to second order in λ equation (20.12) reduces to a linear spectral problem with an effective Hamiltonian H 2 .

Remark. The corresponding eigenstates of the perturbed problem are obtained from (20.10), once the |ϕ n,νn are known.

We now apply (20.13) to (20.2). For this purpose we divide by U . Then H/U = D + T /U . Recall that D has eigenvalues 0, 1, . . . , L. Inserting this for n = 0 with V = T , λ = 1/U into (20.13) we obtain

(20.14)

In order to express the projection operators P m in terms of Fermi operators, we introduce the function

(1 − αn j↑ n j↓ ) . where n is the number of double-occupied sites in |x; a . In particular,

(1 − n j↑ n j↓ ) = P 0 .

(20.17) Moreover,

|x; a = |x; a meaning that G(α) is a generating function for the projection operators P n , n = 0, 1, . . . , L.

We will now use the explicit construction of the projection operators to express H 2 in terms of Fermions. First of all P 0 n j↑ n j↓ = L k=1

(1 − n k↑ n k↓ ) n j↑ n j↓ = 0 = n j↑ n j↓ P 0 , (20.20)

entailing that t jk n j↑ n j↓ c + j,a c k,a P 0 .

(20.22)

Using the latter equation in (20.14) we arrive at

t jk t k c + j,a c k,a n k ↑ n k ↓ c + k ,b c ,b P 0

t jk t k c + j,a c k,a n k↑ n k↓ c + k,b c ,b P 0 . (20.23)

Here we have used (20.20) and the fact that t jj = 0 in the second equation. 

This is called the Heisenberg-Hamiltonian with nearest-neighbour exchange interaction. The Heisenberg model is the model for the antiferromagnetism of insulators (which is ubiquitous in nature).

The space H 0,L , spanned by the states |a s with a j ∈ {↑, ↓}, j = 1, . . . , L, is 2 L dimensional, hence isomorphic to C 2 ⊗L . This makes it possible to describe the action of H spin directly by certain matrices acting on C 2 ⊗L .

The vectors space C 2 ⊗L has the canonical basis vectors |a = e a 1 ⊗ · · · ⊗ e a L , e ↑ = 1 0 , e ↓ = 0 1 .

(21.8)

We would like to identify these vectors with the vectors |a s . Then, on the one hand, S α j |a s = 1 2 σ α a b c + L,a L . . . c + j+1,a j+1 c + j,a c j,b c + j,a j =c + j,a j c + j,a c j,b +δ b a j c + j,a . . . c + 1,a 1 |0 = 1 2 σ α a a j |(a 1 , . . . , a j−1 , a, a j+1 , . . . , a L ) s , (21. 9) while, on the other hand, Since the operators σ α , α = x, y, z, and I 2 form a basis of End C 2 , the operators S α j , j = 1, . . . , L, as defined in (21.10), together with the identity, generate a basis of End C 2 ⊗L . With the identification (21.11) we can interpret the Heisenberg model on 'spin space' C 2 ⊗L with Hamiltonian (21.7), where now the S α j are the spin matrices defined in (21.10).

Remark. Such an identification is not possible for the t-J model. The operator H 2 acts on H 0 which contains H 0,L as a proper subspace. We have e.g. S α j |0 = 0 (where |0 is the vacuum for Fermions). (21. 13) This means that P is a permutation (transposition) matrix. In particular, P | ↑↓ = | ↓↑ .

(21.14)

Define 'symmetrizer' and 'antisymmetrizer' P ± = 1 2 (P ± 1) .

(21.15) They satisfy P ± 2 = 1 4 (P ± 1) 2 = 1 2 (1 ± P ) = ±P ± , (21.16a)

This means that ±P ± are orthogonal projectors onto the symmetric and antisymmetric subspaces of C 2 ⊗ C 2 . The subspace P + C 2 ⊗ C 2 has a basis It follows that H spin has eigenvalue 0, −J. The corresponding eigenvectors are |1, s , s = 0, ±1, and |0,0 . If J > 0, then the singlet |0,0 is the ground state. This is called antiferromagnetism, since σ z ⊗ σ z |0,0 = −|0, 0 , ⇒ 0,0|σ z ⊗ σ z |0,0 = −1 .

(21.20)

One says that the ground state has antiferromagnetic correlations. In the general case

is a sum of projectors onto 'local singlets'. An antiferromagnet (J > 0) may therefore be characterized as a system which prefers local singlets. By contrast, a ferromagnet (J < 0) prefers triplets. Local singlets are incompatible with the global SU (2) invariance of the Hamiltonian. For this reason the ground state of a macroscopic Heisenberg antiferromagnet is a complicated 'many-body state'. On the other hand, the state | ↑ . . . ↑ = e ⊗L ↑ (21.22) may be seen as a tensor product of states from local triplets. It is annihilated by H spin and is one of its ground states in the ferromagnetic case. The full ground state subspace can be constructed using the global SU (2) symmetry.

In an external electromagnetic field the hopping term in the Hubbard Hamiltonian is modified by so-called Peierls phases t jk → t jk e iλ jk , λ jk ∈ R .

(21.23) (cf. section 19.4). The Hubbard interaction U , on the other hand remains, unchanged. It follows that the corresponding Heisenberg model (strong-coupling limit at half-filling) is not modified, since it depends only on |t jk | 2 . In other words, to leading order perturbation theory the Hubbard model at half-filling does not couple to an external field. The system is an insulator, although it has an odd number of electrons per unit cell, and therefore is a conductor at U = 0. This suggests that there might be an interaction induced metalinsulator transition ('Mott transition') somewhere in between, at a finite value U c of the interaction. It is believed that such a transition occurs indeed in any spatial dimension. A proof only exists in d = 1, where U c = 0.

Consider the periodic Heisenberg Hamiltonian (XXX model) on a four-site 1d lattice H = P 12 + P 23 + P 34 + P 41 .

(21.24)

Here P jk is the transposition, interchanging spin states on sites j and k of the chain. Recall that the space of states of the model is the tensor product H = C 2 ⊗ C 2 ⊗ C 2 ⊗ C 2 . The vectors e ↑ = ( 1 0 ) and e ↓ = ( 0 1 ) form a basis of C 2 . Hence, B = |a 1 a 2 a 3 a 4 = e a 1 ⊗ e a 2 ⊗ e a 3 ⊗ e a 4 ∈ H a 1 , a 2 , a 3 , a 4 =↑, ↓ (21.25) is a basis of H. Denote the embeddings of the Pauli matrices into End(H) by σ α j , j = 1,2,3,4, α = x, y, z. Then the transpositions can be expressed as P jk = 1 2 id +σ α j σ α k .

(21.26)

The operator of the total spin has components S α = 1 2 (σ α 1 + σ α 2 + σ α 3 + σ α 4 ) which can be used to define the ladder operators S ± = S x ± iS y . The shift operator for the chain of length four is defined asÛ = P 12 P 23 P 34 . for α, β = x, y, z.

(ii) Construct a basis of common eigenvectors and obtain the corresponding eigenvalues of the above operators. Sketch the spectrum of H. Hint: Use, for instance, that U 4 = id, and use the angular momentum algebra.

In this lecture we study the response of a quantum many-body system, in thermal equilibrium with a heat bath of temperature T , to a small external perturbation. A solid is responding, for instance, with a current to an external voltage, with a thermal current to a temperature gradient, or with a deformation to external mechanic stress. An example which will be considered in some detail is the absorption of microwaves by a spin system in a homogeneous magnetic field (ESR experiment). This will allow us to take a glimpse at our own work [4, 15] .

Consider a quantum system with Hamiltonian H possessing a discrete spectrum (E n ) ∞ n=0 with corresponding eigenstates {|n } ∞ n=0 . At time t 0 a time-dependent perturbation V (t) is adiabatically switched on. We are interested in the time evolution of the system, assuming it initially, at times t < t 0 , in an equilibrium state described by the statistical operator We are interested in an approximation to ρ(t) linear in the strength of the perturbation V (t). In order to derive this approximation we define 

Let us now apply the above formalism to the Heisenberg-Ising (alias XXZ) spin chain in a longitudinal static magnetic field of strength h. The Hamiltonian of this model is defined as

s x j−1 s x j + s y j−1 s y j + ∆s z j−1 s z j .

(22.14)

Here we have switched from Pauli matrices σ α to spin operators s α = σ α /2. We shall assume periodic boundary conditions s α 0 = s α L , α = x, y, z. The real parameter ∆ is called the anisotropy parameter. For ∆ = 1 the Hamiltonian H 0 turns into the Heisenberg Hamiltonian considered in lecture L21. Values of ∆ different from one may be needed for a more accurate modeling of real magnetic materials and are due to a combination of crystal symmetry, spin-orbit interactions and dipole-dipole interactions.

We define the operator of the total spin as

σ α for α = x, y, z.

(22.15) If a spin system is exposed to a homogeneous magnetic field h a so-called 'Zeeman term' − h, S must be added to the Hamiltonian. Assuming a field in z-direction our Hamiltonian takes the form H = H 0 − hS z , (22.16)

Note that the z-direction is special in that [H 0 , S z ] = 0. We perturb the spin chain by a circular polarized electro-magnetic wave propagating in z-direction. We assume that the wave length is large compared to the length of the spin chain † and idealize this assumption by setting the wave number k = 0. Here we have used that The ability to absorb radiation is a material property. Hence, we generally expect the absorbed energy per unit time to be proportional to the number of constituents of a physical system and to diverge in the thermodynamic limit. In order to define a quantity that truly characterizes the material and is finite in the thermodynamic limit we should therefore normalize by the average intensity A 2 of the incident wave and by the number of lattice sites L. Further averaging the normalized absorption rate over a half-period π/ω of the applied field, we obtain the normalized absorbed intensity and is proportional to the magnetic energy h m(T, h) per lattice site. This case includes the familiar paramagnetic resonance (Zeeman effect) for which the magnetization is known more explicitly, namely, m(T, h) = 1 2 th h 2T for J = 0. In general, an exact calculation of the magnetization of the isotropic Heisenberg chain at any finite temperature is not elementary and requires the machinery of Bethe Ansatz and quantum transfer matrix [9] .

Comparing (22.17), (23.3) and (23.5) we interpret the absorption of energy as a resonance between the rotating field of the incident wave and the precessing total spin of the chain. Both are rotating clockwise with angular velocity ω = h. If we deviate from the isotropic point ∆ = 1 of the Hamiltonian (18.15) we expect that energy is transferred form the 'coherent motion of the total spin' to 'other modes', causing a damping of the spin precession and hence a shift and a broadening of the δ-function shaped spectral line (23.5).

At the present state of the art dynamical correlation functions, such as χ +− , cannot be calculated exactly, not even for models as simple as the XXZ chain. The interaction induced shift of the resonance frequency and the width of the spectral line, on the other hand, are more simple quantities which need less information to be calculated.

For every finite L and for every n ∈ N the integrals

dω ω n I(ω) (23. 6) exist. Since I(ω) is everywhere non-negative and since I 0 > 0 (see below), we may interpret I(ω)/I 0 as a probability distribution with moments I n /I 0 . I 1 /I 0 is the mean value of the distribution and I 2 /I 0 − (I 1 /I 0 ) 2 1 2 its variance. The quantities may be used to define the 'resonance frequency' and the line width.

Instead of the moments of the normalized absorption intensity we shall use the 'shifted moments' of the dynamical susceptibility,

(23.7)

(vi) The resonance shift and line width as determined by (23.9) and (23.10) show a simple scaling behaviour as functions of the exchange interaction J. Namely δω/J and ∆ω/J depend on J only through the ratios T /J and h/J. This is true for the statistical operator ρ, hence also for all spin correlation functions, and then by our formulae (23.15), (23.17) for δω/J and ∆ω/J as well. Note also that δω/J and ∆ω/J both vanish proportional to δ as we approach the isotropic point δ = 0.

Obtain m 1 and m 2 from (23.14).

Über die Quantenmechanik der Elektronen in Kristallgittern

Zur Quantentheorie der Molekeln

Dynamical theory of crystal lattices

Theory of microwave absorption by the spin-1/2 Heisenberg-Ising magnet

Zur Theorie der spezifischen Wärmen

The One-Dimensional Hubbard Model

Effect of correlation on ferromagnetism of transition metals

Electron correlations in narrow energy bands

Thermodynamics of the anisotropic spin-1/2 Heisenberg chain and related quantum chains

Absence of Mott transition in an exact solution of the short-range, one-band model in one dimension

A short simple evaluation of expressions of the Debye-Waller form

Exact integrability of the one-dimensional Hubbard-model

Zur Elektronentheorie der Metalle auf Grund der Fermischen Statistik

The occurrence of singularities in the elastic frequency distribution of a crystal

Anisotropic magnetic interactions and spin dynamics in the spin-chain compound Cu(py) 2 Br 2 : An experimental and theoretical study

Here we have neglected contributions that are exponentially smaller than the displayed terms. In order to further simplify the remaining integrals we have to recall that for 3d systems there are square-root type van-Hove singularities at the band edges. This means that there are α v , α c > 0 such that within the respective bandsSubstituting this into the integrals on the right hand side of (17.14) we obtain, for instance,and a similar expression for the other integral. Altogether we end up withwhich is the low-temperature asymptotics of the grand canonical potential for an insulator, when µ ∈ (b n , a n+1 ). Again the formulae for particle number and entropy in the grand canonical ensemble are obtained by taking derivatives (cf. (17.7)),For T → 0+ in (17.18a) we still get that the integral on the right hand side with upper limit E F is equal to the total particle number, but now this equality does not fix E F , since the integral as a function of µ is constant for µ ∈ [b n , a n+1 ], thus non-invertible. Since µ is continuous in a vicinity of T = 0, we see that the integral exactly equals N even for small finite temperatures. Hence, the second term on the right hand side must vanish asymptotically,

If interacting electrons (charge −e, mass m) in a periodic potential V (x) are exposed to an external electro-magnetic field, their one-particle Hamiltonian becomeswhere A(x, t) and φ(x, t) are the vector and scalar potentials of the external field. In the lecture we considered the many-body Hamiltonian generated by h relative to the Wannier basis. It was parameterized by hopping matrix elements t jk . Here we would like to find out how the external field modifies the t jk . We start by fixing the gauge such that φ(x, t) = 0.(i) Let λ(x, t) an arbitrary differentiable function. Using that p k = −i∂ k verify the commutator relationthe Wannier orbital at site R j and recall that the hopping matrix elements for vanishing external fields wereShow that in the presence of the external field this has to be modified to becomewhere λ is still arbitrary.(iii) Choosing nowfor an arbitrary fixed point x 0 and redefining φ(, the so-called Peierls substitution, should be valid.(iv) Show that the Peierls substitution leads to a modification of the hopping part of the many body Hamiltonian according to the rule t ij → t ij e iλ ij , whereThis Volterra equation is an appropriate starting point for a perturbation theory. Assuming |W (t)| to be small we conclude thatand therefore, to lowest order in W ,This is the statistical operator in so-called Born approximation. In the following t 0 will be sent to −∞.

Using (22.8) we can calculate the time evolution of the expectation value of an operator A under the influence of the perturbation. We shall denote the canonical ensemble average by · T = tr{ρ 0 · } and write A(t) = e iHt A e −iHt for the Heisenberg time evolution of A. ThenHere we have used the cyclic invariance of the trace in the third equation and the fact that H commutes with ρ 0 in the fourth equation.A typical example of a perturbation, which will be relevant for our discussion below, is a classical time-dependent field h α (t) coupling linearly to operators X α ,(22.10)In this case

The absorbed energy per unit time is As we shall see these quantities are more natural from a theoretical point of view as they can be more easily calculated. Using the binomial formula we can relate them to the moments of the normalized intensity,Let us now defineδω is the 'resonance shift', i.e., the deviation of the resonance frequency from the resonance frequency in the isotropic case. Similarly,is a measure of the line width. Hence, in order to calculate the resonance shift and the line width, we need to know the first four shifted moments m 0 , m 1 , m 2 , m 3 of the dynamic susceptibility χ +− .

In the following we shall employ the notation ad X · = [X, · ] for the adjoint action of an operator X. Then, using that ad H = ad H 0 −h ad S z , that [H 0 , S z ] = 0 and that [S z , S + ] = S + , we see that Thus,It follows thatentailing that14)The latter formula shows that the moments m n are static correlation functions whose range and complexity grows with growing n. The first few of them can be easily calculated by hand. The most basic one iswhich is the magnetization per lattice site. The subsequent moments vanish in the isotropic point ∆ = 1. It turns our that they are polynomials inUnlike the magnetization they do not have an immediate interpretation. Using (23.14) they can be calculated one by one. We obtain, for instance,The moments are certain combinations of static short-range correlation functions. This implies, in particular, that they all exist in the thermodynamic limit. Substituting (23.17) in to (23.9) and (23.10) we have expressed our measures for the resonance shift and line width in terms of short-range static (rather than dynamical) correlation functions. We close this subject with a number of comments.(i) Short range static correlation functions of the XXZ chain can be calculated exactly at all values of temperatures and magnetic fields.(ii) Our formulae show that special combinations of short-range static correlation functions can be measured, at least in principle, by macroscopic experiments.(iii) In practice such measurements are difficult due to limitations of the experimental accuracies and limitations to experimental techniques, or due to the fact, that δ in typical spin chain materials is too small.(iv) It follows from the existence of the moments that the shape of the ESR absorption lines cannot be Lorentzian, as is assumed by many experimentalists and in most of the more conventional theoretical approaches.(v) Based on the expressions (23.15), (23.17) for the moments in terms of correlation functions we can prove and specify our claim that I(ω) is generally positive, even for vanishing magnetic field h. Using (23.15) and (23.17a) in I 0 = π(Jm 1 + hm 0 ) and noting that s + 1 s − 2 T = 2 s x 1 s x 2 T we obtain I 0 = π hs z 1 + 2δJ(s x 1 s x 2 − s z 1 s z 2 ) T . (23.18)Here the first term in the brackets is positive if h = 0 as the magnetization is positive for positive h and negative for negative h. Note, however, that it becomes extremely small for small temperatures in the massive phase δ > 0. As for the second term, for small enough h and δ > −1 the neighbour correlators are negative, and the zzcorrelations are weaker than the xx-correlations for negative δ and stronger than the xx-correlations for positive δ. Hence, the second term is positive even for vanishing h as long as δ is non-zero.