Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Time-independent perturbation theory

The basic setup is simple. We consider Hamiltonians of the form

H=H0+ϵH1H = H_0 + \eps H_1

where ϵ1\eps \ll 1. Here we are assuming that the matrix elements of H1H_1 are of the same order or smaller than those of H0H_0. We wish to solve the time-independent Schroedinger equation Hψ=EψH \ket{\psi} = E \ket{\psi} when the solutions for H0H_0 are known. We will also assume that the spectra of H0H_0 are discrete. States in the regime with continuous spectra require a different treatment via scattering theory.

Since the eigenvalues and eigenvectors n\ket{n}, EnE_n should be those of H0H_0 as ϵ0\eps \to 0, we make the following assumptions about the true eigenvalues and eigenvectors.

E=En(0)+ϵEn(1)+ϵ2En(2)+n=n(0)+ϵn(1)+ϵ2n(2)+\begin{align} E & = E^{(0)}_n + \eps E^{(1)}_n + \eps^2 E^{(2)}_n + \ldots\\ \ket{n} & = \ket{n^{(0)}} + \eps \ket{n^{(1)}} + \eps^2 \ket{n^{(2)}} + \ldots \end{align}

where H0n(0)=En(0)H_0 \ket{n^{(0)}} = E^{(0)}_n, and the eigenvectors are taken to be orthonormal. We then write

Hn=(H0+ϵH1)(n(0)+ϵn(1)+ϵ2n(2)+)=En=(En(0)+ϵEn(1)+ϵ2En(2)+)(n(0)+ϵn(1)+ϵ2n(2)+)\begin{align*} H\ket{n} & = (H_0 + \eps H_1) \left( \ket{n^{(0)}} + \eps \ket{n^{(1)}} + \eps^2 \ket{n^{(2)}} + \ldots\right)\\ &= E\ket{n}\\ & = \left( E^{(0)}_n + \eps E^{(1)}_n + \eps^2 E^{(2)}_n + \ldots\right)\left(\ket{n^{(0)}} + \eps \ket{n^{(1)}} + \eps^2 \ket{n^{(2)}} + \ldots\right) \end{align*}

We then demand that this equation be solved order by order in ϵ\eps. The solution at order ϵn\eps^n is known as nnth order perturbation theory.

First order perturbation theory (non-degenerate)

The ϵ0\eps^0 terms cancel by supposition. At O(ϵ)\cO(\eps), the terms are:

H1n(0)+H0n(1)=En1n(0)+En0n(1)H_1 \ket{n^{(0)}} + H_0 \ket{n^{(1)}} = E^1_n \ket{n^{(0)}} + E^0_n \ket{n^{(1)}}

We first take the inner product of this equation with \bra{n^{(0)}, and find

n(0)H1n(0)+En0n(0)n(1)=En1+En0n(0)n(1)\bra{n^{(0)}}H_1\ket{n^{(0)}} + E^0_n \brket{n^{(0)}}{n^{(1)}} = E^1_n + E^0_n \brket{n^{(0)}}{n^{(1)}}

or

En1=n(0)H1n(0)E^1_n = \bra{n^{(0)}}H_1\ket{n^{(0)}}

Next, we take the inner product with m(0)\bra{m^{(0)}} for mnm \neq n. Then we have

m(0)H1n(0)+Em0m(0)n(0)=En0m(0)n(1)\bra{m^{(0)}} H_1 \ket{n^{(0)}} + E^0_m \brket{m^{(0)}}{n^{(0)}} = E^0_n \brket{m^{(0)}}{n^{(1)}}

or

m(0)n(1)=m(0)H1n(0)Em0En0 \brket{m^{(0)}}{n^{(1)}} = \frac{\bra{m^{(0)}} H_1 \ket{n^{(0)}}}{E^0_m - E^0_n}

We can see that this fails if En0E^0_n is a degenerate eigenvalue, as there will be terms in the above expression with vanishing denominator. We will turn to this case later, and instead assume that there is no degeneracy. The techniques we explore here are called non-degenerate perturbation theory.

Now m(0)\ket{m^{(0)}} is a complete basis in the Hilbert space, as H0H_0 is a Hermitian operator. To find n(1)\ket{n^{(1)}}, we will need to know n(0)n(1)\brket{n^{(0)}}{n^{(1)}}. I will demand that n(0)n=1\brket{n^{(0)}}{n} = 1, which is not yet the standard normalization. We can rescale n\ket{n} after our calculation, but computig its norm and then dividing n\ket{n} by it. The upshot is that since n(0)n(0)=1\brket{n^{(0)}}{n^{(0)}} = 1, and n=n(0)+\ket{n} = \ket{n^{(0)}} + \ldots, all corrections to n(0)\ket{n^{(0)}} will be orthogonal to n(0)\ket{n^{(0)}}.

Given this, and that nn(0)n(0)=1\sum_n \ket{n^{(0)}}\bra{n^{(0)}} = 1, we have

n(1)=mnm(0)m(0)H1n(0)En0Em0\ket{n^{(1)}} = \sum_{m\neq n} \frac{\ket{m^{(0)}} \bra{m^{(0)}} H_1 \ket{n^{(0)}}}{E_n^0 - E_m^0}

Note that at this order the norm of the state only differs from 1 by a factor of O(ϵ2)\cO(\eps^2); we will need to go to second order in corrections to k\ket{k} to compute the norm, and to see what is needed to properly normalize the state.

Example: anharmonic oscillator

The classic example is the anharmonic quantum oscillator:

H=p22m+12mω2x2+λ4!x4H = \frac{p^2}{2m} + \half m \omega^2 x^2 + \frac{\lambda}{4!} x^4

We will do perturbation series with the parameter λ\lambda. You should be objecting that this isn’t quite right. λ\lambda has dimensions of (energy)/(length)4(energy)/(length)^4. As a dimensionful paraneter one cannot technically call it “small”. But we can combine this with the parameters of the unperturbed Hamiltonian to make a small parameter. For example, we know that we have a length scale

L=mωL = \sqrt{\frac{\hbar}{m\omega}}

and an energy scale E=ω{\cal E} = \hbar\omega. So by dimensional analysis, the combination

ϵ=λL4E=λm2ω3\eps = \frac{\lambda L^4}{{\cal E}} = \frac{\hbar \lambda}{m^2\omega^3}

is a likely perturbation parameter. We will see that this does in fact control the correction terms in perturbation theory.

The correction to the energy Ek(0)=ω(k+12)E^{(0)}_k = \hbar\omega(k + \half) in this case is taken from (6):

Ek(1)=k(0)λx4k(0)=λL4k(0)(a+a)4k(0)=λL4(6k2+6k+3)\begin{align} E^{(1)}_k & = \bra{k^{(0)}} \lambda x^4 \ket{k^{(0)}}\\ & = \lambda L^4 \bra{k^{(0)}}(a + a^{\dagger})^4 \ket{k^{(0)}}\\ & = \lambda L^4 (6 k^2 + 6k + 3) \end{align}

Now for kO(1)k \sim {\cal O}(1), the ratio of this to the energy gap ω\hbar \omega is k2ϵ\sim k^2 \eps, confirming our point above. furthermore, note that once k21/ϵk^2 \sim 1/\eps, this correction is no longer small. This is a general feature of perturbation theory; it only works for low energies.

Similarly the first order correction to the state (9) gives:

k(1)=kλL4(0)(a+a)4k(0)ω(k)=ϵk(0)(a+a)4k(0)k\begin{align} \ket{k^{(1)}} & = \sum_{\ell \neq k} \frac{\lambda L^4 \ket{\ell^{(0)}} (a + a^{\dagger})^4\ket{k^{(0)}}}{\hbar\omega (k-\ell)}\\ & = \eps \sum_{\ell \neq k} \frac{\ket{\ell^{(0)}} (a + a^{\dagger})^4\ket{k^{(0)}}}{k - \ell} \end{align}

We can see again how the dimensionless parameter ϵ\eps appears. The numerator is straightforward but tedious to evaluate. (a+a)4(a + a^{\dagger})^4 clearly connects the level kk to the levels in the range {k4,k2,k+2,k+4}\ell \in \{k-4, k-2,k+2,k+4\} (you should think about why k1,k3k - 1, k-3 do not appear). So the ratio will scale with k2k^2, and again if ϵk2O(1)\eps k^2 \sim \cO(1), perturbation theory will start to break down.

Second order perturbation theory (non-degenerate)

It is worth going out one more order so that you can see how things develop and how complicated they get. Also, it is often the case that corrections to the energy vanishes at first order. For example, let us say that we perturbed the harmonic oscillator by αx-\alpha x. (In fact this is exactly solvable; but we can still do perturbatiob theory and I’m trying to make a point. By the way, what might you worry about if the perturbation was by gx3g x^3?). The first order energy shift \bra{k^{(0)} x \ket{k^{(0)} = 0 because the states have definite parity but xx has odd parity. Thus xkx\ket{k}, ket{k have opposite parity, and so zero overlap.

The O(ϵ2)\cO(\eps^2) terms in (3) are:

H0n(2)+H1n(1)=En(0)n(2)+En(1)n(1)+En(2)n(0)H_0 \ket{n^{(2)}} + H_1 \ket{n^{(1)}} = E_n^{(0)} \ket{n^{(2)}} + E_n^{(1)}\ket{n^{(1)}} + E_n^{(2)} \ket{n^{(0)}}

Taking the inner product with n(0)\bra{n^{(0)}}, the first terms on each side cancel off of each other. The second term on the RHS canishes by our normalization convention. Thus, we find

En(2)=n(0)H1n(1)=kn(0)H1(0)(0)H1n(0)En0Em0=k(0)H1n(0)2En0Em0\begin{align} E_n^{(2)} & = \bra{n^{(0)}} H_1 \ket{n^{(1)}}\\ & = \sum_{\ell \neq k} \frac{\bra{n^{(0)}} H_1 \ket{\ell^{(0)}} \bra{\ell^{(0)}} H_1 \ket{n^{(0)}}}{E_n^0 - E_m^0}\\ & = \sum_{\ell \neq k} \frac{| \bra{\ell^{(0)}} H_1 \ket{n^{(0)}}|^2}{E_n^0 - E_m^0} \end{align}

We can similarly work out the second order correction to the state, again assuming that it is orthoginal to the zeroth order correction:

n(2)=,mnm(0)m(0)H1(0)(0)H1n(0)(En(0)Em(0))(En(0)E0)mnm(0)m(0)m(0)H1n(0)n(0)H1n(0)(En0Em0)2\begin{align} \ket{n^{(2)}} & = \sum_{\ell,m\neq n} \frac{\ket{m^{(0)}}\bra{m^{(0)}} H_1 \ket{\ell^{(0)}}\bra{\ell^{(0)}} H_1 \ket{n^{(0)}}}{(E_n^{(0)} - E_m^{(0)})(E_n^{(0)} - E_{\ell}^0)}\\ & \qquad\qquad -\sum_{m\neq n} \frac{\bra{m^{(0)}}\ket{m^{(0)}}\bra{m^{(0)}} H_1 \ket{n^{(0)}} \bra{n^{(0)}} H_1 \ket{n^{(0)}}}{(E_n^0 - E_m^0)^2} \end{align}

Now let us work out the normalization of this state. We define

1Z=nn=n(0)n(0)+ϵ2n(1)n(1)=1+ϵ2mnm(0)H1n(0)2(En0Em0)2+O(ϵ3)\begin{align} \frac{1}{Z} & = \brket{n}{n} \\ & = \brket{n^{(0)}}{n^{(0)}} + \eps^2 \brket{n^{(1)}}{n^{(1)}}\\ & = 1 + \eps^2 \sum_{m \neq n} \frac{|\bra{m^{(0)}}H_1 \ket{n^{(0)}}|^2}{(E_n^0 - E_m^0)^2} + \cO(\eps^3) \end{align}

Note that the overlap of the zeroth and second order correction does not appear, by our normalization assumption. Note that at this order, we can write

Z1ϵ2mnm(0)H1n(0)2(En0Em0)2Z \sim 1 - \eps^2 \sum_{m \neq n} \frac{|\bra{m^{(0)}}H_1 \ket{n^{(0)}}|^2}{(E_n^0 - E_m^0)^2}

so that the following state has unit norm

Zn\sqrt{Z} \ket{n}

where both terms are understood to go to O(ϵ2)\cO(\eps^2).

There is a nice expression for ZZ in this case. Note that to O(ϵ2)\cO(\eps^2),

EnEn0=En0(En0+ϵn(1)H1n(1)+ϵ2mnm(0)H1n(0)2(En0Em0))=1ϵ2mnm(0)H1n(0)2(En0Em0)2Z\begin{align} \frac{\del E_n}{\del E_n^0} & = \frac{\del}{\del E_n^0}\left(E_n^0 + \eps \bra{n^{(1)}} H_1 \ket{n^{(1)}} + \eps^2 \sum_{m\neq n}\frac{|\bra{m^{(0)}}H_1 \ket{n^{(0)}}|^2}{(E_n^0 - E_m^0)}\right)\\ & = 1 - \eps^2 \sum_{m\neq n}\frac{|\bra{m^{(0)}}H_1 \ket{n^{(0)}}|^2}{(E_n^0 - E_m^0)^2} \\ & \sim Z \end{align}

In fact, the formula

Z=EnEn0Z = \frac{\del E_n}{\del E_n^0}

holds to all orders in ϵ\eps.

Validity of perturbation theory

Perturbation theory can break down in two ways. First, the corrections can fail to be smaller because the ratio of matrix elements (times the appropriate power of ϵ\eps) to energy differences in Equations (9), (16), (17) fail to be small. For example, we may have a group of states which are degenerate or nearly degenerate, so that the denominators are small compared to the matrix elements. We can often deal with this by using (almost) degenerate perturbation theory.

At sufficiently high energies, however, perturbation theory can fail for a different reason. Consider the anharmonic oscillator. One has ratios like

n±k14!λx4nkω\frac{\bra{n\pm k} \frac{1}{4!} \lambda x^4 \ket{n}}{k\hbar \omega}

where k14k \sim 1-4. But the matrix element in the numerator can be shown to scale as n2\sim n^2. The point is that the unperturbed eigenstates at higher order are more spread out -- you should convince yourself that nx2nnL2=nmω\bra{n} x^2 \ket{n} \sim n L^2= \frac{n \hbar}{m\omega}. At some nn they will extend well into the region where λx4/4!12mω2\lambda x^4/4! \gg \half m \omega^2, and the perturbation is no longer small.

In fact, even for low-energy eigenstates this ends up being a problem. As we can start to see in (16), (17), as we go to higher and higher orders, we start connecting states at higher and higher energies with respect to the ground state. One can show (see for example Bender & Wu, 1971) that if E0=nAnλnE_0 = \sum_n A_n \lambda^n, then for large nn, Ann!A_n \sim n!. Since An/An1nA_n/A_{n-1} \sim n, for nλ1n\lambda \sim 1 the higher orders in perturbation theory will cease to be smaller: the series is an asymptotic series. In fact low orders in perturbation theory ofen give you very accurate answers! But they can only be so accurate.

An essential problem is that there are general terms in the exact answer which scale as e1/λe^{-1/\hbar\lambda}. This is not an analytic function near λ=0\lambda = 0, even though for positive real λ\lambda, it is very small. So the radius of converrgence of pertrubation theory is zero! A sign of this is that if we make λ\lambda negative, even small and negative, the theory becomes unstable and ill-defined (it is not just that the potential is unbounded below, but that it falls off more rapidly than x2-x^2). The radius of convergence of a power series in λ\lambda can be understood by studyying the behavior of the actual funcion in the complex plane. Singular behavior along the negative axis, up to the origin, means that a Taylor series in small λ\lambda will not converge.

That being said, we will continue to study perturbation theory because while it may be accurate only up to a point, that point is still pretty accurate! Quantum Electrodynamics can be treated in perturbation theory with respect to the fine structure constant α=e2c1137\alpha = \frac{e^2}{\hbar c} \sim \frac{1}{137}. This series is asymptotic, but nonetheless perturbation theory has given answers exquisitely confirmed by experiment.

Example: the quadratic Stark effect

A more nontrivial example is the response of the hydrogen atom to an electric field, for which H0H_0 is the usual Coulomb Hamiltonian, and H1=eEzH_1 = - e E z, corresponding to a charged particle in a constant electric field E=Ez^{\vec E} = E {\hat z}.

Problem for the student: Can you give an estimate of the dimensionless parameter that controls the size of the corrections?

We will focus on corrections to the ground state energy -- the perturbing Hamiltonian doesn’t act on spin, so we can ignore that degree of freedom, and otherwise the ground state is nondegenerate.

The first thing to note is that n=1,=0,m=0H1100=0\bra{n = 1, \ell = 0, m = 0} H_1 \ket{100} = 0. This is almost immediately apparent because H1H_1 is odd under zzz \to - z whereas 100\ket{100} is even. Thus z100z\ket{100} most be odd; since parity is implemented by a Hermitian operator, with eigenvalues ±1\pm 1 for even and odd states, the overlap of z100z\ket{100} with 100\bra{100} must vanish.

Another way to see this is to recall that for tensor operators, O,mnm{\cal O}_{\ell,m}\ket{n \ell' m'} will transform in a representation in DDD_{\ell}\otimes D_{\ell'}. Now zz is a tensor operator with =1,m=0\ell = 1, m = 0. This z100z \ket{100} must transform as =1\ell = 1, and so be orthogonal to the =0\ell = 0 state 100\ket{100}.

The result is that the leading behavior of the energy for small EE is quadratic in EE (because we have to go to second order in perturbation theory). As it happens, for excited states, when we have to use degenerate perturbation theory, there can be linear shifts, an effect called the linear Stark effect which we will describe later.

Equation (16) gives

E0(2)=e2E2n0,,mn,,mz1002E0Ek=94E2a03E_0^{(2)} = e^2 E^2 \sum_{n \neq 0,\ell,m} \frac{|\bra{n,\ell,m} z \ket{100}|^2}{E_0 - E_k} = \frac{9}{4} E^2 a_0^3

The last step is nontrivial, I am just asserting it. One simplification is that we know that only m=0,=1m = 0, \ell = 1 terms can enter in. Relatedly, this means for the corrections to the wavefunction (9), we get =1,m=0\ell = 1,m=0 component which means that the wavefunction elongates in the electric field, due to the response of the charges in the atom.

Note that the force of an electric field on an electric dipole is:

E0=dEE_0 = - {\vec d}\cdot{\vec E}

This can describe the present situation if we set

d=94a03Ez^=94a03E{\vec d} = - \frac{9}{4} a_0^3 E {\hat z} = - \frac{9}{4} a_0^3{\vec E}

The electric field, by elongating the electron wavefunction, induces an electric dipole moment in the atom. We can define the polarizability as the coefficient α\alpha in

E0E00=12αE2E_0 - E_0^0 = - \half \alpha |{\vec E}|^2

Here α=92a03\alpha = \frac{9}{2} a_0^3.

(Almost) degenerate perturbation theory

We now want to turn to the case that there are groups of states for which the energy splitting is small compared to the matrix elements of ϵH1\eps H_1. More precisely, consider the diagram of energy levels below with the condition

δϵχϵH1ψΔE\delta \eps \ll \bra{\chi} \eps H_1 \ket{\psi} \ll \Delta E

The unperturbed Hamiltonian will have the block diagonal structure

H0=(H0,11000H0,22000H0,33)H_0 = \begin{pmatrix} H_{0,11} & 0 & 0 & \ldots \\ 0 & H_{0,22} & 0 & \ldots \\ 0 & 0 & H_{0,33} & \ldots \\ \ldots & \ldots & \ldots & \ldots \end{pmatrix}

while the perturbed Hamiltonian will have the structure

H1=(H1,11H1,12H1,13H1,21H1,22H1,23H1,31H1,32H1,33)H_1 = \begin{pmatrix} H_{1,11} & H_{1,12} & H_{1,13} & \ldots \\ H_{1,21} & H_{1,22} & H_{1,23} & \ldots \\ H_{1,31} & H_{1,32} & H_{1,33} & \ldots \\ \ldots & \ldots & \ldots & \ldots \end{pmatrix}

Here the n,mn,m indices in Hk,mnH_{k,mn} refer to the subspaces Hm,Hm\cH_m, \cH_m spanned by the nearly-degenerate unperturbed eigenstates as labelled in the figure. The problems arise when ϵH1,nn\eps H_{1,nn} have nonvanishing matrix elements between the unperturbed eigenstates, which are larger than δϵ\delta \eps.

Degenerate perturbation theory

Let us first consider the case that δϵ=0\delta \eps = 0, so that H0,nn=EN(0)1\cH_{0,nn} = {E_N^{(0)} \bf 1} in this subspace . Since Hm\cH_m is often finite dimensional and H1,nnH_{1,nn} comparatively simple, a solution is to first choose bases in Hn\cH_n that diagonalize H1,nnH_{1,nn}. H0H_0 remains diagonal, since the identity remains the identity in any orthonormal basis. The eigenvalues of ϵH1\eps H_1 then give the first-order corrections to the energy, as expressed in this basis. If this resolves the degeneracy, one can then compute further corrections by using standard perturbation theory: for this, only off-diagonal matrix elements appear, and H1H_1 will only have nonvanishing off-diagonal matrix elements between different subspaces Hn\cH_n.

References
  1. Bender, C. M., & Wu, T. T. (1971). Large-Order Behavior of Perturbation Theory. Phys. Rev. Lett., 27(7), 461–465. 10.1103/PhysRevLett.27.461