Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

The WKBJ method

The next “non-perturbative” method we will discuss is the Wentzel-Kramers-Brillouin-Jeffreys approximation (sometimes the “J” is not present). This is a technique for solving wave equations, focusing on the case that the relevant energies are “large” in a sense that we will discuss.

This method is quite powerful and has uses well beyond quantum mechanics. It is the underlying scheme by which we justify ray-tracing methods for solving differential equations, such as geometric optics. For a good general discussion, I recommend Bender & Orszag, 1999. (A wonderful book for many subjects).

In the case of quantum mechanics, the other beautiful outcome of the WKB approximation is that we can begin to see the emergence of classical mechanics. We will discuss this fact once we have gone over the basics of the approximation.

While this is not a perturbative method, as it applies even when there is no “nearby” exactly solvable Hamiltonian, it is an asymptotic method. That is, there is a small parameter, and a systematic way of expanding the solution in that small parameter. The lowest-order solution is singular: in particular it is non-analytic in the small parameter. But there is an expression for it that can often be solved even when the underlying Schroedinger equation cannot be.

The short-wavelength limit

Generalities

We are going to focus here on the one-dimensional time-independent Schroedinger equation:

22m2x2ψ(x)+V(x)ψ(x)=Eψ(x)- \frac{\hbar^2}{2m} \frac{\del^2}{\del x^2} \psi(x) + V(x)\psi(x) = E\psi(x)

Crudely speaking, we are interested in highly excited states, in a way we will try to make precise. Consider in particular a bound state problem as illustrated below.

Inside the potential well, at high eneough energy there will be many nodes and the wavfunction will oscillate rapidly. In this region, the approximation works when the number of nodes is 1\gg 1. If the potential is not varying too rapidly, it will be approximately constant over the scale of separation of the nodes. Locally near these nodes, the solution will then look like a plane wave (solution in a constant potential) with wavenumber

k(x)=p(x)=2m(EV(x))\hbar k(x) = p(x) = \sqrt{2 m (E - V(x))}

Similarly, if the energy is far enough below the potential, then the exponential falloff will be rapid compared to the rate of change of the potential and cam be characterized locally by a decay rate

κ(x)=2m(V(x)E)\hbar\kappa(x) = \sqrt{2m(V(x) - E)}

which we can think of as an imaginary wavenumber.

If this approximation is close to true, we can approximate the wavefunction well inside the potential well with

ψ(x)A+exp(ix0xdxp(x))+Aexp(ix0xdxp(x))\psi(x) \sim A_+ \exp\left(\frac{i}{\hbar}\int_{x_0}^x dx' p(x')\right) + A_- \exp\left(- \frac{i}{\hbar}\int_{x_0}^x dx' p(x')\right)

where p(x)=2m(EV(x))p(x) = \sqrt{2m(E - V(x))} is the classical momentum at fixed energy. Here x0x_0 is a choice, that can be absorbed into A±A_{\pm}. Well below the barrier, the solution is:

ψ(x)B+exp(1x0xdxρ(x))+Bexp(1x0xdxρ(x))\psi(x) \sim B_+ \exp\left(\frac{1}{\hbar}\int_{x_0}^x dx' \rho(x')\right) + B_- \exp\left(- \frac{1}{\hbar}\int_{x_0}^x dx' \rho(x')\right)

where ρ(x)=i2m(EV(x))=2m(V(x)E)\rho(x) = - i \sqrt{2m(E - V(x))} = \sqrt{2m(V(x) - E)} is a sort of imaginary momentum.

The approximation can be expected to break down when the energy is roughly equal to the potential energy, so that the variation of the wavefunciton is not rapid. This is the region around the classical turning points. However, in this region, we can approximate the potential as a linear one, and the Schroedinger equation with a linear potential does have a solution in terms of Airy functions. If the WKB approximation starts to be good while the linear approximation is still valid, then we can use this fact to match the Airy functions to the WKB solutions and get a complete solution, including an approximation to the energy eigenvalues.

To get an estimate of when this approximation words, we can ask that the change in the wavelength λ(x)=/p(x)\lambda(x) = \hbar/p(x) over a distance of order the wavelength is small compared to the wavelength. This becomes

Δλλλxλλ=xλ=mV(2m(EV))3/2=mVp3=122mp2Vp=12λxVEkin1\begin{align} \Big| \frac{\Delta\lambda}{\lambda}\Big| & \sim \Big| \frac{\lambda\del_x \lambda}{\lambda}\Big| \\ & = |\del_x \lambda| = \Big|\frac{m\hbar V'}{(2m(E - V))^{3/2}}\Big| \\ & = \Big| \frac{m \hbar V'}{p^3}\Big| \\ & = \half \Big| \frac{2 m}{p^2} \frac{\hbar V'}{p}\Big|\\ & = \half \Big| \frac{\lambda \del_x V}{E_{kin}}\Big| \ll 1 \end{align}

So the change in the potential over a wavelength is small compared to the kinetix energy. We will see how this conditions emerges from a more systematic treatment.

The semiclassical expansion

Let us begin with the ansatz

ψ=eiS/\psi = e^{i S/\hbar}

So far this is just a rewriting, not an approximation. I am using the letter SS to foreshadow the relationship of the WKB approximation to the emergence of classical mechanics.

Putting this into the TISE, we get

(xS)2=2m(EV(x))+ix2S(\del_x S)^2 = 2m(E - V(x)) + i\hbar \del_x^2 S

This is still an exact equation. But the approximation (4) arises if we are allower to ignore the final term. Since this aproximation seems tied up in classical mechanics, and the last term is proportional to \hbar, one often says that we will perform a “small-\hbar” approximation. This is not quite correct. What we will be demanding is essentially that S/1S/\hbar \gg 1.

With this in mind, we can still write a formal expansion

S=Scl+S1+2S2+S = S_{cl} + \hbar S_1 + \hbar^2 S_2 + \ldots

and solve the Schroedinger equation order by order in \hbar. We will still need to check that SkSk1\hbar S_k \ll S_{k-1}. Note that in terms of the wavefunction ψ\psi this is a singular expansion; the lowest-order piece is nonanalytic in \hbar.

Plugging (9) into (8) and keeping only the \hbar-independent term, we get

(xScl)2=2m(EV(x))(xScl)22m+V(x)=E(\del_x S_{cl})^2 = 2m(E - V(x)) \Rightarrow \frac{(\del_x S_{cl})^2}{2m} + V(x) = E

Those who have had a good classical mechanics course will recognize this as the Hamilton-Jacobi equation, where SS is the classical action. We will return to this interpretation below, which is central to ray tracing/geometric optics/the method of characteristics. The solution is

Scl(x)=±x0xdx2m(EV(x))±x0xdxp(x)S_{cl}(x) = \pm \int_{x_0}^x dx' \sqrt{2m(E - V(x'))} \equiv \pm \int_{x_0}^x dx' p(x')

where again, x0x_0 is an integration constant. We have two solutions to SS and therefore for ψ\psi. At this level of approximation we can consider linear combinations of these two solutions. Note that what we have done is true formally whether EVE - V is positive or negative. In the former case, we recover (4). In the latter case, SS becomes imaginary, and we recover (5).

Our next step is to find S1S_1. The O()\cO(\hbar) terms in (8) are:

2xSclxS1=ix2SclxS1=i2x2S0xS0=i2xlnxS02 \del_x S_{cl} \del_x S_1 = i \del_x^2 S_{cl} \Rightarrow \del_x S_1 = \frac{i}{2} \frac{\del_x^2 S_0}{\del_x S_0} = \frac{i}{2} \del_x \ln \del_x S_0

Thus, keeping only terms that are nonvanishing as 0\hbar \to 0, we have

ψ(x)=eiS/=eiScl+iS1=1xSeiScl=1p(x)eiScl\begin{align} \psi(x) & = e^{i S/\hbar} \\ & = e^{\frac{i S_{cl}}{\hbar} + i S_1}\\ & = \frac{1}{\sqrt{\del_x S}} e^{i \frac{S_{cl}}{\hbar}}\\ & = \frac{1}{\sqrt{p(x)}} e^{i \frac{S_{cl}}{\hbar}} \end{align}

where p(x)=2m(EV(x))p(x) = \sqrt{2m(E - V(x))}. This prefactor has a nice semiclassical interpretation, especially for bound states. We know ψ(x)21p(x)\psi(x)|^2 \propto \frac{1}{p(x)} is the probability density for the particle at position xx. If the particle is in a potential well it will oscillate with period TT. The fraction of time the particle will spend in a region dxdx is the time spent in that region, which is

P(x)dx=dxv(x)=mdxp(x)ψ(x)2dxP(x)dx = \frac{dx}{v(x)} = \frac{m dx}{p(x)} \propto |\psi(x)|^2 dx

Finally, we need to check on our approximation. Thius should be done at every order but we will here simply ask that in (8) the last term is small compared to the rest: that is, that

(xScl)2(x2Scl)(\del_x S_{cl})^2 \gg \hbar (\del_x^2 S_{cl})

This means

p(x)22p(x)p(x)=2mV(x)pαλ(x)VEkin1p(x)^2 \gg 2\hbar| p'(x) p(x)| = \Big| \frac{- 2 m V'(x)}{p}\Big| \Rightarrow \alpha \equiv \Big| \frac{\lambda(x) V'}{E_{kin}} \ll 1

which is essentially the condition we already derived from a more hand-waving argument. Here we introduce α\alpha for notational ease in the example below.

Just to get a sense of the nontriviality of these conditions, we can look at a 1d case with the potential V=c/xnV = - c/x^n with n>0n > 0. If we fix the total energy EE, we find

α=2mncxn+1(E+cxn)3/2\alpha = \frac{2 m \hbar \frac{n c}{x^{n+1}}}{\left(E + \frac{c}{x^n}\right)^{3/2}}

For fixed EE this is always valid as xx \to \infty as the particle becomes the free particle for which WKB is exact. As xto0x to 0, when the potential energy dominates is large compared to E|E|, we have:

α2mncxn21 \alpha \sim \frac{2 m n \hbar}{\sqrt{c}} x^{\frac{n}{2} - 1}

Thus WKB works well when n>2n > 2. For n<2n < 2 WKB breaks down near the origin. (Note that for n>2n > 2 the energy eigenstates are unbounded below, with n=2n = 2 being an interesting marginal case, as we have seen in a prior problem set).

You might fairly ask why we chose the scaling we did. Why is the leading singular behavior eiS/e^{i S/\hbar} and not eiS/2e^{i S/\hbar^2} or eiS/1/2e^{i S/\hbar^{1/2}} or some such? Why do we expand SS in powers of \hbar rather than 2\hbar^2, 1/2\hbar^{1/2}? This is determined by the power of 2\hbar^2 that sits in front of the kinetic term. If we demand that all of the terms in the Schroedinger equation appear at the same order, including the energy, you can convince yourself tha this is the right scaling. In general you have to deduce the correct scaling; and a given problem may support various scalings for different energies and momenta. The book Bender & Orszag, 1999 has a good discussion of these issues. This paper of which I was a co-author encounters this issue in a situation where the resolution is nontrivial.

Bound state problems

We now consider particles in some potential well, such that limxV(x)>E\lim_{x\to\infty} V(x) > E. In the figure below, we have broken up the xx axis into three regions. In regions I, II, III the WKB approximation is expected to hold. For large x|x| (in regions I and III), we assume the energy is far enough below the barrier that the length scale giverning the exponential falloff of the wavefunction is small compared to the length scale over which the potential varies. In region II we assume that the energy is large enough that the wavefunction rapidly oscillates, with a space between nodes so small that the potential does not vary appreciably in this range.

The WKB approximation is expected to break down at the classical turning points A,BA,B> Here p(x)0p(x) \to 0, λ(x)\lambda(x) \to \infty, and we can see that the condition (6) breaks down. Generally, however, the potential can be well-approximated by a linear potential; the resulting Schroedinger equation is exactly solvable. If we are fortunate, the regime in which the linear approximation holds will overlap with the regime in which the WKB approximation holds, as illustrated below. In this case we can match our WKB functions to the solutions near the turning points.

Let us first turn to the solutions in the WKB regions. In region I, the solution is

ψI=CI2m(V(x)E)e1xIxdx2m(V(x)E)\psi_I = \frac{C_I}{\sqrt{2m(V(x) - E)}} e^{\frac{1}{\hbar} \int^x_{x_I} dx'\sqrt{2m(V(x') - E)}}

Here xIx_I is arbitrary. This solution will be exponentially decaying as xx \to - \infty: we have set to zero an exponentially growing component.

In region II the WKB solution looks like

ψII(x)=12m(EV(x))(AIIeixIIxdx2m(V(x)E)+BIIeixIIxdx2m(V(x)E))\psi_{II}(x) = \frac{1}{\sqrt{2m(E - V(x))}} \left(A_{II} e^{\frac{i}{\hbar}\int_{x_{II}}^x dx'\sqrt{2m(V(x') - E)}} + B_{II} e^{- \frac{i}{\hbar}\int_{x_{II}}^x dx'\sqrt{2m(V(x') - E)}}\right)

While in Region III the WKB solution looks like

ψI=CIII2m(V(x)E)e1xIIIxdx2m(V(x)E)\psi_I = \frac{C_{III}}{\sqrt{2m(V(x) - E)}} e^{- \frac{1}{\hbar} \int^x_{x_{III}} dx'\sqrt{2m(V(x') - E)}}

which is exponentially decaying.

As with the other bound state problems we have studied, we have 4 complex unknowns, with one complex constraint due to the combination of normalization and choice of overeall phase. We will find four constraints due to matching the WKB solutions to the equations near the turning points.

Let us study the turning point BB. Expanding in a Taylor series,

V(x)=V(xB)+(xxB)V(xB)+12(xxB)2V(xB)+=V(xB)+γ(x=xN)+\begin{align} V(x) & = V(x_B) + (x - x_B) V'(x_B) + \half (x - x_B)^2 V''(x_B) + \ldots\\ & = V(x_B) + \gamma (x = x_N) + \ldots \end{align}

We assume that teh higher order terms are small in a region around xBx_B that overlaps with the region of validity of the WKB approximation. In general, of course, this needs to be checked.

Now we examine the Schrodinger equation. By definition, V(xB)=EV(x_B) = E, so these terms drop out. Furthermore, we define yL(xxB)y \equiv L (x - x_B), with LL to be determined. Nondimensionalizing our equations are almost always a good idea -- we will discover important characteristic scales, and we will reduce the equations to ones that we can look up or plug most easily into a computer. Further multiplyng the equation by 2mL22\frac{2 m L^2}{\hbar^2}, the resulting equation is:

y2ψ+2mγL32y=0- \del_y^2 \psi + \frac{2 m \gamma L^3}{\hbar^2} y = 0

We set L=(2mγ2)1/3L = \left(\frac{2m\gamma}{\hbar^2}\right)^{1/3}, to find

y2ψ+yψ=0- \del_y^2 \psi + y \psi = 0

This differential equation has known solutions in terms of Airy functions, whose properties you can look up in many places such as the Digital Library of Mathematical Functions (see Chapter 9). There are two independent solutions Ai(y),Bi(y)Ai(y), Bi(y), and a general solution has the form

ψ=aAi(y)+bBi(y)\psi = a Ai(y) + b Bi(y)

At large y|y|, the WKB approximation works well for these functions (you can and should deduce this from the conditions we have discussed). Their asymptotic properties are known:

Ai(y){1πy1/4e23y3/2y1πy1/4cos(23y3/2π4)yBi(y){1πy1/4e23y3/2y1πy1/4sin(23y3/2π4)y\begin{align} Ai(y) \sim \begin{cases} \frac{1}{\sqrt{\pi} y^{1/4}} e^{-\frac{2}{3}y^{3/2}} & y \to \infty \\ \frac{1}{\sqrt{\pi} |y|^{1/4}} \cos\left(\frac{2}{3}|y|^{3/2} - \frac{\pi}{4}\right) & y \to - \infty \end{cases} Bi(y) \sim \begin{cases} \frac{1}{\sqrt{\pi} y^{1/4}} e^{\frac{2}{3}y^{3/2}} & y \to \infty \\ \frac{1}{\sqrt{\pi} |y|^{1/4}} \sin\left(\frac{2}{3}|y|^{3/2} - \frac{\pi}{4}\right) & y \to - \infty \end{cases} \end{align}

You can check easily that these result from applying our WKB solutions to regions II,IIIII,III to the linear potential.

Normalizability then demands that we choose solution Ai(y)Ai(y) about the turning point xBx_B. About xAx_A we can map the problem to the one we solved by letting yyy \to - y, so that the proper solution is Ai(y)Ai(-y).

We then need to match the solutions in region II. Performing the expansion of dxp(x)\int dx' p(x') near the turning point,

1xIIxdx2m(EV(x))23(y)3/2+23(yII)3/2\frac{1}{\hbar} \int_{x_{II}}^x dx' \sqrt{2m(E - V(x'))} \sim - \frac{2}{3}(-y)^{3/2} + \frac{2}{3}(-y_{II})^{3/2}

where yII=L(xIIxB)y_{II} = L (x_{II} - x_B). The minus sign appears because we get y-y inside the square root; the upshot is an overall minus sign when we do the integral. We set xII=xBx_{II} = x_B to find that our solution in region II matches AiAi if we adjust AII,BIIA_{II}, B_{II} such that

ψ(x)12m(EV(x))cos(1xxBdx2m(EV(x))π4)\psi(x) \propto \frac{1}{\sqrt{2m(E - V(x))}} \cos\left(\frac{1}{\hbar} \int^{x_B}_x dx' \sqrt{2m(E - V(x'))} - \frac{\pi}{4}\right)

Note the integral is from xx to xBx_B; this takes care of the minus sign we introduced.

On the other hand, the same matching condition applied to the turning point AA gives us

ψ(x)12m(EV(x))cos(1xAxdx2m(EV(x))π4)\psi(x) \propto \frac{1}{\sqrt{2m(E - V(x))}} \cos\left(\frac{1}{\hbar} \int_{x_A}^x dx' \sqrt{2m(E - V(x'))} - \frac{\pi}{4}\right)

These two need to be equal, which means the cosine terms need to be equal. Since the cosine terms are odd, we can write

cos(1xxBdx2m(EV(x))+π4)=cos(1xAxdx2m(EV(x))π4) \cos\left(- \frac{1}{\hbar} \int^{x_B}_x dx' \sqrt{2m(E - V(x'))} + \frac{\pi}{4}\right) = \cos\left(\frac{1}{\hbar} \int_{x_A}^x dx' \sqrt{2m(E - V(x'))} - \frac{\pi}{4}\right)

The arguments now need to be equal up to a factor of πn\pi n (the overall sign difference if nn is odd can be absorbed into the overall relative sign of AI,AIIIA_I, A_{III}), so that

(1xxBdx2m(EV(x))+π4=1xAxdx2m(EV(x))π4πn (- \frac{1}{\hbar} \int^{x_B}_x dx' \sqrt{2m(E - V(x'))} + \frac{\pi}{4} = \frac{1}{\hbar} \int_{x_A}^x dx' \sqrt{2m(E - V(x'))} - \frac{\pi}{4} - \pi n

which becomes

1xAxBdxp(x)=π(n+12)\frac{1}{\hbar} \int_{x_A}^{x_B}dx' p(x') = \pi \hbar(n + \half)

This is the Bohr-Sommerfeld quantization condition. It was generalized to higher-dimensional situations in various works by Einstein, Brillouin, and Keller (only applicable to energies for which classical trajectories live on tori in phase space; a story that will take us too far afield, however).

Example 1: the Simple Harmonic Oscillator

In this case the turning points are at xB=xA=2Emωx_B = - x_A = \sqrt{\frac{2E}{m\omega}}. The quantization condition is

xBxBdx2m(E12mω2(x)2)=π(n+12)\int_{-x_B}^{x_B} dx' \sqrt{2m(E - \half m \omega^2 (x')^2)} = \hbar \pi(n + \half)

Now we can substitute x=y2Emωx = y \sqrt{\frac{2E}{m\omega}} and set y=sinθy = \sin \theta; we find

πEω=(n+12)π\frac{\pi E}{\omega} = (n + \half) \pi \hbar

But this is the exact answer! This is a general feature of Hamiltonians quadratic in their arguments -- the WKB approximation gives exact expressions for the eigenvalues. It also, through the path integral, gives an exact expression for teh propagator, as we will discuss. It does not, however, give the right expressions for the energy eigenstates. This is a longer story; the chapters in Shankar on path integrals and their relation to WKB contains the required pieces of this puzzle.

Example 2: the Linear potential

Consider instead V=gxV = g |x|. If we go through the same exercise, we find

E=12(3g(n+12)π2m12)2/3E = \half \left(\frac{3 g (n + \half) \pi \hbar}{2 m^{\half}}\right)^{2/3}

The wavefunctions, as it happens, are amenable to an exact solution using Airy functions, and the resulting eigenvalues can be computed. For the ground state n=0n = 0, the WKB expression is too high by 10%10\%. for the first excited state, the result is too high by 1%1\%. For the second excited state, the result is low by .5%.5\%. (It is worth thinking about why the n=0,1n = 0,1 WKB answers will always be larger than the exact answer, but at higher nn there is no such restriction).

The Hamilton-Jacobi equation and geometric optics

We found that at lowest order in the WKBJ approximation,

12m(Scl)2+V(x)=E\frac{1}{2m} ({\vec\nabla} S_{cl})^2 + V({\vec x}) = E

Here we have written down the equation for an nn-dimensional nonrelativistic potential problem; the derivation is essentially the same.

As it happens this is a well-known equation in classical mechanics, the Hamilton-Jacobi equation. Since this course teaches classical as well as quantum mechanics, I will break here and discuss the meaning of this equation in classical mechanics.

Hamilton-Jacobi from least action

Let us return to the principle of least action and consider it in phase space form. Starting with S=titfdtL(q,q˙)S = \int_{t_i}^{t_f} dt L(q,{\dot q}), we recall that to first order in variations of qq,

δS=[Lq˙iδqi]ttf+titdt(LqiddtLq˙i)δqi\delta S = \left[ \frac{\del L}{\del {\dot q}^i} \delta q^i\right]\Big|^{t_f}_{t} + \int_{t_i}^{t} dt' \left(\frac{\del L}{\del q^i} - \frac{d}{dt} \frac{\del L}{\del {\dot q}^i}\right)\delta q^i

In the usual Euler-Lagrange equations we fix the initial and final positions. Here we want to understand SS evaluated on classical paths as a function of q(t)q(t), given some fixed initial condition q(ti)q(t_i). Since we are evaluating SS along classical paths, the integral vanishes, the integrand just being the classical erquations of motion, and we are left with δS=Lq˙i(t)δqi(t)\delta S = \frac{\del L}{\del {\dot q}^i}(t) \delta q^i(t), or

Sqi=pi\frac{\del S}{\del q^i} = p_i

where pip_i is the conjugate momentum. Here since q(ti)q(t_i) is fixed and q(t)q(t) given, pip_i is completely determined.

Now by definition, dSdt=L\frac{d S}{d t} = L by definition. On the other hand, using the chain rule,

L=dSdt=St+Sqiq˙i=St+piq˙iL = \frac{d S}{dt} = \frac{\del S}{\del t} + \frac{\del S}{\del q^i}{\dot q^i} = \frac{\del S}{\del t} + p_i {\dot q^i}

Since H=piq˙iLH = p_i {\dot q^i} - L, we have

St=H\frac{\del S}{\del t} = - H

Thus if we consider SS as a function of q(t),tq(t), t, its variation dSdS under infinitesimal variations dqi,dtdq^i, dt of its arguments is

dS=pidqiHdtdS = p_i dq^i - H dt

We have also shown that

St+H(qi,pi=Sqi)=0\frac{\del S}{\del t} + H(q^i, p_i = \frac{\del S}{\del q^i}) = 0

when evaluated along classical trajectories. This is the Hamilton-Jacobi equation; the function SS is called Hamilton’s principle function.

We have shown that by knowing the classical trajectories qi(t),pi(t)q^i(t), p_i(t) given fixed initial conditions, the equations (38) and (40) furnish a solution SS to the Hamilton-Jacobi equations. Furthermore, this solution is the classical action as a function of the endpoint q(t)q(t) given q(ti)q(t_i). Returning to the quantum problem, if we consider the time-dependent Schroedinger equation

itψ=22m2ψ+V(q)ψi\hbar \del_t \psi = - \frac{\hbar^2}{2m} \nabla^2 \psi + V(q) \psi

and write \psi = e^{i\frac{S(q,t)}{\hbar} with S=Scl+S1+S = S_{cl} + \hbar S_1 + \ldots, then the leading order O(0)\cO(\hbar^{0}) equation is the full Hamilton-Jacobi equation.

The equation (36) is consistent with the full Hamilton-Jacobi equations if we set

S=W(qi(t))EtS = W(q^i(t)) - E t

where WW is called Hamilton’s characteristic function; here pi=Wqip_i = \frac{\del W}{\del q^i}, or dW=pidqidW = p_i dq^i. The solution is also consistent with the classical ewuations of motion for the family of trajectories with fixed energy. The functional form will be different from the solution with fixed initial conditions, as the corresponding trajectories will have varying energy (consider the 1d case of force-free motion where we fix tt and vary qq given an initial condition; this will be a family of trajectories with increasing velocity and thus energy).

What we are seeing is that different solutions to the H-J equations with different specified constants yield different families of trajectories. To understand this we consider a different perspective on these equations.

Canonical transformations in phase space

Last semester we briefly touched on canonical transformations as transformations in phase space which preserve the Poisson brackets and the Hamiltonian structure of the problem. That is, start with some system for which the phase space variables are qi,piq^i, p_i, i{1,,n}i \in \{1,\ldots,n\}, such that {qi,qj}={pi,pj}=0\{q^i, q^j\} = \{p_i, p_j\} = 0, and {qi,pj}=δji\{q^i,p_j\} = \delta^i_j. Assume further there is a Hamiltonian H(q,p)H(q,p) such that the system satisfies Hamilton’s equations. A canonical transformation is coordinate transformation in phase space to new coordinates QI(q,p),PI(q,p)Q^I(q,p), P_I(q,p), I{1,,N}I - \{1,\ldots,N\} such that

{QI,QJ}={PI,PJ}=0 ;  {QI,PJ}=δJI\{Q^I, Q^J\} = \{P_I, P_J\} = 0\ ; \ \ \{Q^I, P_J\} = \delta^I_J

and such that there is function K(Q,P)K(Q,P) for which Hamilton’s equations for q,p,Hq,p,H become, under the coordinate transformation, Hamilton’s equations for Q,P,KQ,P,K:

dQIdt=KPIdPIdt=KQI\begin{align} \frac{d Q^I}{d t} & = \frac{\del K}{\del P_I}\\ \frac{d P_I}{d t} & = - \frac{\del K}{\del Q^I} \end{align}

Following the previous discussion, these are consistent with the trajectory being the minimum of the phase space action

S=titdt(PIQ˙IK)S' = \int_{t_i}^t dt' \left(P_I {\dot Q}^I - K\right)

or in differential form, dS=PIdQIKdtdS' = P_I dQ^I - K dt.

The variation of S,SS,S' yields the same equations of motion if dS=dS+dFdS = dS' + dF where FF is a function of coordinates, momenta, and time; integrating this along a trajectory, it will only contribute to the endpoints and thus not appear in the equations of motion. (Note that in principle stationarity works if dS+dF=λdSdS' + dF = \lambda dS for some constant λ\lambda. But we can scale λ\lambda away, by rescaling P,Q,KP,Q,K and rewriting the form of KK, so we will ignore this. (See chapter 10 of Goldstein et al., 2002 for a discussion of this point).

As it happens, however, FF determines the coordinate transformation. That is, if we write

dF=pidqiPIdQIHdt+KdtdF = p_i dq^i - P_I dQ^I - Hdt + Kdt

or equivalently

dFdt=piq˙iPIQ˙I+(KH)\frac{dF}{dt} = p_i {\dot q}^i - P_I {\dot Q}^I + (K - H)

The differential version treats q,Qq, Q as the independent coordinates. Thus, we have

pi=FqiPI=FQIK=H+Ft\begin{align} p_i & = \frac{\del F}{\del q^i}\\ P_I & = - \frac{\del F}{\del Q^I}\\ K & = H + \frac{\del F}{\del t} \end{align}

If we specify F(q,Q,t)F(q, Q, t), then, the equations above give p(q,Q,t)p(q,Q,t), P(q,Q,t)P(q, Q, t), KK. We can then solve for Q,PQ, P in terms of q,pq, p.

On the other hand, writing PdQ=d(PQ)QdPP dQ = d(PQ) - Q d P, we can instead consider the equation

d(F+PIQI)=pidqi+QIdPIHdt+Kdtd(F + P_I Q^I) = p_i dq^i + Q^I dP_I - Hdt + Kdt

If we set Φ=F+PIQI=Φ(qi,PI)\Phi = F + P_I Q^I = \Phi(q_i, P^I) we get a different set of coordinate transformations determined by Φ\Phi, with

pi=ΦqiQI=ΦPIK=H+Φt\begin{align} p_i & = \frac{\del \Phi}{\del q^i}\\ Q^I & = - \frac{\del \Phi}{\del P_I}\\ K & = H + \frac{\del \Phi}{\del t} \end{align}

We can consider similar generating functions that are functions of p,Pp, P or of p,Qp, Q. These are typically classified into four types, dependening on which pair of old and new variables they are explicitly functions of.

Let us therefore consider coordinates QI,PIQ^I, P_I that are constant in time along trajectories. These could be the initial conditions or any other set of 2n2n bits of data that label the trajectories. The corresponding Hamiltonian KK is therefore constant by Hamilton’s equations and can be set to zero. Then we are left with

K=0=H+tF2(q,P,t)K = 0 = H + \del_t F_2(q,P,t)

where we have set S=F2S = F_2 to tie this to the discussion in Section 9.1 of Goldstein et al., 2002. The variables PIP_I appear as explicit constants of motion; we are guaranteed that QI=F2PIQ^I = - \frac{\del F_2}{\del P^I} is also constant. The equation above becomes the Hamilton-Jacobi equation

H(qi,pi=F2qi,t)+F2tH(q_i, p_i = \frac{\del F_2}{\del q^i}, t) + \frac{\del F_2}{\del t}

In the original derivation we specified qi(ti)q^i(t_i). But as long as we have a complete solution with nn independent integration constants, then we can invert the coordinate transformations to find qi(QI,PI,t)q^i(Q^I, P_I, t), that is, we generate a trajectory.

Canonical transformations and the HJ equation

In this context, if the Hamiltonian is time-independent, we can choose a solution of the form F2=S=W(q,E)EtF_2 = S = W(q,E) - Et. EE now becomes one of the constants of motion: the resulting equation

H(qi,pi=Wqi)=EH(q^i, p_i = \frac{\del W}{\del q^i}) = E

A complete solution will require n1n-1 additional constants of motion.

The simplest interesting example is the 1d simple harmonic oscillator. Considering solutions of fixed energy, we have

Sx=2m(E12mω2x2)\frac{\del S}{\del x} = \sqrt{2m(E - \half m \omega^2 x^2)}

We can do an integral to find

S=xdx2m(E12ω2x2)EtS = \int^x dx\sqrt{2m(E - \half \omega^2 x^2)} - E t

up to a constant of integration which we can absorb in to SS without changing the fact that SS solves the H-J equation.

By the above discussion, ES\del_E S is a constant of motion, which we will call β\beta, so that

β=dxm212m(E12ω2x2)t\beta = \int dx \sqrt{\frac{m}{2}}\frac{1}{\sqrt{2m(E - \half \omega^2 x^2)}} - t

is constant; again the constant of integraion unspecified by the indefinite integral can be absorbed into β\beta. The result is

t+β=1ωsin1xmω22Et + \beta = \frac{1}{\omega}\sin^{-1} x \sqrt{\frac{m\omega^2}{2E}}

or

x=2Emω2sin(ω(t+β))x = \sqrt{\frac{2E}{m\omega^2}} \sin(\omega (t + \beta))

This is the general solution; the constants of motion are EE and the time β-\beta at which x=0x = 0. The latter can be shifted away with a shift in time.

The method of characteristics

The flip side of this, relevant for the quantum problem, is that if we have a family of solutions qi(t)q^i(t) to the equations of motion, we have a solution to the Hamilton-Jacobi equation. That is, let us assume we know S(qi(t0),t0)S(q^i(t_0),t_0) (the initial condition) at some time t0t_0. This yields pi(t0)=Sqip_i(t_0) = \frac{\del S}{\del q^i}. At each initial point we can thus generate a trajectory qi(t),pi(t)=Sqi(t)q^i(t), p_i(t) = \frac{\del S}{\del q^i(t)}. Along each trajectory, the partial derivative tS\del_t S becomes a total derivative, so we have dtS+H(qi(t),pi(t))=0d_t S + H(q^i(t), p_i(t)) = 0. HH is a known function and the equations of motion yield known functions qi(t),pi(t)q^i(t), p_i(t), so we have dtS+F(t)=0d_t S + F(t) = 0 which can be integrated, perhaps on a computer. We then know SS along the family of trajectories we have computed. This is a specific example of the method of characteristics. The process of turning the wave equation into an equation of Hamilton-Jacobi type, and then solving the katter via the method of characteristics, is a specific example of ray tracing. A similar procedure for exlectromagnetic waves, for example, moving through a medium with variable index of refraction, yields the geometric optics approximation, where the individual ``rays" are the characteristics of an associated Hamilton-Jacobi-like approximation to the full wave equation.

A simple example is the case of the free particle, for which

tS+12m(xS)2=0\del_t S + \frac{1}{2m}(\del_x S)^2 = 0

The characteristic equations/Hamilton’s equations are x˙=p{\dot x} = p, {\dot p} = 0,solvedby, solved by x(t) = x_0 + p_0 t/m.Letussayweknowthatat. Let us say we know that at t = 0,, S = \half \gamma x_0 ^2.Then. Then p_0 = \gamma x_0,and, and x(t) = (1 + \gamma t/m) x_0$.

Now along this trajectory,

tS+12mp(t)xS=tS+12x˙xS=dtS12mp(t)2=0\begin{align} \del_t S + \frac{1}{2m} p(t) \del_x S & = \del_t S + \frac{1}{2} {\dot x} \del_x S \\ & = d_t S - \frac{1}{2m} p(t)^2 = 0 \end{align}

Now along the trajectory x(t)x(t), p(t)=p(0)=γx0p(t) = p(0) = \gamma x_0, so dtS=γ22mx02d_t S = \frac{\gamma^2}{2m} x_0^2 or

S=12γx02+γ22mx02t=12γx02(1+γmt)=12γx(t)2(1+γt/m)\begin{align} S & = \half \gamma x_0^2 + \frac{\gamma^2}{2m} x_0^2 t \\ & = \half \gamma x_0^2\left(1 + \frac{\gamma}{m} t\right) \\ & = \half \gamma\frac{x(t)^2}{(1 + \gamma t/m)} \end{align}

You can check that this solves the Hamilton-Jacobi equation.

WKBJ as a semiclassical limit

The fact that the classical equations omf motion arise from the WKBJ approximation motivates the term semiclassical limit. You might think this is because it is an 0\hbar \to 0 limit; again, that is not quite right, since (as I keep ranting on and on about) \hbar is a dimensionful parameter.

I will sketch out a couple of ways to think about this limit. These are only sketches. Fleshing out the arguments in detail is non-trivial!

Comment: large quantum numbers

In general the story is that Scl/1S_{cl}/\hbar \gg 1. We can see what this means in terms of the simple harmonic ocillator. If LTVnωL \sim T - V \sim n \hbar\omega, wheer nn is the excitation level of a state, we can guess that SnS \sim n \hbar. So n1n \gg 1 is the limit; that is, we have a limit of large quantum numbers.

Anther way we might see classical mechanics emerge is through Ehrenfest’s theorem. For the potential problem, this states

mddtxi=xiV(x)m \frac{d}{dt} \vev{x^i} = - \vev{\frac{\del}{\del x^i} V(x)}

This is exactly the classical equations of motion for xix^i if

xiV(x)=xiV(xi) \vev{\frac{\del}{\del x^i} V(x)} = \frac{\del}{\del x^i} V(\vev{x^i})

As it happens this is trivially true for the SHO, for which the WKB approximation is known to be exact. More generally, if we write x^=x+δx{\hat x} = \vev{x} + \delta x, we need that xnδxn\vev{x}^n \gg \vev{\delta x^n}. The latter measures the spread of the wavefunction; we need this to be small compared to the range of variation of the position variable. This should be related to the kind of high-energy limit we are discussing in which the wavelengths used are small compared to the range of variation of the potential. We can realize this by constructing coherent states which realize these conditions explicitly. In view of time, however, I will press on.

Comment: the path integral picture

Let us return to the Feynman path integral for solutions to the time-dependent Schroedinger equation:

ψ(x,t)=DxeiS([x(t)])/\psi(x,t) = \int Dx e^{i S([x(t)])/\hbar}

This would yield the WKB approximation if the dominant contribution to the integral was eiScl[x]/e^{i S_{cl}[x]/\hbar}. As it happens, this emerges fairly naturally. I will sketch out a hand-waving description here.

Let us first consider the general 1d integral

I(t)=dxeitf(x)I(t) = \int dx e^{i t f(x)}

As tt \to \infty, if f(x)f(x) varies nontrivially with xx, eg fxO(1)f_x \sim \cO(1), the integrand will oscillate rapidly and the integral will vanish. The leading contributions to this integral occur when ff changes slowly, that is, near fx=0f_x = 0. In this region, we can write f(x)=f(x0)+12(xx0)2f(x0)+f(x) = f(x_0) + \half (x - x_0)^2 f''(x_0) + \ldots. If we insert this expansion into the integrand,

I(t)=dxeit(f(x0)+12(xx0)2f(x0))(1+14!(xx0)4f(iv)(x0)+)=2πtf(x0)(1+O(1t2)+)\begin{align} I(t) & = \int dx e^{i t (f(x_0) + \half (x - x_0)^2 f''(x_0))}\left(1 + \frac{1}{4!}(x - x_0)^4 f^{(iv)}(x_0) + \ldots\right)\\ & = \sqrt{\frac{2 \pi}{t f''(x_0)}} \left(1 + \cO\left(\frac{1}{t^2}\right) + \ldots\right) \end{align}

(we expect the cubic term in the integrand to drop out as it is odd and the exponential even). The resulting leading-order approximation is known as the stationary phase approximation (and is related to the saddle-point approximation for more general integrals in the complex plane).

If we assume that this approximation applies to the Feynman path integral, we should replace the point x0x_0 by the classical trajectory x(t)x(t) about which SS is stationary. We are thus led to

ψ(x,t)eiScl[x(t)]/\psi(x,t) \sim e^{i S_{cl}[x(t)]/\hbar}

times a prefactor. Here SS is the classical action, and we have shown that it solves the Hamilton-Jacobi equation.

In general, this works when Scl/1S_{cl}/\hbar \gg 1. In explicit examples, you can show that this ratio controls the corrections to the semiclassical approximation, whether you work in the path integral formulation or with the Schroedinger equation. Such large actions arise from highly excited states.

References
  1. Bender, C. M., & Orszag, S. A. (1999). Advanced Mathematical Methods for Scientists and Engineers I: Asymptotic Methods and Perturbation Theory. Springer. https://books.google.com/books?id=-yQXwhE6iWMC
  2. Goldstein, H., Poole, C. P., & Safko, J. L. (2002). Classical Mechanics. Addison Wesley.