Adjoints and inner products

Dual vector spaces¶

Definition¶

Let $V$ be a vector space over $\CC$ . the dual vector space $V^*$ is the space of all linear maps $f:V \to \CC$ .

Properties and notation¶

$V^*$ is a vector space.

Consider linear maps $f_{1,2}$ , and $a,b \in \CC$ . Then we can define a linear map $a f_1 + b f_2$ by their action on a vector $\ket{v}$ .

(a f_1 + b f_2)(\ket{v}) = a f_1(\ket{v}) + b f_2(\ket{v})

(1)

One can show that this defines a linear map: for any $c,d\in \CC$ and $\ket{v_{1,2}} \in V$ ,

(a f_1 + b f_2)(c \ket{v_1} + d \ket{v_2}) = c (a f_1 + b f_2)\ket{v_1} + d (a f_1 + b f_2)(\ket{v_2})

(2)

which follows from $f_{1,2}$ being linear maps.

$\text{dim}(V^*) = \text{dim}(V)$ . It is instructive to show this. Consider a basis $\ket{i}$ of $V$ , $i = 1,\ldots d = \text{dim}(V)$ . A general vector $\ket{v}$ can be expressed as

\ket{v} = \sum_{i = 1}^d c_i \ket{i}

(3)

for a unique set of coefficients $c_i \in \CC$ . Now

f(\ket{v}) = \sum_{i = 1}^d c_i f(\ket{i})

(4)

Thus, the map $f$ is completely specified by $d$ complex numbers $f(\ket{i}) = c_i \in \CC$ . Thus, if we define $f_i$ by $f_i(\ket{j}) = \delta_{ij}$ , we can show that each $f_i$ is a linear map. Furthermore, $f_i$ is linearly independent of $f_{j \neq i}$ (you should convince yourself of this).

Any function $f$ can always be written as $f = \sum_{i = 1}^d c_i f_i$ , so this basis is maximal, and $f_i$ form a complete basis for $V^*$ . There are $d$ such independent basis functions, so $\text{dim}(V^*) = d$ .

Notation. We can express $f \in V^*$ as a “bra vector” $\bra{f}$ . We then call elements $\ket{v} \in V$ “ket vectors”. We can then write

\brket{f}{v} \equiv f(\ket{v})

(5)

as a “bra(c)ket”. I didn’t do this, please blame Dirac. Anyhow the notation is unfortunately standard. With this notation we can define the linear structure of $V^*$ as

\bra{a f_1 + b f_2} = a \bra{f_1} + b \bra{f_2}

(6)

Finally, The dual vector space for $V^*$ is $V$ , or $(V^*)^* = V$ : for any $\ket{v}$ the map from $V^* \to \CC$ is just $\ket{v}: f \to \brket{f}{v}$ .

Example¶

If $V = \CC^3$ is represented as the space of column vectors. we can represent $V^*$ as the set of row vectors. That is, consider a basis

\ket{1} = \begin{pmatrix}1 \\ 0 \\ 0 \end{pmatrix}\ ; \ket{2} = \begin{pmatrix}0 \\ 1 \\ 0 \end{pmatrix}\ ; \ket{3} = \begin{pmatrix}0 \\ 0 \\ 1 \end{pmatrix}

(7)

We can define any linear map $f$ by $f(\ket{i} = c_i$ . Then if $\ket{v} = \sum_i a_i \ket{i}$ ,

f(\ket{v}) = \sum_i c_i a_i = \begin{pmatrix} c_1 & c_2 & c_3 \end{pmatrix} \begin{pmatrix} a_1 \\ a_2 \\ a_3 \end{pmatrix}

(8)

Adjoint maps¶

Since $\dim V = \dim V^*$ , we expect that there is an isomorphism (a map that is one-to-one and onto) between them. Choosing such a map leads to a choice of “inner product” on $V$ itself: a way of assigning to $\ket{v}$ a number corresponding to some notion of its length.

Definition¶

Let $V$ be a vector space over $\CC$ . An adjoint map is a map $\cal{A}: V \to V^*$ , which we denote by $A\ket{v} \equiv \bra{f_v}$ with the properties

Skew symmetry: $\brket{f_w}{v} = \brket{f_v}{w}^*$ .
Positive semi-definiteness:

\brket{f_v}{v} \equiv ||v||^2 \geq 0\ ; ||v|| = 0\ \text{iff}\ \ket{v} = 0

(9)

In general we write $\bra{f_v} = \bra{v}$ .

Properties¶

Antilinearity. Using skew-symmetry you can show that for $\ket{v_{1,2}} \in V$ , $a, b \in \CC$ ,

\bra{a v_1 + b v_2} = a^* \bra{v_1} + b^* \bra{v_2}

(10)

Schwarz inequality

|\brket{v}{w}| \leq ||v||\ ||w||

(11)

Triangle inequality

||u + v|| \neq ||u| + ||v||

(12)

Examples¶

$V = \CC^3$ .

A\begin{pmatrix} c_1 \\ c_2 \\ c_3 \end{pmatrix} = \begin{pmatrix} c_1^* & c_2^* & c_2^* \end{pmatrix}

(13)

\ket{v} = \begin{pmatrix} c_1 \\ c_2 \\ c_3 \end{pmatrix}\ , \ket{w} = \begin{pmatrix} d_1 \\ d_2 \\ d_3 \end{pmatrix}

(14)

then

\brket{v}{w} = c_1^* d_1 + c_2^* d_2 + c_3^* d_3

(15)

$V = M_2(\CC)$ .

\brket{M_1}{M_2} = \text{tr} \left((M_1^{*})^T M_2\right) = \sum_{i,j} (M_1)^*_{ij} (M_2)_{ij}

(16)

$V = L^2(\CR)$ , the space of complex square-integrable functions on the real line where $\ket{\psi}$ is represented by the function $\psi(x)$ . A good inner product, which defines an adjoint map, is

\brket{\chi}{\psi} = \int_{-\infty}^{\infty} dx \chi(x)^* \psi(x)

(17)

Additional definitions and a comment¶

$\brket{v}{w}$ is the inner product of $\ket{v}$ , $\ket{w}$ .
$||v||^2 = \brket{v}{v}$ is called the norm of $\ket{v}$ .
$V$ with an adjoint map is called an inner product space.
An inner product space (over $\CC$ ) is called a Hilbert space if either:

$\text{dim}(V) < \infty$ , or
Cauchy sequences in $V$ are complete.

To explain the last possibility, note that $\ket{v_i}$ , $i = 1,\ldots,\infty$ is a Cauchy sequence if for any $\eps > 0$ , there exists some integer $N$ such that

|| v_n - v_m || < \eps\ \forall\ n, m \geq N

(18)

Such a sequence is complete if it converges to a vector in $V$ .

There is no unique adjoint map.

Actions of operators¶

Given a linear operator $A$ and $\ket{v} \in V$ , $A\ket{v}$ is a vector and $\bra{w} A \ket{v}$ is a complex number. We can therefore define $\bra{A w} \equiv \bra{w} A$ such that $\brket{A w}{v} = \bra{w} A \ket{v}$ .

Orthonormal bases¶

Definitions¶

Let $V$ be a vector space over $\CC$ .

$\ket{v} \in V$ is a normal vector if $||v||^2 = \brket{v}{v} = 1$ .
$\ket{v},\ket{w} \in V$ are orthogonal if $\brket{v}{w} = 0$ .
An orthonormal basis is a basis $\ket{i} \in V$ , $i = 1,\ldots,d = \text{dim} V$ such that for $\cal{A}: \ket{i} \to \bra{i}, \brket{i}{j} = \delta_{ij}$ .

Examples¶

We can write $\ket{v} = \sum_i v_i \ket{i}$ ; the antilienarity of the adjoint map means that $\bra{v} = \sum_i \bra{i} v^*_i$ . This means that

\brket{v}{v} = \sum_{i,j} v^*_i \brket{i}{j} v_j = \sum_i |v_i|^2

(19)

Similarly, for $\ket{w} = \sum_i w_i \ket{i}$ ,

\brket{w}{v} = \sum_i w^*_i v_i

(20)

This works if we identify

\ket{v} \to \begin{pmatrix} v_1 \\ v_2 \\ \vdots \\ v_n \end{pmatrix}

(21)

and thus

\bra{v} \to \begin{pmatrix} v_1^* & v_2^* & \ldots & v_n^* \end{pmatrix}

(22)

The basis element $\ket{i}$ is a column vector with all zeros except a 1 in the $i$ th row.

If $V = M_2(\CC)$ , the space of $2\times 2$ complex matrices, a natural inner product is

\brket{m}{n} = \text{tr} (m^T)^* n

(23)

where $m,n$ are $2\times 2$ matrices. This clearly defines an adjoint map from $\ket{n}$ to a linear map. An orthonormal basis is:

\ket{1} \to \begin{pmatrix} 1 & 0 \\ 0 & 0 \end{pmatrix} \ ; \ \ \ket{2} \to \begin{pmatrix} 0 & 1 \\ 0 & 0 \end{pmatrix}\ ; \ \ \ket{3} \to \begin{pmatrix} 0 & 0 \\ 1 & 0 \end{pmatrix}\ ; \ \ \ket{4} \to \begin{pmatrix} 0 & 0 \\ 0 & 1 \end{pmatrix}

(24)

Consider the vector space of complex functions on the interval $0,L$ with Dirichlet boundary condittions. You can convince yourself that the basis

\ket{n} \to \psi_n(x) = \sqrt{\frac{2}{L}} \sin \frac{n\pi x}{L}

(25)

is orthonormal with respect to the inner product (17)

The Gram-Schmidt machine¶

Theorem: every finite-dimensional vector space or infinite dimensional vector space with a countable basis has an orthonormal basis.

Proof (partial): Given a basis $\ket{v_1},\ket{v_2},\ldots,\ket{v_d}$ , we can construct a basis iteratively. Define

\begin{align} \ket{1} & = \frac{\ket{v_1}}{||v_1||}\\ \ket{2} & = \frac{\ket{v_2} - \brket{1}{v_2}\ket{1}}{\sqrt{||v_2||^2 - |\brket{1}{v_2}|^2}}\\ \ket{k} & = \frac{\ket{v_k} - \sum_{n = 0}^{k-1} \ket{n}\brket{n}{v_k}}{\sqrt{||v_k||^2 = \sum_{n = 1}^{k-1} |\brket{v_k}{n}|^2}} \end{align}

(26)

Matrix elements of operators¶

Since $\ket{i}$ is a basis, we can write the action of operators in this basis: $A\ket{j} = A_{ij}\ket{i}$ . As notation, we will sometimes write

A = \ket{i} A_{ij} \bra{j}

(27)

We understand this to mean

A\ket{v} = \sum_{i,j} \ket{i} A_{ij} \brket{j}{v} = \sum_{i,j}A_{ij} v_j \ket{i}

(28)

where $\ket{v} = \sum_i v_i \ket{i}$ , and for dual vectors $\bra{v} = \sum_i \bra{i} v_i^*$ ,

\bra{v} A = \sum_{i,k} \bra{i} v^*_k A_{ki}

(29)

Thus

\bra{v} A \ket{w} = \sum_{i,j} v^*_i A_{ij} w_j

(30)

A particularly important example is the identity operator $\bf{1}$ for which $\bf{1}_{ij} = \delta_{ij}$ . This can be represented as above by:

{\bf 1} = \sum_i \ket{i}\bra{i}

(31)

for any orthonormal basis. This is called a resolution of the identity, associated to a given basis.

In this basis, an important operator on $A$ is the transpose. That is given a linear operator $A$ , we can define the transpose $A^T$ via its matrix elements

(A^T)_{ij} = A_{ji}

(32)

In particular, we can write

\bra{v}A = \bra{i} v^*_k A_{ki} = \bra{i} (A^T)_{ik} v^*_k

(33)

Adjoints of operators¶

The vector $A\ket{v} \equiv \ket{Av} = A_{lk} v_k \ket{l}$ has a natural adjoint

{\cal A} : A\ket{v} \to \bra{l} v_k^* A_{lk}^* = \bra{l} v_k^* (A^T)^*_{kl} \equiv \bra{v} A^{\dagger}

(34)

which defines the Hermitian conjugate $A^{\dagger}$ . We can either define it as ${\cal A}: A\ket{v} \to \bra{a} A^{\dagger}$ or via its matrix elements in an orthonormal basis,

A^{\dagger}_{ij} = (A^T)^*_{ij} = A^*_{ji}

(35)

Hermitian and unitary operators¶

Definition. A Hermitian operator is an operator $A = A^{\dagger}$ .

Note that this does not mean the operator has real matrix elements. The following operator on $\CC^2$ is Hermitian:

\sigma_y = \begin{pmatrix} 0 & i \\ -i & 0 \end{pmatrix}

(36)

Definition. A Unitary operator is an operator $U$ such that $U^{\dagger} = U^{-1}$ .

An important property of this operator is that it is norm-preserving:

|| U\ket{v}||^2 = \bra{v} U^{\dagger} U \ket{v} = \bra{v} U^{-1} U \ket{v} = \brket{v}{v} = ||v||^2

(37)

An example of a unitary operator acting on $\CC^2$ :

U = \begin{pmatrix} \cos\theta & \sin\theta e^{i\phi} \\ - \sin\theta e^{-i\phi} & \cos\theta \end{pmatrix}

(38)

As we will discuss, this implements rotations on the spin components of a spin- $\half$ system.

Two nontrivial Hermitian examples for $L^2(\CR)$ :

The position operator ${\hat x}: \psi(x) \to x \psi(x)$ . Since

\begin{align} \bra{\chi} \hat{x} ket{\psi} & = \int dx \chi^* (x \psi(x)) = \int dx (x\chi)^* \psi \\ & = \brket{\chi}{x\psi} = \brket{x\chi}{\psi} \end{align}

(39)

as expected for a Hermitian operator.

the operator $\hat{p} = - i\hbar \frac{\del}{\del x}$ , when acting on $\psi(x)$ .

\begin{align} \bra{\chi} {\hat p}\ket{\psi} & = \int_{-\infty}^{\infty} \chi^* (-i\hbar) \frac{\del \psi}{\del x}\\ & = (-i \hbar) \int_{-\infty}^{\infty} dx \frac{\del}{\del x} (\chi^* \psi) + i \hbar \int dx \frac{\del \chi^*}{\del x} \psi \\ & = - i \hbar \chi^* \psi \Big|_{-\infty}^{\infty} + \int_{-\infty}^{\infty} dx\left(-i\hbar \frac{\del \chi}{\del x}\right)^*\psi\\ & = \bra{\chi}{\hat p}^{\dagger} \ket{\psi} \end{align}

(40)

The second line follows from integration by parts, and the boundary terms vanish because $\psi$ is square integrable. In other words for every $\ket{\psi},\ket{\chi}$ , $\bra{\chi} {\hat p} \ket{\psi} = \bra{\chi} {\hat p}^{\dagger} \ket{\psi}$ . From this we can deduce that ${\hat p} = {\hat p}^{\dagger}$ .

The same argument follows for the case of complex functions with periodic boundary conditions. For Dirichlet boundary conditions, ${\hat p}$ fails to be an operator on teh Hilbert space, as the derivative of a function with Dirichlet boundary conditions does not in general satisfy Dirichlet boundary conditions. (Similarly for Neumann boundary conditions).