Consider a linear map A : H 1 → H 2 {\displaystyle A:H_{1}\to H_{2}} between Hilbert spaces. Without taking care of any details, the adjoint operator is the (in most cases uniquely defined) linear operator A ∗ : H 2 → H 1 {\displaystyle A^{*}:H_{2}\to H_{1}} fulfilling
where ⟨ ⋅ , ⋅ ⟩ H i {\displaystyle \langle \cdot ,\cdot \rangle _{H_{i}}} is the inner product in the Hilbert space H i {\displaystyle H_{i}} , which is linear in the first coordinate and conjugate linear in the second coordinate. Note the special case where both Hilbert spaces are identical and A {\displaystyle A} is an operator on that Hilbert space.
When one trades the inner product for the dual pairing, one can define the adjoint, also called the transpose, of an operator A : E → F {\displaystyle A:E\to F} , where E , F {\displaystyle E,F} are Banach spaces with corresponding norms ‖ ⋅ ‖ E , ‖ ⋅ ‖ F {\displaystyle \|\cdot \|_{E},\|\cdot \|_{F}} . Here (again not considering any technicalities), its adjoint operator is defined as A ∗ : F ∗ → E ∗ {\displaystyle A^{*}:F^{*}\to E^{*}} with
i.e., ( A ∗ f ) ( u ) = f ( A u ) {\displaystyle \left(A^{*}f\right)(u)=f(Au)} for f ∈ F ∗ , u ∈ E {\displaystyle f\in F^{*},u\in E} .
The above definition in the Hilbert space setting is really just an application of the Banach space case when one identifies a Hilbert space with its dual (via the Riesz representation theorem). Then it is only natural that we can also obtain the adjoint of an operator A : H → E {\displaystyle A:H\to E} , where H {\displaystyle H} is a Hilbert space and E {\displaystyle E} is a Banach space. The dual is then defined as A ∗ : E ∗ → H {\displaystyle A^{*}:E^{*}\to H} with A ∗ f = h f {\displaystyle A^{*}f=h_{f}} such that
Let ( E , ‖ ⋅ ‖ E ) , ( F , ‖ ⋅ ‖ F ) {\displaystyle \left(E,\|\cdot \|_{E}\right),\left(F,\|\cdot \|_{F}\right)} be Banach spaces. Suppose A : D ( A ) → F {\displaystyle A:D(A)\to F} and D ( A ) ⊂ E {\displaystyle D(A)\subset E} , and suppose that A {\displaystyle A} is a (possibly unbounded) linear operator which is densely defined (i.e., D ( A ) {\displaystyle D(A)} is dense in E {\displaystyle E} ). Then its adjoint operator A ∗ {\displaystyle A^{*}} is defined as follows. The domain is
Now for arbitrary but fixed g ∈ D ( A ∗ ) {\displaystyle g\in D(A^{*})} we set f : D ( A ) → R {\displaystyle f:D(A)\to \mathbb {R} } with f ( u ) = g ( A u ) {\displaystyle f(u)=g(Au)} . By choice of g {\displaystyle g} and definition of D ( A ∗ ) {\displaystyle D(A^{*})} , f is (uniformly) continuous on D ( A ) {\displaystyle D(A)} as | f ( u ) | = | g ( A u ) | ≤ c ⋅ ‖ u ‖ E {\displaystyle |f(u)|=|g(Au)|\leq c\cdot \|u\|_{E}} . Then by the Hahn–Banach theorem, or alternatively through extension by continuity, this yields an extension of f {\displaystyle f} , called f ^ {\displaystyle {\hat {f}}} , defined on all of E {\displaystyle E} . This technicality is necessary to later obtain A ∗ {\displaystyle A^{*}} as an operator D ( A ∗ ) → E ∗ {\displaystyle D\left(A^{*}\right)\to E^{*}} instead of D ( A ∗ ) → ( D ( A ) ) ∗ . {\displaystyle D\left(A^{*}\right)\to (D(A))^{*}.} Remark also that this does not mean that A {\displaystyle A} can be extended on all of E {\displaystyle E} but the extension only worked for specific elements g ∈ D ( A ∗ ) {\displaystyle g\in D\left(A^{*}\right)} .
Now, we can define the adjoint of A {\displaystyle A} as
The fundamental defining identity is thus
Suppose H is a complex Hilbert space, with inner product ⟨ ⋅ , ⋅ ⟩ {\displaystyle \langle \cdot ,\cdot \rangle } . Consider a continuous linear operator A : H → H (for linear operators, continuity is equivalent to being a bounded operator). Then the adjoint of A is the continuous linear operator A∗ : H → H satisfying
Existence and uniqueness of this operator follows from the Riesz representation theorem.2
This can be seen as a generalization of the adjoint matrix of a square matrix which has a similar property involving the standard complex inner product.
The following properties of the Hermitian adjoint of bounded operators are immediate:3
If we define the operator norm of A by
then
Moreover,
One says that a norm that satisfies this condition behaves like a "largest value", extrapolating from the case of self-adjoint operators.
The set of bounded linear operators on a complex Hilbert space H together with the adjoint operation and the operator norm form the prototype of a C*-algebra.
Let the inner product ⟨ ⋅ , ⋅ ⟩ {\displaystyle \langle \cdot ,\cdot \rangle } be linear in the first argument. A densely defined operator A from a complex Hilbert space H to itself is a linear operator whose domain D(A) is a dense linear subspace of H and whose values lie in H.6 By definition, the domain D(A∗) of its adjoint A∗ is the set of all y ∈ H for which there is a z ∈ H satisfying
Owing to the density of D ( A ) {\displaystyle D(A)} and Riesz representation theorem, z {\displaystyle z} is uniquely defined, and, by definition, A ∗ y = z . {\displaystyle A^{*}y=z.} 7
Properties 1.–5. hold with appropriate clauses about domains and codomains. For instance, the last property now states that (AB)∗ is an extension of B∗A∗ if A, B and AB are densely defined operators.8
For every y ∈ ker A ∗ , {\displaystyle y\in \ker A^{*},} the linear functional x ↦ ⟨ A x , y ⟩ = ⟨ x , A ∗ y ⟩ {\displaystyle x\mapsto \langle Ax,y\rangle =\langle x,A^{*}y\rangle } is identically zero, and hence y ∈ ( im A ) ⊥ . {\displaystyle y\in (\operatorname {im} A)^{\perp }.}
Conversely, the assumption that y ∈ ( im A ) ⊥ {\displaystyle y\in (\operatorname {im} A)^{\perp }} causes the functional x ↦ ⟨ A x , y ⟩ {\displaystyle x\mapsto \langle Ax,y\rangle } to be identically zero. Since the functional is obviously bounded, the definition of A ∗ {\displaystyle A^{*}} assures that y ∈ D ( A ∗ ) . {\displaystyle y\in D(A^{*}).} The fact that, for every x ∈ D ( A ) , {\displaystyle x\in D(A),} ⟨ A x , y ⟩ = ⟨ x , A ∗ y ⟩ = 0 {\displaystyle \langle Ax,y\rangle =\langle x,A^{*}y\rangle =0} shows that A ∗ y ∈ D ( A ) ⊥ = D ( A ) ¯ ⊥ = { 0 } , {\displaystyle A^{*}y\in D(A)^{\perp }={\overline {D(A)}}^{\perp }=\{0\},} given that D ( A ) {\displaystyle D(A)} is dense.
This property shows that ker A ∗ {\displaystyle \operatorname {ker} A^{*}} is a topologically closed subspace even when D ( A ∗ ) {\displaystyle D(A^{*})} is not.
If H 1 {\displaystyle H_{1}} and H 2 {\displaystyle H_{2}} are Hilbert spaces, then H 1 ⊕ H 2 {\displaystyle H_{1}\oplus H_{2}} is a Hilbert space with the inner product
where a , c ∈ H 1 {\displaystyle a,c\in H_{1}} and b , d ∈ H 2 . {\displaystyle b,d\in H_{2}.}
Let J : H ⊕ H → H ⊕ H {\displaystyle J\colon H\oplus H\to H\oplus H} be the symplectic mapping, i.e. J ( ξ , η ) = ( − η , ξ ) . {\displaystyle J(\xi ,\eta )=(-\eta ,\xi ).} Then the graph
of A ∗ {\displaystyle A^{*}} is the orthogonal complement of J G ( A ) : {\displaystyle JG(A):}
The assertion follows from the equivalences
and
An operator A {\displaystyle A} is closed if the graph G ( A ) {\displaystyle G(A)} is topologically closed in H ⊕ H . {\displaystyle H\oplus H.} The graph G ( A ∗ ) {\displaystyle G(A^{*})} of the adjoint operator A ∗ {\displaystyle A^{*}} is the orthogonal complement of a subspace, and therefore is closed.
An operator A {\displaystyle A} is closable if the topological closure G cl ( A ) ⊆ H ⊕ H {\displaystyle G^{\text{cl}}(A)\subseteq H\oplus H} of the graph G ( A ) {\displaystyle G(A)} is the graph of a function. Since G cl ( A ) {\displaystyle G^{\text{cl}}(A)} is a (closed) linear subspace, the word "function" may be replaced with "linear operator". For the same reason, A {\displaystyle A} is closable if and only if ( 0 , v ) ∉ G cl ( A ) {\displaystyle (0,v)\notin G^{\text{cl}}(A)} unless v = 0. {\displaystyle v=0.}
The adjoint A ∗ {\displaystyle A^{*}} is densely defined if and only if A {\displaystyle A} is closable. This follows from the fact that, for every v ∈ H , {\displaystyle v\in H,}
which, in turn, is proven through the following chain of equivalencies:
The closure A cl {\displaystyle A^{\text{cl}}} of an operator A {\displaystyle A} is the operator whose graph is G cl ( A ) {\displaystyle G^{\text{cl}}(A)} if this graph represents a function. As above, the word "function" may be replaced with "operator". Furthermore, A ∗ ∗ = A cl , {\displaystyle A^{**}=A^{\text{cl}},} meaning that G ( A ∗ ∗ ) = G cl ( A ) . {\displaystyle G(A^{**})=G^{\text{cl}}(A).}
To prove this, observe that J ∗ = − J , {\displaystyle J^{*}=-J,} i.e. ⟨ J x , y ⟩ H ⊕ H = − ⟨ x , J y ⟩ H ⊕ H , {\displaystyle \langle Jx,y\rangle _{H\oplus H}=-\langle x,Jy\rangle _{H\oplus H},} for every x , y ∈ H ⊕ H . {\displaystyle x,y\in H\oplus H.} Indeed,
In particular, for every y ∈ H ⊕ H {\displaystyle y\in H\oplus H} and every subspace V ⊆ H ⊕ H , {\displaystyle V\subseteq H\oplus H,} y ∈ ( J V ) ⊥ {\displaystyle y\in (JV)^{\perp }} if and only if J y ∈ V ⊥ . {\displaystyle Jy\in V^{\perp }.} Thus, J [ ( J V ) ⊥ ] = V ⊥ {\displaystyle J[(JV)^{\perp }]=V^{\perp }} and [ J [ ( J V ) ⊥ ] ] ⊥ = V cl . {\displaystyle [J[(JV)^{\perp }]]^{\perp }=V^{\text{cl}}.} Substituting V = G ( A ) , {\displaystyle V=G(A),} obtain G cl ( A ) = G ( A ∗ ∗ ) . {\displaystyle G^{\text{cl}}(A)=G(A^{**}).}
For a closable operator A , {\displaystyle A,} A ∗ = ( A cl ) ∗ , {\displaystyle A^{*}=\left(A^{\text{cl}}\right)^{*},} meaning that G ( A ∗ ) = G ( ( A cl ) ∗ ) . {\displaystyle G(A^{*})=G\left(\left(A^{\text{cl}}\right)^{*}\right).} Indeed,
Let H = L 2 ( R , l ) , {\displaystyle H=L^{2}(\mathbb {R} ,l),} where l {\displaystyle l} is the linear measure. Select a measurable, bounded, non-identically zero function f ∉ L 2 , {\displaystyle f\notin L^{2},} and pick φ 0 ∈ L 2 ∖ { 0 } . {\displaystyle \varphi _{0}\in L^{2}\setminus \{0\}.} Define
It follows that D ( A ) = { φ ∈ L 2 ∣ ⟨ f , φ ⟩ ≠ ∞ } . {\displaystyle D(A)=\{\varphi \in L^{2}\mid \langle f,\varphi \rangle \neq \infty \}.} The subspace D ( A ) {\displaystyle D(A)} contains all the L 2 {\displaystyle L^{2}} functions with compact support. Since 1 [ − n , n ] ⋅ φ → L 2 φ , {\displaystyle \mathbf {1} _{[-n,n]}\cdot \varphi \ {\stackrel {L^{2}}{\to }}\ \varphi ,} A {\displaystyle A} is densely defined. For every φ ∈ D ( A ) {\displaystyle \varphi \in D(A)} and ψ ∈ D ( A ∗ ) , {\displaystyle \psi \in D(A^{*}),}
Thus, A ∗ ψ = ⟨ φ 0 , ψ ⟩ f . {\displaystyle A^{*}\psi =\langle \varphi _{0},\psi \rangle f.} The definition of adjoint operator requires that Im A ∗ ⊆ H = L 2 . {\displaystyle \mathop {\text{Im}} A^{*}\subseteq H=L^{2}.} Since f ∉ L 2 , {\displaystyle f\notin L^{2},} this is only possible if ⟨ φ 0 , ψ ⟩ = 0. {\displaystyle \langle \varphi _{0},\psi \rangle =0.} For this reason, D ( A ∗ ) = { φ 0 } ⊥ . {\displaystyle D(A^{*})=\{\varphi _{0}\}^{\perp }.} Hence, A ∗ {\displaystyle A^{*}} is not densely defined and is identically zero on D ( A ∗ ) . {\displaystyle D(A^{*}).} As a result, A {\displaystyle A} is not closable and has no second adjoint A ∗ ∗ . {\displaystyle A^{**}.}
A bounded operator A : H → H is called Hermitian or self-adjoint if
which is equivalent to
In some sense, these operators play the role of the real numbers (being equal to their own "complex conjugate") and form a real vector space. They serve as the model of real-valued observables in quantum mechanics. See the article on self-adjoint operators for a full treatment.
For a conjugate-linear operator the definition of adjoint needs to be adjusted in order to compensate for the complex conjugation. An adjoint operator of the conjugate-linear operator A on a complex Hilbert space H is an conjugate-linear operator A∗ : H → H with the property:
The equation
is formally similar to the defining properties of pairs of adjoint functors in category theory, and this is where adjoint functors got their name.
Miller, David A. B. (2008). Quantum Mechanics for Scientists and Engineers. Cambridge University Press. pp. 262, 280. ↩
Reed & Simon 2003, pp. 186–187; Rudin 1991, §12.9 - Reed, Michael; Simon, Barry (2003), Functional Analysis, Elsevier, ISBN 981-4141-65-8 ↩
See unbounded operator for details. /wiki/Unbounded_operator ↩
Reed & Simon 2003, p. 252; Rudin 1991, §13.1 - Reed, Michael; Simon, Barry (2003), Functional Analysis, Elsevier, ISBN 981-4141-65-8 ↩
Rudin 1991, Thm 13.2 - Rudin, Walter (1991). Functional Analysis. International Series in Pure and Applied Mathematics. Vol. 8 (Second ed.). New York, NY: McGraw-Hill Science/Engineering/Math. ISBN 978-0-07-054236-5. OCLC 21163277. https://archive.org/details/functionalanalys00rudi ↩
Reed & Simon 2003, pp. 187; Rudin 1991, §12.11 - Reed, Michael; Simon, Barry (2003), Functional Analysis, Elsevier, ISBN 981-4141-65-8 ↩