Motivating Ladder Operators
August 5, 2024
By Aathreya Kadambi
Recently I’ve been reading about spin and angular momentum operators in quantum mechanics, which both satisfy some very nice commutation relations:
[Ai,Aj]=iℏϵijkAk
for
A=L or
A=S. Somehow, we are repeatedly able to use so called ladder operators to find that an operator can only take on a discrete set of eigenvalues. Apparently the two index cards in the picture above weren’t enough to figure it all out, so I’ve decided to utilize my new ability to make KaTeX+MDX blog posts.
Contents
- Jumbled Thoughts
- A Path to The Essence (Possibly)
- Random Scratch Work
Jumbled Thoughts
I’ll start out with a vague (potentially nonsensical) description of what I think is so interesting about the ladder operators technique, and then try to generalize it. We are able to construct so called “ladder operators” which are such that when you act them on an eigenvector f of A with eigenvalue μ, they have the property that they return a new eigenvector of A with eigenvalue μ±c. Namely,
A3(A±f)=(λ+c)(A±f).
Somehow, with this clever operator, we are able to actually discover that the set of eigenvalues of
A3 are discrete. Here the choice of
3 seems soemwhat arbitrary, it’s just to mimic the way it is done with
Lz or
Sz.
To me, it seems that the crux of the ladder argument is the above, that we can find an operator that “bumps up” an eigenvector to another eigenvector with increased eigenvalue. By doing this, we are actually breaking up R (the set of all possible eigenvalues) into a bunch of little ladders and offsets: cZ+r where r∈[0,c). Denote by Eλ(Ω) the set of all eigenvectors of Ω corresponding to an eigenvalue λ. Then A± give us maps:
A±:Eλ(A3)→Eλ±c(A3).
The other big trick is having an operator A2:=A12+A22+A32 that commutes with A3:
[A3,A2]=[A3,A12]+[A3,A22]=iℏA1A2+iℏA2A1−iℏA2A1−iℏA1A2=0.
This allows us to restrict our attention to just simultaneous eigenvectors of both
A2 and
A3, which allow us to fix an eigenvalue of
A2 and focus on eigenvalues of
A3. Somehow, with this fact, we are able to bound the eigenvalues of
A3, and combined with the previous ladder operators, we are able to actually
discover that the eigenvalues of both
A2 and
A3 are discrete! A big step in this is the fact that:
A∓A±=A2−A32±iℏA3,
which ends up following from the commutation relation and the construction of
A2.
How does it all just work out so perfectly?
If I want to claim that it worked out so perfectly, I should identify what parts seem so coincidental. To me, it seems magical that we can discover that the collection of eigenvalues is discrete in nature from the initial commutation relationship. This leads me to wonder… is there a general use case for these ladder operators? It seems to me that there are a few crucial steps in this process:
- Bound the eigenvalues of the operator in question.
- Relate eigenvalues by the ladder operators A±.
- It must be that A±kf=0 for some k.
- If k is chosen minimally, it must mean that λ±kc=0, so that λ=∓kc with k∈Z.
One thing that particularly intruiges me is the possibility that the discretness comes from the very fact that we can bound the eigenvalues.
To examine this further, we can try to compare this to the discreization of energy for the particle in a box. In that case, the idea was to factor the hamiltonian operator: H∼A†A. Here again, we try to factor L2∼A†A. In the case of the hamiltonian, we were left with something of the form:
H=A†A+21ℏωI=A†A+p(I)
where
p is a polynomial and notice that
[H,I]=0. In the angular momentum case, we were left with
L2=A†A+L32∓iℏL3=A†A+p(L3)
where
p is again a polynomial and notice that
[L2,L3]=0. Very interesting indeed. But this doesn’t shed light on why they increment and decrement the eigenvalues of eigenvectors. Maybe we just find operators that both increment/decrement the eigenvalues
and satisfy the above properties?
We can actually see the “ladderness” of the ladder operators from the commutation relationship they have with the operator in question. In particular,
[H,A±]=±ℏωA±
and for the angular momentum case,
[L2,A±]=±0⋅A±
but we do have that:
[L3,A±]=±ℏA±.
Mhm. Very peculiar indeed.
I think a very integral part of these ladder operators is that they satisfy:
[A,D±]=±cD±
for some c. This is important, because then:
A(D±f)=([A,D±]+D±A)f=(λ±c)(D±f).
where
f∈Eλ(A). This encapsulates the idea that the ladder operator
D will increase of decrease the eigenvalue.
Now I’ll get into what I think may be the essence behind thesse ladder operator methods.
A Path to the Essence (Possibly)
To start, I’ll define ladder operator:
Definition. Consider any Hermitian operator A. We say that an operator L is a ladder operator for A of step size c if
[A,L]=cL.
Theorem. Consider any Hermitian operator A. If L is a ladder operator for A of step size c, then L† is a ladder operator for A of step size −c∗.
Proof.
[A,L†]=AL†−L†A=(LA−AL)†=(−[A,L])†=−c∗L.
Theorem. Suppose A is a Hermitian operator and L is a ladder operator for A of step size c where c∈R. Then [A,LL†]=0.
Proof.
[A,LL†]=L[A,L†]+[A,L]L†=−cLL†+cLL†=0
Theorem. Suppose every eigenvalue of A satisfies λ≤λmax for some eigenvalue λmax. Then if L is a ladder operator for A of step size c∈R+, and q(A)+L†L=B for some operator B which commutes with A where p and q are polynomials, the only possible eigenvalues of A are those in λmax−cN.
Proof.
If λmax is an eigenvalue, consider a corresponding eigenvector v which is also an eigenvector of B with eigenvalue μ (I think you might need that B is self-adjoint for this, although I need to check again later). Then
A(Lv)=(LA+[A,L])v=(LA+cL)v=LAv+cLv=λmaxLv+cLv=(λmax+c)Lv
but
λmax+c cannot be an eigenvalue. Thus, it must be that
Lv=0, or equivalently,
∥Lv∥=0. Now notice that:
∥Lv∥=⟨v,L†Lv⟩
Now notice that
L†Lv=(q(A)−B)v=q(A)v−Bv=q(λmax)v−μv=(q(λmax)−μ)v
so that
0=∥Lv∥=q(λmax)−μ
so
q(λmax)=μ.
Hmm… there is something still missing here. The thing is, in the case of the energy eigenvalues, we can actually find B as the identity operator so μ=1, and λmin is some constant value. In the case of the angular momentum operator, we made A the L2 operator and B the Lz operator. However, this isn’t quite the result I wanted. I was hoping for something more like “boundedness translates into discreteness,” but it looks like somehow, these polynomials end up messing things around.
It feels like beyond this step, the path diverges. Like in the case of the hamiltonian operator for the particle in a box, the fact that Lv=0 can be used to simplify the Shrodinger equation, and in the case of the angular momentum operator, we were able to use the identity of the polynomial in combination with the reverse statement with a λmin to discritize the angular momentum. It seems that things may be more complicated than I thought. One similarity I do see, though, is that we have I=L†L+p(H) and L2=A†A+q(Lz), and I and L2 are similar in that they commute with everything else, L and A are the respective ladder operators of H and Lz, and q and p are polynomials. I feel like there has to be something there.
I’ll have to think about it more, and I’ll make an update in a future post!
Random Scratch Work
We use the well-ordering principle (or equivalently, we could frame this via induction). Let S⊆N be the set of all n≥0 such that the only eigenvalue of A in λmax−c[n,n+1) is λmax−n. Now let Sc=N\S. Suppose for contradiction that Sc is nonempty. By the well-ordering principle, there exists a smallest value m in Sc. Then there must be an eigenvalue in λmax−c[m,m+1) which is not λmax−cm. Denote it by λmax−c(m+r), where r∈[0,1). Denote the corresponding eigenvector by v.
Now, if Lv=0,
A(Lv)=(LA+[A,L])v=(LA+cL)v=LAv+cLv
=(λmax−c(m+r))Lv+cLv=(λmax−c(m+r−1))Lv
but
λmax−c(m−1+r)<λmax−c(m+r).
If m=0, then λmax−c(m−1+r)=λmax+c(1−r)>λmax, which can’t possibly be an eigenvalue because λmax was the largest eigenvalue, which is a contradiction. If m≥1, m−1 is in Sc as well, which is a contradiction of the minimality of m.
So since both cases lead to contradiction, it must be that Lv=0, or equivalently, ∥Lv∥=0. Now notice that:
∥Lv∥=⟨v,L†Lv⟩
Now notice that
L†Lv=(A−p(B))v=Av−p(B)v=(λmax−c(m+r))v−p(μ)v=(λmax−c(m+r)−p(μ))v
so that
0=∥Lv∥=λmax−c(m+r)−p(μ)
In either case, we have achieved a contradiction, so Sc is indeed empty, and so S=N. Thus, for all n, the only eigenvalue of A in λmax−c[n,n+1) is λmax−n, so indeed, the only possible eigenvalues of A are those in λmax−cN
By essentially the same proof but using L† in place of L, we also have that:
Theorem. Suppose every eigenvalue of A satisfies λ≥λmin for some λmin. Then if L is a ladder operator for A of step size c∈R+, the only possible eigenvalues of A are those in λmin+cN.
So indeed, the boundedness of the eigenvalues of an operator translates directly into discreteness! I should specify that by discrete, I mean that there is a minimum distance between two points in the set, or even more strongly, that the set is a subset of cZ for some c. Even more interestingly, this result seems to suggest that there cannot be two ladder operators of step sizes different in magnitude same operator. That does seem magical. (Just kidding, I had an error in this proof)
In other words, to identify whether an operator is discrete, you simply need to
- Find a ladder operator for it with step size c.
- Find the minimum and/or maximum eigenvalues for the operator.
And boom. You have now shown that the eigenvalues of this operator are precisely those in the set:
{λmin+ck:k∈Z,0≤k≤cλmax−λmin}
This actually motivates setting up a “quantum number” for it. We might call it
l, and take
l=cλ. This way, the eigenvalues of the operator are those in the set:
{l∈Z:lmin≤l≤lmax}
where
lmin=cλmin and
lmax=cλmax..
If we then have bounds on the maximum and minimum possible eigenvalues for Ai (here Ai is self-adjoint so that all eigenvalues are real, which is an ordered set), we can actually see that there must be some maximal number of times that we can apply A± to an eigenvector f of A until we obtain zero. Else, we would have discovered a new eigenvector corresponding to a large enough or small enough eigenvalue, which is not possible. So there must be extreme eigenstates of Ai which can be obtained by repeated application of the ladder operators, such that A±f=0, or equivalently, ∥A±f∥=0.
This is where the magical part happens. It just so happens that A+ and A− are adjoints of each other, and so it is possible to write:
∥A±f∥=⟨A±f,A±f⟩=⟨f,A∓A±f⟩.
It just so happens that
f is also an eigenvector of
A∓A± (whad’ya know?) so that now,
∥A±f∥ is equal to the eigenvalue of
f with respect to
A∓A±. In the case of the angular momentum and spin operators, this gives us equations which the extreme states must satisfy. There is one last trick, which is that any two extreme states part of the same chain must differ by an element of
cZ, by the nature of the construction of the ladder operators. If we then factor out the right things, we obtain the fact that
the set of eigenvalues of the operator Ai is discrete.
There is actually another trick that I glossed over, which was the trick of using some other operator that commutes with Ai; we use something like L2 or S2, which gives us a vehicle to do things, because somehow, we can express A∓A± in terms of A2 and Ai.
Comments
Not signed in. Sign in with Google to make comments unanonymously!