News

Happy 2025! I'm back home for winter break! :-)

Blog

[1/7/2025:] SLAHMR and New Years Updates
[11/5/2024:] Satisfying Sip
[9/28/2024:] Fomenko's Art
[9/18/2024:] Truth and Orientation
[9/2/2024:] Ghee and Ethics

Notes

Working on notes on the quantum mechanics, derivatives (AKA tangent spaces vs. algebraic approaches), and uploading my course notes onto this blog!

Projects

Finally started a projects page! I've recently made some nice upgrades to my post component, so it looks pretty clean! ;)

🌊

I'm considering whether or not to continue this project using WebGL or Three.js.

I'm also researching methods for generating the 3D scenes I want for this project automatically.

In the meantime, I've decided to proceed with some preliminary prototypes of the other interactive parts of this project.

Orange Juice

I like orange juice. :)

Mlog


Motivating Ladder Operators

August 5, 2024
By Aathreya Kadambi

Recently I’ve been reading about spin and angular momentum operators in quantum mechanics, which both satisfy some very nice commutation relations:

[Ai,Aj]=iϵijkAk[A_i, A_j] = i\hbar\epsilon_{ijk} A_k
for A=LA = L or A=SA = S. Somehow, we are repeatedly able to use so called ladder operators to find that an operator can only take on a discrete set of eigenvalues. Apparently the two index cards in the picture above weren’t enough to figure it all out, so I’ve decided to utilize my new ability to make KaTeX+MDX blog posts.

Contents

  1. Jumbled Thoughts
  2. A Path to The Essence (Possibly)
  3. Random Scratch Work

Jumbled Thoughts

I’ll start out with a vague (potentially nonsensical) description of what I think is so interesting about the ladder operators technique, and then try to generalize it. We are able to construct so called “ladder operators” which are such that when you act them on an eigenvector ff of AA with eigenvalue μ\mu, they have the property that they return a new eigenvector of AA with eigenvalue μ±c\mu \pm c. Namely,

A3(A±f)=(λ+c)(A±f).A_3 (A_{\pm} f) = (\lambda + c)(A_{\pm} f).
Somehow, with this clever operator, we are able to actually discover that the set of eigenvalues of A3A_3 are discrete. Here the choice of 33 seems soemwhat arbitrary, it’s just to mimic the way it is done with LzL_z or SzS_z.

To me, it seems that the crux of the ladder argument is the above, that we can find an operator that “bumps up” an eigenvector to another eigenvector with increased eigenvalue. By doing this, we are actually breaking up R\R (the set of all possible eigenvalues) into a bunch of little ladders and offsets: cZ+rc\Z + r where r[0,c)r \in [0, c). Denote by Eλ(Ω)E_\lambda(\Omega) the set of all eigenvectors of Ω\Omega corresponding to an eigenvalue λ\lambda. Then A±A_{\pm} give us maps:

A±:Eλ(A3)Eλ±c(A3).A_{\pm} : E_{\lambda}(A_3) \rightarrow E_{\lambda \pm c}(A_3).

The other big trick is having an operator A2:=A12+A22+A32A^2 := A_1^2 + A_2^2 + A_3^2 that commutes with A3A_3:

[A3,A2]=[A3,A12]+[A3,A22]=iA1A2+iA2A1iA2A1iA1A2=0.[A_3, A^2] = [A_3, A_1^2] + [A_3, A_2^2] = i\hbar A_1A_2 + i\hbar A_2A_1 - i\hbar A_2A_1 - i\hbar A_1A_2 = 0.
This allows us to restrict our attention to just simultaneous eigenvectors of both A2A^2 and A3A_3, which allow us to fix an eigenvalue of A2A^2 and focus on eigenvalues of A3A_3. Somehow, with this fact, we are able to bound the eigenvalues of A3A_3, and combined with the previous ladder operators, we are able to actually discover that the eigenvalues of both A2A^2 and A3A_3 are discrete! A big step in this is the fact that:
AA±=A2A32±iA3,A_{\mp}A_{\pm} = A^2 - A_3^2 \pm i\hbar A_3,
which ends up following from the commutation relation and the construction of A2A^2.

How does it all just work out so perfectly?

If I want to claim that it worked out so perfectly, I should identify what parts seem so coincidental. To me, it seems magical that we can discover that the collection of eigenvalues is discrete in nature from the initial commutation relationship. This leads me to wonder… is there a general use case for these ladder operators? It seems to me that there are a few crucial steps in this process:

  1. Bound the eigenvalues of the operator in question.
  2. Relate eigenvalues by the ladder operators A±A_{\pm}.
  3. It must be that A±kf=0A_{\pm}^k f = 0 for some kk.
  4. If kk is chosen minimally, it must mean that λ±kc=0\lambda \pm kc = 0, so that λ=kc\lambda = \mp kc with kZk \in \Z.

One thing that particularly intruiges me is the possibility that the discretness comes from the very fact that we can bound the eigenvalues.

To examine this further, we can try to compare this to the discreization of energy for the particle in a box. In that case, the idea was to factor the hamiltonian operator: HAAH \sim A^\dagger A. Here again, we try to factor L2AAL^2 \sim A^\dagger A. In the case of the hamiltonian, we were left with something of the form:

H=AA+12ωI=AA+p(I)H = A^\dagger A + \frac{1}{2}\hbar\omega I = A^\dagger A + p(I)
where pp is a polynomial and notice that [H,I]=0[H, I] = 0. In the angular momentum case, we were left with
L2=AA+L32iL3=AA+p(L3)L^2 = A^\dagger A + L_3^2 \mp i\hbar L_3 = A^\dagger A + p(L_3)
where pp is again a polynomial and notice that [L2,L3]=0[L^2, L_3] = 0. Very interesting indeed. But this doesn’t shed light on why they increment and decrement the eigenvalues of eigenvectors. Maybe we just find operators that both increment/decrement the eigenvalues and satisfy the above properties?

We can actually see the “ladderness” of the ladder operators from the commutation relationship they have with the operator in question. In particular,

[H,A±]=±ωA±[H, A_{\pm}] = \pm \hbar\omega A_{\pm}
and for the angular momentum case,
[L2,A±]=±0A±[L^2, A_{\pm}] = \pm 0\cdot A_{\pm}
but we do have that:
[L3,A±]=±A±.[L_3, A_{\pm}] = \pm \hbar A_{\pm}.
Mhm. Very peculiar indeed.

I think a very integral part of these ladder operators is that they satisfy: [A,D±]=±cD±[A, D_{\pm}] = \pm c D_{\pm} for some cc. This is important, because then:

A(D±f)=([A,D±]+D±A)f=(λ±c)(D±f).A(D_{\pm}f) = ([A, D_{\pm}] + D_{\pm}A)f = (\lambda \pm c)(D_{\pm}f).
where fEλ(A)f \in E_{\lambda}(A). This encapsulates the idea that the ladder operator DD will increase of decrease the eigenvalue.

Now I’ll get into what I think may be the essence behind thesse ladder operator methods.

A Path to the Essence (Possibly)

To start, I’ll define ladder operator:

Definition. Consider any Hermitian operator AA. We say that an operator LL is a ladder operator for AA of step size cc if

[A,L]=cL.[A, L] = cL.

Theorem. Consider any Hermitian operator AA. If LL is a ladder operator for AA of step size cc, then LL^\dagger is a ladder operator for AA of step size c-c^*.

Proof.

[A,L]=ALLA=(LAAL)=([A,L])=cL.[A, L^\dagger] = AL^\dagger - L^\dagger A = (LA - AL)^\dagger = (-[A, L])^\dagger = -c^*L.
\blacksquare

Theorem. Suppose AA is a Hermitian operator and LL is a ladder operator for AA of step size cc where cRc \in \R. Then [A,LL]=0[A, LL^\dagger] = 0.

Proof.

[A,LL]=L[A,L]+[A,L]L=cLL+cLL=0[A, LL^\dagger] = L[A,L^\dagger] + [A,L]L^\dagger = -cLL^\dagger + cLL^\dagger = 0
\blacksquare

Theorem. Suppose every eigenvalue of AA satisfies λλmax\lambda \le \lambda_{\max} for some eigenvalue λmax\lambda_{\max}. Then if LL is a ladder operator for AA of step size cR+c \in \R^+, and q(A)+LL=Bq(A) + L^\dagger L = B for some operator BB which commutes with AA where pp and qq are polynomials, the only possible eigenvalues of AA are those in λmaxcN\lambda_{\max} - c\mathbb{N}.

Proof.

If λmax\lambda_{\max} is an eigenvalue, consider a corresponding eigenvector vv which is also an eigenvector of BB with eigenvalue μ\mu (I think you might need that BB is self-adjoint for this, although I need to check again later). Then

A(Lv)=(LA+[A,L])v=(LA+cL)v=LAv+cLv=λmaxLv+cLv=(λmax+c)LvA(Lv) = (LA + [A, L])v = (LA + cL)v = LAv + cLv= \lambda_{\max} Lv + cLv = (\lambda_{\max} + c)Lv
but λmax+c\lambda_{\max} + c cannot be an eigenvalue. Thus, it must be that Lv=0Lv = 0, or equivalently, Lv=0\|Lv\| = 0. Now notice that:
Lv=v,LLv\|Lv\| = \langle v, L^\dagger L v\rangle
Now notice that
LLv=(q(A)B)v=q(A)vBv=q(λmax)vμv=(q(λmax)μ)vL^\dagger Lv = (q(A) - B)v = q(A)v - B v = q(\lambda_{\max})v - \mu v = (q(\lambda_{\max}) - \mu) v
so that
0=Lv=q(λmax)μ0 = \|Lv\| = q(\lambda_{\max}) - \mu
so q(λmax)=μq(\lambda_{\max}) = \mu.

Hmm… there is something still missing here. The thing is, in the case of the energy eigenvalues, we can actually find BB as the identity operator so μ=1\mu = 1, and λmin\lambda_{\min} is some constant value. In the case of the angular momentum operator, we made AA the L2L^2 operator and BB the LzL_z operator. However, this isn’t quite the result I wanted. I was hoping for something more like “boundedness translates into discreteness,” but it looks like somehow, these polynomials end up messing things around.

It feels like beyond this step, the path diverges. Like in the case of the hamiltonian operator for the particle in a box, the fact that Lv=0Lv = 0 can be used to simplify the Shrodinger equation, and in the case of the angular momentum operator, we were able to use the identity of the polynomial in combination with the reverse statement with a λmin\lambda_{\min} to discritize the angular momentum. It seems that things may be more complicated than I thought. One similarity I do see, though, is that we have I=LL+p(H)I = L^\dagger L + p(H) and L2=AA+q(Lz)L^2 = A^\dagger A + q(L_z), and II and L2L^2 are similar in that they commute with everything else, LL and AA are the respective ladder operators of HH and LzL_z, and qq and pp are polynomials. I feel like there has to be something there.

I’ll have to think about it more, and I’ll make an update in a future post!

Random Scratch Work

Remark. Warning, the following stuff probably has errors, which is why it is down here. But the errors did help me realize and understand things better.

We use the well-ordering principle (or equivalently, we could frame this via induction). Let SNS \subseteq \N be the set of all n0n \ge 0 such that the only eigenvalue of AA in λmaxc[n,n+1)\lambda_{\max} - c[n, n+1) is λmaxn\lambda_{\max} - n. Now let Sc=N\SS^c = \mathbb{N}\backslash S. Suppose for contradiction that ScS^c is nonempty. By the well-ordering principle, there exists a smallest value mm in ScS^c. Then there must be an eigenvalue in λmaxc[m,m+1)\lambda_{\max} - c[m, m+1) which is not λmaxcm\lambda_{\max} - cm. Denote it by λmaxc(m+r)\lambda_{\max} - c(m+r), where r[0,1)r \in [0,1). Denote the corresponding eigenvector by vv.

Now, if Lv0Lv \neq 0,

A(Lv)=(LA+[A,L])v=(LA+cL)v=LAv+cLvA(Lv) = (LA + [A, L])v = (LA + cL)v = LAv + cLv
=(λmaxc(m+r))Lv+cLv=(λmaxc(m+r1))Lv= (\lambda_{\max} - c(m+r))Lv + cLv = (\lambda_{\max} - c(m+r-1))Lv
but λmaxc(m1+r)<λmaxc(m+r)\lambda_{\max} - c(m-1+r) < \lambda_{\max} - c(m+r).

If m=0m = 0, then λmaxc(m1+r)=λmax+c(1r)>λmax\lambda_{\max} - c(m-1+r) = \lambda_{\max} + c(1-r) > \lambda_{\max}, which can’t possibly be an eigenvalue because λmax\lambda_{\max} was the largest eigenvalue, which is a contradiction. If m1m \ge 1, m1m-1 is in ScS^c as well, which is a contradiction of the minimality of mm.

So since both cases lead to contradiction, it must be that Lv=0Lv = 0, or equivalently, Lv=0\|Lv\| = 0. Now notice that:

Lv=v,LLv\|Lv\| = \langle v, L^\dagger L v\rangle
Now notice that
LLv=(Ap(B))v=Avp(B)v=(λmaxc(m+r))vp(μ)v=(λmaxc(m+r)p(μ))vL^\dagger Lv = (A - p(B))v = Av - p(B) v = (\lambda_{\max} - c(m+r))v - p(\mu)v = (\lambda_{\max} - c(m+r) - p(\mu)) v
so that
0=Lv=λmaxc(m+r)p(μ)0 = \|Lv\| = \lambda_{\max} - c(m+r) - p(\mu)

In either case, we have achieved a contradiction, so ScS^c is indeed empty, and so S=NS = \N. Thus, for all nn, the only eigenvalue of AA in λmaxc[n,n+1)\lambda_{\max} - c[n, n+1) is λmaxn\lambda_{\max} - n, so indeed, the only possible eigenvalues of AA are those in λmaxcN\lambda_{\max} - c\mathbb{N}

\blacksquare

By essentially the same proof but using LL^\dagger in place of LL, we also have that:

Theorem. Suppose every eigenvalue of AA satisfies λλmin\lambda \ge \lambda_{\min} for some λmin\lambda_{\min}. Then if LL is a ladder operator for AA of step size cR+c \in \R^+, the only possible eigenvalues of AA are those in λmin+cN\lambda_{\min} + c\mathbb{N}.

So indeed, the boundedness of the eigenvalues of an operator translates directly into discreteness! I should specify that by discrete, I mean that there is a minimum distance between two points in the set, or even more strongly, that the set is a subset of cZc\Z for some cc. Even more interestingly, this result seems to suggest that there cannot be two ladder operators of step sizes different in magnitude same operator. That does seem magical. (Just kidding, I had an error in this proof)

In other words, to identify whether an operator is discrete, you simply need to

  1. Find a ladder operator for it with step size cc.
  2. Find the minimum and/or maximum eigenvalues for the operator.

And boom. You have now shown that the eigenvalues of this operator are precisely those in the set:

{λmin+ck:kZ,0kλmaxλminc}\{\lambda_{\min} + ck : k\in \Z, 0 \le k \le \frac{\lambda_{\max} - \lambda_{\min}}{c}\}
This actually motivates setting up a “quantum number” for it. We might call it ll, and take l=λcl = \frac{\lambda}{c}. This way, the eigenvalues of the operator are those in the set:
{lZ:lminllmax}\{l \in \Z : l_{\min} \le l \le l_{\max}\}
where lmin=λmincl_{\min} = \frac{\lambda_{\min}}{c} and lmax=λmaxcl_{\max} = \frac{\lambda_{\max}}{c}..

If we then have bounds on the maximum and minimum possible eigenvalues for AiA_i (here AiA_i is self-adjoint so that all eigenvalues are real, which is an ordered set), we can actually see that there must be some maximal number of times that we can apply A±A_{\pm} to an eigenvector ff of AA until we obtain zero. Else, we would have discovered a new eigenvector corresponding to a large enough or small enough eigenvalue, which is not possible. So there must be extreme eigenstates of AiA_i which can be obtained by repeated application of the ladder operators, such that A±f=0A_{\pm} f = 0, or equivalently, A±f=0\|A_{\pm} f\| = 0.

This is where the magical part happens. It just so happens that A+A_+ and AA_- are adjoints of each other, and so it is possible to write:

A±f=A±f,A±f=f,AA±f.\|A_\pm f\| = \langle A_\pm f, A_\pm f\rangle = \langle f, A_{\mp}A_{\pm} f\rangle.
It just so happens that ff is also an eigenvector of AA±A_{\mp}A_{\pm} (whad’ya know?) so that now, A±f\|A_{\pm} f\| is equal to the eigenvalue of ff with respect to AA±A_{\mp}A_{\pm}. In the case of the angular momentum and spin operators, this gives us equations which the extreme states must satisfy. There is one last trick, which is that any two extreme states part of the same chain must differ by an element of cZc\Z, by the nature of the construction of the ladder operators. If we then factor out the right things, we obtain the fact that the set of eigenvalues of the operator AiA_i is discrete.

There is actually another trick that I glossed over, which was the trick of using some other operator that commutes with AiA_i; we use something like L2L^2 or S2S^2, which gives us a vehicle to do things, because somehow, we can express AA±A_{\mp}A_{\pm} in terms of A2A^2 and AiA_i.



As a fun fact, it might seem like this website is flat because you're viewing it on a flat screen, but the curvature of this website actually isn't zero. ;-)

Copyright © 2024, Aathreya Kadambi

Made with Astrojs, React, and Tailwind.