News
It's spring break! I'm relaxing back at home.
[May 13, 2025:] Geometry and Probability
[May 1, 2025:] Words I Pretend to Know
[April 1, 2025:] Langevin on Manifolds
Notes
Working on notes on the quantum mechanics, derivatives (AKA tangent spaces vs. algebraic approaches), and uploading my course notes onto this blog!
Projects
Finally started a projects page! I've recently made some nice upgrades to my post component, so it looks pretty clean! ;)
🌊
I'm considering whether or not to continue this project using WebGL or Three.js.
I'm also researching methods for generating the 3D scenes I want for this project automatically.
In the meantime, I've decided to proceed with some preliminary prototypes of the other interactive parts of this project.
Orange Juice
I like orange juice. :)
Mlog
Geometry and Probability
May 13, 2025
By Aathreya Kadambi
When thinking about what to do for my final project in my Riemannian geometry class, I had two potentialy topics in mind:
- Information geometry, the fisher information metric, etc.
- A review of recent work on optimal transport, based on the material in Cédric Villani’s books.
I became interested in the first topic because it has the word “information” in it. Supposedly this may be a misnomer, but my gut instinct is that it’s not. I was particularly interested because Fisher information is a Hessian, which is essentially curvature. In my mind (especially after studying a tad bit of physics) curvature is all that matters since we usually only care about second-order Taylor expansions and local studies. So this felt like a mix of my favorite things: information and curvature. Not only that, but one of my friends had been learning about Fisher information in his statisics class, so I was a little jealous and curious. After enrolling in Bayesian statistics and Riemannian geometry this semester, I learned a bit more about Fisher information and also the Fisher information metric, and the field of information geometry has been stuck in the back of my mind ever since.
I became interested in the second topic when I was reading about TRELLIS and a method called rectified flow. Supposedly, optimal transport finds amazing applications in computer vision and graphics, and it’s really manifested in some amazing technologies. I wanted to learn more about these. In addition, I noticed that optimal transport seemed to have an extremely rich mathematical theory behind it. I found Cédric Villani’s books on the subject and decided to start reading. Again, it seemed to fit right in with Riemannian geometry, and I loved that it also considered measure theory (it’s always felt very strange to me that there are almost two theories of integration and interpretations of symbols like “dx”).
After watching this talk by Villani, I have a strong gut feeling that there is a much deeper connection between all of these ideas. After all, it seems to me that probability theory and geometry set out to do the same thing: measure the world. I think that in some way, they should yield extremely similar theory and ideas.
In this post, I will give an overview of what I am learning about both subjects, and maybe some small details about why I think they are so interesting and related. My Riemannian geometry professor mentioned RCD spaces, so I’ll focus on a paper from 2009 by John Lott and Cédric Villani, Ricci curvature for metric-measure spaces via optimal transport. In this paper, the authors show how to extend optimal transport theory to metric-measure spaces, and how to develop a notion of Ricci curvature bounded below.
Remark. For transparency, I used an LLM to speedily convert my original LaTeX notes into this blog post in MDX (I was originally going to do it directly in MDX but decided to go with LaTeX for the better PDF convertibility). Once I refine them further, my notes will also be on my professional website! If you notice any errors in this post, however, let me know!
In a next post I’ll consider some of theorems and inequalities that can be generalized to metric spaces based on this work. Somehow I saw these theorems on nonsmooth spaces before I saw them on smooth spaces. :-) But they were theorems I’ve been looking forward to since I first heard about Fisher information and entropy.
For context, I will also discuss some background on optimal transport theory based on Villani’s book, Optimal transport: old and new and a lecture by Luigi Ambrosio. I will borrow notation and ideas from the references above. will denote a complete, separable, measured length space.
Background on Optimal Transport
Definition (Image Measure/Push-Forward). a Borel measure on . If is a Borel map, is the push-forward of by :
Definition (Law of a Random Variable). For a random variable on ,
Definition (Coupling). Let and be probability spaces. A coupling is a collection of two random variables and such that and . Via notation abuse, and are also called couplings. The set of all couplings of and is denoted .
Villani’s Remark (Story of Couplings).
- Uninformative couplings always exist. Take the space (, : the trivial coupling. Here ” does not give any information about the value of ” (Villani 6).
- Really good couplings don’t always exist. When has all information about , we have a deterministic coupling: a measurable function so that .
Optimal Transport. Let be a cost function. The Monge-Kantorovich problem is:
Wasserstein Distance. Let be the space of Borel probability measures on metric space . The square of the Wasserstein distance between two measures is the infimal cost of the Monge-Kantorovich problem with cost .
Ricci Curvature Bounds on Nonsmooth Spaces
The trick to defining nonnegative -Ricci curvature is to consider this notion of weak displacement convexity: convexity along Wasserstein geodesics. Define as equipped with the Wasserstein distance .
Consider a continuous convex function with .
Definition (, ). Consider . We define the following collections of “entropy-type” functions:
For any reference probability measure , can be decomposed as where is singular w.r.t. . Then, the corresponding “entropy-type” functional to is:
where .
Ricci Curvature Bounds on Nonsmooth Spaces. Compact measured length space has nonnegative -Ricci curvature for if for every with support contained in we have weak displacement convexity for every functional: there exists a Wasserstein geodesic connecting and such that
Remark (Eulerian/Lagrangian duality). This definition has a heavy Lagrangian flavor in the sense that we consider convexity along geodesics, whereas the notion in the Riemannian case feels fundamentally more Eulerian and algebraic in nature. Maybe this provides another example of what Ambrosio mentions in his talk?
Theorem (Lott-Villani 0.12 with constant ). For any smooth compact connected Riemannian manifold ,
- for , the space has nonnegative -Ricci curvature if and only if .
- has -Ricci curvature bounded below by if and only if .
Theorem (Lott-Villani 0.13). The properties of having nonnegative -Ricci curvature or -Ricci curvature bounded below by commute with taking Gromov-Hausdorff limits.
Remark (Role of Optimal Transport). The role of optimal transport in all of this is providing a metric on from which we get , where we can formulate displacement convexity more carefully as convexity along Wasserstein geodesics.
RCD() Spaces
Definition (). A space is called if it has Ricci curvature bounded below by and dimension at most .
With the theory above, we can at least define spaces.
Remark (Finsler Spaces). A Finsler space is a differentiable manifold with a Minkowski norm on tangent spaces. We don’t like them because they do not have as nice properties as Riemannian manifolds (“anisotropic” geometry, issues relating to flows, no Bochner inequality).
RCD() spaces exclude Finsler spaces by requiring spaces to be “infinitesimally Hilbertian”:
Definition (). A metric-measure space is called if it is and is a Hilbert space.
Alternatively, in Ambrosio’s talk, he instead uses the definition for infinitesimally Hilbertian that the “Cheeger Dirichlet energy” is quadratic.
Remark (GH Convergence). In a work by Ambrosio, Gigli, and Savaré, they show that spaces are also more stable under GH convergence than spaces. Here they use the alternative definition for infinitesimally Hilbertian spaces.
One other result Lott and Villani show is the following:
Theorem (Lott-Villani 5.31). If a compact measured length space has nonnegative -Ricci curvature for some , then for all and all ,
and from class we had the following Bishop-Gromov result: if is complete and has nonnegative Ricci curvature,
References
- Ambrosio, Luigi. Optimal transport: old and new. 2015.
- Lott, John, and Cédric Villani. Ricci curvature for metric-measure spaces via optimal transport. Annals of Mathematics 169.3 (2009): 903—991.
- “Optimal Transport and Ricci Curvature.” YouTube, uploaded by Rio ICM2018, https://www.youtube.com/watch?v=JCHNQWhcSLs.
- Gigli, Nicola. On the differential structure of metric measure spaces and applications. Memoirs of the American Mathematical Society 236.1113 (2015): 1—91. DOI: https://doi.org/10.1090/memo/1113.
Comments
Not signed in. Sign in with Google to make comments unanonymously!