The duality between two plane trajectories related by a conformal map
I always feel amazed about how 2D physics can often be fascinating due to theorems in complex analysis. This article is about one among such cases.
Theorem. The conformal map transforms the trajectory with energy in potential into the trajectory with energy in potential .
This result is pretty amazing in that it reveals a quite implicit duality between the two potentials, and it looks very symmetric as written.
This theorem, as I know of, was first introduced in the appendix of V. I. Arnold’s book Huygens and Barrow, Newton and Hooke. Part of this article is already covered in the relevant part of the book.
Power-law central-force potentials
Before I show the proof of it, let me first introduce it by a much more well-known example.
As we all know, Bertrand’s theorem states that the only two types of central-force potentials where all bound orbits are closed are (the Kepler problem) and (the harmonic oscillator). How the two potentials are special among all sorts of different central-force potentials makes people wonder if there is any connection between them. Fortunately, there is one, and it is obvious once we notice that the complex squaring transforms any center-at-origin ellipses into focus-at-origin ellipses. Inspired by this, it is easy to see that trajectories in the Kepler problem can be transformed into trajectories of harmonic oscillators under complex squaring.
You may ask, how can we notice complex squaring does the said transformation on ellipses? The observation is noticing the simple algebra which means that the Joukowski transform of a unit circle simply translates under complex squaring. We can then try to generalize this to circles of other radii, whose Joukowski transformations are just ellipses! (If you remember, this is the second time Joukowski transformation appears in my blog. The first time was here.)
Then, are the Kepler problem and the harmonic oscillator the only two central-force potentials whose trajectories can be transformed into each other by a complex function? The answer is no. In fact, for any trajectory in almost any power-law central-force potential, we can take some power of it to get a trajectory in another power-law central-force potential.
This result can be summarized as follows. Taking the th power of a trajectory with energy in the potential () gives a trajectory with energy in the potential , where To prove this, we just need to reparameterize the transformed trajectory in a new time coordinate defined as , where is the complex position of the original trajectory. Then, by some calculation and utilizing the energy conservation, we can show that the parameter equation in terms of the new time coordinate satisfy the equation of motion we expect. I will not show the details here because they would be redundant once I prove the more general case using the same methods.
Corollaries and applications
There is an interesting special case, which is . There is no potential that is dual to . Another interesting case is , which is dual to itself (). It kind of means that the coefficient in the potential is “interchangeable” with the energy, and the trajectories can be derived from each other by taking the complex reciprocal.
We can get some interesting results with , which is just the case of a free particle, whose trajectories are all straight lines. Since in this case we necessary have , we can say that the zero-energy trajectory in any power-law potential is related to a straight line by a power. From this result, we can derive some interesting corollaries. For example, the zero-energy trajectory in the Kepler problem is a parabola (square of a straight line), which is well-known. The zero-energy trajectory in is a circle passing through the origin (reciprocal of a straight line), which is a pretty interesting not-so-well-known result.
Another interesting result is that, the deflection angle of an incident zero-energy particle scattered by the potential is under paraxial limit, if This result can be easily derived by using the conformal transform of the real line (actually, a straight line that approaches the real line). The crucial part here is that cannot take negative integers because we need . The reason is that, when , paraxial zero-energy particles are bound to sink into the origin, and thus no scattering actually happens. This small pitfall indicates that the trajectory in the dual potential is not a two-side infinite straight line, either, in that limit, in contrast to being seemingly a free particle.
Some straightforward proofs
Let’s go back to the theorem I stated at the beginning of this article.
Proof. Consider a new time coordinate defined as . Then, the motion of satisfies Here we need to substitute by the equation of motion for . By computing the real and imaginary parts separately, we can derive that for any holomorphic function , the gradient of expressed as a complex number is . Therefore, the equation of motion for is According to series reversion, we have . Therefore, the equation of motion for can also be written as Substitute this, and we have Substitute the energy conservation of the motion of : and we have which is the equation of motion for that we expect.
To get the energy of the motion of , we calculate which is the energy conservation of the motion of in the potential that we expect.
Noticing that we are only interested in the trajectory, we can just use Maupertuis’ principle to get a simpler proof.
Proof. The abbreviated action is then exactly the same for the motion of and the motion of . Therefore, by Maupertuis’ principle, for any physical trajectory of , the trajectory of is also physical.
Details worth noting
Invertibility of the conformal map
There are two different definitions of a conformal transformation in two dimensions. One is that a function defined on an open subset of is conformal iff it is holomorphic and its derivative is nowhere zero. The other is that a function is conformal iff it is biholomorphic (is bijective and has a holomorphic inverse).
You may think here I have adopted the second definition because when I say , I am implicitly assuming that I can take the inverse of to get the function and then take the derivative of it. However, if that is the case, an immediate problem is that then the duality between the Kepler problem and the harmonic oscillator, from which I introduced the more general result in the first place, would not be actually covered by the “more general” result. This is because is not biholomorphic (because it is not injective).
Then, why did this never become a problem when we were studying the duality between the Kepler problem and the harmonic oscillator? All we have talked about is how we can derive a trajectory in the Kepler problem by squaring the trajectory of a harmonic oscillator, but we have not discussed about how we can reverse this process, as an essential part of the duality. You may think the reverse of the process would be totally natural given how symmetric our theorem is regarding the two potentials. However, the reverse is not actually well-defined since the inverse of squaring, i.e., taking the square root, is not a single-valued function. Nevertheless, it is still well-defined in some sense: starting with whichever branch we like, tracing one point on the trajectory of the Kepler problem, and moving it along this trajectory for two cycles, we will end up with a trajectory of the harmonic oscillator if we take the square root of the position and ensure we always choose the branch so that the mapping is continuously done.
What about other power-law central potentials? In those cases, we have non-closed trajectories, so we cannot just move along the trajectory for two cycles. For example, if we take , then the potential would be . For any non-closed trajectory, we can uniquely map it to a trajectory of the potential . However, we cannot uniquely do the reverse mapping. There would be three different trajectories in the potential that can be mapped to the same trajectory in , and we can in turn map the trajectory in to any of the three trajectories in depending on which branch we choose.
Therefore, to generalize this for more general potentials, we can use similar arguments. Because has non-zero derivative everywhere in our considered region, it is everywhere locally invertible by the Lagrange inversion theorem. We can then bijectively map the trajectories in the two dual potentials locally for every small (and finite) segment and then patch them together to get the global correspondence between the two trajectories. This mapping may not be well-defined globally, but the trajectories can still be considered dual to each other. If the potential also becomes multi-valued due to the mapping being multi-valued, then we should imagine this situation like this: at some point, the potential may be different when the particle visit here for the second time. This case does not happen if we only look at power-law potentials, but it does happen for more general cases.
What makes this sense of duality weaker is that one trajectory can be dual to multiple different trajectories. A case worth noting is that sometimes one trajectory can be mapped to infinitely many different trajectories. This happens when the trajectory runs around a logarithmic branch point. However, we can gain the sense of duality back if we can also consider the case where is multi-valued. The notion of conformal transformation is now too limited to cover this case, a better notion is a global analytic function, which generalizes the notion of analytic function to allow for multiple branches.
Requirements for the potential
Not any potential can be expressed as . How can we determine whether a potential can be expressed in this form?
Theorem. A continuous potential can be expressed in the form of (where is a conformal transformation) iff one of the following conditions is met:
- is zero everywhere, or
- is a harmonic function on the domain of .
Proof. First, prove the necessity.
An obvious requirement is that the potential must be positive everywhere or negative everywhere (or zero everywhere, but that is trivial). The sign is determined by the sign of . Therefore, without loss of generality, we can assume because we can always absorb a factor of into and adjust the overall sign of accordingly.
We can decompose in the polar form where is a real function of . Applying the Cauchy–Riemann equations to gives Equate the real and imaginary parts, and we have Use the symmetry of second derivatives on , and we have In the language of vector analysis, this is just .
Considering the case where is negative everywhere, we have that is a harmonic function.
Then, prove the sufficiency.
The case where is zero everywhere is trivial. Otherwise, because is defined everywhere on the domain of , we must have is non-zero everywhere. Because is continuous, we have is either positive everywhere or negative everywhere.
Without loss of generality, assume is positive everywhere. Let be the harmonic conjugate of . Then, is a holomorphic function. We can then define which is also a holomorphic function.
From now on, we will call this requirement on as being log-harmonic for obvious resons.
We should notice that whether is log-harmonic does not respect that any potential can have an additive constant and still be essentially the same potential. An immediate example is that a function that is positive everywhere may be negative somewhere if we add a constant to it. We may then want to ask whether can be log-harmonic if we allow it to be added an additive constant. It is easy to do this: we can just apply the same test to , and see if there is some that makes it work. To illustrate, solve the equation for , and then see whether it is a constant over the whole complex plane.
A property of log-harmonic functions is that the product of two log-harmonic functions is also log-harmonic.
Trajectories that run out of the domain
Trajectories often run out of the domain of the potential. For example, in the discussions about power-law potentials before, though not emphasized, the origin is outside the domain of the potential because it is either a pole or a zero of (except the trivial case where is simply proportional to ). Another example that is rather overlooked is that unbound trajectories go to infinity while infinity is often not in the domain of the potential, either.
What need to take care of is that, when the trajectories run out of the domain, the trajectory is cut off there, and the rest of the trajectory is never considered (even if it may come back to the domain again later). Take the Kepler problem ane the harmonic oscillator as an example. If a trajectory of the harmonic oscillator passes through the origin, which is outside the domain, the trajectory degrades from a closed ellipse to a segment. If you take the square of a segment passing through the origin, you will get a broken line folded into itself, which looks like a particle in the Coulomb field may sink into the origin and then goes back along the exact path it came along. This would confusing if it were physical.
Arbitrariness in the construction of the conformal map
The construction of is not unique for a given .
Rotation and translation
First, we can observe that the substitution does not change (nor thus ). The real number is a function of in principle, but if we want to be holomorphic on a connected region, then must be a constant (except the trivial case where ).
The dual trajectory does change, though, but the dual potential is also changed, too. Because , we have Therefore, the dual trajectory and the dual potential are also rotated and translated by the same amount.
Scaling
Before introducing scaling, I need to add some words about the unit systems. In the above discussions, I have never mentioned what units or dimensions do have. The natural way of thinking is to let have the dimension of length and let have the dimension of energy. However, this is not the only way of thinking. We will later see that the -space and the -space can have totally different dimensions.
The dimensions or units of variables in a physical formula can be totally different from what they were originally intended to be. For example, when a particle is rotating, its motion needs to satisfy , where is the angular velocity. However, although has the dimension of length when it is first introduced, this formula is satisfied by any rotating vectors. A typical example is that the angular momentum changes according to this formula when a rigid body is doing precession. For another example, in classical mechanics and general relativity, the coordinates used to describe the motion of a particle are often not in the dimension of length, but have all sorts of dimensions. For another example that is less well-known, just because the Berry connection has the same gauge transformation as the electromagnetic potential, a bunch of formulas that are useful in electromagnetic theory can be applied to the Berry connection to define all sorts of interesting quantities with rich physical implications. The units of Berry connection are, however, very unimportant because they are literally arbitrary.
Therefore, what does a unit system actually bring us in a physical theory? The only thing it brings us is the ability to conveniently see in what aspects our theories are invariant under the scaling of some quantities. For example, in classical mechanics, we can scale the mass and the potential of any system with the same factor, and then the system will still behave the same in terms of the time-dependent length-based motion. This is because the part of the dimension of energy that is independent of length and time is to the first power of the dimension of mass. For similar reasons, we can derive another two scaling invariances, one about length-scaling and the other about time-scaling. In quantum mechanics, we suffer one less such scaling invariances because of the existence of ; in special relativity, we suffer one less such scaling invariances because of the existence of ; and in general relativity, we suffer two less such scaling invariances because of the existence of and . This is the incentive of introducing natural units in physics: they give us a more clear image of how our theory can be scaled leaving the physics invariant.
As for dimensional analysis, the essence of it is to find the required form of theory so that it satisfies some sort of scaling invariance. For example, we can use dimensional analysis to derive that the frequency of a harmonic oscillator is proportional to the square root of the ratio of the stiffness to the mass. We know this must be correct because this is the only theory that is consistent with the three scaling invariances that must be satisfied by any theories under the framework of classical mechanics.
Now, consider the scaling in , i.e., for some non-zero real number . The potential can be kept invariant by scaling . However, we cannot change if we want to leave the trajectory of unchanged because it is determined by the energy of the trajectory of . Therefore, the dual potential would be scaled to This means that physics is unchanged if length is scaled by and energy and potential are both scaled by . This corresponds to one of the three scaling invariances in classical mechanics that we talked about before.
What is interesting here is that the length-scaling in the -space is done independently of that in the -space. This means that the length dimension in the two systems are independent of each other, so the two systems can have totally different unit systems.
Canonical transformation of time
The transformation from to seems like a coordinate transformation, which is covered by canonical transformations. However, here we have an additional requirement about the form of the Hamiltonian: where is the transformed Hamiltonian (or called the Kamiltonian in the jargon of canonical transformations). This is not generally true because the transformation in the generalized momentum is restrictively determined when the transformation in the generalized coordinate is already given. From the proof of the original theorem, we can see that a transformation in time is a must, which is given by .
The problem is that the canonical transformations covered in most textbooks usually do not allow for a transformation in time, but only for a transformation in the canonical variables. Therefore, I need to first address the problem of integrating the transformation of time into the theory of canonical transformations. I will not do this for the most general case, but only for the case general enough for the purpose of explaining the case interesting this article.
Change in the time variable in the stationary-action principle
Before diving into the general canonical transformation, let’s first consider the case where the transformation is only in the time variable.
Consider a system with the Lagrangian (not explicitly dependent on time). Then, the action can be expressed as The same integral can be expressed in terms of a new time variable as where is the generalized velocity in the new time variable. The transformed Lagrangian, or what I want to call the Magrangian1, is then For the case that we are concerning, is a positive real function of but does not (explicitly) depend on . The limits satisfy the condition This relation is crucial. When finding the variation , we are fixing . However, we cannot fix both because their difference is dependent on the path . What we can do is to fix and to let have a variation given by where is the derivative (or gradient, in higher dimensions) of as a function of . As can be seen, only if is a constant (i.e., is simply an affine transform of ) does vanish for any .
Using the well-known variation of the action when there is variation in the time coordinate, we have where is the energy (or the Kamiltonian, but as a function of generalized coordinates and velocities) of the system.
A quick check of this variation
Because is fixed, we have Now, calculate the variation of the action: Recall the derivation of the Euler–Lagrange equation. For the second term in the integrand, we can integrate by parts to get Substitute this back into the expression for , and we have the desired result.
If we let the first term in vanish, we would get the well-known Euler–Lagrange equation:
However, that term is not zero because there is another term in . If we want the Euler–Lagrange equation to be satisfied, we need the second term to be zero. This means that either is zero or is zero. The latter case will lead us to the trivial case because we have just derived that is zero only if is a constant. The former case can be satisfied, however. If the Euler–Lagrange equation is satisfied, then is a conserved quantity due to the symmetry of in -translation. Then, if happens to be zero at some point, it will be zero over the whole motion, and the stationary-action principle will be satisfied by the motion between any two points.
We can explicitly show that Equation 2 can be derived from the original Euler–Lagrange equation under the zero-energy condition.
Proof. We need to first derive the condition of zero energy in the old time variable. Take derivatives of Equation 1 with respect to , and we have Therefore, the Kamiltonian is where is the original Hamiltonian. This relation means that the condition is equivalent to the condition .
Then, use Equation 1 to explicitly calculate the lhs of Equation 2:
vs. specifying
SpecifyingWe will see that specifying , which is what we have done in the above discussion, is pretty different from specifying . The latter is much simpler, but the former is the one that is used for the conformal duality between potentials. Although I do not have to discuss what the transformation should look like when we specify instead of , I will still do this because I need to point it out that it is quite different from the case we have discussed.
Recall that the canonical transformation is just a transformation of coordinates in the phase space that preserves the canonical one-form up to a total differential. Adding the idea of time transformation into this has a difficulty that time is not a coordinate in the phase space. Including the time coordinate, the actual one-form that needs to be preserved is which is exactly the total differential of the action. Therefore, we have where are the new canonical variables, is the transformed Hamiltonian, and is called the generating function of the canonical transformation. Assume and are both functions of . Then, we have Compare the coefficients of on both sides, and we have These equations determines . They will satisfy Hamilton’s equation:
An example
Consider the Hamiltonian . The motion is Consider the new time variable and the generating function . With Equation 5 and the expression for and , we have a set of five equations: With the expression for the Kamiltonian , we get the motion of : This is consistent with the motion of as can be verified with calculation.
It seems that specifying is much easier than specifying . We can easily discuss the most general case and perfectly recover the equation of motion without having to impose a bizarre condition like the zero energy. This is because specifying is, in some sense, more general than specifying : we can always find the total derivative of for any form of it, but we cannot always find given the form of because of limitations on the integrability.
The conformal transformation as a canonical transformation
Now, we can discuss the conformal transformation as a canonical transformation. The procedure is pretty analogous to that in the previous section, but this time the conclusion would only be valid under the zero-energy condition.
Denote the real and imaginary parts of as , and the real and imaginary parts of as . The Cauchy–Riemann equations give Here are two real functions defined for convenience. They can either be functions of or functions of , depending on which are more convenient. With , we have The time transformation is given by The original Hamiltonian is (the last term is added because we want it to be zero during the motion). Substitute these into Equation 4, and we have () Then, after some calculations, we have perfectly the expected result The condition specifies the energy of the dual trajectory.
For unknown reasons, the transformed Hamiltonian is called the Kamiltonian just because we often use the symbol to represent it. However, there is not a similar convention for the transformed Lagrangian, so I would like to use the letter and call it the Magrangian. The surname “Lagrange” is originated from the French phrase la grange (meaning “the barn”), and correspondingly “Magrange” may refer to the French phrase ma grange (meaning “my barn”). This pun then can make “Magrangian” kind of mean “my Lagrangian”.↩︎