Regularizing the partition function of a hydrogen atom
Introduction
The unit system
The unit system used in this article is Hartree atomic units: , where is the electron mass.
In this unit system, the Bohr radius is , which is of angstrom order. Therefore, I will use as the order of macroscopic lengths. The Rydberg unit of energy is , which is of electronvolt order. Therefore, I will use as the order of inverse room temperature.
One can adjust the units to get results for the cases of other hydrogen-like atoms: use instead of , where is the atomic number.
In this article, I also assume that the mass of the nucleus is infinite. If you want more accuracy, you can use instead of , where is the mass of the nucleus.
Terminology about temperatures
I will mainly be working with the inverse temperature , where is the temperature. However, I will still use “temperature” often to give some physical intuition. To avoid confusion in the context of using and in appearance of negative temperature, I would avoid using phrases like “high temperature” and “low temperature”. Instead, here are some terminologies that I am going to use:
- “Cold (positive) temperature” means .
- “Hot positive temperature” means .
- “Cold negative temperature” means .
- “Hot negative temperature” means .
The energy levels of a hydrogen atom are (ignoring fine structures etc.) , with each energy level labeled by , and each energy level has degeneracy (ignoring spin degeneracy, which merely contributes to an overall factor of the partition function). The partition function is which diverges for any (of course, normally we can only have , but the point of saying that it diverges for any complex is that there is no way we can analytically continue the function to get a finite result). Does this mean that statistical mechanics breaks down for this system? Not necessarily. Actually, there are multiple ways we can tackle this divergence.
One should notice that, although this article concentrates on regularizing partition functions and that of the hydrogen atom in particular, all the methods are valid for more general divergent sums.
Here is a sentence that is quoted by many literatures on diverging series, so I want to quote it, too:
It translates to “Divergent series are in general deadly, and it is shameful that anyone dare to base any proof on them.”
The physical answer
A physicist always tell you that one should not be afraid of infinities. Instead, one should look at where the infinity comes out from the seemingly physical model, where there is something sneakily unphysical which ultimately leads to this unphysical divergence. In our case, the divergence comes from high energy levels. It is then a good time to question whether those high energy levels are physical.
There is a radius associated with each energy level in the sense of the Bohr model: . When (which happens at ), the orbit is really microscopic now, and the interaction between the electron and the “box” that contains the whole experimental setup is now having significant effects. Or, if there is not a box at all, we can use the size of the universe instead, which is about (). Use the model of particle in a box for energy levels higher than , and we have where is the side length of the box (assuming that the box is cubic). If is very large, we can approximate the second term as a spherically symmetric integral over the first octant to get .
The integral approximation
This is actually the result for Boltzmann ideal gas, so it should be familar, but I still write down the calculation here for completeness.
We can approximate where means . We can then change the integral to spherical coordinates: where the factor of is because we only integrate in the first octant, and the second step utilizes the symmetry of the integrand and redefines the integrated variable. This integral is than a familiar Gaussian integral of order unity. The value of it is not important for later discussion because all the arguments that follow only uses orders of magnitude, but I tell you it is , which can be evaluated by integrating by parts once and utilizing the famous . The final result is .
Is this an overestimation or underestimation? It is actually an overestimation. Draw a picture of to convince yourself of this. We do not need to estimate how large the error is, though, because we will see that we only need an upper bound to get the arguments we need.
For the first term, we need to consider how the magnitude of the summand changes with . The minimum value of the summand is at . At room temperature, we have , so is well between and . Therefore, the largest term is either or . The former is , which is of order , while the latter is , which is of order for the case of the size of the universe. We may then be interested in the term , which is of order . This is much larger than the term but much smaller than the term, so it is second largest term in the sum.
An upper bound of the summation is given by replacing every term except the largest term by the second largest term, which gives Therefore, the term dominates the entire partition function. This means that the hydrogen atom is extremely likely to be in the ground state (despite the seeming divergence of the partition function). This is intuitive. The probability of the system not being in the ground state is of order for the size of the universe and for a typical macroscopic experiment.
More accurate considerations
The usage of the model of particle in a box for energy levels gives good enough arguments and results, but one may want to question whether this is appropriate.
What happens if you actually put a hydrogen atom in a box (for simplicity, make the box spherically symmetric)? More accurately, consider the quantum mechanical problem in spherically symmetric potential such that for small but grows fast and high enough at large so that the partition function for bound states is convergent. This is called a confined hydrogen atom. A book chapter The Confined Hydrogen Atom Revisited discusses this problem in detail and cited several papers that did the calculations about the energy levels.
Cutoff regularization
By analyzing the orders of magnitude, we see that we actually do not lose much if we just simply cut off the sum at . This corresponds to a regularization method called the simple cutoff: it replaces the infinite sum by a finite partial sum. This can be generalized a little by considering a more general cutoff function such that . Then, an infinite sum can be written as The simple cutoff is then the case where and , where is the Heaviside step function. For converging series, this gives the same result as the original sum thanks to the dominated convergence theorem.
For diverging series
For diverging series, this may give a finite result. For example, for , this method gives for any complex and any smooth enough , where is the Dirichlet eta function. Here is a check for the special case (equivalent to the Abel summation). By definition of the polylogarithm, we have Now, substitute , and utilizing the identity , we have the result .
You may wonder what is the case for , which is also a diverging series, and it looks much like the case above. However, the limit at simply does not exist when (i.e., when the series diverges). This is because we have only for , where is the Riemann zeta function, but it is undefined for other values of . If you analytically continue the result, you will get the famous Rieman zeta function.
However, although this series may converge for any positive , the limit as may not exist. If it diverges because grows too fast (or decays too slowly) as , then we should expect that the sum also tends to infinity as . Assume that we can characterize this divergence by a Laurent series: If the limit converge, we would expect to be zero, and then the result is simply . Therefore, we may also want only when the limit does not exist. To pick out , utilize the residue theorem: where the domain of is now analytically continued from to a deleted neighborhood of . Equation 3 is then a generalized version of Equation 2.
Notice that I have been super slippery in math in the discussion. For example, the Laurent series may not exist at all, and the analytic continuation may not be possible at all; even if they exist, the limit may also be different from . However, I may claim that we should be able to select smooth enough for all of these to work, and the results will be independent of the choice of as long as Equation 3 works in this form.
Particularly, one can rigorously prove that for , the sum obtained by this precedure is , where is the Riemann zeta function, as long as has bounded th derivative and the sum converges. This is proven in an interesting blog article.
Alternative forms of cutoff regularization
In some cases, one may discover that is not analytic when so that the Laurent series expansion is not possible. An example is (for ) with no degeneracies (this system also has a diverging partition function for any complex ). In this case, if you try to use the cutoff function , the sum goes like instead of analytically when . Proving this is simple. We have where the last step uses the substitution . Using the binomial theorem, we have where is the binomial coefficient. Note that , where is the th derivative to the Euler Gamma function, so the integral for gives a factor in the limit of . Therefore, where only the term in the sum is retained for the leading contribution as .
However, for any , one can always choose functions so that the sum goes like as . For example, for (equivalent to the Abelian mean or the heat-kernel regularization), we have We can choose so that Therefore, as , we have However, this does not guarantee that the Laurent series expansion exists. This is a good trial, though. My math capacity does not allow me to confirm whether this is the case for the example of .
Regularizing the hydrogen atom
After saying so much about cutoff regularization in general, what does it say about the partition function of a hydrogen atom? Try multiplying the cutoff function to the summand in Equation 1: where the last step utilizes the result for , with which we get rid of the dependence on . The last expression is then identified as .
Now that we get the expression of , we can get some useful things. However, this time we cannot simply use the summand divided by to get the probability of each energy level because that will break the normalization of the probability distribution. What we can do, however, is to find the expectation value of the energy using . On the other hand, we have , so the probability that the system is not in the ground state is bounded above by .
The first check to do is to verify that this result is consistent with the known behavior of the system at cold zero temperature, where the system is almost certainly in the ground state; in other words, . To get for large , we notice that , so , and this leads to as expected.
Now, we may try to estimate for finite but large (e.g., ) and thus give an upper bound for . We can study the asymptotic behavior of for cold positive temperature. It turns out that , which is for . As we can see, without any physical arguments but only with regularization, we get a result that seems sensible and well between the results in the last section for a hydrogen atom confined in a box with a typical macroscopic size or the size of the universe.
Derivation of the asymptotic behavior at cold positive temperature
We have Therefore, We can try to find the asymptotic behavior of the coefficient of each term. We have We also have , of course. Therefore, These power series are then simply exponential functions. Therefore,
Although the asymptotic behavior at cold temperature () looks good, its behavior is very wrong at some regimes. At some temperature, the monoticity of reverts, and then it gets even lower than the ground state energy and heads all the way to at some finite temperature.This is clearly unphysical. This suggests that it is wrong to use the regularized result.
Plots
Here is a plot that shows how starts to decrease with temperature at some point and becomes even lower than the ground state energy:
Here is a plot that shows how goes to infinity at different temperatures:
Here are also plots for and , if you are curious:
The two vertical asymptotes of corresponds to the two zeros of , which are and . It also has a zero, correponding to the zero of at . The point where is , and the point where has a local maximum is .
Another aspect where we can see that this result is wrong is that, if we look at the hot negative temperature limit , although we have as expected, it is approaching from the wrong side. In fact, because while for , we have for , exceeding the supremum of the energy levels, which is unphysical.
Derivation of the hot negative temperature limit
Here is a non-rigorous derivation. We can rewrite the regularized in a similar form as For finite , it has a straight line asymptote as . The envelope of this family of straight lines (parametrized by ) is , which means that as , where “” means that the ratio of the two sides approaches a positive constant. Similarly, we have . Therefore, as .
Another regularization special to the hydrogen atom
Here is a special regularization method for the hydrogen atom which is not applicable to general systems. Consider the second derivative by differentiating the summand twice w.r.t. in Equation 1, and then take twice antiderivative w.r.t. . This gives where are integration constants. The result from the cutoff regularization and the zeta function regularization is simply , . What is interesting about this is that it already determines the asymptotic behavior of at cold temperature, which is (see Equation 5), no matter what are.
Zeta function regularization
For a series , if it diverges, we can instead consider for some whose real part is big enough for the series to converge. Then, we can try to analytically continue to to get a finite result for the original series. This is called the zeta function regularization.
When zeta function regularization fails
For the zeta function regularization to work, the asymptotic behavior of needs to be a non-trivial power law as . Otherwise, the sum may not converge for any . For example, consider (with no degeneracies). The partition function with zeta function regularization is This series is divergent for any complex .
A famous example is , which gives . Generally, for , we have . This is the same as the result for the simple cutoff regularization. This raises the question of whether the results obtained from those two methods are necessary the same whenever they both exist. I do not have a rigorous proof, but a strong argument is that both of them are the result of some analytic continuation, so they should be the same by the uniqueness of analytic continuation.
We can check this with the hydrogen atom. We have, for and real, Analytically continue this result to , and then this gives the same result as Equation 4. The rest will be the same as the last section.
Can we trust this result?
However, can we trust this result, though? Everything is becoming fishy. Probabilities are no longer well-defined because how we normally derive them using is causing a divergent sum of probabilities and thus invalid. Yet somehow we are trying to estimate the bound of the probability of the system not being in the ground state and getting an expected result. You must have been feeling uncomfortable about this.
The first thing to ask is what we mean by “the expectation value” when the probability distribution is not even well-defined. If it means nothing physical, can we still trust its expression? The simple answer is no.
As we already see, although the result at cold temperature is sensible, the result at some regimes is clearly unphysical. We can also see similar problems with other systems. Consider the system that has energy levels (with no degeneracies). We can easily get , and thus there is a absi at . For , converges, and everything looks good. For , the system is so hot that diverges. Previous arguments suggest that, in this region, the regularized is still . However, we then have in this region, which is lower than the ground state energy. This clearly should not be trusted.
In another aspect, we should note that since the estimation for does not depend on the size of the box confining the hydrogen atom, its rough agreement with the result in the last section should be considered a coincidence.
Another thing to note is that the result of the regularizations depend on whether we “flatten” the energy levels. We can “flatten” all the energy levels: pretend no degeneracies exist. For example, suppose a system with and . However, we can rewrite the same system as (or equivalently )1, with no degeneracies. This “re-grouping” of the energy levels can affect the result of regularizations and whether a zeta function regularization exists. For an immediate example, if we flatten the energy levels of the hydrogen atom, the zeta function regularization does not exist. Another simple example is that, for a system with , we can essentially re-group the all-degenerate states to have any positive integer sequence to get very arbitrary results for the partition function.
Abscissa of convergence
Forget about the hydrogen atom, and let us consider a general system with (ever-increasing) energy levels and degeneracies . For a given system, there is an abscissa of convergence , below (hotter than) which the partition function diverges. In other words, converges for and diverges for . For most physical systems, we have , meaning that it can have any positive temperature, which sounds sensible. The hydrogen atom has , and a two-level system has . A system with and no degeneracy has .
The term “abscissa of convergence” is borrowed from the study of general Dirichlet series. The form of is indeed very much like a general Dirichlet series, but a general Dirichlet series requires , which is not true for the hydrogen atom. However, the existence of an abscissa of convergence is still true for the more general case.
What does it mean physically to have an abscissa of convergence ? First, if , then the system is well behaved at any temperature, which is good and does not need further care.
If , normally one should say the system cannot reach a certain temperature: the system can never be in equilibrium with a heat bath hotter than . Thermodynamically, one can say that the system needs to absorb an infinite amount of heat to reach this temperature. One can see this easily by considering any sensible system, which has : for to go below zero means to make the temperature hotter than infinity, which of course needs an infinite amount of heat intuitively. One may want to see whether it is possible to regularize to get a finite result for . A valid claim to make is that, if can be analytically continued to the half real axis to the left of , then any sensible regularization of there will give the same result as the analytic continuation. Actually, the analytic continuation is exactly the zeta function regularization if there is no degeneracy (or regarding degenerate states as different energy levels). However, it is possible that the analytic continuation does not exist. There may be a branch cut or a natural boundary. For example, if with no degeneracy, where is the th prime number, then is the prime zeta function, which has a natural boundary at . Even if such a regularization exists, it should be questioned whether it is physical.
If , then the system is not well behaved at any temperature. This is the case for the hydrogen atom. Physically, this means that the system cannot be in equilibrium with a heat bath at any temperature. The problem with regularization is the same as the case with .
In a previous article about statistical ensembles, when I defined the partition function, I briefly mentioned that it is only defined for those intensive variables ( in the context of this article) such that the partition function converges. I did not talk about what to do with the partition function when it diverges, but what that article implied is that it is simply undefined and that no physical meaning should be assigned to it in principle. The existence of an abscissa of convergence tells us that there is a “hottest possible temperature” for any given system. The hydrogen atom is symply the case where the hottest possible temperature coincides with the coldest possible temperature (which is the absolute zero). For most sensible systems, the hottest possible temperature is just the positive hot limit. For systems such as , the hottest possible temperature is a finite positive temperature, which is at , resulting from . This can be conterintuitive at first, but one should realize that it is not essentially different from the more common case of .