A measure-theoretic formulation of statistical ensembles (part 2)
This article follows part 1.
Introduction
In part 2, I will focus on non-thermal ensembles.
Before I proceed, I need to clarify that almost all ensembles that we actually use in physics are thermal ensembles, including the microcanonical ensemble, the canonical ensemble, and the grand canonical ensemble (the microcanonical ensemble can be considered as a special case of thermal ensemble where is the trivial).
The theory of thermal ensembles is built by letting the system in question be in thermal contact with a bath. Similarly, if we let the system in question be in non-thermal contact with a bath, we can get the theory of non-thermal ensembles. An example of non-thermal ensembles that is actually used in physics is the isoenthalpic–isobaric ensemble, where we let the system in question be in non-thermal contact with a pressure bath.
However, we will see that it is harder to measure-theoretically develop the theory of non-thermal ensembles if we continue to use the same method as in the theory of thermal ensembles.
Introducing non-thermal contact with an example
A thermal contact is a contact between thermal system that conducts heat (while exchanging some extensive quantities). A non-thermal contact is a contact between thermal system that does not conduct heat (while exchanging some extensive quantities). For reversible processes, thermodynamically and mathematically, heat is equivalent to a form of work, where the entropy is the displacement and where the temperature is the force. However, this is not true for non-reversible processes because of the Clausius theorem. This should have something to do with the fact that entropy is different from other extensive quantities (as is illustracted in part 1).
First, I may introduce how we may cope with the reversible processes of two subsystems in non-thermal contact in thermodynamics. As an example, consider a tank of monatomic ideal gas separated into two parts by a thermally non-conductive, massless, incompressible plate in the middle that can move. The two parts can then adiabatically exchange energy () and volume () but not number of particles (). For one of the parts, we have which is good and easy to deal with because it is simply a differential 1-form.
However, this convenience is not possible for non-reversible processes because then we do not have the simple relation . Actually, the pressure is only well-defined for equilibrium states, and it is impossible to define a pressure that makes sense during the whole non-reversible process, which involves non-equilibrium states. Therefore, although it seems that the “thermally non-conductive” condition imposes a stronger restriction on what states can the composite system reach without external sources, it actually does not because the energy exchanged by the subsystems when they exchange volume is actually arbitrary (as long as it does not violate the second law of thermodynamics) if the process is not reversible.
The possible states of the non-thermally composite system then cannot be simply described by a vector subspace of . If we try to use the same approach as constructing the thermally composite system to construct the non-thermally composite system, the attempt will fail.
Continuing with our example of a tank of gas. Although the pressure is not determined in the non-reversible process, there is one thing that is certain: the pressure on the plate by the gas on one side is equal to the pressure on the plate by the gas on the other side. This is because the plate must be massless (otherwise its kinetic energy would be an external source of energy; also, remember that it is incompressible: this means that it cannot be an external source of volume). Therefore, the relation between the volume exchanged and the energy exchanged is determined as long as at least one side of the plate is undergoing a reversible process because then the reversible side has determined pressure, which determines the pressure of the other side.
This is the key idea of formulating the non-thermal ensembles without formulating the non-thermally composite system. In a thermal or non-thermal ensemble, the composite system consists of two subsystems, one of which is the system in question, and the other is the bath which we are in control of. We can let the bath have zero relaxation time (the time for it to reach thermal equilibrium) so that any process of it is reversible. Then, the pressure (or generally, any other intensive quantities that we are in control of times the temperature) is determined (and actually constant), and we can express the non-conductivity restriction as where is the pressure, which is a constant. This is a homogeneous linear equation on (whose vectors are denoted as in our case) which defines a vector subspace of , which we call . The dimension of is that of minus one. The physical meaning of in this example is the hyperplane of fixed enthalpy.
Note that our bath actually has the fixed intensive quantities , we can rewrite the above equation as Wait! What does do here? It is supposed to mean the temperature of the bath, but the temperature of the bath is irrelevant since the contact is non-thermal. Actually, it is. The temperature of the bath serves as an overall constant factor of , which does not affect as long as it is not zero or infinite. So far, this means that the temperature of the bath is not necessarily fixed, so the actual number of fixed intensive quantities is the dimension of minus one, which is the same as the dimension of . Later we will see that anything that is relevant to the temperature of the bath will finally be irrelevant to our problem. This seems magical, but you will see the sense in that after we introduce another way of developing the non-thermal ensembles (that do not involve baths and non-thermal contact) later.
We can define a complement of in as . Then, we have . The space is a one-dimensional vector space.
For convenience, define . The vector space associated with it is a complement of in . To make the notation look more consistent, we can use as an alias of . They are the same vector space, but emphasizes that it is a subspace of , and emphasizes that it is a subspace of . Then, we have . Every point in can be uniquely written as a sum of a point in and a vector in . We can describe the decomposition by a projection .
We will heavily use the “” on the superscripts of symbols. Any symbol labeled with “” is dependent on (but independent on an overall constant factor on ). You can regard those symbols to have an invisible “” in the subscript so that you can keep in mind that they are dependent on .
Example. Suppose we have a tank of gas with three extensive quantities . It is in non-thermal contact with a pressure bath with pressure so that it can exchange and with the bath. Then, the projection projects macrostates with the same enthalpy and number of particles into the same point. Because a complement of a vector subspace is not determined, there are multiple possible ways of constructing the projection. One possible way is Here the fixed intensive quantity is involved. Note that this projection is still valid for different temperatures of the bath, so an overall constant factor of does not affect the projection.
Non-thermal contact with a bath
Now, after introducing non-thermal contact with an example, we can now formulate the non-thermal contact with a bath.
Suppose we have a system . The main approach is constructing a composite system out of the composite system for the -ensemble.
The composite system for the -ensemble was introduced in part 1. We denote the bath that is in contact with our system as .
Consider this projection (where is an affine subspace of and the range of ): To ensure that it is well-defined, we need to guarantee that for any , and this is true.
The two spaces and do not have any direct relation. The only relation between them is that the dimension of is one plus the dimension of (if they are finite-dimensional).
What is good about the projection is that it satisfies . This makes our notation consistent if we construct another composite system out of . Now, consider the composite system of and under the projection . In the notation of the spaces and mappings that are involved in the newly constructed composite system, we write “” in the superscript.
Just like how is a subspace of , is also a subspace of . This means that both and are well-defined. The former maps to another subspace of , and the latter maps to another subspace of .
We can regard the construction of the new composite system as replacing the “plate” between the subsystems in the original composite system from a “thermally conductive plate” to a “thermally non-conductive plate”. Suppose that in the new situation, the intensive quantities “felt” by subsystem 1 is . Then, because the bath is still the same bath in the two situations, we have Therefore, would be a good definition of . However, actually is trivial: This is because 2 shows that , and thus which is the kernel of by definition.
Because is trivial, it is irrelevant to the temperature of the bath because it is zero no matter what temperature the bath is at.
Example. Suppose a system described by is in non-thermal contact with a pressure bath, and they can exchange energy and volume. The projection is Then, the projection can be By choosing a different or a different , we can get a different . They physically mean the same composite system.
The space is four-dimensional, and the space is five-dimensional. We can denote the five degrees of freedom as , where is the total energy, is the total volume, and is the enthalpy of subsystem 1. Then, the projection can be written as We can get by finding the inverse of the projection, where : Because it is parameterized by one real parameter , it is a one-dimensional affine subspace of . Projecting it under and will respectively give us and :
The affine isomorphism is then naturally Its vectoric form is then
Our fixed intensive quantities are , which is defined as . We can then get by This is consistent with Equation 4.
Non-thermal ensembles (bath version)
Now, we can define the non-thermal contact with a bath to be the same as the thermal contact with a bath under . Utilizing this definition, we can define the composite system for non-thermal ensembles.
Definition. A composite system for the non-thermal -ensemble of the system with fixed intensive quantities is the same as the composite system for the thermal -ensemble with fixed intensive quantities (given by Equation 4), where is defined by Equation 1.
This definition looks very neat. Also, just like how we define the domain of fixed intensive quantities of a thermal ensemble, we can define the domain of fixed intensive quantities of a non-thermal ensemble to consist of those values that make the integral in the definition of the partition function converge.
Because we already derived the formula of the partition function in part 1 that does not involve information about the bath anymore, we can drop the “” in the superscripts. The partition function of the non-thermal ensemble is then Here, the is not fixed at the trivial value (I abused the notation here) but actually is an independent variable serving as one of the arguments of the partition function that takes values in (which is not the domain of fixed intensive quantities of the non-thermal ensemble that was mentioned above).
However, the only meaningful information about this non-thermal ensemble is in the behavior of at instead of any arbitrary , but we do not know whether or not. This is then a criterion of judge whether is in the domain of fixed intensive quantities of the non-thermal ensemble or not. To be clear, we define A problem about this formulation is that it is possible to have two ’s that share the same thermal equilibrium state. In that case, the non-thermal ensemble is not defined.
Because , the observed extensive quantities in thermal equilibrium are just and the entropy in thermal equilibrium is just We can cancel the parameter by Equation 5 and 6 to get
What is interesting about Equation 7 is that it actually does not guarantee the intensive variables to be defined in . Physically this means that the temperature is not necessarily defined, unlike the case of thermal ensembles (this is because the thermal contact makes the temperature the same as the bath and thus defined). The thing that is guaranteed is that the intensive variables are defined in and they must be zero. Therefore, whenever the intensive variables are defined in , it must be parallel to (and remains the same if we scale by an arbitrary non-zero factor). Physically, this means that the system must have the same intensive variables as the bath up to different temperatures.
Non-thermal ensembles (non-bath version)
It may seem surprising that we can define non-thermal ensembles without a bath. How is it possible to fix some features about the intensive variables without a bath? The inspiration is looking at Equation 1. We can make a guess here: if we contract the system along , the contraction satisfy the equal a priori probability principle. We make this guess because of the following arguments:
- Mathematically, contraction is a legal new system, so it should also satisfy the axioms that we proposed before.
- Physically, because the temperature of the bath is arbitrary, the different accessible macrostates should not be too different because otherwise the temperature would matter (as appears in the expression of the partition function).
After finding the equilibrium state of the contraction, we can use the contractional pullback to find the equilibrium state of the original system.
If you do it right, you should get the same answer as Equation 7.
Summary
The only axiom that we used is the equal a priori probability principle. Then, we formulated three types of ensembles: microcanonical, thermal, and non-thermal.