In this article, we will find that the inertia matrix naturally appears when we calculate the kinetic energy T or the angular momentum M of a rigid body. Then, we introduce the concept of principal inertia Jpri. We also study how the inertia matrix changes under translations and rotations and how those transformations may lead to conclusions that can help us simplify the calculation of inertia matrices.
This article is translated from a Chinese article on my Zhihu account. The original article was posted at 2019-11-03 20:02 +0800.
Some notations:
a indexes particles. j,k,l are indices of vector components. q,s,t are arbitrary vectors.
I is the unit matrix. Matrix q⊗s:=qsT is the outer product of q and s, and q⊗2:=q⊗q.
A rigid body consists of some particles with distances between each other fixed. μa is the mass of a particle; ra is the position of a particle relative to the center of the rigid body (∑aμara=0); and va is the velocity of a particle.
The total mass of the rigid body is m=∑aμa; the velocity of its mass center is V; and its angular velocity is Ω. θa is the angle between ra and Ω.
m:=mI, J:=∑aμa(ra2I+ra⊗2).
Talk about homogeneous scalar fields of degree 2. Homogeneous scalar fields of degree 2 are function f that satisfies f(λq)=λ2f(q). It has the form f(q)=21j,k∑Aj,kqjqk, where the coefficient matrix A is a symmetric matrix (this is why there is a factor of 21 in front).
Actually, such a function can be written in matrix multiplication as f(q)=2Aq⋅q (note that it is not the same as 2Aq2, but is (2Aq)⋅q instead).
Similarly, homogeneous vector field with degree 1 has the property g(λq)=λg(q), and they have the form g(q)=Aq (now the coefficient matrix A is not necessarily symmetric).
Introduce outer product. If the matrix A:=q⊗s, then Aj,k=qjsk. Specially, if A:=q⊗2, then Aj,k=qjqk. The perfect square expansion of outer product is (q+s)⊗2=q⊗2+q⊗s+s⊗q+s⊗2.
First, according to geometric relations, we can prove a theorem va=V+Ω×ra. Then, we can discuss the kinetic energy T:=a∑2μava2 of the rigid body. We will discover that the kinetic energy of the rigid body can be represented as the sum of a homogeneous function of degree 2 w.r.t. V and a homogeneous function of degree 2 w.r.t. Ω, i.e.
T=Ttra+Trot, where Ttra:=2mV⋅V,Trot:=2JΩ⋅Ω.
The proof is not difficult: T:=a∑2μava2=a∑2μa(V+Ω×ra)2=a∑2μa(V2+2V⋅(Ω×ra)+(Ω×ra)2)=21V2ma∑μa+V⋅Ω×0a∑μara+21a∑μa(Ω×ra)2=Ttra+21a∑μa(Ω×ra)2, while
=(Ω×ra)2=Ω2ra2sin2θa=Ω2ra2−Ω2ra2cos2θa=Ω2ra2−(Ω⋅ra)2=(j∑ra,j2)(j∑Ωj2)−(j∑ra,jΩj)(j∑ra,jΩj)=j,l∑ra,l2Ωj2−j,k∑ra,jra,kΩjΩk=j,l∑ra,l2Ωjk∑Ij,kΩk−j,k∑ra,jra,kΩjΩk=j,k∑ra2(l∑ra,l2)Ij,kΩjΩk−j,k∑ra,jra,kΩjΩk=j,k∑(ra2Ij,k−ra,jra,k)ΩjΩk=(ra2I−ra⊗2)Ω⋅Ω.
Therefore, T=Ttra+21a∑μa(ra2I−ra⊗2)Ω⋅Ω=Ttra+Trot.
Then, we may discuss the momentum of the rigid body P:=a∑μava. We will discover that it is a homogeneous function of degree 1 w.r.t. V, i.e. P=mV.
The proof is easier: P:=a∑μava=a∑μa(V+Ω×ra)=m(a∑μa)V+Ω×0a∑μara=mV.
Finally, we can discuss the angular momentum of the rigid body M:=a∑ra×(μava). We will discover that it is a homogeneous function of degree 1 w.r.t. Ω, i.e. M=JΩ.
The proof is also easy: M:=a∑ra×(μava)=a∑μara×(V+Ω×ra)=0(a∑μara)×V+a∑μara×(Ω×ra)=a∑μa(ra2Ω−ra⋅Ωra) (note the order of computation in the notation
q⋅st: it is (q⋅s)t). Therefore, Mj=a∑μara2∑kIj,kΩkΩj−a∑μa(k∑ra,kΩk)ra,k=a∑μak∑(ra2Ij,k−ra,jra,k)Ωk=k∑a∑μa(ra2Ij,k−ra,jra,k)Ωk=k∑Jj,kΩk,
i.e. M=JΩ.
We discover that, it seems that the kinetic energy is the sum of “translational kinetic energy” and “rotational kinetic energy”, while the momentum only contains “translational momentum”, and the angular momentum only contains “rotational angular momentum”.
We may discover that J seems to represent some inertia related to rotation of the rigid body, so we call it the inertia matrix (or, in tensor language, the inertia tensor). By this means we introduced an important physical quantity. Now we study some of its properties.
Note that J is a real symmetric matrix, so it has real eigenvaluese Jpri (this is a vector containing all the eigenvalues as its components), and its (normalized) eigenvectors epri (this is a matrix, with its columns being the eigenvectors) orthogonal to each other. These eigenvalues are called the principal moments of inertia, and the axes parallel to the eigenvectors and passing through the center of mass are called the principal axes of inertia.
J is diagonalizable, i.e. epri−1Jepri=diagJpri. Because the eigenvectors of J are orthogonal to each other, the matrix epri is an orthogonal matrix, i.e. epriT=epri,j, so qpri=(qTepri)T=epriTq=epri−1q, i.e. epriqpri=q. We may find that
M=JΩ=(epriepri−1)J(epriepri−1)Ω=epri(epri−1Jepri)(epri−1Ω)=epri(diagJpri)Ωpri.
On the other hand, we have M=epriMpri, so Mpri=(diagJpri)Ωpri, i.e. Mpri,j=Jpri,jΩpri,j. This means that the component of the angular momentum along some certain principal axis of inertia is the product of the principal inertia along this axis and the component of the angular velocity along this axis. This can simplify the calculation of the angular momentum a lot when the
principal inertia can be calculated.
Now, we may study the rotational kinetic energy Trot:=2JΩ⋅Ω=21M⋅Ω=21M⋅Ω=21epri(diagJpri)Ωpri⋅Ω=21(diagJpri)(epriΩ)⋅Ωpri=21(diagJpri)Ωpri⋅Ωpri=21j∑Jpri,jΩpri,j2.
This also makes the calculation simplified when the principal inertia can be calculated.
Study how J will change if ra experience some transformations.
First, consider translation. Let ra′:=ra+d,J′:=a∑μa(ra′2I−ra′⊗2) (we may regard J′ as the inertia matrix of the rigid body w.r.t. the point with d distance away from the center of mass, but not (in usual cases) w.r.t. the center of mass), and then we will discover that J can
be calculated from J′ by J=J′−m(d2I−d⊗2). This provides us a useful way to calculate J when J′ is easy to calculate while J is not.
The proof is not difficult: J′:=a∑μa(ra′2I−ra′⊗2)=a∑μa((ra+d)2I−(ra+d)⊗2)=a∑μa((ra2+2ra⋅d+d2)I−(ra⊗2+d⊗ra+ra⊗d+d⊗2))=Ja∑μa(ra2I−ra⊗2)+m(a∑μa)(d2I−d⊗2)+a∑μa(2ra⋅dI−d⊗ra−ra⊗d)=J+m(d2I−d⊗2)+20(a∑μara)⋅dI−d⊗0(a∑μara)−0(a∑μara)⊗d=J+m(d2I−d⊗2), i.e. J=J′−m(d2I−d⊗2).
The special case of it is called the parallel axis theorem in general physics.
Now, consider rotation. Suppose the rotational matrix R represents the rotation around the unit vector u. Let ra′:=Rra,J′:=a∑μa(ra′2I−ra′⊗2), and we will discover
J′=RJR−1.
Now we will prove the conclusion. First, we now that R is an orthogonal matrix, i.e. RT=R−1. By this we can get other properties like (Rq)2=q2,(Rq)⊗2=Rq⊗2R−1. Therefore, J′:=a∑μa(ra′2I−ra′⊗2)=a∑μa(ra2I−Rra⊗2R−1)=R(a∑μa(ra2I−ra⊗2))R−1=RJR−1.
After getting this conclusion, we may be interested in non trivial (i.e. R=I) cases of J′=J. We will prove that in this case u is a principal axis of the rigid body.
By J′=RJR−1, we can get J=RJR−1, i.e. RJ=JR. Multiply u on both sides, and we have RJu=JRu, i.e. R(Ju)=J(Ru). By Euler’s rotation theorem, the only real eigenvalue of nontrivial R is 1, and its corresponding eigenvector is u, i.e. Ru=u. Substitute, and we have
R(Ju)=Ju, i.e. Ju is an eigenvector of R.
Because the only independent (real) eigenvector of notrial R is u, Ju is parallel to u, so u is an eigenvector of J, i.e. u is a principal axis of the rigid body.
This leads to an important conclusion: if the rigid body maintains its original inertia matrix after it rotates around an axis passing its center of mass by a nontrivial angle (an angle that is not a multiple of 2π), then this axis is a principal axis of the rigid body.
By this we can also get another important conclusion: if a rigid body is rotational symmetric around some axis, then this axis is a principal axis of the rigid body. This conclusion simplifies the determination of principal axes of a rigid body with some symmetries.