By using power series, we can prove that the problem of solving linear homogeneous ODE with constant coefficients can be reduced to the problem of solving a polynomial with those coefficients. This article illustrates this point in detail, but it uses a very awful notation…
This article is translated from a Chinese article on my Zhihu account. The original article was posted at 2019-04-06 13:46 +0800.
Notice: Because this article was written very early, there are many mistakes and inappropriate notations.
(I’m going to challenge writing articles without sum symbols!)
In this article, functions all refer to unary functions; both independent and dependent variables are scalars; and differentials refer to ordinary differentials.
Definition 1 (linear differential operator). Let L(D,x) be a differential operator. If for any two functions F1(x),F2(x) and constants C1,C2, the operator satisfies
L(C1F2+C2F2)=C1LF1+C2LF2, then L is a linear differential operator.
Lemma 1. The sufficient and necessary condition for L to be a linear differential operator is that for any tuple of functions F(x) and any tuple of constants C (the dimensions of the two vectors are the same), the operator satisfies L(C⋅F)=C⋅LF.
Brief proof. Directly letting C and F be 2-dimensional vectors and using Definition 1, the sufficiency can be proved; by using mathematical induction on the dimension of C and F, the necessity can be proved. □
Definition 2.ann=0m:=(a0,a1,…,am).
Lemma 2. Suppose the (m+1)-dimensional vector a is independent to x, and {fn(x)}n=0s is a sequence of functions, then Dk(a⋅fn(x)n=0s)=a⋅Dkfn(x)n=0s.
Brief proof. By mathematical induction. □
Lemma 3. Suppose P is a tuple of functions w.r.t. x, and the dimension of P is m+1, then the differential operator L(D,x):=P⋅Dkk=0m is a linear differential operator.
Brief proof. By Definition 1. □
Corollary of Lemma 3. Suppose p is a constant (m+1)-dimensional vector, then the differential operator L(D):=p⋅Dkk=0m(1) is a linear differential operator.
Definition 3 (linear differential operator with constant coefficients). Linear differential operators with form as Equation 1 are called linear differential operators with constant coefficients.
Definition 4 (linear ODE). Suppose L is a linear differential operator. Then the ODE w.r.t. the function y(x)Ly=f is called a linear ODE, where f(x) is a function.
Specially, if f=0, the ODE is called a homogeneous linear ODE. If L is a linear differential operator with constant coefficients, then the ODE is called a linear ODE with constant coefficients.
Definition 5 (generating function). For a sequence {an}n=0∞, the function G(x):=ann=0∞⋅xnn=0∞ is called the (ordinary) generating function (OGF) of the sequence.
Note. here we do not introduce vectors with infinite dimensions. Actually,
G(x):=s→∞limann=0s⋅xnn=0s.
Definition 6 (exponential generating function). For a sequence {an}n=0∞, the OGF of the sequence {n!an}n=0∞ is called the exponential generating function (EGF) of the sequence {an}n=0∞. In other words,
G(x):=n!ann=0∞⋅xnn=0∞.
Lemma 5 (differential of power functions). Suppose n,k∈N, then Dk(xn)=(n−k)!n!xn−k.
Note. it is stipulated that factorials of negative integers are infinity, so Dk(xn)=0 when n<k.
Brief proof. By mathematical induction. □
Lemma 6 (differential of EGF). If G(x) is the EGF of a sequence {an}n=0∞, then DkG is the EGF of {an+k}n=0∞.
Proof.DkG=Dk(n!ann=0∞⋅xnn=0∞)=n!ann=0∞⋅Dk(xn)n=0∞=n!ann=0∞⋅(n−k)!n!xn−kn=0∞=(n−k)!ann=0∞⋅xn−kn=0∞=n!an+kn=0∞⋅xnn=0∞.(Definition 6)(Lemma 2)(Lemma 5)(Lemma 4)(note in Lemma 5) Then the result can be proved by Definition 6. □
Corollary to Lemma 6.Dkk=0m(n!ann=0∞⋅xnn=0∞)=n!an+kn=0∞⋅xnn=0∞k=0∞.
Definition 9 (sequence equation). Let {an}n=0∞ be an unknown sequence. If the function F(n) explicitly depends on terms in the sequence, then the function F(n)=0 is called a sequence equation w.r.t. the sequence {an}n=0∞. For a sequence, if it satisfies the equation for any n, then it is called a special solution of the sequence equation. The set of all special solutions of the sequence equation is called the general solution of the equation.
Definition 10 (linear dependence of sequences). If for a set of sequences (a sequence of tuples) {an}n=0∞ there exists a tuple of constants C which are not all zero (the dimensions of C and an are the same) such that C⋅an=0 for any n, then the set of sequences are called to be linearly dependent. They are otherwise called to be linearly independent.
Lemma 10. The sufficient and necessary condition for a set of m+1 sequences {an}n=0∞ to be linearly dependent is that detan+kk=0m=0.
Proof. First prove the necessity. There exists a tuple of constants C which are not all zero such that C⋅an=0 for any n (Definition 10).
Replace n by n,n+1,n+1,…,n+m respectively, and we have C⋅an+kk=0m=0. Let the (l+1)th component of an be an∗l, i.e. an=an∗ll=0m. Then we have C⋅an∗ll=0mk=0m=0. By Lemma 9, the LHS actually equals an∗lk=0ml=0mC.
Define matrix A:=an∗lk=0ml=0m, then AC=0, and detA=detAT=detan+kk=0m. Prove by contradiction. Assume that the value of the determinant is not 0, i.e. detA=0, then the matrix A is invertible. Multiply the equation AC=0 by A−1 from the left on both sides, and we have C=0, which contradicts with the fact that C is not all zero.
Therefore, detan+kk=0m=0.
(Boohoo! I cannot prove the sufficiency myself.) □
Lemma 11. Suppose the sequence equation p⋅an+kk=0m=0 (where p is a (m+1)-dimensional constant vector and not all zero) has a set of m linearly independent special solutions {an∗ll=1m}, then the general solution of the sequence solution is C⋅an∗ll=1m, where C is a tuple of m constants.
Proof. First prove that {an}, where an:=C⋅an∗ll=1m, must be a special solution of the original sequence equation.
Substitute it into the LHS of the original sequence equation, and we have p⋅C⋅an+k∗ll=1mk=0m=C⋅p⋅an+k∗lk=0ml=1m=C⋅0l=1m=0.(Lemma 7)(Definition 9) By Definition 9, the sequence {an} is a special solution of the original sequence equation.
Then prove that the original sequence equation does not have a special solution {an}, such that there does not exist a set of m constants C such that an=C⋅an∗ll=1m for any n.
Prove by contradiction. Assume there is such a special solution, denoted as {an∗0}. Then by Definition 10, the set of sequences (sequence of tuples) an∗ll=0m are linearly independent. Let matrix A:=an∗ll=0k=0m, then according to Lemma 10, A is invertible.
Because the set of sequences are all special solutions of the original sequence equation, by Definition 9, p⋅an+k∗lk=0ml=0m=0. By Lemma 9, the LHS of the equation is actually Ap. Therefore, Ap=0. Multiply the equation by A−1 from the left on both sides, we have p=0, which contradicts with the fact that p is not all zero.
From all the above, we have proved that the general solution of the sequence equation is C⋅an∗ll=1m. □
Definition 11 (polynomial). Suppose p is a constant vector whose (m+1)th component is not 0, then the function F(x):=p⋅xkk=0m is called a polynomial of degree m w.r.t. x, and p is called the coefficients of the polynomial.
Definition 12 (multiplicity). Suppose F(x) is an m-degree polynomial w.r.t. x, and r is a complex number, then the maximum natural number w≤m such that DqF(r))q=0w−1=0 is called the multiplicity of r in the polynomial. The complex number with non-zero multiplicity is called a root of the polynomial.
Lemma 12 (fundamental theorem of algebra). The sum of multiplicity of roots of a polynomial equals the degree of the polynomial.
Lemma 13. If r is a root with multiplicity w of the polynomial with coefficients p, then for any natural number q<w, we have p⋅(k−q)!k!rk−qk=0m=0.
Brief proof. First use Definition 11 and Definition 12, and then use Lemma 2 and Lemma 5. □
Lemma 16. If r is a root with multiplicity w of the polynomial with coefficients p, then for any natural number q<w, the sequence an:=(n−q)!n!rn−q(2) is a special solution of the sequence equation p⋅an+kk=0m=0.
Proof. Because an+k=(n+k−q)!(n+k)!rn+k−q, we have rn+k−qan+k=(n+k−q)!(n+k)!=(uq)u=0q⋅(n−q+u)!n!(k−u)!k!u=0q=r−k(k−u)!k!u=0q⋅(uq)(n−q+u)!n!ruu=0q,(Lemma 15)(Lemma 4) and thus an+k=rn−q(k−u)!k!rk−uu=0q⋅(uq)(n−q+u)!n!ruu=0q. Because the LHS of the original sequence equation p⋅an+kk=0m=p⋅rn−q(k−u)!k!rk−uu=0q⋅(uq)(n−q+u)!n!ruu=0qk=0m=rn−q(uq)(n−q+u)!n!ruu=0q⋅p⋅(k−u)!k!rk−uu=0qk=0m=rn−q(n−q+u)!n!ruu=0q⋅0k=0m=0,(Lemma 7)(Lemma 13) by Definition 9, {an} is a special solution of the sequence equation. □
Lemma 17. The sequence an:=Cl,qq=0wl−1⋅(n−q)!n!rln−qq=0wl−1l=1o⋅1 is the general solution to the sequence equation p⋅an+kk=0m=0, where rll=1o all different roots of the polynomial with coefficients p, and wll=1o are the corresponding multiplicities of the roots, and Cl,q are arbitrary constants.
Brief proof. By Lemma 16, the root rl brings wl special solutions. All the special solutions brought by all the roots can be proved to be linearly independent. According to Lemma 12, there are m linearly independent special solutions. According to Lemma 11, the result can be proved. □
Lemma 18. The sufficient and necessary condition for the sequence {an} to be a special solution of the sequence equation p⋅an+kk=0m=0 is that its EGF is a special solution of the ODE p⋅Dkk=0my=0.
Proof. First prove the sufficiency. Suppose y is the EGF of {an}, i.e. y=n!ann=0∞⋅xnn=0∞ (Definition 6), then the LHS of the original ODE L(D)y=p⋅Dkk=0m(n!ann=0∞⋅xnn=0∞)=p⋅n!an+kn=0∞⋅xnn=0∞k=0m=p⋅n!an+kk=0mn=0∞⋅xnn=0∞=n!p⋅an+kk=0mn=0∞⋅xnn=0∞.(Lemma 6)(Lemma 7) Therefore, L(D)y is the EGF of the sequence {p⋅an+kk=0m}n=0∞. Because it is a zero function, by Lemma 8, for any n, p⋅an+kk=0m=0. All the steps are reversible, so the necessity is also proved. □
Definition 14 (exponential function). The EGF of the sequence {1} is called the exponential function, i.e. ex:=n!1n=0∞⋅xnn=0∞.
Lemma 19. If y is the EGF of the sequence in Equation 2, then y=xqerx.
Proof.y=n!(n−q)!n!n=0∞⋅xnn=0∞=(n−q)!rn−qn=0∞⋅xnn=0∞=n!rnn=0∞⋅xn+qn=0∞=xqn!1n=0∞⋅(rx)nn=0∞=xqerx.(Definition 6)(note of Lemma 5)(Lemma 4)(Definition 14)□
Corollary of Lemma 18 and 19. The function y=Cl,qq=0wl−1⋅xqq=0wl−1erlxl=1o⋅1 is the general solution of the ODE p⋅Dkk=0my=0, where rll=1o are the different roots of the polynomial with coefficients p, and wll=1o are the corresponding multiplicities of the roots, and Cl,q are arbitrary constants.
Finally, according to all the lemmas above, we now know how to solve the ODE p⋅Dkk=0my=0, where the (m+1)th component of p is not zero. Actually, all we need to do is to solve the polynomial with coefficients p, and use the corollary above, and then we can get the general solution of the original ODE.
The method is identical to that in Advanced Mathematics (notes of translation: this is a popular book in China about calculus), but the derivation is different. Although mine is much more complex, but it is very interesting, because it involves much knowledge in algebra.
(I haven’t used the summation symbol! I’m so good!
The whole article is using the scalar product of vectors as summation, very entertaining. Actually, when examining linear problems, vectors are good. Also, it looks clear if I use the vector notation.)