Taylor Formula, Taylor Theorem, Taylor Series, Taylor Expansion
March 6, 2024
This article mainly introduces the content and connection between the four concepts of Taylor's formula, Taylor's theorem, Taylor series and Taylor's expansion.
If the function f(x) is differentiable at point x0, i.e.,
x→x0limx−x0f(x)−f(x0)=f′(x0)
which is also expressed as
x→x0limx−x0f(x)−f(x0)−f′(x0)=0
Rearranging gives
x→x0limx−x0f(x)−f(x0)−f′(x0)(x−x0)=0
Thus,
f(x)−f(x0)−f′(x0)(x−x0)=o(x−x0)(x→x0)
Rearranging terms yields
f(x)=f(x0)+f′(x0)(x−x0)+o(x−x0)(x→x0)
This implies that near the point x∗0, we can approximate the function f(x) with a first-degree polynomial f(x∗0)+f′(x∗0)(x−x∗0), with an error of (x−x∗0) as a higher-order infinitesimal. Sometimes, such an approximation may be crude, indicating a relatively large error. Thus, naturally, we wonder if we can approximate f(x) using a higher-degree n polynomial to make the error o((x−x∗0)n).
Thus, the coefficients of the polynomial function f∗n(x) are uniquely determined by its derivatives at point x∗0. This insight inspires us that for a general function f(x), if f(x) has derivatives up to nth order at point x0, then these derivatives uniquely determine an nth degree polynomial
This polynomial is called the Taylor polynomial of function f(x) at point x∗0, and the coefficients of T∗n(x)
k!f(k)(x0)(k=1,2,...,n)
are termed as Taylor coefficients.
It is evident that the function f(x) and its Taylor polynomial T∗n(x) have the same function values and derivatives up to the nth order at point x∗0, i.e.,
f(k)(x0)=Tn(k)(x0),k=0,1,2,...,n.
Returning to our conjecture, can we prove f(x)=T∗n(x)+o((x−x∗0)n)? If this holds, then when approximating function f(x) with the Taylor polynomial T∗n(x), the error will be as desired, i.e., an error term higher order than (x−x∗0)n.
Theorem: If the function f(x) has derivatives up to nth order at point x0, then
Because f(n)(x∗0) exists,
Therefore, in a neighborhood U(x∗0) of x∗0, f(x) has a (n−1)th order derivative f(n−1)(x).
Thus, when x∈Uo(x∗0) and x→x0, by repeatedly applying L'Hôpital's rule n−1 times, we have
The expression proved by this theorem is termed as the Taylor formula for function f(x) at x0. Since its corresponding remainder term
Rn(x)=f(x)−Tn(x)=o((x−x0)n)
it is also called the Taylor formula with Peano remainder. Hence, this expression is also referred to as the Taylor formula with Peano remainder.
Note: The Taylor formula (with Peano remainder) is a qualitative expression. Although this expression holds for the entire domain of f(x), its remainder term is meaningful only near the point x0. Thus, this expression has strong limitations.
Taylor's Theorem
In order to overcome the drawback of the Taylor formula above, which only allows for a qualitative analysis of functions, we need a more precise quantitative expression to characterize the function f(x). A quantitative expression can more accurately delineate the range of errors when approximating the function f(x) with a polynomial function.
Theorem (Taylor's Theorem): If the function f(x) has continuous derivatives up to nth order on [a,b] and has (n+1) st order derivatives on (a,b), then for any given x,x0∈[a,b], there exists at least one point ξ∈(a,b) such that
Analysis: From the conditions of the theorem, we can see that the functions f(x) and (x−x0)n+1 can be "arbitrarily" used on [a,b] using the Mean Value Theorem, hence the proof approach utilizes the Mean Value Theorem.
Clearly, F(x) and G(x) both have continuous derivatives up to nth order on [a,b] and have (n+1) st order derivatives on (a,b). Also, F(x0)=F′(x0)=⋯=F(n)(x0)=0, G(x0)=G′(x0)=⋯=G(n)(x0)=0, and G(n+1)(x)=(n+1)!. Hence,
Note: Taylor's Theorem can be alternatively stated as follows: if f(x) has ((n+1))st order derivatives in a neighborhood U(x0) of x0, then for any point x in this neighborhood, we have
This alternative description may be easier to remember.
The remainder term
Rn(x)=(n+1)!f(n+1)(ξ)(x−x0)n+1
is called the Lagrange remainder, hence Taylor's Theorem can also be called the Taylor formula with Lagrange remainder.
Furthermore, from the conditions of Taylor's Theorem, it can be seen that the conditions for its use are much stricter compared to the Taylor formula (which only requires nth order derivatives to exist at a point x0), hence the conclusion obtained is stronger. Taylor's Theorem can be utilized to quantitatively approximate the function f(x) using polynomial functions.
Taylor Series
The Taylor series, compared to the two mathematical concepts mentioned earlier, appeared later in mathematical analysis because it requires a foundation in knowledge of series, power series, and function series. Since the knowledge of series and function series is not the focus of this article, interested readers can refer to any textbook on mathematical analysis for further study. Here, we only provide information closely related to the Taylor series, focusing on power series.
A power series is a type of function series with the simplest form, generated by the sequence of functions {an(x−x0)n}:
To simplify the form, we only discuss the power series when x0=0:
n=0∑∞anxn=a0+a1x+a2x2+⋯+anxn+…
Correspondingly, by replacing x with x−x0, we obtain the general case mentioned above. Hence, the power series mentioned below all refer to the power series:
n=0∑∞anxn
When discussing a function series, the first thing we need to know is its domain of convergence. For a power series, its domain of convergence has a special characteristic, as illustrated in the following theorem:
Abel's Theorem: If the power series
n=0∑∞anxn
converges at x=xˉ=0, then for any x satisfying the inequality ∣x∣<∣xˉ∣, the power series converges and converges absolutely. If the power series diverges at x=xˉ=0, then for any x satisfying the inequality ∣x∣>∣xˉ∣, the power series diverges.
Proof: Assume the series
n=0∑∞anxˉn
converges. By the necessary condition for the convergence of series, the sequence {anxˉn} converges to zero and is bounded. Hence, there exists a positive number M such that
∣anxˉn∣<M(n=0,1,2,…)
Furthermore, for any x satisfying the inequality ∣x∣<∣xˉ∣, xˉx<1. Thus,
∣anxn∣=anxˉn⋅xˉnxn=∣anxˉn∣xˉxn<Mrn
where r=xˉx<1. Since the series ∑n=0∞Mrn converges, the power series ∑n=0∞anxn converges absolutely when ∣x∣<∣xˉ∣.
Conversely, if the power series diverges at x=xˉ=0, and if there exists x0 such that ∣x0∣>∣xˉ∣ and the series ∑n=0∞anx0n converges, then according to the previous conclusion, the power series should converge absolutely at x=xˉ, which contradicts the assumption. Thus, for all x satisfying the inequality ∣x∣>∣xˉ∣, the power series ∑n=0∞anxˉn diverges.
In fact, Abel's theorem tells us about the convergence characteristics of power series, that is, the domain of convergence of a power series must be an interval centered at the origin. If we denote the length of this interval as 2R, then R is called the radius of convergence of the power series. We refer to (−R,R) as the interval of convergence of the power series ∑n=0∞anxn.
Theorem: For the power series
n=0∑∞anxn
If limn→∞n∣an∣=ρ, then when
(i)0<ρ<+∞, the radius of convergence R=ρ1;
(ii)ρ=0, the radius of convergence R=+∞;
(iii)p=+∞, the radius of convergence R=0.
Proof: For the power series
n=0∑∞∣anxn∣
Since
n→∞limn∣anxn∣=n→∞limn∣an∣∣x∣=ρ∣x∣
According to the root test for positive series, when ρ∣x∣<1, ∑n=0∞∣anxn∣ converges; when ρ∣x∣>1 the power series diverges. Therefore, when 0<ρ<+∞, from ρ∣x∣<1, we obtain the radius of convergence R=ρ1. When ρ=0, ρ∣x∣<1 holds for any x, so R=+∞. When ρ=+∞, ρ∣x∣>1 holds for any x except x=0, so R=0.
Note: If
n→∞lim∣an∣∣an+1∣=ρ
then necessarily
n→∞limn∣an∣=ρ
Therefore, in finding the convergence radius, the ratio test can also be utilized. Additionally, in the above proof, we took the absolute value of each term of the original power series ∑n=0∞anxn to obtain the power series ∑n=0∞∣anxn∣ for demonstration. Since the power series is absolutely convergent within its convergence interval, these two power series have exactly the same convergence radius. Thus, this proof method is effective.
Certainly, the method mentioned above for finding the convergence radius has limitations. If the limit limn→∞n∣an∣ does not exist (and is not positive infinity), then this method will fail. Therefore, in general mathematical analysis textbooks, a more general method for finding the convergence radius of a power series ∑n=0∞anxn is given using the method of upper limits. Below is the theorem:
(Extended) Theorem: For the power series
n=0∑∞anxn
If
n→∞limn∣an∣=ρ
then when
(i)0<ρ<+∞, the radius of convergence R=ρ1;
(ii)ρ=0, the radius of convergence R=+∞;
(iii)p=+∞, the radius of convergence R=0.
Since this upper limit always exists, any power series can be used to find the convergence radius using this theorem.
In practical problems of finding the convergence radius of power series, we may encounter "missing term" power series, such as the power series
n=1∑∞n−32nx2n
Such power series can certainly be found using the method of upper limits mentioned above, but a more effective method is to use the root test to deduce the convergence radius of the power series:
Consider
n→∞limn∣n−32n∣x2n=91n→∞limn1−32nnx2n=9x2
According to the root test for positive series, when 9x2<1, i.e., ∣x∣<3, the power series converges, and when 9x2>1, i.e., ∣x∣>3, the power series diverges. When x=±3, the corresponding series are:
n=1∑∞n−32n32n
Since
n→∞limn−32n32n=−1=0
the series
n=1∑∞n−32n32n
diverges, indicating that the original series has a convergence interval of (−3,3).
Power series, compared to general series of functions, exhibit two properties regarding uniform convergence:
Property 1: If the power series
n=0∑∞anxn
has a convergence radius R(>0), then the power series uniformly converges on its convergence interval ((-R,R)).
Proof: Let [a,b] be any closed interval within (−R,R), and denote
xˉ=max{∣a∣,∣b∣}∈(−R,R)
Then for any point x in [a,b], we have
∣anxn∣≤∣anxˉn∣
Since the power series converges absolutely at the point xˉ, by the M-test, the power series
n=0∑∞anxn
uniformly converges on [a,b]. Since [a,b] was chosen arbitrarily, the power series uniformly converges on the convergence interval (−R,R) .
Property 2: If the power series ∑n=0∞anxn has a convergence radius R(>0), and converges at x=R (or x=−R), then the power series ∑n=0∞anxn uniformly converges on [0,R] (or [−R,0]).
Proof: Suppose the power series converges at x=R. For x∈[0,R], we have ∑n=0∞anxn=∑n=0∞anRn(Rx)n.
Since the series
n=0∑∞anRn
converges, and the function sequence
{(Rx)n}
is decreasing and uniformly bounded on [0,R], i.e.,
1≥Rx≥(Rx)n≥...≥(Rx)n≥...≥0
by Abel's test, the series
n=0∑∞anxn
uniformly converges on [0,R].
Note: In fact, since the power series ∑n=0∞anxn is uniformly convergent on (−R,0] , the conclusion can also be drawn: when the power series ∑n=0∞anxn converges at x=R, the power series uniformly converges on (−R,R].
From the two properties regarding uniform convergence of power series, we can deduce other properties that power series possess:
(i) The sum function of the power series ∑n=0∞anxn is continuous on ((-R,R)).
(ii) If the power series ∑n=0∞anxn converges at the left (right) endpoint of its convergence interval, then its sum function is also continuous at this endpoint from the right (left).
Note: Since each term of the power series ∑n=0∞anxn is continuous on ((-R,R)), and ∑n=0∞anxn uniformly converges on ((-R,R)), we can deduce property (i) from the properties of uniform convergence of series of functions. If the power series ∑n=0∞anxn converges at x=R (or x=−R), then the power series ∑n=0∞anxn uniformly converges on [0,R] (or [−R,0]) by the same reasoning, thus property (ii) holds.
In the study of series of functions, there are theorems concerning term-by-term integration and differentiation:
(Term-by-term Integration Theorem): If the series of functions
n=0∑∞un(x)
uniformly converges on [a,b], and each term un(x) is continuous, then
n=0∑∞∫abun(x)dx=∫abn=0∑∞un(x)dx
(Term-by-term Differentiation Theorem): If the series of functions
n=0∑∞un(x)
has continuous derivative for each term on [a,b], x0∈[a,b] is a point of convergence for
n=0∑∞un(x)
, and
n=0∑∞un′(x)
uniformly converges on [a,b], then
n=0∑∞(dxdun(x))=dxd(n=0∑∞un(x))
Note: The requirement that x0∈[a,b] is a point of convergence for
n=0∑∞un(x)
in the term-by-term differentiation theorem is for the convenience of application. Its equivalent condition is that
n=0∑∞un(x)
converges on [a,b]. Furthermore, the interval [a,b] in the conditions can be replaced with a general open interval, and the conclusion still holds.
For the power series
n=0∑∞anxn
its power series obtained by differentiating each term is
n=1∑∞nanxn−1
and the power series obtained by integrating each term is
n=0∑∞n+1anxn+1
For convenience, let's denote the three power series as (1), (2), and (3), respectively.
To discuss term-by-term differentiation and integration of power series, we first introduce the following lemma:
Lemma: Power series (1) and (2), (3) have the same interval of convergence.
Proof: We only need to prove that (1) and (2) have the same interval of convergence because differentiating each term of (3) results in (2).
Suppose (1) converges only at x=0. Assuming (2) converges at x′>0, then for xˉ∈(0,x′), by the Abel's theorem,
n=1∑∞nanxˉn−1
converges. Thus,
∣anxˉn∣=nanxˉn−1nxˉ=nanxˉn−1nxˉ
Since it is given that
n=1∑∞nanxˉn−1
converges,
{nxˉ}
is monotonically bounded. Thus, by the Abel's test, we know that
n=1∑∞∣anxˉn∣
converges. This contradicts the assumption. Therefore, (2) also converges only at x=0, and the proposition holds.
Now let the convergence interval of power series (1) be (−R,R), where R=0. Let x0 be any nonzero point in (−R,R). As shown in the proof of the Abel's theorem, there exist positive numbers M and r(0<r<1) such that for all positive integers n,
∣anx0n∣<Mrn
So,
nanx0n−1=x0n∣anx0n∣<∣x0∣Mnrn
By the ratio test, the series
n=0∑∞nrn
converges. Therefore, by the comparison test, the series
n=0∑∞nanx0n−1
converges. This implies that power series (2) is absolutely convergent (and hence convergent) at point x0, as it converges absolutely. Since x0 is an arbitrary point in (−R,R), power series (2) converges on the interval (−R,R).
Next, we need to prove that power series (2) does not converge for all x satisfying ∣x∣>R. Suppose (2) converges at a point x0(∣x0∣>R), then there exists a number xˉ such that ∣x0∣>∣xˉ∣>R. By the Abel's theorem, power series (2) converges absolutely at x=xˉ. However, when n≥∣xˉ∣, we have
nanxˉn−1=∣xˉ∣n∣anxˉn∣≥∣anxˉn∣
By the comparison test, power series (1) converges absolutely at x=xˉ. This contradicts the fact that the convergence interval of power series (1) is (−R,R). Thus, power series (2) does not converge for all x satisfying ∣x∣>R. In summary, power series (1) and (2) have the same interval of convergence.
Regarding term-by-term differentiation and integration of power series, we have the following theorem:
Theorem: Suppose the convergence interval of series (1) is (−R,R), and its sum function on this interval is denoted by f(x). If x is any point in (−R,R), then:
(i)f(x) is differentiable at point x, and
f′(x)=n=1∑∞nanxn−1
(ii)f(x) is integrable over the interval between 0 and x on (−R,R), and
∫0xf(t)dt=n=0∑∞n+1anxn+1
Proof: Since series (1), (2), and (3) have the same convergence radius R, and each term of series (1) has a continuous derivative, and all three power series converge uniformly on the closed interval (−R,R), they satisfy the theorem of term-by-term differentiation and integration of function series. Therefore, the above theorem holds.
From the above theorem, we can derive the following corollaries:
Corollary 1: Let f(x) be the sum function of series (1) on the convergence interval (−R,R). Then, f(x) has derivatives of any order on (−R,R), and can be differentiated term by term any number of times, i.e.,
Corollary 2: Let f(x) be the sum function of series (1) on the convergence interval (−R,R). Then the coefficients of series (1) are determined by the various derivatives of f(x) at x=0:
a0=f(0)an=n!f(n)(0)(n=1,2,...)
Equality of Power Series: If the power series ∑n=0∞anxn and ∑n=0∞bnxn have the same sum function in a neighborhood of x=0, then these two power series are considered equal in that neighborhood.
Since the coefficients of power series are determined by the sum function and its derivatives at x=0, it follows from the definition of equality of power series that two power series are equal in a neighborhood if and only if their coefficients are equal.
Arithmetic Operations on Power Series: Suppose the power series ∑n=0∞anxn and ∑n=0∞bnxn have convergence radii Ra and Rb, respectively. Then:
where λ is a constant, and R=min{Ra,Rb}, cn=∑k=0nakbn−k.
These properties can be derived from the corresponding properties of numerical series.
The above discussion regarding power series actually serves as a preliminary study for Taylor series. Although it might seem extensive, it's essential for understanding Taylor series thoroughly.
In the section on Taylor's theorem, we previously mentioned that if a function f(x) has derivatives up to order n+1 in a neighborhood of x0, then the Taylor series expansion is given by:
This series is called the Taylor series of the function f(x) at the point x0. Now, we face the question: What is the relationship between the function f(x) and its Taylor series expansion at x0? If the function f(x) is defined in a neighborhood U(x0,δ) and its Taylor series converges in the interval U(x0,R), does the function f(x) equal the sum function of its Taylor series in U(x0,δ)∩U(x0,R)?
Theorem: Suppose f(x) has derivatives of all orders at the point x0. Then, f(x) equals the sum function of its Taylor series on the interval (x0−r,x0+r) if and only if, for all x satisfying ∣x−x0∣<r, we have
n→∞limRn(x)=0
Here, Rn(x) represents the Lagrange remainder term of f(x) at x0.
Proof: Given that f(x) has derivatives of all orders at x0, by Taylor's theorem, for any x∈U(x0,r), where r≤R, we have f(x)=Tn(x)+Rn(x).
[Necessity]: If f(x)=limn→∞Tn(x), then as n→∞, we have limn→∞Rn(x)=0.
[Sufficiency]: If limn→∞Rn(x)=0, then f(x)=limn→∞Tn(x).
Taylor Expansion
If function f(x) can be expressed as the sum function of its Taylor series in a neighborhood of point x∗0, then function f(x) can be expanded into a Taylor series in the neighborhood of point x∗0. The right side of the equation
is called the Taylor expansion of f(x) at point x0.
According to the previous theorem, if function f(x) is the sum function of a power series
n=0∑∞anxn
in the convergence interval (−R,R), then the coefficients of ∑∗n=0∞a∗nxn are determined by the sum function f(x) and the values of its various order derivatives at x=0, that is,
a0=f(0)an=n!f(n)(0)n=1,2,...
That is, ∑∗n=0∞a∗nxn is the Taylor expansion of function f(x) on (−R,R).
The above is the process of analyzing these four mathematical concepts. In the part of Taylor series, the relevant concepts of power series are elaborated in detail. Here we summarize these four concepts:
Summary: From the chronological point of view, we usually encounter the Taylor formula for the first time in the part of the mean value theorem of differentiation in the first volume of mathematical analysis, that is, the Taylor formula with the Peano remainder term, which is introduced by the finite increment formula in the derivative:
f(x)=f(x0)+f′(x0)(x−x0)+o(x−x0)
When using the finite increment formula for estimation, the error is o(x−x∗0), and its estimation effect is quite rough. If the function is known to have an n-th order derivative at the point x∗0, we can generalize the finite increment formula, that is, to obtain the Taylor formula with the Peano remainder term. Its estimation error is o((x−x∗0)n). In order to further improve the estimation effect and quantitatively analyze the range of errors, the Taylor formula with the Lagrange remainder term is introduced. The use condition of Taylor formula with the Lagrange remainder term is further strict, which requires that the function f(x) has derivatives up to (n+1)-th order at the point x∗0. Its corresponding error is
(n+1)!f(n+1)(ξ)(x−x0)n+1
The Lagrange remainder term can quantitatively analyze the error, and its estimation effect is much better than that of the Peano remainder term.
Following the learning order of the textbook, after learning related knowledge such as series, functional series, and power series in the second volume of mathematical analysis, we can introduce the concept of Taylor series. If the function f(x) has derivatives of any order at some point x0, then we can write
This expression is called the Taylor series of function f(x) at point x∗0. As a special power series, the Taylor series has a convergence range, namely the convergence domain of the Taylor series, and we give the necessary and sufficient conditions for the Taylor series of function f(x) at point x∗0 in a certain neighborhood U(x∗0,r) to be equal to its sum function: lim∗n→∞Rn(x)=0.
This leads to the concept of Taylor expansion. The so-called Taylor expansion refers to the power series expression that function f(x) can be expanded into in a certain neighborhood. In this neighborhood, function f(x) is completely characterized by the power series. This is different from the approximation by polynomial functions above, but is equal in a real sense. However, the application of power series has certain limitations, namely, because the convergence domain of power series is often a finite interval, it can only characterize function f(x) in a small segment interval. In order to solve this drawback of power series, research on Fourier series has emerged.