Math Refresher: Introduction to integrals

If derivatives enable us to identify the slope at any point on a continuous function, integrals allow us to calculate the sum of area underneath.

In a previous blogs, I reviewed the foundations of derivatives and some of their advanced properties. In today's blog, I will examine the area under the curve problem and show how definite integrals solve this problem.

The content in today's blog is largely taken from the Wikipedia article on Riemann sums and Edwards and Penney's textbook Calculus and Analytic Geometry.

For purposes of today's blog, I will use the example of the curve f(x) = x³. I will focus on the problem of determining the area under the curve between x=0 and x=2 and y=0. The question at hand is how do we figure out a general method for determining the the total area within these constraints:

One approach to this problem to use a Riemann sum. Reimann sums were first proposed by Bernhard Riemann and today they are a major part of the foundation of calculus.

Now, let me offer a formal definition of a Riemann sum. If you are not interested in formal definitions, feel free to skip this. I will include a more intuitive summary afterwards.

Definition 1: Partition

Let I = [a,b] be a closed interval. [See Definition 3, here for definition of closed interval]

A set P is said to be a partition of I if P is a set of nonempty subintervals = { [a,x₁) [x₁, x₂), ... , [x_n-1,b] } where n, the number of subintervals, is a positive integer and a is less than x₁ which is less than x₂ ... which is less than x_n-1 which is less than b.

Definition 2: Riemann sum

Let f(x) be a real value function on a closed interval I=[a,b] such that f: I → R. Let P be a partition of the interval I.

The Riemann sum S of f over I with respect to a partition P is:

S = ∑ (j=1,n) f(c_j)(x_j - x_j-1)

where c_j is any arbitrary point in [x_j, x_j-1].

If c_j = the minimum in [x_j, x_j-1] for all j, then S is a lower Riemann sum which I will denote as L(P)

If c_j = the maximum in [x_j, x_j-1] for all j, then S is an upper Riemann sum which I will denote as U(P).

The important idea here is that a Riemann sum consists of dividing up a the graph of a function into an arbitrary number of intervals.

Here's a diagram of an upper Riemann sum U(P) where the rectangle corresponds with the maximum point in the interval.

Here is a diagram of a lower Riemann sum L(P) where the rectangle corresponds with the minimum point in the interval.

Now, it should be clear that as we increase the number of subintervals in a partition, both the L(P) and U(P) get closer to the true area under the curve. In fact, we have hit our answer when U(P) = L(P).

The goal of the rest of this blog is to demonstrate that if we continue to increase the number of subintervals, we eventually get to this point. Then, I use Riemann sums to determine the area under the curve where f(x) = x³.

Lemma 1: For any partition P of [a,b], L(P) ≤ R(P) ≤ U(P)

Proof:

(1) Let f(p_j) be the minimum point in each interval [x_j, x_j-1]

(2) Let f(q_j) be the maximum point in each interval [x_j, x_j-1]

(3) It follows that for all j, f(p_j) ≤ f(q_j).

(4) It therefore follows that for all j, f(p_j)(x_j - x_j-1) ≤ f(q_j)(x_j - x_j-1).

(5) And therefore that ∑ (j=1, n)f(p_j)(x_j - x_j-1) ≤ ∑ (j=1,n)f(q_j)(x_j - x_j-1).

(6) It follows that L(P) ≤ R(P) ≤ U(P) since for all j, f(p_j) ≤ f(c_j) ≤ f(q_j).

(7) Applying the same steps as before (step #4 and step #5), we can conclude that:

L(P) ≤ R(P) ≤ U(P)

QED

What makes Riemann sums interesting, is that for any partition P, it is possible refine it. That is, we can increase the number of partitions and get an answer that it closer to the true area under the curve.

Definition 3: Refinement of a Partition

A partition P' is said to be a refinement of a partition P if and only if each subinterval of P' is contained in some subinterval of P.

In other words, each interval in P now corresponds to a unique set of one or more intervals in P' and the total number of intervals in P' is greater or equal to the total number of intervals in P.

Lemma 2: If P' is a refinement of P, then L(P) ≤ L(P') ≤ U(P') ≤ U(P)

Proof:

(1) From Lemma 1 above, we know that:

L(P') ≤ U(P') and L(P) ≤ U(P).

(2) Assume that a partition P' is derived from a partition P by dividing up the kth subinterval [x_k-1, x_k] of P into [x_k-1,z] and [z, x_k] by introducing a point z.

(3) From this perspective, the only difference between L(P) and L(P') is that the term f(p_k)*(x_k - x_k-1) is now replaced by:

f(u)*(z - x_k-1) + f(v)*(x_k - z) where u,v are the minimum points on [x_k-1,z] and [z, x_k] respectively.

If we set L'(P) = L(P) - f(p_k)*(x_k - x_k-1), then we have:

L(P) = L'(P) + f(p_k)*(x_k - x_k-1)

L(P') = L'(P) + f(u)*(z - x_k-1) + f(v)*(x_k - z)

(4) Since f(p_k) is the minimum point for all (x_k-1, x_k), we have:

f(p_k) ≤ f(u)

f(p_k) ≤ f(v)

(5) Hence, it follows that:

f(u)*(z - x_k-1) + f(v)*(x_k - z) ≥ f(p_k)*(z - x_k-1) + f(p_k)*(x_k - z) = f(p_k)*(z - x_k-1 + x_k - z) = f(p_k)*(x_k - x_k-1)

(6) From step #3, we have :

L'(P) + f(u)*(z - x_k-1) + f(v)*(x_k - z) ≥ L'(P) + f(p_k)*(x_k - x_k-1)

So that:

L(P') ≥ L(P)

(7) We can use the same exact reasoning to establish that U(P') ≤ U(P).

(8) So that we have:

L(P) ≤ L(P') [step #6]
L(P') ≤ U(P') [Lemma 1 above]
U(P') ≤ U(P) [Step #7]

QED

Lemma 3: Limit to the lower Riemann sum

Let P_n denote a partition P of the interval [a,b] such that P consists of 2_n subintervals of equal length.

Then, there exists a value I = lim (n → ∞) L(P_n)

Proof:

(1) By Definition 3 above, each P_n+1 is a refinement of P_n.

(2) Using Lemma 2 above:

L(P₁) ≤ L(P₂) ≤ ... ≤ L(P_n) ≤ ...

(3) The sequence { L(P_n) } is a nondecreasing monotonic sequence of real numbers. [See Definition 8, here for a definition of nondecreasing monotonic sequence]

(4) The sequence is clearly bounded since:

(a) Let D = the minimum value of f on [a,b] (See Theorem, here for proof that a minimum exists)

(b) Let U = the maximum value of f on [a,b] (See Lemma 3, here for a proof that the maximum exists)

(c) Since L(P_n) = ∑ (i = 1, 2n) f(p_i)(x_i - x_i-1), it is clear that:

U*(b-a) ≤ L(P_n) ≤ M*(b - a)

(5) Therefore, I must exist since a bounded monotone sequence of real numbers must converge. [See Lemma 1, here]

QED

Definition 4: mesh(P)

The mesh of a partition P is the largest length of x_i - x_i-1.

If all the subintervals that make up the Riemann sum are the same, then mesh(P) = this value. If they are not the same, then mesh(P) = the largest subinterval.

Lemma 4:

For any given real ε greater than 0, there exists a real δ greater than 0 such that if P is a partition of [a,b] with mesh(P) less than δ and P' is a refinement of P, then:

abs(L(P) - U(P)) is less than ε/3 and abs(R(P) - R(P')) is less than ε/3

for any two Riemann sums R(P) associated with P and R(P') associated with P'.

Proof:

(1) Let f be a continous function on the closed interval [a,b]

(2) Since [a,b] is bounded and closed, [a,b] is compact. [See Heine-Borel Theorem, here]

(3) Since [a,b] is compact, f is uniformly continuous on [a,b]. [See Heine-Cantor Theorem, here]

(4) So, there exists a number δ greater than 0 such that if:

abs(u - v) is less than δ, then abs(f(u) - f(v)) is less than ε/[3(b - a)]. [See Definition 1, here for definition of uniformly continuous functions]

(5) Suppose now that P is a partion of [a,b] with mesh(P) less than δ.

(6) Then, abs(U(P) - L(P)) = ∑ (i=1,n) abs(f(q_i) - f(p_i))Δx_i which is less than ε/[3(b-a)]∑(i=1,n) Δx_i = ε/3 [Since ∑(i=1,n) Δx_i= (b - a)].

This is valid since abs(p_i - q_i) is less than δ since mesh(P) is less than δ and p_i and q_i belong to the same interval.

(7) By Lemma 1 above, L(P) ≤ R(P) ≤ U(P) where R(P) = any Riemann sum and this is also true of L(P') ≤ R(P') ≤ U(P') where P' is a refinement of P.

(8) By Lemma 2 above, L(P) ≤ L(P') ≤ U(P') ≤ U(P).

(9) Combining #7 and #8, we can see that R(P) and R(P') are both within the interval [L(P), U(P)] which means that abs(R(P) - R(P')) is less than ε/3 since abs(U(P) - L(P)) is less than ε/3 [From step #6]

QED

This leads us to a definition of a definite integral.

Definition 5: Definite Integral (or Riemann Integral)

∫(a,b) f(x)dx = lim (n → ∞) ∑ (j=1,n) f(c_j)(x_j - x_j-1)

I will try to explain each part of the definition since it is very formal. The ∫ is the notation introduced by Leibniz to indicate the sum of a continuous function (as opposed to ∑ which is the sum of a discrete set of values).

The (a,b) is the closed interval of the integral. The a should really be at the top of the ∫ and the b at the bottom but I am stuck with the limitations of html so this is the notation that I use.

dx is Leibniz's notation for Δx. The idea here is that the integral is the sum of rectangles, that is, an infinitely small width (dx) multiplied with the height f(x) as x varies from a to b.

Putting this together just says the "sum of the continuous function in the closed interval [a,b]".

So, we are defining the integral to mean the limit of the Reimann sum as the partition mesh gets closer and closer to 0.

Lemma 5: abs(a + b) ≤ abs(a) + abs(b)

Proof:

(1) if a,b are both nonegative, then abs(a + b) = a + b = abs(a) + abs(b).

(2) if both are both negative, then abs(a + b) = -(a + b) = -a + -b = abs(a) + abs(b)

(3) if a+b is positive and a is negative, then abs(a + b) = b - a is less than b + (-a) = abs(b) + abs(a) [We can make a parallel argument if a+b is positive and b is negative]

(4) if a+ b is negative and b is positive, then abs(a + b) = -(a + b) = -a -b = abs(a) - abs(b) which is less than abs(a) + abs(b). [We can make a parallel argument if a+b is negative and a is positive]

QED

Corollary 5.1: abs(a + b + ....) ≤ abs(a) + abs(b) + ...

Proof:

(1) We know that abs(a₁ + a₂) ≤ abs(a₁) + abs(a₂) from Lemma 5 above.

(2) Assume that it is true for up to some n so that we have:

abs(a₁ + a₂ + ... + a_n) ≤ abs(a₁) + abs(a₂) + ... + abs(a_n)

(3) Let b = a₁ + a₂ + ... + a_n

(4) abs(b + a_n+1) ≤ abs(b) + abs(a_n+1) [From Lemma 5 above]

(5) Replacing b with a₁ + a₂ + ... + a_n gives us:

abs(a₁ + a₂ + ... + a_n + a_n+1) ≤ abs(a₁) + abs(a₂) + ... + abs(a_n) + abs(a_n+1)

QED

Theorem: For any continuous function, the Riemann sum converges to the area under the curve.

Let I = the area under the curve. Let f be a function continuous on the closed interval [a,b]

For any positive real ε, there exists a positive real δ such that if P is a partition on f in [a,b] such that the mesh(P) is less than δ, then abs(I - R(P)) is less than ε where R(P) = the Riemann sum based on partition P.

Proof:

(1) Let ε be a real greater than 0.

(2) Let δ be a real greater than 0 based on the Lemma 4 above.

(3) By Lemma 3 above, there exists a δ₂ such that there if we find an n such that mesh(P_n) is less than min(δ,δ₂), then abs(L(P_n) - I) ≤ ε/3.

(4) Let P = P_n and let P' be a refinement on P.

(5) Then, abs(R(P) - R(P')) ≤ ε/3. [From Lemma 4 above]

(6) Further, since R(P') is in between L(P') and U(P') [from Lemma 1 above] and L(P) ≤ L(P') ≤ U(P') ≤ U(P) [from Lemma 2 above], it follows that:

abs(L(P) - R(P')) ≤ abs(L(P) - U(P)) ≤ ε/3 [From Lemma 4 above]

(7) abs(I - R(P)) = abs(I - L(P) + L(P) - R(P') + R(P') - R(P))

(8) Using Corollary 5.1 above, we have:

abs(I - L(P) + L(P) - R(P') + R(P') - R(P)) ≤ abs(I - L(P)) + abs(L(P) - R(P')) + abs(R(P') - R(P))

(9) Since abs(I - L(P)) is less than ε/3 [step #3 since P = P_n] and abs(L(P) - R(P')) is less than ε/3 [step #6] and abs(R(P') - R(P)) is less than ε/3 [step #5], we have:

abs(I - R(P)) is less than abs(I - L(P)) + abs(L(P) - R(P')) + abs(R(P') - R(P)) which is less than ε/3 + ε/3 + ε/3 = ε

QED

Example: Using a Riemann sum to estimate the area under f(x) = x³ on the closed interval [0,b]

(1) Using the definition of a Riemann sum, we can now use the Theorem above to get:

∫(0,b) f(x)dx = lim (n → ∞) ∑ (i=1,n) f(c_i)Δx

(2) If we divide up P into n equal subintervals, then we can set Δx = b/n and set c_i = ib/n. This then gives us:

∫(0,b) x³dx=lim (n → ∞) ∑ (i=1,n) (ib/n)³(b/n) = lim (n → ∞) ∑(i=1,n)(i³b⁴/n⁴)

(3) Since b,n are constants, we have:

lim (n → ∞) (i³b⁴/n⁴) = lim (n → ∞) (b⁴/n⁴) ∑(i=1,n)(i³)

(4) Using Lemma 4, here,

lim (n → ∞) (b⁴/n⁴) ∑(i=1,n)(i³) =lim (n → ∞) (b⁴/n⁴)[(1/4)n⁴ + (1/2)n³ + (1/4)n²]=

= lim (n → ∞) (b⁴)[(1/4) + (1/2)/n + (1/4)/n²]

(5) Since n is approaching ∞, this means that:

(1/2)/n and (1/4)/n² approach 0 giving us:

lim (n → ∞) (b⁴)[(1/4) + (1/2)/n + (1/4)/n²] = b⁴/4.

This example shows the limitation of using Riemann sums to determine the integral. It makes a lot of sense as an explanation of how the integral relates to a limit. It presents problems when it comes down to applying it to a more general set of equations. If a summation formula exists, then the Riemann sum works fine. If a summation formula does not exist, then the Riemann sum, by itself, does not get us to a final result.

A much more powerful method for determining integrals is to view them as antiderivatives. That is, take the inverse of the derivative and the answer is the integral. How do we know that this works? This very important result is called the Fundamental Theorem of Calculus and I will go over its proof in my next blog.

References

Riemann Sum, Wikipedia
Riemann Sum, PlanetMath.org
Edwards & Penney, Calculus and Analytic Geometry

Math Refresher

Tuesday, September 12, 2006

Introduction to integrals

No comments :

About Me

Blog Archive