Math Refresher: 2007-05-06

The content in today's blog is taken from Schneider and Barker's Matrices and Linear Algebra.

If you need to review vectors (see Definition 1 here), vector spaces (see Definition 2, here), linear combinations (see Definition 1, here) or family of vectors (see Definition 4, here), start here.

Algorithm 1:

Given a family of linear independent vectors and a family that spans vectors in V.

Produce a family of linear indepedent vectors that serves as a basic for V.

Here are the steps:

(1) Let X = (x¹, ..., xⁿ) be a linearly independent family of vectors in V.

(2) Let Y = (y¹, ..., yⁿ) be a family of vectors that spans a vector space V.

(3) Let i = 1

(4) Let Z₀ = X

(5) If yⁱ is not an element of [[Z_i-1]], then let Z_i = (Z_i-1,yⁱ).

(6) If yⁱ is an element of [[Z_i-1]], then let Z_i = Z_i-1

(7) Let i = i + 1

(8) If i ≤ n, then go to step #4.

(9) The answer is Z_n

Done.

Theorem 1: Theorem of Exchange

Let V be a vector space. Let X = (x¹, ..., xⁿ) be a linearly independent family of vectors in V. Let Y = (y¹, ..., yⁿ) be a family of vectors that span V.

Then, we can form a new family of vectors Z consisting of all the vectors of X such that Z is a basis for V.

Proof:

(1) Since X = (x¹, ..., xⁿ) is linearly independent, we can conclude that for all i, xⁱ ≠ 0. [See Corollary 2.1, here]

(2) For each i, xⁱ is not a linear combination of its predecessors since X is linearly independent. [See Corollary 3.1, here]

(3) If yⁱ ∈ Z, then the family of predecessors of yⁱ is Z_i-1 and by construction yⁱ is not a linear combination of Z_i-1 (or else yⁱ would not be in Z).

(4) Hence, Z is linearly independent. [See Definition 1, here]

(5) Next, we need to show that Z spans V.

(6) By assumption V ⊆ [[Y]] since Y spans V.

(7) [[Y]] ⊆ [[X,Y]] [See Corollary 1.2, here]

(8) Further, each xⁱ is in Z, so that xⁱ ∈ [[Z]] (Since Z ⊆ [[Z]])

(9) If yⁱ ∈ Z, then again yⁱ ∈ [[Z]].

(10) But if yⁱ is not in Z, then yⁱ ∈ [[Z_i-1]] since y_i is added to Z_i only when yⁱ is not an element of [[Z_i-1]].

(11) [[Z_i-1]] ⊆ [[Z]] [See Corollary 1.2, here] since Z is obtained from Z_i-1 by adjoining vectors.

(12) Hence, in any case yⁱ ∈ [[Z]] since:

(a) Either yⁱ ∈ Z or yⁱ is not in Z.

(b) If yⁱ ∈ Z, then yⁱ ∈ [[Z]]

(c) If yⁱ is not in Z, then yⁱ ∈ [[Z_i-1]]

(d) Since [[Z_i-1]] ⊆ [[Z]], it follows that yⁱ ∈ [[Z]]

(13) It follows that each element of [[X,Y]] belongs to [[Z]] since we have shown that yⁱ ∈ [[X,Y]] implies that yⁱ ∈ [[Z]] [from step #12]

(14) This gives us that [[X,Y]] ⊆ [[Z]] [See Lemma 1, here]

(15) But each element of [[Z]] belongs to V which gives us that:

[[Z]] ⊆ V

(16) We obtain that:

V ⊆ [[Y]] ⊆ [[X,Y]] ⊆ [[Z]] ⊆ V.

(17) Hence, equality holds throughout and [[Z]] = V since we have shown that V ⊆ [[Z]] and [[Z]] ⊆ V.

QED

Corollary 1.1:

Assume that V is nonzero and let (y¹, ..., y^t) span V.

Then there is a basis for V consisting of some of the yⁱ

Proof:

(1) Assume y¹ ≠ 0

(2) Let X = (y¹)

(3) Let Y = (y¹, ..., y^t)

(4) Using Theorem 1 above, we know that it is possible to construct a family of vectors Z such that Z is a basis for V.

(5) From Algorithm 1 above, it is clear that this consists of some of the yⁱ.

QED

Theorem 2:

Let (x¹, .., x^t) be a linearly independent family of vectors, and let (y¹, ..., y^u) = Z span V.

Then t ≤ u.

Proof:

(1) Let Z₀ = Z

(2) Since X = (x¹, ..., x^t) is in V, it follows that x¹ ∈ V.

(3) Using Theorem 1 above, we can construct a family Z₁ = (x¹, y¹, ..., y^p) such that Z₁ is a basis for V.

(4) We can claim that p must be less than t since:

(a) Since x¹ ∈ V and (y¹, y², ..., y^t) spans V, there must exist α_i such that:

x¹ = α₁y¹ + ... + α_ty^t

(b) Therefore (x¹, y¹, ..., y^t) is linearly dependent (see Lemma 4, here)

(c) Therefore, using Algorithm 1 above, we would have removed one of the values of 1 .. t in order to create the set x¹, y¹, ..., y^p

(5) This gives us that p + 1 ≤ t which allows us to conclude that:

|Z₁| ≤ |Z₀|

(6) This proves the base case.

(7) Now, we need only show that the proposition is true for |Z_i-1| then it is also true for |Z_i|.

(8) Assume that we have a basis Z_i-1 = (x¹, ..., x^i-1,y¹, ..., y^r) for V where |Z_i-1| = r + i - 1 ≤ u ≤ |Z₀|.

(9) We can use Algorithm 1 to create a basis Z_i = (x¹, ..., xⁱ, y¹, ..., y^q) where y¹, .., y^q ⊆ y¹, ..., y^r.

(10) Clearly, y¹, ..., y^q cannot consist of all the elements of y¹, ..., y^r since (x¹, ..., x^i-1,y¹, ..., y^r) is a basis for V and xⁱ ∈ V.

(11) Therefore q is less than r so it follows that |Z_i| = i + q ≤ (i - 1) + r = |Z_i-1|

(12) After t such steps, we obtain a basis Z_t = (x¹, ..., x^t, y¹, ..., y^l) where |Z_t| = t + l ≤ |Z_t-1|

(13) Clearly, t ≤ |Z_t| and |Z_t| ≤ |Z_t-1| ≤ ... ≤ |Z₀| = u.

(14) Hence, t ≤ u and the theorem is proved.

QED

References

Hans Schneider, George Philip Barker, Matrices and Linear Algebra, 1989.

Math Refresher

Saturday, May 12, 2007

Theorem of Exchange

About Me

Blog Archive