Skip to main content

Section 31 Vector Spaces

Subsection Application: The Hat Puzzle

In a New York Times article (April 10, 2001) β€œWhy Mathematicians Now Care About Their Hat Color”, the following puzzle is posed.

β€œThree players enter a room and a red or blue hat is placed on each person's head. The color of each hat is determined by a coin toss, with the outcome of one coin toss having no effect on the others. Each person can see the other players' hats but not his own. No communication of any sort is allowed, except for an initial strategy session before the game begins. Once they have had a chance to look at the other hats, the players must simultaneously guess the color of their own hats or pass. The group shares a hypothetical $3 million prize if at least one player guesses correctly and no players guess incorrectly.”

The game can be played with more players, and the problem is to find a strategy for the group that maximizes its chance of winning. One strategy is for a designated player to make a random guess and for the others to pass. This gives a 50% chance of winning. However, there are much better strategies that provide a nearly 100% probability of winning as the number of players increases. One such strategy is based on Hamming codes and subspaces of a particular vector space to implement the most effective approach.

Subsection Introduction

We have previously seen that Rn and the set of fixed size matrices have a nice algebraic structure when endowed with the addition and scalar multiplication operations. In fact, as we will see, there are many other sets of elements that have the same kind of structure with natural addition and scalar multiplication operations. Due to this underlying similar structure, these sets are connected in some way and can all be studied jointly. Mathematicians look for these kinds of connections between seemingly dissimilar objects and, from a mathematical standpoint, it is convenient to study all of these similar structures at once by combining them into a larger collection. This motivates the idea of a vector space that we will investigate in this chapter.

An example of a set that has a structure similar to vectors is a collection of polynomials. Let P1 be the collection of all polynomials of degree less than or equal to 1 with real coefficients. That is,

P1={a0+a1t:a0,a1∈R}.

So, for example, the polynomials 2+t, 5t, βˆ’7, and 12βˆ’Ο€t are in P1, but t is not in P1. Two polynomials a(t)=a0+a1t and b(t)=b0+b1t in P1 are equal if a0=b0 and a1=b1.

We define addition of polynomials in P1 by adding the coefficients of the like degree terms. So if a(t)=a0+a1t and b(t)=b0+b1t, then the polynomial sum of a(t) and b(t) is

a(t)+b(t)=(a0+a1t)+(b0+b1t)=(a0+b0)+(a1+b1)t.

So, for example,

(2+3t)+(βˆ’1+5t)=(2+(βˆ’1))+(3+5)t=1+8t.

We now consider the properties of the addition operation. For example, we can ask if polynomial addition is commutative. That is, if a(t) and b(t) are in P1, must it be the case that

a(t)+b(t) = b(t)+a(t)?

To show that addition is commutative in P1, we choose arbitrary polynomials a(t)=a0+a1t and b(t)=b0+b1t in P1. Then we have

a(t)+b(t)=(a0+b0)+(a1+b1)t=(b0+a0)+(b1+a1)t=b(t)+a(t).

Note that in the middle step, we used the definition of equality of polynomials since a0+b0=b0+a0 and a1+b1=b1+a1 due to the fact that addition of real numbers is commutative. So addition of elements in P1 is a commutative operation.

Preview Activity 31.1.

(a)

Now we investigate other properties of addition in P1.

(i)

To show addition is associative in P1, we need to verify that if a(t)=a0+a1t, b(t)=b0+b1t, and c(t)=c0+c1t are in P1, it must be the case that

(a(t)+b(t))+c(t)=a(t)+(b(t)+c(t)).

Either verify this property by using the definition of two polynomials being equal, or give a counterexample to show the equality fails in that case.

(ii)

Find a polynomial z(t)∈P1 such that

a(t)+z(t)=a(t)

for all a(t)∈P1. This polynomial is called the zero polynomial or the additive identity polynomial in P1.

(iii)

If a(t)=a0+a1t is an element of P1, is there an element p(t)∈P1 such that

a(t)+p(t)=z(t),

where z(t) is the additive identity polynomial you found above? If not, why not? If so, what polynomial is p(t)? Explain.

(b)

We can also define a multiplication of polynomials by scalars (real numbers).

(i)

What element in P1 could be the scalar multiple 12(2+3t)?

(ii)

In general, if k is a scalar and a(t)=a0+a1t is in P1, how do we define the scalar multiple ka(t) in P1?

(iii)

If k is a scalar and a(t)=a0+a1t and b(t)=b0+b1t are elements in P1, is it true that

k(a(t)+b(t))=ka(t)+kb(t)?

If no, explain why. If yes, verify your answer using the definition of two polynomials being equal.

(iv)

If k and m are scalars and a(t)=a0+a1t is an element in P1, is it true that

(k+m)a(t)=ka(t)+ma(t)?

If no, explain why. If yes, verify your answer.

(v)

If k and m are scalars and a(t)=a0+a1t is an element in P1, is it true that

(km)a(t)=k(ma(t))?

If no, explain why. If yes, verify your answer.

(vi)

If a(t)=a0+a1t is an element of P1, is it true that

1a(t)=a(t)?

If no, explain why. If yes, verify your answer.

Subsection Spaces with Similar Structure to Rn

Mathematicians look for patterns and for similarities between mathematical objects. In doing so, mathematicians often consider larger collections of objects that are sorted according to their similarities and then study these collections rather than just the objects themselves. This perspective can be very powerful β€” whatever can be shown to be true about an arbitrary element in a collection will then be true for every specific element in the collection. In this section we study the larger collection of sets that share the algebraic structure of vectors in Rn. These sets are called vector spaces.

In Preview Activity 31.1, we showed that the set P1 of polynomials of degree less than or equal to one with real coefficients, with the operations of addition and scalar multiplication defined by

(a0+a1t)+(b0+b1t)=(a0+b0)+(a1+b1)t    and    k(a0+a1t)=(ka0)+(ka1)t,

has a structure similar to R2.

By structure we mean how the elements in the set relate to each other under addition and multiplication by scalars. That is, if a(t)=a0+a1t, b(t), and c(t) are elements of P1 and k and m are scalars, then

  1. a(t)+b(t) is an element of P1,

  2. a(t)+b(t)=b(t)+a(t),

  3. (a(t)+b(t))+c(t)=a(t)+(b(t)+c(t)),

  4. there is a zero polynomial z(t) (namely, 0+0t) in P1 so that a(t)+z(t)=a(t),

  5. there is an element βˆ’a(t) in P1 (namely, (βˆ’a0)+(βˆ’a1)t) so that a(t)+(βˆ’a(t))=z(t),

  6. ka(t) is an element of P1,

  7. (k+m)a(t)=ka(t)+ma(t),

  8. k(a(t)+b(t))=ka(t)+kb(t),

  9. (km)a(t)=k(ma(t)),

  10. 1a(t)=a(t).

The properties we saw for polynomials in P1 stated above are the same as the properties for vector addition and multiplication by scalars in Rn, as well as matrix addition and multiplication by scalars identified in Section 8. This indicates that polynomials in P1, vectors in Rn, and the set of mΓ—n matrices behave in much the same way as regards their addition and multiplication by scalars. There is an even closer connection between linear polynomials and vectors in R2. An element a(t)=a0+a1t in P1 can be naturally associated with the vector [a0a1] in R2. All the results of polynomial addition and multiplication by scalars then translate to corresponding results of addition and multiplication by scalars of vectors in R2. So for all intents and purposes, as far as addition and multiplication by scalars is concerned, there is no difference between elements in P1 and vectors in R2 β€” the only difference is how we choose to present the elements (as polynomials or as vectors). This sameness of structure of our sets as it relates to addition and multiplication by scalars is the type of similarity mentioned in the introduction. We can study all of the types of objects that exhibit this same structure at one time by studying vector spaces.

Subsection Vector Spaces

We defined vector spaces in the context of subspaces of Rn in Definition 12.3. In general, any set that has the same kind of additive and multiplicative structure as our sets of vectors, matrices, and linear polynomials is called a vector space. As we will see, the ideas that we introduced about subspaces of Rn apply to vector spaces in general, so the material in this chapter should have a familiar feel.

Definition 31.1.

A set V on which an operation of addition and a multiplication by scalars is defined is a vector space if for all u, v, and w in V and all scalars a and b:

  1. u+v is an element of V (we say that V is closed under the addition in V),

  2. u+v=v+u (we say that the addition in V is commutative),

  3. (u+v)+w=u+(v+w) (we say that the addition in V is associative),

  4. there is a zero vector 0 in V so that u+0=u (we say that V contains an additive identity 0),

  5. for each x in V there is an element y in V so that x+y=0 (we say that V contains an additive inverse y for each element x in V),

  6. au is an element of V (we say that V is closed under multiplication by scalars),

  7. (a+b)u=au+bu (we say that multiplication by scalars distributes over scalar addition),

  8. a(u+v)=au+av (we say that multiplication by scalars distributes over addition in V),

  9. (ab)u=a(bu),

  10. 1u=u.

Note.

Unless otherwise stated, in this book the scalars will refer to real numbers. However, we can define vector spaces where scalars are complex numbers, or rational numbers, or integers modulo p where p is a prime number, or, more generally, elements of a field. A field is an algebraic structure which generalizes the structure of real numbers and rational numbers under the addition and multiplication operations. Since we will focus on the real numbers as scalars, the reader is not required to be familiar with the concept of a field.

Because of the similarity of the way elements in vector spaces behave compared to vectors in Rn, we call the elements in a vector space vectors. There are many examples of vectors spaces, which is what makes this idea so powerful.

Example 31.2.

(a)

The space Rn of all vectors with n components is a vector space using the standard vector addition and multiplication by scalars. The zero element is the zero vector 0 whose components are all 0.

(b)

The set P1 of all polynomials of degree less than or equal to 1 with addition and scalar multiplication as defined earlier. Recall that P1 is essentially the same as R2.

(c)

The properties listed in the introduction for P1 are equally true for the collection of all polynomials of degree less than or equal to some fixed number. We label as Pn this set of all polynomials of degree less than or equal to n, with the standard addition and scalar multiplication. Note that Pn is essentially the same as Rn+1. More generally, the space P of all polynomials is also a vector space with standard addition and scalar multiplication.

(d)

As a subspace of Rn, the eigenspace of an nΓ—n matrix corresponding to an eigenvalue Ξ» is a vector space.

(e)

As a subspace of Rn, the null space of an mΓ—n matrix is a vector space.

(f)

As a subspace of Rm, the column space of an mΓ—n matrix is a vector space.

(g)

The span of a set of vectors in Rn is a subspace of Rn, and is therefore a vector space.

(h)

Let V be a vector space and let 0 be the additive identity in V. The set {0} is a vector space in which 0+0=0 and k0=0 for any scalar k. This space is called the trivial vector space.

(i)

The space MmΓ—n (or MmΓ—n(R) when it is important to indicate that the entries of our matrices are real numbers) of all mΓ—n matrices with real entries with the standard addition and multiplication by scalars we have already defined. In this case, MmΓ—n is essentially the same vector space as Rmn.

(j)

The space F of all functions from R to R, where we define the sum of two functions f and g in F as the function f+g satisfying

(f+g)(x)=f(x)+g(x)

for all real numbers x, and the scalar multiple cf of the function f by the scalar c to be the function satisfying

(cf)(x)=cf(x)

for all real numbers x. The verification of the vector space properties for this space is left to the reader.

(k)

The space R∞ of all infinite real sequences (x1,x2,x3,…). We define addition and scalar multiplication termwise:

(x1,x2,x3,…)+(y1,y2,y3,…)=(x1+y1,x2+y2,x3+y3,…),
c(x1,x2,x3,…)=(cx1,cx2,cx3,…)text,

is a vector space. In addition, the set of convergent sequences inside R∞ forms a vector space using this addition and multiplication by scalars (as we did in Rn, we will call this set of convergent sequences a subspace of R∞).

(l)

(For those readers who are familiar with differential equations). The set of solutions to a second order homogeneous differential equation forms a vector space under addition and scalar multiplication defined as in the space F above.

(m)

The set of polynomials of positive degree in P1 is not a vector space using the standard addition and multiplication by scalars in P1 is not a vector space. Notice that t+(βˆ’t) is not a polynomial of positive degree, and so this set is not closed under addition.

(n)

The color space where each color is assigned an RGB (red, green, blue) coordinate between 0 and 255, with addition and scalar multiplication defined component-wise, however, does not define a vector space. The color space is not closed under either operation due to the color coordinates being integers ranging from 0 to 255.

It is important to note that the set of defining properties of a vector space is intended to be a minimum set. Any other properties of a vector space must be verified or proved using the defining properties. For example, in Rn it is clear that the scalar multiple 0v is the zero vector for any vector v in Rn. This might be true in any vector space, but it is not a defining property. Therefore, if this property is true, then we must be able to prove it using just the defining properties. To see how this might work, let v be any vector in a vector space V. We want to show that 0v=0 (the existence of the zero vector is property (4)). Using the fact that 0+0=0 and that scalar multiplication distributes over scalar addition, we can see that

0v=(0+0)v=0v+0v.

Property (5) tells us that V contains an additive inverse for every vector in V, so let u be an additive inverse of the vector 0v in V. Then 0v+u=0 56  and so

0v+u=(0v+0v)+u0=0v+(0v+u)0=0v+0.

Now 0 has the property that 0+w=w+0=w for any vector w in V (by properties (4) and (2)), and so we can conclude that

0=0v.

Activity 31.2.

Another property that will be useful is a cancellation property. In the set of real numbers we know that if a+b=c+b, then a=c, and we verify this by subtracting b from both sides. This is the same as adding the additive inverse of b to both sides, so we ought to be able to make the same argument using additive inverses in a vector space. To see how, let u, v, and w be vectors in a vector space and suppose that

(31.1)u+w=v+w.
(a)

Why does our space contain an additive inverse z of w?

(b)

Now add the vector z to both sides of equation (31.1) to obtain

(31.2)(u+w)+z=(v+w)+z.

Which property of a vector space allows us to state the following equality?

(31.3)u+(w+z)=v+(w+z).
(c)

Now use the properties of additive inverses and the additive identity to explain why u=v. Conclude that we have a cancellation law for addition in any vector space.

We should also note that the definition of a vector space only states the existence of a zero vector and an additive inverse for each vector in the space, and does not say that there cannot be more than one zero vector or more than one additive inverse of a vector in the space. The reason why is that the uniqueness of the zero vector and an additive inverse of a vector can be proved from the defining properties of a vector space, and so we don't list this consequence as a defining property. Similarly, the defining properties of a vector space do not state that the additive inverse of a vector v is the scalar multiple (βˆ’1)v. Verification of these properties are left for the exercises. We summarize the results of this section in the following theorem.

Subsection Subspaces

In Section 12 we saw that Rn contained subsets that we called subspaces that had the same algebraic structure as Rn. The same idea applies to vector spaces in general.

Activity 31.3.

Let H={at:a∈R}. Notice that H is a subset of P1.

(a)

Is H closed under the addition in P1? Verify your answer.

(b)

Does H contain the zero vector from P1? Verify your answer.

(c)

Is H closed under multiplication by scalars? Verify your answer.

(d)

Explain why H satisfies every other property of the definition of a vector space automatically just by being a subset of P1 and using the same operations as in P1. Conclude that H is a vector space.

Activity 31.3 illustrates an important point. There is a fundamental difference in the types of properties that define a vector space. Some of the properties that define a vector space are true for any subset of the vector space because they are properties of the operations (such as the commutative and associative properties). The other properties (closure, the inclusion of the zero vector, and the inclusion of additive inverses) are set properties, not properties of the operations. So these three properties have to be specifically checked to see if a subset of a vector space is also a vector space. This leads to the definition of a subspace, a subset of a vector space which is a vector space itself.

Definition 31.4.

A subset H of a vector space V is a subspace of V if

  1. whenever u and v are in H it is also true that u+v is in H (that is, H is closed under addition),

  2. whenever u is in H and a is a scalar it is also true that au is in H (that is, H is closed under scalar multiplication),

  3. 0 is in H.

Activity 31.4.

Is the given subset H a subspace of the indicated vector space V? Verify your answer.

(a)

V is any vector space and H={0}

(b)

V=M2Γ—2, the vector space of 2Γ—2 matrices and H={[2xy0x]| x and y are scalars }.

(c)

V=P2, the vector space of all polynomials of degree less than or equal to 2 and H={2at2+1∣a is a scalar }.

(d)

V=P2 and H={at∣a is a scalar }βˆͺ{bt2∣b is a scalar }.

There is an interesting subspace relationship between the spaces P1,P2,P3,… and P. For every i, Pi is a subspace of P. Furthermore, P1 is a subspace of P2, P2 is a subspace of P3, and so on. Note however that a similar relationship does NOT hold for Rn, even though Pi looks like Ri+1. For example, R1 is NOT a subspace of R2. Similarly, R2 is NOT a subspace of R3. Since the vectors in different Rn's are of different sizes, none of the Ri's is a subset of another Rn with iβ‰ n, and hence, Ri is not a subspace of Rn when i<n.

Subsection The Subspace Spanned by a Set of Vectors

In Rn we showed that the span of any set of vectors forms a subspace of Rn. The same is true in any vector space. Recall that the span of a set of vectors in Rn is the set of all linear combinations of those vectors. So before we can discuss the span of a set of vectors in a vector space, we need to extend the definition of linear combinations to vector spaces (compare to Definition 4.8 and Definition 4.10).

Definition 31.5.

Let V be a vector space. A linear combination of vectors v1, v2, …, vk in V is a vector of the form

x1v1+x2v2+β‹―+xkvk,

where x1, x2, …, xk are scalars. The span of the vectors v1, v2, …, vk is the collection of all linear combinations of v1, v2, …, vk. That is,

Span{v1,v2,…,vk}={x1v1+x2v2+β‹―+xkvk∣x1,x2,…,xk are scalars }.

The argument that the span of any finite set of vectors in a vector space forms a subspace is the same as we gave for the span of a set of vectors in Rn (see Theorem 12.7). The proof is left for the exercises.

The subspace Span{v1,v2,…,vm} is called the subspace of V spanned by v1,v2,…,vm.

Activity 31.5.
(a)

Let H={a2t2βˆ’a1t:a2 and a1 are real numbers }. Note that H is a subset of P2. Find two vectors v1,v2 in P2 so that H=Span{v1,v2} and hence conclude that H is a subspace of P2. (Note that the vectors v1,v2 are not unique.)

(b)

Let p1(t)=1βˆ’t2 and p2(t)=1+t2, and let S={p1(t),p2(t)} in P2. Is the polynomial q(t)=3βˆ’2t2 in Span S? (Hint: Create a matrix equation of the form Ax=b by setting up an appropriate polynomial equation involving p1(t), p2(t) and q(t). Under what conditions on A is the system Ax=b consistent?)

(c)

With S as in part (b), describe as best you can the subspace Span S of P2.

Given a subspace H, the set S such that H=Span S is called a spanning set of H. In order to determine if a set S={v1,v2,…,vk} is a spanning set for H, all we need to do is to show that for every b in H, the equation

x1v1+x2v2+β‹―+xkvk=b

has a solution. We will see important uses of special spanning sets called bases in the rest of this chapter.

Subsection Examples

What follows are worked examples that use the concepts from this section.

Example 31.7.

Determine if each of the following sets is a vector space.

(a)

V={(x,y,z):x,y,z∈R} with addition and multiplication by scalars defined by

(a,b,c)βŠ•(x,y,z)=(a+x,c+z,b+y)  and  k(x,y,z)=(kx,kz,ky),

where (a,b,c) and (x,y,z) are in V and k∈R

Solution.

We consider the vector space properties in Definition 31.1. Let (a,b,c), (u,v,w), and (x,y,z) be in V and let k,m∈R. By the definition of addition and multiplication by scalars, both (a,b,c)+(x,y,z) and k(x,y,z) are in V. Note also that

(a,b,c)βŠ•(x,y,z)=(a+x,c+z,b+y)=(x+a,z+c,y+b)=(x,y,z)βŠ•(a,b,c),

and so addition is commutative in V. Since

((1,1,0)βŠ•(0,1,1))βŠ•(0,0,1)=(1,1,2)βŠ•(0,0,1)=(1,3,1)

and

(1,1,0)βŠ•((0,1,1)βŠ•(0,0,1))=(1,1,0)βŠ•(0,2,1)=(1,1,3),

we see that addition is not associative and conclude that V is not a vector space. At this point we can stop since we have shown that V is not a vector space.

(b)

V={x∈R:x>0} with addition βŠ• and multiplication by scalars defined by

xβŠ•y=xy  and  kx=xk,

where x and y are in V, k∈R, and xy is the standard product of x and y

Solution.

We consider the vector space properties in Definition 31.1. Let x, y, and z be in V and let k,m∈R. Since x and y are both positive real numbers, we know that xy is a positive real number. Thus, xβŠ•y∈V and V is closed under its addition. Also, xk is a positive real number, so xk∈V as well. Now

xβŠ•y=xy=yx=yβŠ•x

and addition is commutative in V. Also,

(xβŠ•y)βŠ•z=(xy)βŠ•z=(xy)z=x(yz)=xβŠ•(yz)=xβŠ•(yβŠ•z)

and addition is associative in V. Since

1βŠ•x=1x=x,

V contains an additive identity, which is 1. The fact that x is a positive real number implies that 1x is a positive real number. Thus, 1x∈V and

xβŠ•1x=x(1x)=1

and V contains an additive inverse for each of its elements. We have that

(k+m)x=xk+m=xkxm=xkβŠ•xm=k(x)βŠ•m(x),k(xβŠ•+y)=k(xy)=xkyk=xkβŠ•yk=k(x)βŠ•k(y),(km)x=xkm=(xm)k=(m(x))k=k(m(x))1x=x1=x.

So V satisfies all of the properties of a vector space.

(c)

The set W of all 2Γ—2 matrices of the form [a0b0] where a and b are real numbers using the standard addition and multiplication by scalars on matrices.

Solution.

Recall that M2Γ—2 is a vector space using the standard addition and multiplication by scalars on matrices. Any matrix of the form [a0b0] can be written as

[a0b0]=a[1000]+b[0010].

So W=Span{[1000],[0010]} and W is a subspace of M2Γ—2. Thus, W is a vector space.

(d)

The set W of all functions f from R to R such that f(0)β‰₯0 using the standard addition and multiplication by scalars on functions.

Solution.

We will show that W is not a vector space. Let f:Rβ†’R be defined by f(x)=1. Then f(0)β‰₯0 and f∈W. However, if h=(βˆ’1)f, then h(0)=(βˆ’1)f(0)=βˆ’1 and hβˆ‰W. It follows that W is not closed under multiplication by scalars and W is not a vector space.

Example 31.8.

Let V be a vector space and u and v vectors in V. Also, let a and b be scalars. You may use the result of Exercise 5 that c0=0 for any scalar c in any vector space.

(a)

If av=bv and v≠0, must a=b? Use the properties of a vector space or provide a counterexample to justify your answer.

Solution.

We will show that this statement is true. Suppose av=bv and vβ‰ 0. Then 0=avβˆ’bv=(aβˆ’b)v. If a=b, then we are done. So suppose aβ‰ b. Then aβˆ’bβ‰ 0 and 1aβˆ’b is a real number. Then

1aβˆ’b0=1aβˆ’b((aβˆ’b)v)0=(1aβˆ’b(aβˆ’b))v0=v.

But we assumed that v≠0, so we can conclude that a=b as desired.

(b)

If au=av and a≠0, must u=v? Use the properties of a vector space or provide a counterexample to justify your answer.

Solution.

We will show that this statement is true. Suppose au=av and aβ‰ 0. Then 0=auβˆ’av=a(uβˆ’v). Since aβ‰ 0, we know that 1a is a real number. Thus,

1a0=1a(a(uβˆ’v))0=(1aa)(uβˆ’v)0=uβˆ’vu=v.
(c)

If au=bv, must a=b and u=v? Use the properties of a vector space or provide a counterexample to justify your answer.

Solution.

We will demonstrate that this statement is false with a counterexample. Let a=1, b=2, u=[2 0]T and v=[1 0]T in R2. Then

au=1[2 0]T=[2 0]T=2[1 0]T=bv,

but a≠b and u≠v.

Subsection Summary

  • A set V on which an operation of addition and a multiplication by scalars is defined is a vector space if for all u, v, and w in V and all scalars a and b:

    1. u+v is an element of V,

    2. u+v=v+u,

    3. (u+v)+w=u+(v+w),

    4. there is a zero vector 0 in V so that u+0=u,

    5. for each x in V there is an element y in V so that x+y=0,

    6. au is an element of V,

    7. (a+b)u=au+bu,

    8. a(u+v)=au+av,

    9. (ab)u=a(bu),

    10. 1u=u.

  • A subset H of a vector space V is a subspace of V if

    1. whenever u and v are in H it is also true that u+v is in H,

    2. whenever u is in H and a is a scalar it is also true that au is in H,

    3. 0 is in H.

  • A linear combination of vectors v1, v2, …, vk in a vector space V is a vector of the form

    x1v1+x2v2+β‹―+xkvk,

    where x1, x2, …, xk are scalars.

  • The span of the vectors v1, v2, …, vk in a vector space V is the collection of all linear combinations of v1, v2, …, vk. That is,

    Span{v1,v2,…,vk}={x1v1+x2v2+β‹―+xkvk:x1,x2,…,xk are scalars }.
  • The span of any finite set of vectors in a vector space V is always a subspace of V.

  • This concept of vector space is important because there are many different types of sets (e.g., Rn, MmΓ—n, Pn, F) that have similar structure, and we can relate them all as members of this larger collection of vector spaces.

Exercises Exercises

1.

The definition of a vector space only states the existence of a zero vector and does not say how many zero vectors the space might have. In this exercise we show that the zero vector in a vector space is unique. To show that the zero vector is unique, we assume that two vectors 01 and 02 have the zero vector property.

(a)

Using the fact that 01 is a zero vector, what vector is 01+02?

Hint.

Use the fact that 01+v=v for any vector v in our vector space.

(b)

Using the fact that 02 is a zero vector, what vector is 01+02?

Hint.

Same reasoning as in part (a).

(c)

How do we conclude that the zero vector is unique?

Hint.

Use the transitive property of equality.

2.

The definition of a vector space only states the existence of an additive inverse for each vector in the space, but does not say how many additive inverses a vector can have. In this exercise we show that the additive inverse of a vector in a vector space is unique. To show that a vector v has only one additive inverse, we suppose that v has two additive inverses, u and w, and demonstrate that u=w.

(a)

What equations must u and w satisfy if u and w are additive inverses of v?

(b)

Use the equations from part (a) to show that u=w. Clearly identify all vector space properties you use in your argument.

4.

Let V be a vector space and v a vector in V. In all of the vector spaces we have seen to date, the additive inverse of the vector v is equal to the scalar multiple (βˆ’1)v. This seems reasonable, but it is important to note that this result is not stated in the definition of a vector space, so this it is something that we need to verify. To show that (βˆ’1)v is an additive inverse of the vector v, we need to demonstrate that

v+(βˆ’1)v=0.

Verify this equation, explicitly stating which properties you use at each step.

Hint.

Use the fact that βˆ’1+1=0.

5.

It is reasonable to expect that if c is any scalar and 0 is the zero vector in a vector space V, then c0=0. Use the fact that 0+0=0 to prove this statement.

6.

Let W1,W2 be two subspaces of a vector space V. Determine whether W1∩W2 and W1βˆͺW2 are subspaces of V. Justify each answer clearly.

7.

Find three vectors v1,v2,v3 to express

W={[a+2b+cbβˆ’3caβˆ’c]:a,b,c in R}

as Span{v1,v2,v3}. How does this justify why W is a subspace of R3?

8.

Find three vectors v1,v2,v3 to express

W={[a+baβˆ’2c3b+ca+bβˆ’c]:a,b,c in R}

as Span{v1,v2,v3}. How does this justify why W is a subspace of M2Γ—2?

9.

Let F be the set of all functions from R to R, where we define the sum of two functions f and g in F as the function f+g satisfying

(f+g)(x)=f(x)+g(x)

for all real numbers x, and the scalar multiple cf of the function f by the scalar c to be the function satisfying

(cf)(x)=cf(x)

for all real numbers x. Show that F is a vector space using these operations.

11.

Determine if each of the following sets of elements is a vector space or not. If appropriate, you can identify a set as a subspace of another vector space, or as a span of a collection of vectors to shorten your solution.

(a)

A line through the origin in Rn.

(c)

The set of vectors {[a0]:a in Z}.

(d)

The set of all differentiable functions from R to R.

(e)

The set of all functions from R to R which are increasing for every x. (Assume that a function f is increasing if f(a)>f(b) whenever a>b.)

(f)

The set of all functions f from R to R for which f(c)=0 for some fixed c in R.

(g)

The set of polynomials of the form a+bt, where a+b=0.

(h)

The set of all upper triangular 4Γ—4 real matrices.

(i)

The set of complex numbers C where scalar multiplication is defined as multiplication by real numbers.

12.

A reasonable way to extend the idea of the vector space Rn to infinity is to let R∞ be the set of all sequences of real numbers. Define addition and multiplication by scalars on R∞ by

{xn}+{yn}={xn+yn}  and  c{xn}={cxn}

where {xn} denotes the sequence x1,x2,x3,…, {yn} denotes the sequence y1,y2,y3,… and c is a scalar.

(a)

Show that R∞ is a vector space using these operations.

(b)

Is the set of sequences that have infinitely many zeros a subspace of R∞? Verify your answer.

(c)

Is the set of sequences which are eventually zero a subspace of R∞? Verify your answer. (A sequence {xn} is eventually zero if there is an index k0 such that xn=0 whenever nβ‰₯k0.)

(d)

Is the set of decreasing sequences a subspace of R∞? Verify your answer. (A sequence {xn} is decreasing if xn+1≀xn for each n.)

(e)

Is the set of sequences in R∞ that have limits at infinity a subspace of R∞?

(f)

Let β„“2 be the set of all square summable sequences in R∞, that is sequences {xn} so that βˆ‘k=1∞xk2 is finite. So, for example, the sequence {1n} is in β„“2. Show that β„“2 is a subspace of R∞ (the set β„“2 is an example of what is called a Hilbert space by defining the inner product ⟨{xn},{yn}⟩=βˆ‘n=1∞xnyn). (Hint: show that 2u2+2v2βˆ’(u+v)2β‰₯0 for any real numbers u and v.)

13.

Given two subspaces H1,H2 of a vector space V, define

H1+H2={w∣w=u+v where u in H1,v in H2}.

Show that H1+H2 is a subspace of V containing both H1,H2 as subspaces. The space H1+H2 is the sum of the subspaces H1 and H2.

14.

Label each of the following statements as True or False. Provide justification for your response.

(a) True/False.

The intersection of any two subspaces of V is also a subspace.

(b) True/False.

The union of any two subspaces of V is also a subspace.

(c) True/False.

If H is a subspace of a vector space V, then βˆ’H={(βˆ’1)v:v in H} is equal to H.

(d) True/False.

If v is a nonzero vector in H, a subspace of Rn, then H contains the line through the origin and v in Rn.

(e) True/False.

If v1,v2 are nonzero, non-parallel vectors in H, a subspace of Rn, then H contains the plane through the origin, v1 and v2 in Rn.

(f) True/False.

The smallest subspace in Rn containing a vector v is a line through the origin.

(g) True/False.

The largest subspace of V is V.

(h) True/False.

The space P1 is a subspace of Pn for nβ‰₯1.

(i) True/False.

The set of constant functions from R to R is a subspace of F.

(j) True/False.

The set of all polynomial functions with rational coefficients is a subspace of F.

Subsection Project: Hamming Codes and the Hat Puzzle

Recall the hat problem from the beginning of this section. Three players are assigned either a red or blue hat and can only see the colors of the hats of the other players. The goal is to devise a high probability strategy for one player to correctly guess the color of their hat. The players have a 50% chance of winning if one player guesses randomly and all of the others pass. However, the group can do better than 50% with a reasonably simple strategy. There are 2 possibilities for each hat color for a total of 23=8 possible distributions of hat colors. Of these, only red-red-red and blue-blue-blue contain only one hat color, so 6/8 of 3/4 of the possible hat distributions have two hats of one color and one of the other color. So if a player sees two hats of the same color, that player guesses the other color and passes otherwise. This gives a 75% chance of winning. This strategy will only work for three players, though. We want to develop an effective strategy that works for larger groups of players.

There is a strategy, based on Hamming codes that can be utilized when the number of players is of the form 2kβˆ’1 with kβ‰₯2. This strategy will provide a winning probability of

1βˆ’2βˆ’k.

Note that as kβ†’βˆž, this probability has a limit of 1. Note also that if k=2 (so that there are 3 players), then the probability is 34 or 75% β€” the same strategy we came up with earlier.

To understand this strategy, we need to build a slightly different kind of vector space than we have seen until now, one that is based on a binary choice of red or blue. To do so, we identify the hat colors with numbers β€” 0 for red and 1 for blue. So let F={0,1}. Assume there are n=2kβˆ’1 players for some integer kβ‰₯2. We can then view a distribution of hats among the n=2kβˆ’1 players as a vector with n components from F. That is,

Fn={[Ξ±1 Ξ±2 β‹― Ξ±n]T:Ξ±i∈F}.

We can give some structure to both F and Fn by noting that we can define addition and multiplication in F by

30+0=0,0+1=1+0=1,1+1=00β‹…0=0,0β‹…1=1β‹…0=0,1β‹…1=1.

Project Activity 31.6.

Show that F has the same structure as R. That is, show that for all x, y, and z in F, the following properties are satisfied.

(c)

(x+y)+z=x+(y+z) and (xy)z=x(yz)

(d)

There is an element 0 in F such that x+0=x

(e)

There is an element 1 in F such that (1)x=x

(f)

There is an element βˆ’x in F such that x+(βˆ’x)=0

(g)

If x≠0, there is an element 1x in F such that x(1x)=1

Project Activity 31.6 shows that F has the same properties as R β€” that is that F is a field. Until now, we have worked with vector spaces whose scalars come from the set of real numbers, but that is not necessary. None of the results we have discovered so far about vector spaces require our scalars to come from R. In fact, we can replace R with any field and all of the same vector space properties hold. It follows that V=Fn is a vector space over F. As we did in Rn, we define the standard unit vectors e1=[1 0 0 β‹― 0]T, e2=[0 1 0 0 β€¦ 0]T, …, en=[0 0 0 β€¦ 0 1]T in V=Fn.

Now we return to the hat puzzle. We have n=2kβˆ’1 players. Label the players 1, 2, …, n. We can now represent a random placements of hats on heads as a vector v=[Ξ±1 Ξ±2 β‹― Ξ±n]T in V=Fn, where Ξ±i=0 in the ith entry represents a red hat and Ξ±i=1 a blue hat on player i. Since player i can see all of the other hats, from player i's perspective the distribution of hats has the form

v=vi+Ξ²iei,

where Ξ²i is the unknown color of hat on player i's head and

vi=[Ξ±1 Ξ±2 β‹― Ξ±iβˆ’1 0 Ξ±i+1 β‹―Ξ±n]T.

In order to analyze the vectors v from player i's perspective and to devise an effective strategy, we will partition the set V into an appropriate disjoint union of subsets.

To provide a different way to look at players, we will use a subspace of V. Let W be a subspace of V that has a basis of k vectors. The elements of W are the linear combinations of k basis vectors, and each basis vector in a linear combination has 2 possibilities for its weight (from F). Thus, W contains exactly 2k=n+1 vectors. We can then use the n=2kβˆ’1 nonzero vectors in W to represent our players. Each distribution of hats can be seen as a linear combination of the vectors in W. Let w1, w2, …, w2kβˆ’1 be the nonzero vectors in W. We then define a function Ο†:Vβ†’W as

Ο†([Ξ±1 Ξ±2 β‹― Ξ±n]T)=βˆ‘i=1nΞ±iwi

that identifies a distribution of hats with a vector in W. The subspace that we need to devise our strategy is what is called a Hamming code.

Project Activity 31.7.

Let

H={[Ξ±1 Ξ±2 β‹― Ξ±n]T∈V:βˆ‘i=1nΞ±iwi=0}.

Show that H is a subspace of V. (The subspace H is called the (2kβˆ’1,2kβˆ’kβˆ’1) Hamming code (where the first component is the number of elements in a basis for V and the second the number of elements in a basis for H). Hamming codes are examples of linear codes β€” those codes that are subspaces of the larger vector space.)

Now for each i between 0 and n we define Hi=ei+H as

Hi=ei+H={ei+h:h∈H},

where we let e0=0. The sets Hi are called cosets of H.

Project Activity 31.8.

To complete our strategy for the hat puzzle, we need to know some additional information about the sets Hi.

(a)

Show that the sets Hi are disjoint. That is, show that Hi∩Hj=βˆ… if iβ‰ j.

Hint.

If v∈Hi and v∈Hj, what can we say about eiβˆ’ej?

(b)

Since HiβŠ†V for each i, it follows that ⋃i=0nHiβŠ†V. Now we show that V=⋃i=0nHi by demonstrating that ⋃i=0nHi has exactly the same number of elements as V. We will need one fact for our argument. We will see in a later section that H has a basis of nβˆ’k elements, so the number of elements in H is 2nβˆ’k.

(i)

Since the sets Hi are disjoint, the number of elements in ⋃i=1nHi is equal to the sum of the number of elements in each Hi. Show that each Hi has the same number of elements as H.

(ii)

Now use the fact that the number of elements in ⋃i=0nHi is equal to the sum of the number of elements in each Hi to argue that V=⋃i=0nHi.

The useful idea from Project Activity 31.8 is that any hat distribution in V is in exactly one of the sets Hi. Recall that a hat distribution v=[Ξ±1 Ξ±2 β‹― Ξ±n]T in V can be written from player i's perspective as

v=vi+Ξ²iei,

where vi=[Ξ±1 Ξ±2 β‹― Ξ±iβˆ’1 0 Ξ±i+1 β‹― β‹― Ξ±n]T. Our strategy for the hat game can now be revealed.

  • If vi+Ξ²iei is not in H for either choice of Ξ²i, then player i should pass.

  • If vi+Ξ²iei is in H, then player i guesses 1+Ξ²i.

Project Activity 31.9.

Let us analyze this strategy.

(a)

Explain why every player guesses wrong if v is in H.

(b)

Now we see determine that our strategy is a winning strategy for all hat distributions v that are not in H. First we need to know that these two options are the only ones. That is, show that it is not possible for vi+Ξ²iei to be in H for both choices of Ξ²i.

(c)

Now we want to demonstrate that this is a winning strategy if vβˆ‰H. That is, at least one player guesses a correct hat color and no one else guesses incorrectly. So assume vβˆ‰H.

(i)

We know that v∈Hi for some unique choice of i, so let v=ei+h for some h∈H. Explain why player i can correctly choose color 1+αi.

(ii)

Finally, we need to argue that every player except player i must pass. So consider player j, with j≠i. Recall that

v=vj+Ξ±jej.

Analyze our strategy and the conditions under which player j does not pass. Show that this leads to a contradiction.

Project Activity 31.9 completes our analysis of this strategy and shows that our strategy results in a win with probability

1βˆ’|H||V|=1βˆ’22kβˆ’kβˆ’122kβˆ’1=1βˆ’2βˆ’k.
It is very important to keep track of the different kinds of zeros here β€” the boldface zero 0 is the additive identity in the vector space and the non-bold 0 is the scalar zero.