The Matrix-Vector Form of a Linear System

Section 5 The Matrix-Vector Form of a Linear System

Focus Questions

By the end of this section, you should be able to give precise and thorough answers to the questions listed below. You may want to keep these questions in mind to focus your thoughts as you complete the section.

How and when is the matrix-vector product \(A \vx\) defined?
How can a system of linear equations be written in matrix-vector form?
How can we tell if the system \(A \vx = \vb\) is consistent for a given vector \(\vb\text{?}\)
How can we tell if the system \(A \vx = \vb\) is consistent for every vector \(\vb\text{?}\)
What is a homogeneous system? What can we say about the solution set to a homogeneous system?
What must be true about pivots in the coefficient matrix \(A\) in order for the homogeneous system \(A \vx = \vzero\) to have a unique solution?
How are the solutions to the nonhomogeneous system \(A \vx = \vb\) related to the solutions of the corresponding homogeneous system \(A \vx = \vzero\text{?}\)

Subsection Application: Modeling an Economy

An economy is a very complex system. An economy is not a well-defined object, there are many factors that influence an economy, and it is often unclear how the factors influence each other. Mathematical modeling plays an important role in attempting to understand an economy.

In 1941 Wassily Leontief developed the first empirical model of a national economy. Around 1949 Leontief used data from the U.S. Bureau of Labor Statistics to divide the U.S. economy into 500 sectors. He then set up linear equations for each sector. This system was too large for the computers at the time to solve, so he then aggregated the information into 42 sectors. The Harvard Mark II computer was used to solve this system, one of the first significant uses of computers for mathematical modeling. Leontief won the 1973 Nobel Prize in economics for his work.

With such large models (Leontief's models are called input-output models) it is important to find a shorthand way to represent the resulting systems. In this section we will see how to represent any size system of linear equations in a very convenient way. Later, we will analyze a small economy using input-output models.

Subsection Introduction

There is another useful way to represent a system of linear equations using a matrix-vector product that we investigate in this section. To understand how this product comes about, recall that we can represent the linear system

\begin{alignat*}{5} x_1 \amp {}+{} \amp {4}x_2 \amp {}+{} \amp {2}x_3 \amp {}+{} \amp {4}x_4 \amp {}={} \amp 1\\ {2}x_1 \amp {}-{} \amp {}x_2 \amp {}-{} \amp {5}x_3 \amp {}-{} \amp {}x_4 \amp {}={} \amp 2\\ {3}x_1 \amp {}+{} \amp {7}x_2 \amp {}+{} \amp {}x_3 \amp {}+{} \amp {7}x_4 \amp {}={} \amp 3 \end{alignat*}

as a vector equation as

\begin{equation} x_1\left[ \begin{array}{c} 1\\2\\3 \end{array} \right] + x_2\left[ \begin{array}{r} 4\\-1\\7 \end{array} \right] + x_3\left[ \begin{array}{r} 2\\-5\\1 \end{array} \right] + x_4\left[ \begin{array}{r} 4\\-1\\7 \end{array} \right]= \left[ \begin{array}{c} 1\\2\\3 \end{array} \right]\text{.}\tag{5.1} \end{equation}

We can view the left hand side of Equation (5.1) as a matrix-vector product. Specifically, if \(A = \left[ \begin{array}{crrr} 1\amp 4\amp 2\amp 4 \\ 2\amp -1\amp -5\amp -1 \\ 3\amp 7\amp 1\amp 7 \end{array} \right]\) and \(\vx = \left[ \begin{array}{c} x_1 \\x_2 \\x_3 \\x_4 \end{array} \right]\text{,}\) then we define the matrix-vector product \(A\vx\) as the left hand side Equation (5.1). So the matrix-vector product \(A\vx\) is the linear combination of the columns of \(A\) with weights from the vector \(\vx\) in order.

With this definition, the vector equation in (5.1) can be expressed as a matrix-vector equation as

\begin{equation*} \left[ \begin{array}{crrr} 1\amp 4\amp 2\amp 4 \\ 2\amp -1\amp -5\amp -1 \\ 3\amp 7\amp 1\amp 7 \end{array} \right] \left[ \begin{array}{c} x_1 \\x_2 \\x_3\\x_4 \end{array} \right] = \left[ \begin{array}{c} 1\\2\\3 \end{array} \right] \,\text{.} \end{equation*}

We call this representation the matrix-vector form of the system. Note that the matrix \(A\) in this expression is the same as the coefficient matrix that appears in the augmented matrix representation of the system.

We can use the above definition of the matrix-vector product as a linear combination with any matrix and any vector, as long as it is meaningful to use the entries in the vector as weights for the columns of the matrix. For example, for \(A=\left[ \begin{array}{cc} 1\amp 2\\3\amp 1\\1\amp 1 \end{array} \right]\) and \(\vv = \left[ \begin{array}{c} 3\\4 \end{array} \right]\text{,}\) then we can define \(A\vv\) to be the linear combination of the columns of \(A\) with weights 3 and 4:

\begin{equation*} A\vv= 3 \left[ \begin{array}{c} 1\\3\\1 \end{array} \right] + 4 \left[ \begin{array}{c} 2\\1\\1 \end{array} \right] = \left[ \begin{array}{c} 11\\13\\7 \end{array} \right] \,\text{.} \end{equation*}

However, note that if \(\vv\) had three entries, this definition would not make sense since we do not have three columns in \(A\text{.}\) In those cases, we say \(A\vv\) is not defined. We will later see that this definition can be generalized to matrix-matrix products, by treating the vector as a special case of a matrix with one column.

Preview Activity 5.1.

(a)

Write the vector equation

\begin{equation*} x_1 \left[ \begin{array}{c} 1 \\ 1 \\ 2 \end{array} \right] +x_2 \left[ \begin{array}{c} 1\\ 2\\ 3 \end{array} \right] + x_3 \left[ \begin{array}{r} -1 \\ 2 \\ -3 \end{array} \right] = \left[ \begin{array}{c} 4 \\ 3\\ 11 \end{array} \right] \end{equation*}

in matrix-vector form. Note that this is the vector equation whose augmented matrix representation was given in Problem 2 in Preview Activity 2.1. Compare your matrix \(A\) and the right hand side vector to the augmented matrix. Do not solve the system.

(b)

Given the matrix-vector equation

\begin{equation*} \left[ \begin{array}{crr} 1 \amp 2 \amp -1 \\ 0\amp 1\amp 2\\1\amp -2\amp -3 \end{array} \right] \vx = \left[\begin{array}{r} -3\\3\\-7 \end{array} \right] \end{equation*}

represent the system corresponding to this equation. Note that this should correspond to the system (or an equivalent system where an equation might be multiplied by \((-1)\)) in Problem 1 of Preview Activity 2.1.

(c)

Find the indicated matrix-vector products, if possible. Express as one vector.

(i)

\(\left[ \begin{array}{cr} 2\amp -2 \\ 1\amp 2 \end{array} \right] \left[ \begin{array}{r} 1 \\ -1 \end{array} \right]\)

(ii)

\(\left[ \begin{array}{rrr} 1 \amp 0 \amp 2\\ 2 \amp -2 \amp 3 \end{array} \right] \left[ \begin{array}{r} 2\\1 \end{array} \right]\)

(iii)

\(\left[ \begin{array}{rrc} -6 \amp -2 \amp 1\\ 2 \amp -2 \amp 1 \end{array} \right] \left[ \begin{array}{r} 1\\-3 \\1 \end{array} \right]\)

(d)

As you might have noticed, systems with all the constants being 0 are special in that they always have a solution. (Why?) So we might consider grouping systems into two types: Those of the form \(A \vx = \vb\text{,}\) where not all of the entries of the vector \(\vb\) are \(0\text{,}\) and those of the form \(A \vx = \vzero\text{,}\) where \(\vzero\) is the vector of all zeros. Systems like \(A \vx = \vb\text{,}\) where \(\vb\) contains at least one non-zero entry, are called nonhomogeneous systems, and systems of the form \(A \vx = \vzero\) are called homogeneous systems. For every nonhomogeneous system \(A \vx = \vb\) there is a corresponding homogeneous system \(A \vx = \vzero\text{,}\) and there is a useful connection between the solutions to the nonhomogeneous system and the corresponding homogeneous system. For example, consider the nonhomogeneous system

\begin{equation*} A \vx = \vb \end{equation*}

with

\begin{equation} A = \left[ \begin{array}{ccc} 1 \amp 1 \amp 2 \\ 1 \amp 2 \amp 1 \end{array} \right], \ \vx = \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right], \ \text{ and } \ \vb = \left[ \begin{array}{r} 0 \\ -2 \end{array} \right]\text{.}\tag{5.2} \end{equation}

The augmented matrix representation of this system is \([A \ | \ \vb]\text{.}\) If we reduce this augmented matrix, we find

\begin{equation*} \left[ \begin{array}{ccr|r} 1 \amp 0 \amp 3 \amp 2 \\ 0 \amp 1 \amp -1 \amp -2 \end{array} \right]\text{.} \end{equation*}

From this RREF, we immediately see that the general solution is that \(x_3\) is free, \(x_2 = x_3-2\text{,}\) and \(x_1 = 2-3x_3\text{.}\) In vector form, we can represent this general solution as

\begin{equation} \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right] = \left[ \begin{array}{c} 2-3x_3 \\ x_3-2 \\ x_3 \end{array} \right] = \left[ \begin{array}{r} 2 \\ -2 \\ 0 \end{array} \right] + x_3\left[ \begin{array}{r} -3 \\ 1 \\ 1 \end{array} \right]\text{.}\tag{5.3} \end{equation}

The rightmost expression above is called the parametric vector form of the solution. If we had a system where the general solution involved more than one free variable, then we would write the parametric vector form to include one vector multiplying each free variable. For example, if the general solution of a system were that \(x_2\) and \(x_3\) are free and \(x_1=2+x_2+3x_3\text{,}\) then the parametric vector form would be

\begin{equation*} \vx = \left[ \begin{array}{c} 2+ x_2+3x_3 \\ x_2 \\ x_3 \end{array} \right] = \left[ \begin{array}{c} 2 \\ 0 \\ 0 \end{array} \right] + x_2 \left[ \begin{array}{c} 1 \\ 1 \\ 0 \end{array} \right] + x_3 \left[ \begin{array}{c} 3 \\ 0 \\ 1 \end{array} \right] \,\text{.} \end{equation*}

Note that the parametric vector form expresses the solutions as a linear combination of a number of vectors, depending on the number of free variables, with an added constant vector. This expression helps us in interpreting the solution set geometrically, as we will see in this section.

(i)

Find the general solution to the homogeneous system

\begin{equation*} A \vx = \vzero \end{equation*}

with \(A\) and \(\vx\) as in (5.2) and compare it to the solution to the nonhomogeneous system in (5.3). What do you notice?

(ii)

Find the general solution to the nonhomogeneous system

\begin{equation*} A \vx = \vb \end{equation*}

with

\begin{equation*} A = \left[ \begin{array}{ccr} 1\amp 2\amp -1 \\ 2\amp 4\amp -2 \end{array} \right], \ \vx = \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right], \ \text{ and } \ \vb = \left[ \begin{array}{r} -1 \\ 1 \end{array} \right]\text{.} \end{equation*}

and express it in parametric vector form. Then find the general solution to the corresponding homogeneous system and express it in parametric vector form. How are the two solution sets related?

(iii)

Make a conjecture about the relationship between the solutions to a consistent nonhomogeneous system \(A \vx = \vb\) and the corresponding homogeneous system \(A \vx = \vzero\text{.}\) Be as specific as possible.

Subsection The Matrix-Vector Product

The matrix-vector product we defined in Preview Activity 5.1 for a specific example generalizes in a very straightforward manner, and provides a convenient way to represent a system of linear equations of any size using matrices and vectors. In addition to providing us with an algebraic approach to solving systems via matrices and vectors — leading to a powerful geometric relationship between solution sets of homogeneous and non-homogeneous systems — this representation allows us to think of a linear system from a dynamic perspective, as we will see later in the section on matrix transformations.

The matrix-vector product \(A \vx\) is a linear combination of the columns of \(A\) with weights from \(\vx\text{.}\) To define this product in general, we will need a little notation. Recall that a matrix is made of rows and columns — the entries reading from left to right form the rows of the matrix and the entries reading from top to bottom form the columns. For example, the matrix

\begin{equation*} A = \left[ \begin{array}{cccc} 1 \amp 2 \amp 3 \amp 4 \\ 5 \amp 6 \amp 7 \amp 8 \\ 9 \amp 10 \amp 11 \amp 12 \end{array} \right]\text{.} \end{equation*}

has three rows and four columns. The number of rows and columns of a matrix is called the size of the matrix, so \(A\) is a 3 by 4 matrix (also written as \(3 \times 4\)). We often need to have a way to reference the individual entries of a matrix \(A\text{,}\) and to do so we typically give a label, say \(a_{ij}\) to the entry in the \(i\)th row and \(j\)th column of \(A\text{.}\) So in our example we have \(a_{23}=7\text{.}\) We also write \(A = [a_{ij}]\) to indicate a matrix whose \(i,j\)th entry is \(a_{ij}\text{.}\) At times it is convenient to write a matrix in terms of its rows or columns. If \(A = [a_{ij}]\) is an \(m \times n\) matrix, then we will write

\begin{equation*} A = \left[ \begin{array}{ccccc} a_{11} \amp a_{12} \amp \cdots \amp a_{1n-1} \amp a_{1n} \\ a_{21} \amp a_{22} \amp \cdots \amp a_{2n-1} \amp a_{2n} \\ \vdots \amp \amp \ddots \amp \amp \vdots \\ a_{m1} \amp a_{m2} \amp \cdots \amp a_{mn-1} \amp a_{mn} \end{array} \right] \end{equation*}

or, if we let \(\vr_1, \vr_2, \ldots, \vr_m\) denote the rows of the matrix \(A\text{,}\) then we can write \(A\) as ⁹

\begin{equation*} A = \left[ \begin{array}{c} \vr_1 \\ \vr_2 \\ \vdots \\ \vr_m \end{array} \right]\text{.} \end{equation*}

We can also write \(A\) in terms of its columns, \(\vc_1, \vc_2, \ldots, \vc_n\text{,}\) as

\begin{equation*} A = [\vc_1 \ \vc_2 \ \cdots \ \vc_n]\text{.} \end{equation*}

In general, the product of a matrix with a vector is defined as follows.

Definition 5.1.

Let \(A\) be an \(m \times n\) matrix with columns \(\vc_1\text{,}\) \(\vc_2\text{,}\) \(\ldots\text{,}\) \(\vc_n\text{,}\) and let \(\vx = \left[ \begin{array}{c} x_1 \\ x_2 \\ \vdots \\ x_n \end{array} \right]\) be a vector in \(\R^n\text{.}\) The matrix-vector product \(A\vx\) is

\begin{equation*} A \vx = x_1 \vc_1 + x_2 \vc_2 + \cdots + x_n \vc_n\text{.} \end{equation*}

Important Note.

The matrix-vector product \(A \vx\) is defined only when the number of entries of the vector \(\vx\) is equal to the number of columns of the matrix \(A\text{.}\) That is, if \(A\) is an \(m \times n\) matrix, then \(A\vx\) is defined only if \(\vx\) is a column vector with \(n\) entries.

Subsection The Matrix-Vector Form of a Linear System

As we saw in Preview Activity 5.1, the matrix-vector product provides us with a short hand way of representing a system of linear equations. In general, every linear system can be written in matrix-vector form as follows.

The linear system

\begin{alignat*}{5} {a_{11}}x_1 \amp {}+{} \amp {a_{12}}x_2 \amp {}+{} \amp \cdots \amp {}+{} \amp {a_{1n}}x_n \amp {}={} \amp b_1\\ {a_{21}}x_1 \amp {}+{} \amp {a_{22}}x_2 \amp {}+{} \amp \cdots \amp {}+{} \amp {a_{2n}}x_n \amp {}={} \amp b_2\\ {} \amp {} \amp {} \amp {} \amp \vdots \ \ \amp {} \amp {} \amp {} \amp {}\\ {a_{m1}}x_1 \amp {}+{} \amp {a_{2m}}x_2 \amp {}+{} \amp \cdots \amp {}+{} \amp {a_{mn}}x_n \amp {}={} \amp b_m \end{alignat*}

of \(m\) equations in \(n\) unknowns can be written in matrix-vector form as \(A\vx = \vb\text{,}\) where

\begin{equation*} A = \left[ \begin{array}{cccc} a_{11} \amp a_{12} \amp \cdots \amp a_{1n} \\ a_{21} \amp a_{22} \amp \cdots \amp a_{2n} \\ \vdots \amp \amp \ddots \amp \vdots \\ a_{m1} \amp a_{m2} \amp \cdots \amp a_{mn} \end{array} \right], \ \ \ \vx = \left[ \begin{array}{c} x_1 \\ x_2 \\ \vdots \\ x_n \end{array} \right], \ \ \ \text{ and } \ \ \ \vb = \left[ \begin{array}{c} b_1 \\ b_2 \\ \vdots \\ b_m \end{array} \right]\text{.} \end{equation*}

This general system can also be written in the vector form

\begin{equation*} x_1 \left[ \begin{array}{c} a_{11} \\ a_{21} \\ \vdots \\ a_{m1} \end{array} \right] + x_2 \left[ \begin{array}{c} a_{12} \\ a_{22} \\ \vdots \\ a_{m2} \end{array} \right] + \cdots + x_n \left[ \begin{array}{c} a_{1n} \\ a_{2n} \\ \vdots \\ a_{mn} \end{array} \right] = \left[ \begin{array}{c} b_1 \\ b_2 \\ \vdots \\ b_m \end{array} \right]\text{.} \end{equation*}

With this last representation, we now have four different ways to represent a system of linear equations (as a system of linear equations, as an augmented matrix, in vector equation form, and in matrix-vector equation form), and it is important to be able to translate between them. As an example, the system

from the introduction to this section has corresponding augmented matrix

\begin{equation*} \left[ \begin{array}{crrr|c} 1\amp 4\amp 2\amp 4\amp 1 \\ 2\amp -1\amp -5\amp -1\amp 2 \\ 3\amp 7\amp 1\amp 7\amp 3 \end{array} \right]\text{,} \end{equation*}

is expressed in vector form as

\begin{equation*} x_1\left[ \begin{array}{c} 1\\2\\3 \end{array} \right] + x_2\left[ \begin{array}{r} 4\\-1\\7 \end{array} \right] + x_3\left[ \begin{array}{r} 2\\-5\\1 \end{array} \right] + x_4\left[ \begin{array}{r} 4\\-1\\7 \end{array} \right]= \left[ \begin{array}{c} 1\\2\\3 \end{array} \right]\text{,} \end{equation*}

and has matrix-vector form

\begin{equation*} \left[ \begin{array}{crrr} 1\amp 4\amp 2\amp 4 \\ 2\amp -1\amp -5\amp -1 \\ 3\amp 7\amp 1\amp 7 \end{array} \right] \left[ \begin{array}{c} x_1 \\x_2 \\ x_3 \\ x_4 \end{array} \right] = \left[ \begin{array}{c} 1 \\ 2 \\ 3 \end{array} \right]\text{.} \end{equation*}

Activity 5.2.

In this activity, we will use the equivalence of the different representations of a system to make useful observations about when a system represented as \(A\vx=\vb\) has a solution.

(a)

Consider the system

\begin{equation*} \left[ \begin{array}{ccr} 1\amp 2\amp -1 \\ 2\amp 1\amp 3 \end{array} \right] \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right] = \left[ \begin{array}{c} 2 \\ 6 \end{array} \right]\text{.} \end{equation*}

Write the matrix-vector product on the left side of this equation as a linear combination of the columns of the coefficient matrix. Find weights that make the vector \(\left[ \begin{array}{c} 2 \\ 6 \end{array} \right]\) a linear combination of the columns of the coefficient matrix.

(b)

From this point on we consider the general case where \(A\) is an \(m \times n\) matrix. Use the vector equation representation to explain why the system \(A \vx = \vb\) has a solution if and only if \(\vb\) is a linear combination of the columns of \(A\text{.}\) (Note that ‘if and only if’ is an expression to mean that if one side of the expression is true, then the other side must also be true.) (Hint: Compare to what you did in part (a).)

(c)

Use part (b) and the definition of span to explain why the system \(A \vx = \vb\) has a solution if and only if the vector \(\vb\) is in the span of the columns of \(A\text{.}\)

(d)

Use part (c) to explain why the system \(A \vx = \vb\) always has a solution for any vector \(\vb\) in \(\R^m\) if and only if the span of the columns of \(A\) is all of \(\R^m\text{.}\)

(e)

Use the augmented matrix representation and the criterion for a consistent system to explain why the system \(A \vx = \vb\) is consistent for all vectors \(\vb\) if and only if \(A\) has a pivot position in every row.

We summarize our observations from the above activity in the following theorem.

Theorem 5.2.

Let \(A\) be an \(m \times n\) matrix. The following statements are equivalent:

The matrix equation \(A \vx = \vb\) has a solution for every vector \(\vb\) in \(\R^m\text{.}\)
Every vector \(\vb\) in \(\R^m\) can be written as a linear combination of the columns of \(A\text{.}\)
The span of the columns of \(A\) is \(\R^m\text{.}\)
The matrix \(A\) has a pivot position in each row.

In the future, if we need to determine whether a system has a solution for every \(\vb\text{,}\) we can refer to this theorem without having to argue our reasoning from scratch.

Subsection Properties of the Matrix-Vector Product

As we have done before, we have a new operation (the matrix-vector product), so we should wonder what properties it has.

Activity 5.3.

In this activity, we consider whether the matrix-vector product distributes vector addition. In other words: Is \(A(\vu + \vv)\) equal to \(A\vu + A\vv\text{?}\)

We work with arbitrary vectors \(\vu, \vv\) in \(\R^3\) and an arbitrary matrix \(A\) with 3 columns (so that \(A\vu\) and \(A\vv\) are defined) to simplify notation. Let \(A = [\vc_1 \ \vc_2 \ \vc_3]\) (note that each \(\vc_i\) represents a column of \(A\)), \(\vu = \left[ \begin{array}{c} u_1 \\ u_2 \\ u_3 \end{array} \right]\text{,}\) and \(\vv = \left[ \begin{array}{c} v_1 \\ v_2 \\ v_3 \end{array} \right]\text{.}\) Use the definition of the matrix-vector product along with the properties of vector operations to show that

\begin{equation*} A(\vu + \vv)= A\vu + A\vv\text{.} \end{equation*}

Similar arguments using the definition of matrix-vector product along with the properties of vector operations can be used to show the following theorem:

Theorem 5.3.

Let \(A\) be an \(m \times n\) matrix, \(\vu\) and \(\vv\) \(n \times 1\) vectors, and \(c\) a scalar. Then

\(\displaystyle A(\vu + \vv) = A\vu + A\vv\)
\(\displaystyle c(A\vv) = A(c\vv)\)

Subsection Homogeneous and Nonhomogeneous Systems

As we saw before, the systems with all the right hand side constants being 0 are special in that they always have a solution. (Why?) So we might consider grouping systems into two types: Those of the form \(A \vx = \vb\text{,}\) where not all of the entries of the vector \(\vb\) are \(0\text{,}\) and those of the form \(A \vx = \vzero\text{,}\) where \(\vzero\) is the vector of all zeros. Systems like \(A \vx = \vb\text{,}\) where \(\vb\) contains at least one non-zero entry, are called nonhomogeneous systems, and systems of the form \(A \vx = \vzero\) are called homogeneous systems. For every nonhomogeneous system \(A \vx = \vb\) there is a corresponding homogeneous system \(A \vx = \vzero\text{.}\) We now investigate the connection between the solutions to the nonhomogeneous system and the corresponding homogeneous system.

Activity 5.4.

In this activity we will consider the relationship between the solution sets of nonhomogeneous systems and those of the corresponding homogeneous systems.

(a)

Find the solution sets of the system

\begin{equation*} A \vx = \vb \end{equation*}

where

\begin{equation*} A = \left[ \begin{array}{ccc} 1 \amp 1 \amp 2 \\ 1 \amp 2 \amp 1 \end{array} \right], \ \vx = \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right], \ \text{ and } \ \vb = \left[ \begin{array}{r} 0 \\ -2 \end{array} \right] \end{equation*}

and the corresponding homogeneous system (i.e. where we replace \(\vb\) with \(\vzero\text{.}\))

(b)

Find the solution sets of the system

\begin{equation*} A \vx = \vb \end{equation*}

where

and the corresponding homogeneous system.

(c)

What are the similarities/differences between solutions of the nonhomogeneous system and its homogeneous counterpart?

As we saw in the above activity, there is a relationship between solutions of a nonhomogeneous and the corresponding homogeneous system. Let us formalize this relationship. If the general solution of a system involves free variables, we can represent the solutions in parametric vector form to have a better idea about the geometric representation of the solution set. Suppose the solution is that \(x_3\) is free, \(x_2 = -2+x_3\text{,}\) and \(x_1 = 2-3x_3\text{.}\) In vector form, we can represent this general solution as

From this representation, we see that the solution set is a line through the origin (formed by multiples of \(\left[ \begin{array}{r} -3 \\ 1 \\ 1 \end{array} \right]\)) shifted by the added vector \(\left[ \begin{array}{r} 2 \\ -2 \\ 0 \end{array} \right]\text{.}\) The solution to the homogeneous system on the other does not have the shift.

Algebraically, we see that every solution to the nonhomogeneous system \(A\vx = \vb\) can be written in the form \(\vp + \vv_h\text{,}\) where \(\vp\) is a particular solution to \(A\vx = \vb\) and \(\vv_h\) is a solution to the corresponding homogeneous system \(A\vx = \vzero\text{.}\)

To understand why this always happens, we will verify the result algebraically for an arbitrary \(A\) and \(\vb\text{.}\) Assuming that \(\vp\) is a particular solution to the nonhomogeneous system \(A\vx=\vb\text{,}\) we need to show that:

if \(\vv\) is an arbitrary solution to the nonhomogeneous system, then \(\vv = \vp + \vv_h\text{,}\) where \(\vv_h\) is some solution to the homogeneous system \(A\vx = \vzero\text{,}\) and
if \(\vv_h\) is an arbitrary solution to the homogeneous system, then \(\vp + \vv_h\) is a solution to the nonhomogeneous system.

To verify the first condition, suppose that \(\vv\) is a solution to the nonhomogeneous system \(A \vx = \vb\text{.}\) Since we want \(\vv=\vp+\vv_h\text{,}\) we need to verify that \(\vv-\vp\) is a solution for the homogeneous system so that we can assign \(\vv_h=\vv-\vp\text{.}\) Note that

\begin{equation*} A(\vv - \vp) = A\vv - A\vp = \vb - \vb = \vzero \,\text{,} \end{equation*}

using the distributive property of matrix-vector product over vector addition. Hence \(\vv\) is of the form \(\vp + \vv_h\) with \(\vv_h = \vzero\text{.}\)

To verify the second condition, consider a vector of the form \(\vp + \vv_h\text{,}\) where \(\vv_h\) is a homogeneous solution. We have

\begin{equation*} A(\vp + \vv_h) = A\vp + A\vv_h = \vb + \vzero = \vb\text{,} \end{equation*}

and so \(\vp + \vv_h\) is a solution to \(A \vx = \vb\text{.}\)

Our work above proves the following theorem.

Theorem 5.4.

Suppose the equation \(A\vx=\vb\) is consistent for some \(\vb\) and \(\vp\) is a solution. Then the solution set of \(A\vx=\vb\) consists of all vectors of the form \(\vv=\vp+\vv_h\) where \(\vv_h\) is a solution to \(A\vx=\vzero\text{.}\)

Subsection The Geometry of Solutions to the Homogeneous System

There is a simple geometric interpretation to the solution set of the homogeneous system \(A \vx = \vzero\) based on the number of free variables that imposes a geometry on the solution set of the corresponding nonhomogeneous system \(A \vx = \vb\) (when consistent) due to Theorem 5.4.

Activity 5.5.

In this activity we consider geometric interpretations of the solution sets of homogeneous and nonhomogeneous systems.

(a)

Consider the system \(A \vx = \vb\) where \(A = \left[ \begin{array}{rr} 1 \amp -3 \\ -3 \amp 9 \\ -1 \amp 3 \end{array} \right]\) and \(\vb = \left[ \begin{array}{r} 2 \\ -6 \\ -2 \end{array} \right]\text{.}\) The general solution to this system has the form \(\left[ \begin{array}{c} 2 \\ 0 \end{array} \right] + x_2\left[ \begin{array}{c} 3 \\ 1 \end{array} \right]\text{,}\) where \(x_2\) is any real number.

(i)

Let \(\vv = \left[ \begin{array}{c} 3 \\ 1 \end{array} \right]\text{.}\) What does the set of all vectors of the form \(x_2\vv\) look like geometrically? Draw a picture in \(\R^2\) to illustrate. (Recall that we refer to all the vectors of the form \(x_2\vv\) simply as \(\Span \{\vv\}\text{.}\))

(ii)

Let \(\vp = \left[ \begin{array}{r} 2 \\ 0 \end{array} \right]\text{.}\) What effect does adding the vector \(\vp\) to each vector in \(\Span \{\vv\}\) have on the geometry of \(\Span \{\vv\}\text{?}\) Finally, what does this mean about the geometry of the solution set to the nonhomogeneous system \(A \vx = \vb\text{?}\)

(b)

Consider the system \(A \vx = \vb\) where \(A =\left[ \begin{array}{ccr} 1\amp 2\amp -1 \\ 3\amp 6\amp -3 \end{array} \right]\) and \(\vb = \left[ \begin{array}{r} -2 \\ -6 \end{array} \right]\text{.}\) The general solution to this system has the form \(\left[ \begin{array}{r} -2 \\ 0 \\0 \end{array} \right] + x_2\left[ \begin{array}{r} -2 \\ 1 \\0 \end{array} \right] +x_3\left[ \begin{array}{c} 1 \\ 0 \\1 \end{array} \right]\text{,}\) where \(x_2, x_3\) are any real numbers.

(i)

Let \(\vu = \left[ \begin{array}{r} -2 \\ 1 \\0 \end{array} \right], \vv= \left[ \begin{array}{c} 1 \\ 0 \\1 \end{array} \right]\text{.}\) Use our results from Section 4 to determine the geometric shape of \(\Span \{\vu, \vv\}\text{,}\) the set of all vectors of the form \(x_2\left[ \begin{array}{r} -2 \\ 1 \\0 \end{array} \right] +x_3\left[ \begin{array}{c} 1 \\ 0 \\1 \end{array} \right]\text{,}\) where \(x_2, x_3\) are any real numbers.

(ii)

Let \(\vp = \left[ \begin{array}{r} -2 \\ 0 \\0 \end{array} \right]\text{.}\) What's the geometric effect of adding the vector \(\vp\) to each vector in \(\Span \{\vu, \vv\}\text{?}\) Finally, what does this mean about the geometry of the solution set to the nonhomogeneous system \(A \vx = \vb\text{?}\)

Our work in the above activity shows the geometric shape of the solution set of a consistent nonhomogeneous system is the same as the geometric shape of the solution set of the corresponding homogeneous system. The only difference between the two solution sets is that one is a shifted version of the other.

Subsection Examples

What follows are worked examples that use the concepts from this section.

Example 5.5.

We now have several different ways to represent a system of linear equations. Rewrite the system in an equivalent form

\begin{alignat*}{6} {11}x_1 \amp {}+{} \amp {4}x_2 \amp {}-{} \amp {5}x_3 \amp {}-{} \amp {2}x_4 \amp = \amp {} \amp 63\amp {}\\ {15}x_1 \amp {}+{} \amp {5}x_2 \amp {}+{} \amp {2}x_3 \amp {}-{} \amp {2}x_4 \amp = \amp {} \amp 68\amp {}\\ x_1 \amp {}+{} \amp {2}x_2 \amp {}+{} \amp {}x_3 \amp {}-{} \amp {}x_4 \amp = \amp {} \amp 26\amp {}\\ {9}x_1 \amp {}+{} \amp {3}x_2 \amp {}+{} \amp {2}x_3 \amp {}-{} \amp {}x_4 \amp = \amp {} \amp 40\amp {.} \end{alignat*}

(a)

as an augmented matrix

Solution.

The augmented matrix for this system is

\begin{equation*} \left[ \begin{array}{ccrr|c} 11\amp 4\amp -5\amp -2\amp 63 \\ 15\amp 5\amp 2\amp -2\amp 68 \\ 6\amp 2\amp 1\amp -1\amp 26 \\ 9\amp 3\amp 2\amp -1\amp 40 \end{array} \right]\text{.} \end{equation*}

(b)

as an equation involving a linear combination of vectors

Solution.

If we make vectors from the columns of the augmented matrix, we can write this system in vector form as

\begin{equation*} x_1 \left[ \begin{array}{c} 11 \\ 15 \\ 6 \\ 9 \end{array} \right] + x_2 \left[ \begin{array}{c} 4 \\ 5 \\ 2 \\ 3 \end{array} \right] + x_3 \left[ \begin{array}{r} -5 \\ 2\\ 1 \\ 2 \end{array} \right] + x_4 \left[ \begin{array}{r} -2\\ -2 \\ -1 \\ -1 \end{array} \right] = \left[ \begin{array}{c} 63 \\ 68 \\ 26 \\ 40 \end{array} \right]\text{.} \end{equation*}

(c)

using a matrix-vector product

Solution.

The coefficient matrix for this system is \(\left[ \begin{array}{ccrr} 11\amp 4\amp -5\amp -2 \\ 15\amp 5\amp 2\amp -2 \\ 6\amp 2\amp 1\amp -1 \\ 9\amp 3\amp 2\amp -1 \end{array} \right]\text{,}\) and the matrix-vector form of the system is

\begin{equation*} \left[ \begin{array}{ccrr} 11\amp 4\amp -5\amp -2 \\ 15\amp 5\amp 2\amp -2 \\ 6\amp 2\amp 1\amp -1 \\ 9\amp 3\amp 2\amp -1 \end{array} \right] \left[ \begin{array}{c} x_1\\x_2\\x_3\\x_4 \end{array} \right] = \left[ \begin{array}{c} 63 \\ 68 \\ 26 \\ 40 \end{array} \right]\text{.} \end{equation*}

(d)

Then solve the system.

Solution.

Using technology, we find that the reduced row echelon form of the augmented matrix for this system is

\begin{equation*} \left[ \begin{array}{cccc|r} 1\amp 0\amp 0\amp 0\amp 3 \\ 0\amp 1\amp 0\amp 0\amp 7 \\ 0\amp 0\amp 1\amp 0\amp -2 \\ 0\amp 0\amp 0\amp 1\amp 4 \end{array} \right]\text{.} \end{equation*}

So the solution to this system is \(x_1 = 3\text{,}\) \(x_2 = 7\text{,}\) \(x_3 = -2\text{,}\) and \(x_4 = 4\text{.}\)

Example 5.6.

Consider the homogeneous system

\begin{alignat*}{5} {}x_1 \amp {}+{} \amp {8}x_2 \amp {}-{} \amp {}x_3 \amp = \amp {} \amp 0\amp {}\\ {}x_1 \amp {}-{} \amp {7}x_2 \amp {}+{} \amp {2}x_3 \amp = \amp {} \amp 0\amp {}\\ {3}x_1 \amp {}+{} \amp {4}x_2 \amp {}+{} \amp {}x_3 \amp = \amp {} \amp 0\amp {.} \end{alignat*}

(a)

Find the general solution to this homogeneous system and express the system in parametric vector form.

Solution.

The augmented matrix of the homogeneous system is

\begin{equation*} \left[ \begin{array}{crr|c} 1\amp 8\amp -1\amp 0 \\ 1\amp -7\amp 2\amp 0 \\ 3\amp 4\amp 1\amp 0 \end{array} \right]\text{,} \end{equation*}

and the reduced row echelon form of this augmented matrix is

\begin{equation*} \left[ \begin{array}{ccr|c} 1\amp 0\amp \frac{3}{5}\amp 0 \\ \amp 1\amp -\frac{1}{5}\amp 0 \\ 0\amp 0\amp 0\amp 0 \end{array} \right]\text{.} \end{equation*}

Since there is no corresponding equation of the form \(0 = b\) for a nonzero constant \(b\text{,}\) this system is consistent. The third column contains no pivot, so the variable \(x_3\) is free, \(x_2 = \frac{1}{5}x_3\) and \(x_1 = -\frac{3}{5}x_3\text{.}\) In parametric vector form the general solution to the homogeneous system is

\begin{equation*} \left[\begin{array}{c} x_1\\x_2\\x_3 \end{array} \right] = \left[ \begin{array}{c} -\frac{3}{5}x_3\\ \frac{1}{5}x_3\\x_3 \end{array} \right] = x_3 \left[ \begin{array}{r} -\frac{3}{5}\\ \frac{1}{5}\\1 \end{array} \right]\text{.} \end{equation*}

(b)

Let \(A = \left[ \begin{array}{crr} 1\amp 8\amp -1 \\ 1\amp -7\amp 2 \\ 3\amp 4\amp 1 \end{array} \right]\text{,}\) and let \(\vb = \left[ \begin{array}{r} -6\\9\\2 \end{array} \right]\text{.}\) Show that \(\left[ \begin{array}{r} -1\\0\\5 \end{array} \right]\) is a solution to the non-homogeneous system \(A \vx = \vb\text{.}\)

Solution.

Since

\begin{align*} A \left[ \begin{array}{r} -1\\ 0\\ 5 \end{array} \right] \amp = (-1)\left[ \begin{array}{c} 1\\ 1\\ 3 \end{array} \right] + (0)\left[ \begin{array}{r} 8\\ -7\\ 4 \end{array} \right] + (5) \left[ \begin{array}{r} -1\\ 2\\ 1 \end{array} \right]\\ \amp = \left[ \begin{array}{c} -1-5\\ -1+10\\ -3+5 \end{array} \right] = \left[ \begin{array}{r} -6\\ 9\\ 2 \end{array} \right]\text{,} \end{align*}

we conclude that \(\left[ \begin{array}{r} -1\\0\\5 \end{array} \right]\) is a solution to the non-homogeneous system \(A \vx = \vb\text{.}\)

(c)

Use the results from part (a) and (b) to write the parametric vector form of the general solution to the non-homogeneous system \(A \vx = \vb\text{.}\) (Do this without directly solving the system \(A \vx = \vb\text{.}\))

Solution.

We know that every solution to the non-homogeneous system \(A \vx = \vb\) has the form of the general solution to the homogeneous system plus a particular solution to the non-homogeneous system. Combining the results of (a) and (b) we see that the general solution to the non-homogeneous system \(A \vx = \vb\) is

\begin{equation*} \left[\begin{array}{c} x_1\\x_2\\x_3 \end{array} \right] = \left[\begin{array}{r} -1\\0\\5 \end{array} \right] + x_3 \left[ \begin{array}{r} -\frac{3}{5}\\ \frac{1}{5}\\1 \end{array} \right]\text{,} \end{equation*}

where \(x_3\) can be any real number.

(d)

Describe what the general solution to the homogeneous system \(A \vx = \vzero\) and the general solution to the non-homogeneous system \(A \vx = \vb\) look like geometrically.

Solution.

The solution to the homogeneous system \(A \vx = \vzero\) is the span of the vector \(\left[ \begin{array}{r} -\frac{3}{5}\\ \frac{1}{5}\\1 \end{array} \right]\text{.}\) Geometrically, this set of points is a line through the origin and the point \((-3, 1, 5)\) in \(\R^3\text{.}\) The solution to the non-homogeneous system \(A \vx = \vb\) is the translation of the line through the origin and \((-3, 1, 5)\) by the vector \(\left[ \begin{array}{r} -1\\0\\5 \end{array} \right]\text{.}\) In other words, the solution to the non-homogeneous system \(A \vx = \vb\) is the line in \(\R^3\) through the points \((-1,0,5)\) and \((-4,1,10)\text{.}\)

Subsection Summary

If \(A = [\vc_1 \ \vc_2 \ \cdots \ \vc_n]\) is an \(m \times n\) matrix with columns \(\vc_1\text{,}\) \(\vc_2\text{,}\) \(\ldots\text{,}\) \(\vc_n\text{,}\) and if \(\vx = \left[ \begin{array}{c} x_1\\x_2\\ \vdots \\ x_n \end{array} \right]\) is a vector in \(\R^n\text{,}\) then the matrix-vector product \(A \vx\) is defined to be the linear combination of the columns of \(A\) with corresponding weights from \(\vx\) — that is

\begin{equation*} A \vx = x_1 \vc_1 + x_2 \vc_2 + \cdots + x_n \vc_n\text{.} \end{equation*}
A linear system

\begin{alignat*}{5} {a_{11}}x_1 \amp {}+{} \amp {a_{12}}x_2 \amp {}+{} \amp \cdots \amp {}+{} \amp {a_{1n}}x_n \amp {}={} \amp b_1\\ {a_{21}}x_1 \amp {}+{} \amp {a_{22}}x_2 \amp {}+{} \amp \cdots \amp {}+{} \amp {a_{2n}}x_n \amp {}={} \amp b_2\\ {} \amp {} \amp {} \amp {} \amp \vdots \ \ \amp {} \amp {} \amp {} \amp {}\\ {a_{m1}}x_1 \amp {}+{} \amp {a_{2m}}x_2 \amp {}+{} \amp \cdots \amp {}+{} \amp {a_{mn}}x_n \amp {}={} \amp b_m \end{alignat*}

can be written in matrix form as

\begin{equation*} A \vx = \vb\text{,} \end{equation*}

where

\begin{equation*} A = \left[ \begin{array}{cccc} a_{11} \amp a_{12} \amp \cdots \amp a_{1n} \\ a_{21} \amp a_{22} \amp \cdots \amp a_{2n} \\ \vdots \amp \amp \ddots \amp \vdots \\ a_{m1} \amp a_{m2} \amp \cdots \amp a_{mn} \end{array} \right], \ \ \ \vx = \left[ \begin{array}{c} x_1 \\ x_2 \\ \vdots \\ x_n \end{array} \right], \ \ \ \text{ and } \ \ \ \vb = \left[ \begin{array}{c} b_1 \\ b_2 \\ \vdots \\ b_m \end{array} \right]\text{.} \end{equation*}
The matrix equation \(A \vx = \vb\) has a solution if and only if \(\vb\) is a linear combination of the columns of \(A\text{.}\)
The system \(A \vx = \vb\) is consistent for every vector \(\vb\) if every row of \(A\) contains a pivot.
A homogeneous system is a system of the form \(A \vx = \vzero\) for some \(m \times n\) matrix \(A\text{.}\) Since the zero vector in \(\R^n\) satisfies \(A \vx = \vzero\text{,}\) a homogeneous system is always consistent.
A homogeneous system can have one or infinitely many different solutions. The homogeneous system \(A \vx = \vzero\) has exactly one solution if and only if each column of \(A\) is a pivot column.
The solutions to the consistent nonhomogeneous system \(A \vx = \vb\) have the form \(\vp + \vv_h\text{,}\) where \(\vp\) is a particular solution to the nonhomogeneous system \(A \vx = \vb\) and \(\vv_h\) is a solution to the homogeneous system \(A \vx = \vzero\text{.}\) In other words, the solution space to a consistent nonhomogeneous system \(A \vx = \vb\) is a translation of the solution space of the homogeneous system \(A \vx = \vzero\) by a particular solution to the nonhomogeneous system.

Finally, we argued an important theorem.

Theorem 5.2.

Let \(A\) be an \(m \times n\) matrix. The following statements are equivalent.

The matrix equation \(A \vx = \vb\) has a solution for every vector \(\vb\) in \(\R^m\text{.}\)
Every vector \(\vb\) in \(\R^m\) can be written as a linear combination of the columns of \(A\text{.}\)
The span of the columns of \(A\) is \(\R^m\text{.}\)
The matrix \(A\) has a pivot position in each row.

We will continue to add to this theorem, so it is a good idea for you to begin now to remember the equivalent conditions of this theorem.

Exercises Exercises

1.

Write the system

\begin{alignat*}{5} x_1 \amp {}+{} \amp {2}x_2 \amp {}+{} \amp {2}x_3 \amp {}+{} \amp {}x_4 \amp {}={} \amp {}- \amp 1\\ x_1 \amp {}-{} \amp {8}x_2 \amp {}+{} \amp {3}x_3 \amp {}-{} \amp {9}x_4 \amp {}={} \amp {} \amp 2\\ {}x_1 \amp {}+{} \amp {6}x_2 \amp {}-{} \amp {4}x_3 \amp {}+{} \amp {12}x_4 \amp {}={} \amp {}- \amp 1 \end{alignat*}

in matrix-vector form. Explicitly identify the coefficient matrix and the vector of constants.

2.

Write the linear combination

\begin{equation*} x_1\left[ \begin{array}{c} 1\\5 \end{array} \right] + x_2\left[ \begin{array}{r} -3\\10 \end{array} \right] + x_3\left[ \begin{array}{c} 2\\2 \end{array} \right] \end{equation*}

as a matrix-vector product.

3.

Represent the following matrix-vector equation as a linear system and find its solution.

\begin{equation*} \left[ \begin{array}{crc} 2 \amp 3 \amp 4 \\ 1 \amp -2 \amp 3 \end{array} \right] \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right] = \left[ \begin{array}{r} 4\\-6 \end{array} \right] \end{equation*}

4.

Represent the following matrix-vector equation as a linear system and find its solution.

\begin{equation*} \left[ \begin{array}{crr} 1 \amp -2 \amp -1 \\ 2 \amp 2 \amp -2 \\ 3 \amp 1 \amp 1 \end{array} \right] \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right] = \left[ \begin{array}{r} 1\\ -4\\8 \end{array} \right] \end{equation*}

5.

Another way of defining the matrix-vector product uses the concept of the scalar product of vectors.¹⁰ Given a \(1 \times n\) matrix \(\vu = [u_1 \ u_2 \ \ldots \ u_n]\)¹¹ and an \(n \times 1\) vector \(\vv=\left[ \begin{array}{c} v_1\\ v_2\\ \vdots \\ v_n \end{array} \right]\text{,}\) we define the scalar product \(\vu \cdot \vv\) as

\begin{equation*} \vu \cdot \vv = u_1v_1 + u_2v_2 + u_3v_3 + \cdots + u_nv_n\text{.} \end{equation*}

We then define the matrix-vector product \(A\vx\) as the vector whose entries are the scalar products of the rows of \(A\) with \(\vx\text{.}\) As an example, if \(A= \left[ \begin{array}{crc} 2 \amp 3 \amp 4 \\ 1 \amp -2 \amp 3 \end{array} \right]\) and \(\vx = \left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right]\text{,}\) then

\begin{equation*} A \vx = \left[ \begin{array}{c} 2x_1+3x_2 +4x_3 \\ x_1 +(-2)x_2 + 3x_3 \end{array} \right]\text{.} \end{equation*}

Calculate the matrix-vector product \(A\vx\) where \(A=\left[ \begin{array}{cc} a \amp b\\ c\amp d \end{array} \right]\) and \(\vx=\left[ \begin{array}{c} x_1 \\ x_2 \end{array} \right]\) using both methods of finding the matrix-vector product to show that the two definitions are equivalent for size \(2\times 2\) matrices.

6.

Find the value of \(a\) such that

\begin{equation*} \left[ \begin{array}{crc} 1 \amp 2 \amp 2 \\1 \amp -1 \amp 3 \\1 \amp 2 \amp 4 \end{array} \right] \left[ \begin{array}{r} 1 \\ -1 \\ a \end{array} \right] = \left[ \begin{array}{r} * \\ -5 \\ * \end{array} \right] \end{equation*}

where \(*\)'s represent unknown values.

7.

Suppose we have

\begin{equation*} \left[ \begin{array}{rccc} 1 \amp 2 \amp 1 \amp 2\\-1 \amp 2 \amp 3 \amp 1\\2\amp 3 \amp 1 \amp a \end{array} \right] \left[ \begin{array}{r} 1 \\ 2 \\ -2 \\ 3 \end{array} \right] = \left[ \begin{array}{c} b_1 \\ b_2\\ b_3 \end{array} \right] \end{equation*}

where \(b_i\)'s represent unknown values.

(a)

In order to find the value of \(a\text{,}\) which of the \(b_i\)'s do we need to know? Why?

(b)

Suppose the \(b_i\)(s) that we need to know is(are) equal to 9. What is the value of \(a\text{?}\)

8.

Suppose we are given

\begin{equation*} A \vu = \left[ \begin{array}{c} 1 \\ 1 \end{array} \right] \; \text{ and } \; A \vv = \left[ \begin{array}{c} 1 \\ 3 \end{array} \right] \end{equation*}

for an unknown \(A\) and two unknown vectors \(\vu, \vv\) in \(\R^3\text{.}\) Using matrix-vector product properties, evaluate \(A\vw\) where \(\vw=2\vu-3\vv\text{.}\)

9.

Suppose we are given

\begin{equation*} A \left[ \begin{array}{c} 1 \\ 2\\ 1 \end{array} \right] = \left[ \begin{array}{c} 1 \\ 1 \end{array} \right] \; \text{ and } \; A \left[ \begin{array}{c} 1 \\ 0 \\2 \end{array} \right] = \left[ \begin{array}{c} 0 \\ 2 \end{array} \right] \,\text{.} \end{equation*}

After expressing \(\left[ \begin{array}{r} -1 \\ 6 \\ -5 \end{array} \right]\) as a linear combination of \(\left[ \begin{array}{c} 1 \\ 2\\ 1 \end{array} \right]\) and \(\left[ \begin{array}{c} 1 \\ 0 \\2 \end{array} \right]\text{,}\) use the matrix-vector product properties to determine \(A\left[ \begin{array}{r} -1 \\ 6 \\ -5 \end{array} \right]\text{.}\)

10.

(a)

The non-homogeneous system (with unknown constants \(a\) and \(b\))

\begin{alignat*}{4} {}x \amp {}+{} \amp {}y \amp {}-{} \amp {}z \amp {}={} \amp 2\\ {2}x \amp {}+{} \amp {a}y \amp {}+{} \amp {b}z \amp {}={} \amp 4 \end{alignat*}

has a solution which lies on the \(x\)-axis (i.e. \(y=z=0\)). Find this solution.

(b)

If the corresponding homogeneous system

\begin{alignat*}{4} {}x \amp {}+{} \amp {}y \amp {}-{} \amp {}z \amp {}={} \amp 0\\ {2}x \amp {}+{} \amp {a}y \amp {}+{} \amp {b}z \amp {}={} \amp 0 \end{alignat*}

has its general solution expressed in parametric vector form as \(z\cdot \left[ \begin{array}{c} 0\\1\\1 \end{array} \right]\text{,}\) find the general solution for the non-homogeneous system using your answer to part (a).

(c)

Find the conditions on \(a\) and \(b\) that make the system from (a) have the general solution you found in (b).

11.

Find the general solution to the non-homogeneous system

\begin{alignat*}{5} {}x \amp {}-{} \amp {2}y \amp {}+{} \amp {}z \amp {}={} \amp {}\amp 3\amp {}\\ {-2}x \amp {}+{} \amp {4}y \amp {}-{} \amp {2}z \amp {}={} \amp {-}\amp 6\amp {.} \end{alignat*}

Using the parametric vector form of the solutions, determine what the solution set to this non-homogeneous system looks like geometrically. Be as specific as possible. (Include information such as whether the solution set is a point, a line, or a plane, etc.; whether the solution set passes through the origin or is shifted from the origin in a specific direction by a specific number of units; and how the solution is related to the corresponding homogeneous system.)

12.

Come up with an example of a \(3\times 3\) matrix \(A\) for which the solution set of \(A\vx=\vzero\) is a line, and a \(3\times 3\) matrix \(A\) for which the solution set of \(A\vx=\vzero\) is a plane.

13.

Suppose we have three vectors \(\vv_1, \vv_2\) and \(\vv_3\) satisfying \(\vv_3=2\vv_1-\vv_2\text{.}\) Let \(A\) be the matrix with vectors \(\vv_1, \vv_2\) and \(\vv_3\) as the columns in that order. Find a non-zero \(\vx\) such that \(A\vx=\vzero\) using this information.

14.

Label each of the following statements as True or False. Provide justification for your response.

(a) True/False.

If the system \(A\vx = \vzero\) has infinitely many solutions, then so does the system \(A\vx=\vb\) for any right-hand-side \(\vb\text{.}\)

(b) True/False.

If \(\vx_1\) is a solution for \(A\vx=\vb_1\) and \(\vx_2\) is a solution for \(A\vx=\vb_2\text{,}\) then \(\vx_1+\vx_2\) is a solution for \(A\vx=\vb_1+\vb_2\text{.}\)

(c) True/False.

If an \(m \times n\) matrix \(A\) has a pivot in every row, then the equation \(A\vx=\vb\) has a unique solution for every \(\vb\text{.}\)

(d) True/False.

If an \(m \times n\) matrix \(A\) has a pivot in every row, then the equation \(A\vx=\vb\) has a solution for every \(\vb\text{.}\)

(e) True/False.

If \(A\) and \(B\) are row equivalent matrices and the columns of \(A\) span \(\R^m\text{,}\) then so do the columns of \(B\text{.}\)

(f) True/False.

All homogeneous systems have either a unique solution or infinitely many solutions.

(g) True/False.

If a linear system is not homogeneous, then the solution set does not include the origin.

(h) True/False.

If a solution set of a linear system does not include the origin, the system is not homogeneous.

(i) True/False.

If the system \(A\vx=\vb\) has a unique solution for some \(\vb\text{,}\) then the homogeneous system has only the trivial solution.

(j) True/False.

If \(A\) is a \(3 \times 4\) matrix, then the homogeneous equation \(A \vx = \vzero\) has non-trivial solutions.

(k) True/False.

If \(A\) is a \(3 \times 2\) matrix, then the homogeneous equation \(A \vx = \vzero\) has non-trivial solutions.

Subsection Project: Input-Output Models

There are two basic types of input-output models: closed and open. The closed model assumes that all goods produced are consumed within the economy — no trading takes place with outside entities. In the open model, goods produced within the economy can be traded outside the economy.

To work with a closed model, we use an example (from Input-Output Economics by Wassily Leontief). Assume a simple three-sector economy consisting of agriculture (growing wheat), manufacturing (producing cloth), and households (supplying labor). Each sector of the economy relies on goods from the other sectors to operate (e.g., people must eat to work and need to be clothed). To model the interactions between the sectors, we consider how many units of product is needed as input from one sector to another to produce one unit of product in the second sector. For example, assume the following:

to produce one unit (say dollars worth) of agricultural goods requires 25% of a unit of agricultural output, 28% of a unit of manufacturing output, and 27% of a unit of household output;
to produce one unit of manufactured goods requires 20% of a unit of agricultural output, 60% of a unit of manufacturing output, and 60% of a unit of household output;
to produce one unit of household goods requires 55% of a unit of agricultural output, 12% of a unit of manufacturing output, and 13% of a unit of household output.

These assumptions are summarized in Table 5.7.

Table 5.7. Summary of simple three sector economy

into\from	Agriculture	Manufacture	Households
Agriculture	0.25	0.28	0.27
Manufacture	0.20	0.60	0.60
Households	0.55	0.12	0.13

This model is said to be closed because all good produced are used up within the economy. If there are goods that are not used within the economy the model is said to be open. Open models will be examined later.

The economist's goal is to determine what level of production in each section meets the following requirements:

the production from each sector meets the needs of all of the sectors and
there is no overproduction.

Project Activity 5.6.

We can use techniques from linear algebra to determine the levels of production that precisely meet the two goals of the economist.

(a)

Suppose that the agricultural output is \(x_1\) units, the manufacturing output is \(x_2\) units, and the household output is \(x_3\) units. We represent this data as a production vector \(\left[ \begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right]\text{.}\) To produce a unit of agriculture requires 0.25 units of agriculture, \(0.28\) units of manufacturing, and \(0.27\) units of household. If \(x_1\) units of agriculture, \(x_2\) units of manufacturing, and \(x_3\) units of household products are are produced, then agriculture can produce

\begin{equation*} 0.25x_1 + 0.28x_2 + 0.27x_3 \end{equation*}

units. In order to meet the needs of agriculture and for there to be no overproduction, we must then have

\begin{equation*} 0.25x_1 + 0.28x_2 + 0.27x_3 = x_1\text{.} \end{equation*}

Write similar equations for the manufacturing and household sectors of the economy.

(b)

Find the augmented matrix for the system of linear equations that represent production of the three sectors from part (a), and then solve the system to find the production levels that meet the economist's two goals.

(c)

Suppose the production level of the household sector is 200 million units (dollars). Find the production levels of the agricultural and manufacturing sectors that meet the economist's two goals.

In general, a matrix derived from a table like Table 5.7 is called a consumption matrix, which we will denote as \(C\text{.}\) (In the example discussed here \(C = \left[\begin{array}{ccc} 0.25 \amp 0.28 \amp 0.27 \\ 0.20 \amp 0.60 \amp 0.60 \\ 0.55 \amp 0.12 \amp 0.13 \end{array} \right]\text{.}\)) A consumption matrix \(C = [c_{ij}]\text{,}\) where \(c_{ij}\) represents the proportion of the output of sector \(j\) that is consumed by sector \(i\text{,}\) satisfies two important properties.

Since no sector can consume a negative amount or an amount that exceeds the output of another sector, we must have \(0 \leq c_{ij} \leq 1\) for all \(i\) and \(j\text{.}\)
If there are \(n\) sectors in the economy, the fact that all output is consumed within the economy implies that \(c_{1j}+c_{2j}+ \cdots + c_{nj} = 1\text{.}\) In other words, the column sums of \(C\) are all 1.

In our example, if we let \(x = \left[\begin{array}{c} x_1 \\ x_2 \\ x_3 \end{array} \right]\text{,}\) then we can write the equations that guarantee that the production levels satisfy the two economists' goal in matrix form as

\begin{equation} \vx = C\vx\text{.}\tag{5.5} \end{equation}

Now we can rephrase the question to be answered as which production vectors \(\vx\) satisfy equation (5.5). When \(C\vx = \vx\text{,}\) then the system is in equilibrium, that is output exactly meets needs. Any solution \(\vx\) that satisfies (5.5) is called a steady state solution.

Project Activity 5.7.

Is there a steady state solution for the closed system of Agriculture, Manufacturing, and Households? If so, find the general steady state solution. If no, explain why.

So far, we considered the case where the economic system was closed. This means that the industries that were part of the system sold products only to each other. However, if we want to represent the demand from other countries, from households, capital building, etc., we need an open model. In an article in the Scientific American Leontief organized the 1958 American economy into 81 sectors. The production of each of these sectors relied on production from the all of the sectors. Here we present a small sample from Leontief's 81 sectors, using Petroleum, Textiles, Transportation, and Chemicals as our sectors of the economy. Leontief's model assumed that the production of 1 unit of output of

petroleum requires 0.1 unit of petroleum, 0.2 units of transportation, and 0.4 units of chemicals;
textiles requires 0.4 units of petroleum, 0.1 unit of textiles, 0.15 units of transportation, and 0.3 units of chemicals;
transportation requires 0.6 units of petroleum, 0.1 unit of transportation, and 0.25 units of chemicals;
chemicals requires 0.2 units of petroleum, 0.1 unit of textiles, 0.3 units of transportation, and 0.2 units of chemicals.

A summary of this information is in Table 5.8. Assume the units are measured in dollars.

Table 5.8. Summary of four sector economy

into\from	Petroleum	Textiles	Transportation	Chemicals
Petroleum	0.10	0.00	0.20	0.40
Textiles	0.40	0.10	0.15	0.30
Transportation	0.60	0.00	0.10	0.25
Chemicals	0.20	0.10	0.30	0.20

In the open model, there is another part of the economy, called the open sector, that does not produce goods or services but only consumes them. If this sector (think end consumers, for example) demands/consumes \(d_1\) units of Petroleum, \(d_2\) units of Textiles, \(d_3\) units of Transportation, and \(d_4\) units of Chemicals, we put this into a final demand vector \(\vd = \left[ \begin{array}{c} d_1 \\d_2 \\ d_3 \\ d_4 \end{array} \right]\text{.}\)

An economist would want to find the production level where the demand from the good/service producing sectors of the economy plus the final demand from the open sector exactly matches the output in each of the sectors. Let \(x_1\) represent the number of units of petroleum output, \(x_2\) the number of units of textiles output, \(x_3\) the number of units of transportation output, and \(x_4\) the number of units of chemical output during any time period. Then the production vector is \(\vx = \left[\begin{array}{c} x_1 \\ x_2 \\ x_3 \\x_4 \end{array} \right]\text{.}\) So an economist wants to find the production vectors \(\vx\) such that

\begin{alignat*}{6} {0.10}x_1 \amp {}{} \amp {} \amp {}+{} \amp {0.20}x_3 \amp {}+{} \amp {0.40}x_4 \amp {}+{} \amp {}d_1 \amp = x_1\amp {}\\ {0.40}x_1 \amp {}+{} \amp {0.10}x_2 \amp {}+{} \amp {0.15}x_3 \amp {}+{} \amp {0.30}x_4 \amp {}+{} \amp {}d_2 \amp = x_2\amp {}\\ {0.60}x_1 \amp {}{} \amp {} \amp {}+{} \amp {0.10}x_3 \amp {}+{} \amp {0.25}x_4 \amp {}+{} \amp {}d_3 \amp = x_3\amp {}\\ {0.20}x_1 \amp {}+{} \amp {0.10}x_2 \amp {}+{} \amp {0.30}x_3 \amp {}+{} \amp {0.20}x_4 \amp {}+{} \amp {}d_4 \amp = x_4\amp {,} \end{alignat*}

where \(\vd = \left[ \begin{array}{c} d_1 \\d_2 \\ d_3 \\ d_4 \end{array} \right]\) is the demand vector from the open market. The matrix

\begin{equation*} E = \left[ \begin{array}{cccc} 0.10 \amp 0.00 \amp 0.20 \amp 0.40 \\ 0.40 \amp 0.10 \amp 0.15 \amp 0.30 \\ 0.60 \amp 0.00 \amp 0.10 \amp 0.25 \\ 0.20 \amp 0.10 \amp 0.30 \amp 0.20 \end{array} \right] \end{equation*}

derived from Table 5.8, is called the exchange matrix.

Project Activity 5.8.

(a)

Suppose the final demand vector in our four sector economy is \(\left[ \begin{array}{c} 500 \\200 \\ 400 \\ 100 \end{array} \right]\text{.}\) Find the production levels that satisfy our system.

(b)

Does this economy defined by the exchange matrix \(E\) have production levels that exactly meet internal and external demands regardless of the external demands? That is, does the system of equations

have a solution regardless of the values of \(d_1\text{,}\) \(d_2\text{,}\) \(d_3\text{,}\) and \(d_4\text{?}\) Explain.

Technically, the rows of \(A\) are made from the entries of the row vectors, but we use this notation as a shorthand.

Note that some authors refer to the scalar product as the dot product.

We can identify a \(1 \times n\) matrix \(\vu = [u_1 \ u_2 \ \ldots \ u_n]\) with the \(n \times 1\) vector \(\vu=\left[ \begin{array}{c} u_1\\ u_2\\ \vdots \\ u_n \end{array} \right]\text{,}\) so we ofter refer to \([u_1 \ u_2 \ \ldots \ u_n]\) as a vector.