Section 2 The Matrix Representation of a Linear System
Focus Questions
By the end of this section, you should be able to give precise and thorough answers to the questions listed below. You may want to keep these questions in mind to focus your thoughts as you complete the section.
What is a matrix?
How do we associate a matrix to a system of linear equations?
What row operations can we perform on an augmented matrix of a linear system to solve the system of linear equations?
What are pivots, basic variables, and free variables?
How many solutions can a system of linear equations have?
When is a linear system consistent?
When does a linear system have infinitely many solutions? A unique solution?
How can we represent the set of solutions to a consistent system if the system has infinitely many solutions?
Subsection Application: Approximating Area Under a Curve
We know from basic geometry how to find areas of circles and triangles. However, it is much more difficult to find areas of other geometric objects. In fact, it is generally an impossible problem to determine the exact area bounded by a complicated curve. For this reason, approximation methods are used. One such method involves approximating curves using quadratic functions. Unless you have learned some calculus, you have probably never calculated the area under a parabola. In the ancient work Quadrature of the Parabola (3rd century BC), Archimedes determined a method for finding the area of a region bounded by a parabola by using mechanics and then by geometric methods. Once we know how to calculate the area of a region bounded by a parabola, Simpson's Rule uses parabolas to approximate a function, and then approximates the area under the graph of the graph of the function by using the areas under the parabolas. In order to use Simpsons Rule, we need to know how to exactly fit a quadratic function to three points. More details about this process can be found at the end of this section. This idea of fitting a polynomial to a set of data points has uses in other areas as well. For example, two common applications of BΓ©zier curves are font design and drawing tools. When fitting a polynomial to a large set of data points, our systems of equations can become quite large, and can be difficult to solve by hand. In this section we will see how to use matrices to more conveniently represent systems of equations of any size. We also consider how the elimination process works on the matrix representation of a linear system and how we can determine the existence of solutions and the form of solutions of a linear system.Subsection Introduction
When working with a linear system, the labels for the variables are irrelevant to the solution β the only thing that matters is the coefficients of the variables in the equations and the constants on the other side of the equations. For example, given a linear system of the formDefinition 2.1.
A matrix is a rectangular array of quantities or expressions.
Terminology.
There is some important terminology related to matrices.Any number in a matrix is called an entry of the matrix.
The collection of entries in an augmented matrix that corresponds to a given equation (that is reading the entries from left to right, or a horizontal set of entries) is called a row of the matrix. We number the rows from top to bottom in a matrix. For example,
is the first row and is the second row of the coefficient matrix of the system (2.1).The set of entries as we read from top to bottom (or a vertical set of entries that correspond to one fixed variable or the constants on the right hand sides of the equations) is called a column of the matrix. We number the columns from left to right in a matrix. For example,
is the first column and is the third column of the coefficient matrix of the system (2.1).The size of a matrix is given as
where is the number of rows and is the number of columns. The coefficient matrix above is a matrix since it has 3 rows and 3 columns, while the augmented matrix is a matrix as it has 4 columns.
Preview Activity 2.1.
(a)
Write the augmented matrix for the following linear system. If needed, rearrange an equation to ensure that the variables appear in the same order on the left side in each equation with the constants being on the right hand side of each equation.
(b)
Write the linear system in variables
(c)
Consider the three types of elementary operations on systems of equations introduced in Section 1. Each row of an augmented matrix of a system corresponds to an equation, so each elementary operation on equations corresponds to an operation on rows (called row operations).
(i)
Describe the row operation that corresponds to interchanging two equations.
(ii)
Describe the row operation that corresponds to multiplying an equation by a nonzero scalar.
(iii)
Describe the row operation that corresponds to replacing one equation by the sum of that equation and a scalar multiple of another equation.
Subsection Simplifying Linear Systems Represented in Matrix Form
Once we have stored the information about a linear system in an augmented matrix, we can perform the elementary operations directly on the augmented matrix. Recall that the allowable operations on a system of equations are the following:Replacing one equation by the sum of that equation and a scalar multiple of another equation.
Interchanging the positions of two equations.
Replacing an equation by a nonzero scalar multiple of itself.
Replacing one row by the sum of that row and a scalar multiple of another row.
Interchanging two rows.
Replacing a row by a nonzero scalar multiple of itself.
Activity 2.2.
Consider the system
with corresponding augmented matrix
(a)
As a first step in solving our system, we might eliminate
(b)
Now that we have eliminated the
(c)
Now you should have a system in which the last two rows correspond to a system of 2 linear equations in two unknowns. Use a row operation that adds a multiple of the second row to the third row to turn the coefficient of
(d)
Your simplified system and its augment matrix are in row echelon form and this system is solvable using back-substitution (substituting the known variable values into the previous equation to find the value of another variable). Solve the system.
Reflection 2.2.
Do you see how this standard elimination process can be generalized to any linear system with any number of variables to produce a simplified system? Do you see why the process does not change the solutions of the system? If needed, can you modify the standard elimination process to obtain a simplified system in which the last equation contains only the variable
Interchange equations if needed to ensure that the coefficient of
(or, more generally, the first non-zero variable) in the first equation is non-zero.Use the first equation to eliminate
(or, the first non-zero variable) from other equations by adding a multiple of the first equation to the others.After
is eliminated from all equations but the first equation, focus on the rest of the equations. Repeat the process of elimination on these equations to eliminate (or, the next non-zero variable) all but the second equation.Once the process of eliminating variables recursively is finished, solve for the variables in a backwards fashion starting with the last equation and substituting known values in the equations above as they become known.
Subsection Linear Systems with Infinitely Many Solutions
Each of the systems that we solved so far have had a unique (exactly one) solution. The geometric representation of linear systems with two equations in two variables shows that this does not always have to be the case. We also have linear systems with no solution and systems with infinitely many solutions. We now consider the problem of how to represent the set of solutions of a linear system that has infinitely many solutions. (Systems with infinitely many solutions will also be of special interest to us a bit later when we study eigenspaces of a matrix.)Activity 2.3.
Consider the system
(a)
Without explicitly solving the system, check that
(b)
Without explicitly solving the system, show that
(c)
Part (b) shows that our system has infinitely many solutions. We were given solutions in part (b) β but how do we find these solutions and how do we know that these are all of the solutions? We address those questions now. If we apply row operations to the augmented matrix
of this system, we can reduce this system to one with augmented matrix
(i)
What is it about this reduced form of the augmented matrix that indicates that the system has infinitely many solutions?
(ii)
Since the system has infinitely many solutions, we will not be able to explicitly determine values for each of the variables. Instead, at least one of the variables can be chosen arbitrarily. What is it about the reduced form of the augmented matrix that indicates that
(iii)
Letting
(iv)
Use the first row to show that
Definition 2.3.
The first non-zero coefficient (from the left) in an equation in a linear system after elimination is called a pivot. A variable corresponding to a pivot is a basic variable and while a variable not corresponding to a pivot is a free variable.
Activity 2.4.
Each matrix is an augmented matrix for a linear system after elimination. Identify the basic variables (if any) and free variables (if any). Then write the general solution (if there is a solution) expressing all variables in terms of the free variables. Use any symbols you like for the variables.
(a)
(b)
(c)
Reflection 2.4.
Does the existence of a row of 0's always mean a free variable? Can you think of an example where there is a row of 0's but none of the variables is free? How do the numbers of equations and the variables compare in that case?
Subsection Linear Systems with No Solutions
We saw in the previous section that geometrically two parallel and distinct lines represent a linear system with two equations in two unknowns which has no solution. Similarly, two parallel and distinct planes in three dimensions represent a linear system with two equations in three unknowns which has no solution. We can have at least four different geometric configurations of three planes in three dimensions representing a system with no solution. But how do these geometrical configurations manifest themselves algebraically?Activity 2.5.
Consider the linear system
(a)
Apply the elimination process to the augmented matrix of this system. Write the system of equations that corresponds to the final reduced matrix.
(b)
Discuss which feature in the final simplified system makes it easy to determine that the system has no solution. Similarly, what features in the matrix representation makes is easy to see the system has no solution?
Theorem 2.5.
A linear system is consistent if after the elimination process there is no equation of the form
Subsection Examples
What follows are worked examples that use the concepts from this section.Example 2.6.
Consider the linear system
(a)
Set up the augmented matrix for this linear system.
Solution.
The augmented matrix for this system is
(b)
Find all solutions to the system using forward elimination.
Solution.
We apply forward elimination, first making the entries below the 1 in the upper left all 0. We do this by replacing row two with row two minus 2 times row 1, row three with row three minus row 1, and row four with row four minus 4 row one. This produces the augmented matrix
Now we eliminate the leading 5 in the fourth row by replacing row four with row four minus row two to obtain the augmented matrix
When we replace row four with row four minus row three, we wind up with a row of zeros:
We see that there is no pivot in column four, so
The second row tells us that
Finally, the first row gives us
So this system has infinitely many solutions, with
and so this solution satisfies the first equation in our system. You should check to verify that it also satisfies the other three equations.
(c)
Suppose, after forward elimination, the augmented matrix of the system
has the form
For which values of
(i)
No solutions?
Solution.
The system has no solutions when there is an equation of the form
(ii)
A unique solution? Find the solution.
Solution.
When
(iii)
Infinitely many solution? Determine all solutions?
Solution.
When
Example 2.7.
After applying row operations to the augmented matrix of a system of linear equations, each of which describes a plane in 3-space, the following augmented matrix was obtained:
(a)
Describe, algebraically and geometrically, all solutions (if any), to this system when
Solution.
Throughout, we will let the variables
When
This matrix corresponds to the system
There are no equations of the form
(b)
Describe, algebraically and geometrically, all solutions (if any), to this system when
Solution.
Throughout, we will let the variables
When
The last row corresponds to the equation
(c)
Describe, algebraically and geometrically, all solutions (if any), to this system when
Solution.
Throughout, we will let the variables
When
There are no rows that correspond to equations of the form
Subsection Summary
A matrix is just a rectangular array of numbers or objects.
Given a system of linear equations, with the variables listed in the same order in each equation, we represent the system by writing the coefficients of the first equation as the first row of a matrix, the coefficients of the second equation as the second row, and so on. This creates the coefficient matrix of the system. We then augment the coefficient matrix with a column of the constants that appear in the equations. This gives us the augmented matrix of the system.
-
The operations that we can perform on equations translate exactly to row operations that we can perform on an augmented matrix:
Replacing one row by the sum of that row and a scalar multiple of another row.
Interchanging two rows.
Replacing a row by a nonzero scalar multiple of itself.
The forward elimination phase of the elimination method recursively eliminates the variables in a linear system to reach an equivalent but simplified system.
The first non-zero entry in an equation in a linear system after elimination is called a pivot.
A basic variable in a linear system corresponds to a pivot of the system. A free variable is a variable that is not basic.
A linear system can be inconsistent (no solutions), have a unique solution (if consistent and every variable is a basic variable), or have infinitely many solutions (if consistent and there is a free variable).
A linear system has no solutions if, after elimination, there is an equation of the form
where is a nonzero number.A linear system after the elimination method can be solved using back-substitution. The free variables can be chosen arbitrarily and the basic variables can be solved in terms of the free variables through the back-substitution process.
Exercises Exercises
1.
Consider the system of linear equations whose augmented matrix is
where
(a)
a unique solution,
(b)
infinitely many solutions,
(c)
no solution?
2.
Consider the following system:
Check that when
3.
If possible, find a system of three equations (not in reduced form) in three variables whose solution set consists only of the point
4.
What are the possible geometrical descriptions of the solution set of two linear equations in
5.
Two students are talking about when a linear system has infinitely many solutions.
Student 1: So, if we have a linear system whose augmented matrix has a row of zeros, then the system has infinitely many solutions, doesn't it?
Student 2: Well, but what if there is a row of the formwith a non-zero right above the row of 0's?
Student 1: OK, maybe I should ask βIf we have a consistent linear system whose augmented matrix has a row of zeros, then the system has infinitely many solutions, doesn't it?β
Student 2: I don't know. It still doesn't sound enough to me, but I'm not sure why.Is Student 1 right? Or is Student 2's hunch correct? Justify your answer with a specific example if possible.
6.
Label each of the following statements as True or False. Provide justification for your response.
(a) True/False.
A system of linear equations in two unknowns can have exactly five solutions.
(b) True/False.
A system of equations with all the right hand sides equal to 0 has at least one solution.
(c) True/False.
A system of equations where there are fewer equations than the number of unknowns (known as an underdetermined system) cannot have a unique solution.
(d) True/False.
A system of equations where there are more equations than the number of unknowns (known as an overdetermined system) cannot have a unique solution.
(e) True/False.
A consistent system of two equations in three unknowns cannot have a unique solution.
(f) True/False.
If a system with three equations and three unknowns has a solution, then the solution is unique.
(g) True/False.
If a system of equations has two different solutions, then it has infinitely many solutions.
(h) True/False.
If there is a row of zeros in the row echelon form of the augmented matrix of a system of equations, the system has infinitely many solutions.
(i) True/False.
If there is a row of zeros in the row echelon form of the augmented matrix of a system of
(j) True/False.
If a system has no free variables, then the system has a unique solution.
(k) True/False.
If a system has a free variable, then the system has infinitely many solutions.
Subsection Project: Polynomial Interpolation to Approximate the Area Under a Curve
Suppose we want to approximate the area of the region shown in Figure 2.8. As discussed in the introduction, we can approximate the area under a curve by approximating the curve by quadratics. First, we will see how Archimedes approached the problem of finding the area of a quadratic region, then we will determine how to determine a quadratic function that passes through three points, then we put it all together to approximate the area under a curve as in Figure 2.8.Project Activity 2.6.
In this activity we model the function geogebra.org/m/spd4hhbw
. Round all calculations in this activity to the nearest thousandth.
(a)
Set up a system of linear equations to fit a quadratic
(b)
Set up a system of linear equations to fit a quadratic
(c)
Set up a system of linear equations to fit a quadratic
(d)
Use the GeoGebra applet at geogebra.org/m/spd4hhbw
to graph the three quadratics on their intervals on the same axes as the graph of
Project Activity 2.7.
(a)
Set up a linear system that will determine the coefficients
(b)
Explain why the coefficient matrix of the system in Task 2.7.a is
Use these matrices to explain why
(c)
Our goal is to ultimately approximate the area under the curve on the interval
(d)
Now add up all of the area approximations on each subinterval to show that the approximate area under the graph is given by the formula
Project Activity 2.8.
Let