By the end of this section, you should be able to give precise and thorough answers to the questions listed below. You may want to keep these questions in mind to focus your thoughts as you complete the section.
How do we calculate the determinant of an matrix?
What is one important fact the determinant tells us about a matrix?
We could calculate this area, for example, by breaking up the parallelogram into two triangles and a rectangle and finding the area of each. Now consider the problem of calculating the volume of the three-dimensional analog (called a parallelepiped) determined by three vectors ,, and as illustrated at right in Figure 17.1.
It is quite a bit more difficult to break this parallelepiped into subregions whose volumes are easy to compute. However, all of these computations can be made quickly by using determinants. The details are later in this section.
We know that a non-zero vector is an eigenvector of an matrix if for some scalar . Note that this equation can be written as . Until now, we were given eigenvalues of matrices and have used the eigenvalues to find the eigenvectors. In this section we will learn an algebraic technique to find the eigenvalues ourselves. We will also be able to justify why an matrix has at most eigenvalues.
A scalar is an eigenvalue of if has a non-trivial solution , which happens if and only if is not invertible. In this section we will find a scalar whose value will tell us when a matrix is invertible and when it is not, and use this scalar to find the eigenvalues of a matrix.
One of the eigenvalues of is . Recall that we can rewrite the matrix equation in the form . What must be true about in order for 4 to be an eigenvalue of ? How does this relate to ?
To find the eigenvalues of the matrix , we rewrite the equation as . The coefficient matrix of this last system has the form . The determinant of this matrix is a quadratic expression in . Since the eigenvalues will occur when the determinant is 0, we need to solve a quadratic equation. Find the resulting eigenvalues. (Note: One of the eigenvalues is 2.)
Around 1900 or so determinants were deemed much more important than they are today. In fact, determinants were used even before matrices. According to Tucker 33 determinants (not matrices) developed out of the study of coefficients of systems of linear equations and were used by Leibniz 150 years before the term matrix was coined by J. J. Sylvester in 1848. Even though determinants are not as important as they once were, the determinant of a matrix is still a useful quantity. We saw in Preview Activity 17.1 that the determinant of a matrix tells us if the matrix is invertible and how it can help us find eigenvalues. In this section, we will see how to find the determinant of any size matrix and how to use this determinant to find the eigenvalues.
The determinant of a matrix is . The matrix is invertible if and only if . We will use a recursive approach to find the determinants of larger size matrices building from the determinants. We present the result in the case here — a more detailed analysis can be found at the end of this section.
This sum is called a cofactor expansion of the determinant of . The smaller matrices in this expansion are obtained by deleting certain rows and columns of the matrix . In general, when finding the determinant of an matrix, we find determinants of matrices, which we can again reduce to smaller matrices to calculate.
as an example in illustrating the cofactor expansion method in general.
We first pick a row or column of . We will pick the first row of for this example.
For each entry in the row (or column) we choose, in this case the first row, we will calculate the determinant of a smaller matrix obtained by removing the row and the column the entry is in. Let be the smaller matrix found by deleting the th row and th column of . For entry , we find the matrix obtained by removing first row and first column:
.
For entry , we find
.
Finally, for entry , we find
.
Notice that in the determinant formula in (17.1) above, the middle term had a (-) sign. The signs of the terms in the cofactor expansion alternate within each row and each column. More specifically, the sign of a term in the th row and th column is . We then obtain the following pattern of the signs within each row and column:
In particular, the sign factor for is , for is , and for is .
For each entry in the row (or column) of we chose, we multiply the entry by the determinant of and the sign . In this case, we obtain the following numbers
Note that in the last calculation, since , we did not have to evaluate the rest of the terms.
Finally, we find the determinant by adding all these values:
We will now define the determinant of a general matrix in terms of a cofactor expansion as we did in the case. To do so, we need some notation and terminology.
We let be the submatrix of found by deleting the th row and th column of . The determinant of is called the th minor of or the minor corresponding to the entry .
Notice that in the case, we used the opposite of the 1,2 minor in the sum. It will be the case that the terms in the cofactor expansion will alternate in sign. We can make the signs in the sum alternate by taking to an appropriate power. As a result, we define the th cofactor of as
This method for computing determinants is called the cofactor expansion or Laplace expansion of along the 1st row. The cofactor expansion reduces the computation of the determinant of an matrix to computations of determinants of matrices. These smaller matrices can be reduced again using cofactor expansions, so it can be a long and grueling process for large matrices. It turns out that we can actually take this expansion along any row or column of the matrix (a proof of this fact is given in Section 22). For example, the cofactor expansion along the 2nd row is
Which determinant property can be used to calculate the determinant in part (c)? Explain how. (Determinant properties are included below for easy reference.)
Consider the matrix . Let be the matrix which results when times row 1 is added to row 2 of . Evaluate the determinant of by hand to check that it is equal to the determinant of , which verifies one other determinant property (in a specific case).
As with any new idea, like the determinant, we must ask what properties are satisfied. We state the following theorem without proof for the time being. For the interested reader, the proof of many of these properties is given in Section 22 and others in the exercises.
If is upper/lower triangular, then is the product of the entries on the diagonal.
The determinant of a matrix is the product of the eigenvalues, with each eigenvalue repeated as many times as its multiplicity.
Effect of row operations:
Adding a multiple of a row to another does NOT change the determinant of the matrix.
Multiplying a row by a constant multiplies the determinant by the same constant.
Row swapping multiplies the determinant by .
If the row echelon form of is obtained by adding multiples of one row to another, and row swapping, then is equal to multiplied by where is the number of row swappings done during the row reduction.
Note that if we were to find the determinant of a matrix using the cofactor method, we will calculate determinants of 4 matrices of size , each of which will require 3 determinant calculations again. So, we will need a total of 12 calculations of determinants of matrices. That is a lot of calculations. There are other, more efficient, methods for calculating determinants. For example, we can row reduce the matrix, keeping track of the effect that each row operation has on the determinant.
In this case, we can see that the inverse of the matrix will be defined if and only if . So, in the case the determinant of will be given by the value of in Equation (17.2). What remains is for us to see how this is related to determinants of sub-matrices of .
This makes the product of the diagonal elements and minus the product of the off-diagonal elements and . We can visualize this in an array by drawing arrows across the diagonal and off-diagonal, with a plus sign on the diagonal arrow indicting that we add the product of the diagonal elements and a minus sign on the off-diagonal arrow indicating that we subtract the product of the off-diagonal elements as shown in Figure 17.4.
We can do a similar thing for the determinant of a matrix. In this case, we extend the array to a array by adjoining the first two columns onto the matrix. We then add the products along the diagonals going from left to right and subtract the products along the diagonals going from right to left as indicated in Figure 17.5.
With a matrix, we will find the sub-matrices ,,, and . We see that
.
To calculate the th cofactor , we need to calculate the determinants of the . Using the device for calculating the determinant of a matrix we have that
The approach we will take to connecting area (volume) to the determinant will help shed light on properties of the determinant that we will discuss from an algebraic perspective in a later section. First, we mention some basic properties of area (we focus on area for now, but these same properties are valid for volumes as well). volume). As a shorthand, we denote the area of a region by Area.
Area cannot be negative.
If two regions and don't overlap, then the area of the union of the regions is equal to the sum of the areas of the regions. That is, if , then AreaAreaArea.
Area is invariant under translation. That is, if we move a geometric region by the same amount uniformly in a given direction, the area of the original region and the area of the transformed region are the same. A translation of a region is done by just adding a fixed vector to each vector in the region. That is, a translation by a vector is a function such that the image of a region is defined as
.
Since area is translation invariant, AreaArea.
The area of a one-dimensional object like a line segment is .
Since area is translation and rotation invariant, we can translate our parallelogram by to place its basepoint at the origin, then rotate by an angle (as shown at left in Figure 17.8. This transforms the vector to a vector and the vector to a vector as shown at right in Figure 17.8. With this in mind we can always assume that our parallelograms have one vertex at the origin, with along the -axis, and in standard position. Now we can investigate how to calculate the area of a parallelogram.
There are two situations to consider when we want to find the area of a parallelogram determined by vectors and , both shown in Figure 17.9. The parallelogram will be determined by the lengths of these vectors.
The result of Project Activity 17.3 is that the area of is given by , where is the height of the parallelogram determined by dropping a perpendicular from the terminal point of to the line determined by the vector .
If is any scalar, then either stretches or compresses . Use this idea, and the result of Project Activity 17.3, to explain why
AreaAreaArea(17.4)
for any real number . A representative picture of this situation is shown at left in Figure 17.9 for a value of . You will also need to consider what happens when .
Let . Recall that our alternate representation of allows us to write
AreaArea.
This should seem very suggestive. We are essentially applying the process of Gaussian elimination to our parallelogram matrix to reduce it to a diagonal matrix. From there, we can calculate the area. The matrix form should indicate the next step — applying an operation to eliminate the entry in the first row and second column. To do this, we need to consider what happens if and if .
Now we consider the case when . Complete the process as in part (a), using properties (17.4) and (17.5) (compare to Gaussian elimination) to continue to reduce the problem of calculating Area to one of calculating Area. Use this process to conclude that
We can apply the same arguments as above using rotations, translations, shearings, and scalings to show that the properties of area given above work in any dimension. Given vectors ,,, in , we let
If , then is the parallelogram determined by and with basepoint . If , then is the parallelepiped with basepoint determined by ,, and . In higher dimensions the sets are called parallelotopes, and we use the notation Vol for their volume. The -dimensional volumes of these paralleotopes satisfy the following properties:
We now show that Vol is the absolute value of the determinant of . For easier notation, let ,, and . As we argued in the 2-dimensional case, we can assume that all terms that we need to be nonzero are nonzero, and we can do so without verification.
Just as we saw in the 2-dimensional case, we can proceed to use the diagonal entries to eliminate the entries above the diagonal without changing the volume to see that
VolVol.
Complete the process, applying appropriate properties to explain why
VolVol
for some constant . Find the constant and, as a result, find a specific expression for Vol involving a determinant.
Properties (17.6), (17.7), and (17.8) involve the analogs of row operations on matrices, and we will prove algebraically that the determinant exhibits the same properties. In fact, the determinant can be uniquely defined by these properties. So in a sense, the determinant is an area or volume function.
Tucker, Alan. (1993). The Growing Importance of Linear Algebra in Undergraduate Mathematics. The College Mathematics Journal, 1, 3-9.