Skip to main content

Section 17 The Determinant

Subsection Application: Area and Volume

Consider the problem of finding the area of a parallelogram determined by two vectors \(\vu\) and \(\vv\text{,}\) as illustrated at left in Figure 17.1.

Figure 17.1. A parallelogram and a parallelepiped.

We could calculate this area, for example, by breaking up the parallelogram into two triangles and a rectangle and finding the area of each. Now consider the problem of calculating the volume of the three-dimensional analog (called a parallelepiped) determined by three vectors \(\vu\text{,}\) \(\vv\text{,}\) and \(\vw\) as illustrated at right in Figure 17.1.

It is quite a bit more difficult to break this parallelepiped into subregions whose volumes are easy to compute. However, all of these computations can be made quickly by using determinants. The details are later in this section.

Subsection Introduction

We know that a non-zero vector \(\vx\) is an eigenvector of an \(n \times n\) matrix \(A\) if \(A \vx = \lambda \vx\) for some scalar \(\lambda\text{.}\) Note that this equation can be written as \((A-\lambda I_n)\vx=\vzero\text{.}\) Until now, we were given eigenvalues of matrices and have used the eigenvalues to find the eigenvectors. In this section we will learn an algebraic technique to find the eigenvalues ourselves. We will also be able to justify why an \(n\times n\) matrix has at most \(n\) eigenvalues.

A scalar \(\lambda\) is an eigenvalue of \(A\) if \((A - \lambda I_n)\vx=\vzero\) has a non-trivial solution \(\vx\text{,}\) which happens if and only if \(A-\lambda I_n\) is not invertible. In this section we will find a scalar whose value will tell us when a matrix is invertible and when it is not, and use this scalar to find the eigenvalues of a matrix.

Preview Activity 17.1.

In this activity, we will focus on \(2\times 2\) matrices. Let \(A = \left[ \begin{array}{cc} a\amp b \\ c\amp d \end{array} \right]\) be a \(2\times 2\) matrix. To see if \(A\) is invertible, we row reduce \(A\) by replacing row 2 with \(a\cdot\)(row 2) \(- c \cdot\)(row 1):

\begin{equation*} \left[ \begin{array}{cc} a\amp b \\ 0\amp ad-bc \end{array} \right]\text{.} \end{equation*}

So the only way \(A\) can be reduced \(I_2\) is if \(ad - bc \neq 0\text{.}\) We call this quantity \(ad-bc\) the determinant of \(A\text{,}\) and denote the determinant of \(A\) as \(\det(A)\) or \(|A|\text{.}\) When \(\det(A)\neq 0\text{,}\) we know that

\begin{equation*} A^{-1} = \frac{1}{ad-bc} \left[ \begin{array}{rr} d\amp -b \\ -c\amp a \end{array} \right]\text{.} \end{equation*}

We now consider how we can use the determinant to find eigenvalues and other information about the invertibility of a matrix.

(a)

Let \(A = \left[ \begin{array}{cc} 1\amp 2 \\ 2\amp 4 \end{array} \right]\text{.}\) Find \(\det(A)\) by hand. What does this mean about the matrix \(A\text{?}\) Can you confirm this with other methods?

(b)

One of the eigenvalues of \(A=\left[ \begin{array}{cc} 1\amp 3 \\ 2\amp 2 \end{array} \right]\) is \(\lambda=4\text{.}\) Recall that we can rewrite the matrix equation \(A\vx=4\vx\) in the form \((A-4I_2) \vx = \vzero\text{.}\) What must be true about \(A-4I_2\) in order for 4 to be an eigenvalue of \(A\text{?}\) How does this relate to \(\det(A-4I_2)\text{?}\)

(c)

Another eigenvalue of \(A=\left[ \begin{array}{cc} 1\amp 3 \\ 2\amp 2 \end{array} \right]\) is \(\lambda=-1\text{.}\) What must be true about \(A+I_2\) in order for \(-1\) to be an eigenvalue of \(A\text{?}\) How does this relate to \(\det(A+I_2)\text{?}\)

(d)

To find the eigenvalues of the matrix \(A=\left[ \begin{array}{cc} 3\amp 2\\2\amp 6 \end{array} \right]\text{,}\) we rewrite the equation \(A \vx = \lambda \vx\) as \((A - \lambda I_2) \vx = \vzero\text{.}\) The coefficient matrix of this last system has the form \(A-\lambda I_2 = \left[ \begin{array}{cc} 3-\lambda \amp 2 \\ 2\amp 6-\lambda \end{array} \right]\text{.}\) The determinant of this matrix is a quadratic expression in \(\lambda\text{.}\) Since the eigenvalues will occur when the determinant is 0, we need to solve a quadratic equation. Find the resulting eigenvalues. (Note: One of the eigenvalues is 2.)

(e)

Can you explain why a \(2\times 2\) matrix can have at most two eigenvalues?

Subsection The Determinant of a Square Matrix

Around 1900 or so determinants were deemed much more important than they are today. In fact, determinants were used even before matrices. According to Tucker 33  determinants (not matrices) developed out of the study of coefficients of systems of linear equations and were used by Leibniz 150 years before the term matrix was coined by J. J. Sylvester in 1848. Even though determinants are not as important as they once were, the determinant of a matrix is still a useful quantity. We saw in Preview Activity 17.1 that the determinant of a matrix tells us if the matrix is invertible and how it can help us find eigenvalues. In this section, we will see how to find the determinant of any size matrix and how to use this determinant to find the eigenvalues.

The determinant of a \(2 \times 2\) matrix \(A = \left[ \begin{array}{cc} a\amp b \\ c\amp d \end{array} \right]\) is \(\det(A)=ad-bc\text{.}\) The matrix \(A\) is invertible if and only if \(\det(A) \neq 0\text{.}\) We will use a recursive approach to find the determinants of larger size matrices building from the \(2\times 2\) determinants. We present the result in the \(3 \times 3\) case here — a more detailed analysis can be found at the end of this section.

To find the determinant of a \(3 \times 3\) matrix \(A = \left[ \begin{array}{ccc} a_{11}\amp a_{12}\amp a_{13} \\ a_{21}\amp a_{22}\amp a_{23}\\ a_{31}\amp a_{32}\amp a_{33} \end{array} \right]\text{,}\) we will use the determinants of three \(2\times 2\) matrices. More specifically, the determinant of \(A\text{,}\) denoted \(\det(A)\) is the quantity

\begin{equation} a_{11} \det\left(\left[ \begin{array}{cc} a_{22}\amp a_{23} \\ a_{32}\amp a_{33} \end{array} \right] \right)- a_{12} \det \left(\left[ \begin{array}{cc} a_{21}\amp a_{23} \\ a_{31}\amp a_{33} \end{array} \right] \right) + a_{13} \det \left(\left[ \begin{array}{cc} a_{21}\amp a_{22} \\ a_{31}\amp a_{32} \end{array} \right] \right)\text{.}\tag{17.1} \end{equation}

This sum is called a cofactor expansion of the determinant of \(A\text{.}\) The smaller matrices in this expansion are obtained by deleting certain rows and columns of the matrix \(A\text{.}\) In general, when finding the determinant of an \(n\times n\) matrix, we find determinants of \((n-1)\times (n-1)\) matrices, which we can again reduce to smaller matrices to calculate.

We will use the specific matrix

\begin{equation*} A = \left[ \begin{array}{ccc} 1 \amp 2 \amp 0 \\ 1 \amp 4 \amp 3 \\ 2 \amp 2 \amp 1 \end{array} \right] \end{equation*}

as an example in illustrating the cofactor expansion method in general.

  • We first pick a row or column of \(A\text{.}\) We will pick the first row of \(A\) for this example.

  • For each entry in the row (or column) we choose, in this case the first row, we will calculate the determinant of a smaller matrix obtained by removing the row and the column the entry is in. Let \(A_{ij}\) be the smaller matrix found by deleting the \(i\)th row and \(j\)th column of \(A\text{.}\) For entry \(a_{11}\text{,}\) we find the matrix \(A_{11}\) obtained by removing first row and first column:

    \begin{equation*} A_{11} = \left[ \begin{array}{cc} 4 \amp 3 \\ 2 \amp 1 \end{array} \right]\,\text{.} \end{equation*}

    For entry \(a_{12}\text{,}\) we find

    \begin{equation*} A_{12} = \left[ \begin{array}{cc} 1 \amp 3 \\ 2 \amp 1 \end{array} \right]\,\text{.} \end{equation*}

    Finally, for entry \(a_{13}\text{,}\) we find

    \begin{equation*} A_{13} = \left[ \begin{array}{cc} 1 \amp 4 \\ 2 \amp 2 \end{array} \right] \,\text{.} \end{equation*}
  • Notice that in the \(3 \times 3\) determinant formula in (17.1) above, the middle term had a (-) sign. The signs of the terms in the cofactor expansion alternate within each row and each column. More specifically, the sign of a term in the \(i\)th row and \(j\)th column is \((-1)^{i+j}\text{.}\) We then obtain the following pattern of the signs within each row and column:

    \begin{equation*} \left[ \begin{array}{cccc} + \amp - \amp + \amp \cdots \\ - \amp + \amp - \amp \cdots \\ + \amp - \amp + \amp \cdots \\ \vdots \amp \amp \amp \end{array} \right] \end{equation*}

    In particular, the sign factor for \(a_{11}\) is \((-1)^{1+1}=1\text{,}\) for \(a_{12}\) is \((-1)^{1+2}=-1\text{,}\) and for \(a_{13}\) is \((-1)^{1+3}=1\text{.}\)

  • For each entry \(a_{ij}\) in the row (or column) of \(A\) we chose, we multiply the entry \(a_{ij}\) by the determinant of \(A_{ij}\) and the sign \((-1)^{i+j}\text{.}\) In this case, we obtain the following numbers

    \begin{equation*} a_{11} (-1)^{1+1} \det(A_{11}) = 1 \det \left[ \begin{array}{cc} 4 \amp 3 \\ 2 \amp 1 \end{array} \right] = 1(4-6)=-2 \end{equation*}
    \begin{equation*} a_{12} (-1)^{1+2} \det(A_{12}) = -2 \det \left[ \begin{array}{cc} 1 \amp 3 \\ 2 \amp 1 \end{array} \right] = -2(1-6)=10 \end{equation*}
    \begin{equation*} a_{13} (-1)^{1+3} \det(A_{13}) = 0 \end{equation*}

    Note that in the last calculation, since \(a_{13}=0\text{,}\) we did not have to evaluate the rest of the terms.

  • Finally, we find the determinant by adding all these values:

    \begin{align*} \det(A) \amp = a_{11} (-1)^{1+1} \det(A_{11}) + a_{12} (-1)^{1+2} \det(A_{12})\\ \amp \qquad + a_{13} (-1)^{1+3} \det(A_{13})\\ \amp = 8\text{.} \end{align*}

Subsection Cofactors

We will now define the determinant of a general \(n \times n\) matrix \(A\) in terms of a cofactor expansion as we did in the \(3 \times 3\) case. To do so, we need some notation and terminology.

  • We let \(A_{ij}\) be the submatrix of \(A = [a_{ij}]\) found by deleting the \(i\)th row and \(j\)th column of \(A\text{.}\) The determinant of \(A_{ij}\) is called the \(ij\)th minor of \(A\) or the minor corresponding to the entry \(a_{ij}\text{.}\)

  • Notice that in the \(3 \times 3\) case, we used the opposite of the 1,2 minor in the sum. It will be the case that the terms in the cofactor expansion will alternate in sign. We can make the signs in the sum alternate by taking \(-1\) to an appropriate power. As a result, we define the \(ij\)th cofactor \(C_{ij}\) of \(A\) as

    \begin{equation*} C_{ij} = (-1)^{i+j} \det\left(A_{ij}\right)\text{.} \end{equation*}
  • Finally, we define the determinant of \(A\text{.}\)

Definition 17.2.

If \(A=[a_{ij}]\) is an \(n \times n\) matrix, the determinant of \(A\) is the scalar

\begin{equation*} \det(A) = a_{11}C_{11} + a_{12}C_{12} + a_{13}C_{13} + \cdots + a_{1n}C_{1n} \end{equation*}

where \(C_{ij}= (-1)^{i+j} \det(A_{ij})\) is the \(ij\)-cofactor of \(A\) and \(A_{ij}\) is the matrix obtained by removing row \(i\) and column \(j\) of matrix \(A\text{.}\)

This method for computing determinants is called the cofactor expansion or Laplace expansion of \(A\) along the 1st row. The cofactor expansion reduces the computation of the determinant of an \(n \times n\) matrix to \(n\) computations of determinants of \((n-1) \times (n-1)\) matrices. These smaller matrices can be reduced again using cofactor expansions, so it can be a long and grueling process for large matrices. It turns out that we can actually take this expansion along any row or column of the matrix (a proof of this fact is given in Section 22). For example, the cofactor expansion along the 2nd row is

\begin{equation*} \det(A) = a_{21}C_{21} + a_{22}C_{22} + \cdots + a_{2n}C_{2n} \end{equation*}

and along the 3rd column the formula is

\begin{equation*} \det(A) = a_{13}C_{13} + a_{23}C_{23} + \cdots + a_{n3}C_{n3}\text{.} \end{equation*}

Note that when finding a cofactor expansion, choosing a row or column with many zeros makes calculations easier.

Activity 17.2.

(a)

Let \(A = \left[ \begin{array}{rcr} 1\amp 2\amp -1 \\ -2\amp 0\amp 4 \\ 6\amp 3\amp 0 \end{array} \right]\text{.}\) Use the cofactor expansion along the first row to calculate the determinant of \(A\) by hand.

(b)

Calculate \(\det(A)\) by using a cofactor expansion along the second row where \(A = \left[ \begin{array}{ccc} 1 \amp 4 \amp 2 \\ 0 \amp 2 \amp 0 \\ 2 \amp 5 \amp 3 \end{array} \right]\text{.}\)

(c)

Calculate the determinant of \(\left[ \begin{array}{crr} 1\amp -2\amp 3 \\ 0\amp 4\amp -3 \\ 0\amp 0\amp 8 \end{array} \right]\text{.}\)

(d)

Which determinant property can be used to calculate the determinant in part (c)? Explain how. (Determinant properties are included below for easy reference.)

(e)

Consider the matrix \(A=\left[ \begin{array}{ccc} 1 \amp 1 \amp 2 \\ 0 \amp 2 \amp 1 \\ 1 \amp 2 \amp 2 \end{array} \right]\text{.}\) Let \(B\) be the matrix which results when \(c\) times row 1 is added to row 2 of \(A\text{.}\) Evaluate the determinant of \(B\) by hand to check that it is equal to the determinant of \(A\text{,}\) which verifies one other determinant property (in a specific case).

As with any new idea, like the determinant, we must ask what properties are satisfied. We state the following theorem without proof for the time being. For the interested reader, the proof of many of these properties is given in Section 22 and others in the exercises.

Note that if we were to find the determinant of a \(4\times 4\) matrix using the cofactor method, we will calculate determinants of 4 matrices of size \(3\times 3\text{,}\) each of which will require 3 determinant calculations again. So, we will need a total of 12 calculations of determinants of \(2\times 2\) matrices. That is a lot of calculations. There are other, more efficient, methods for calculating determinants. For example, we can row reduce the matrix, keeping track of the effect that each row operation has on the determinant.

Subsection The Determinant of a \(3 \times 3\) Matrix

Earlier we defined the determinant of a \(3 \times 3\) matrix. In this section we endeavor to understand the motivation behind that definition.

We will repeat the process we went through in the \(2 \times 2\) case to see how to define the determinant of a \(3 \times 3\) matrix. Let

\begin{equation*} A = \left[ \begin{array}{ccc} a_{11}\amp a_{12}\amp a_{13} \\ a_{21}\amp a_{22}\amp a_{23}\\ a_{31}\amp a_{32}\amp a_{33} \end{array} \right]\text{.} \end{equation*}

To find the inverse of \(A\) we augment \(A\) by the \(3 \times 3\) identity matrix

\begin{equation*} [A \ | \ I_3] = \left[ \begin{array}{cccccc} a_{11}\amp a_{12}\amp a_{13}\amp 1\amp 0\amp 0 \\ a_{21}\amp a_{22}\amp a_{23}\amp 0\amp 1\amp 0 \\ a_{31}\amp a_{32}\amp a_{33}\amp 0\amp 0\amp 1 \end{array} \right] \end{equation*}

and row reduce the matrix (using appropriate technology) to obtain

\begin{equation*} \left[ \begin{array}{rrrrrr} 1\amp 0\amp 0\amp \ds \frac{a_{33}a_{22}-a_{32}a_{23}}{d}\amp \ds -\frac{a_{33}a_{12}-a_{32}a_{13}}{d}\amp \ds \frac{-a_{13}a_{22}+a_{12}a_{23}}{d} \\ 0\amp 1\amp 0\amp \ds -\frac{a_{33}a_{21}-a_{31}a_{23}}{d}\amp \ds \frac{a_{33}a_{11}-a_{31}a_{13}}{d}\amp \ds -\frac{a_{23}a_{11}-a_{21}a_{13}}{d} \\ 0\amp 0\amp 1\amp \ds \frac{-a_{31}a_{22}+a_{32}a_{21}}{d}\amp \ds -\frac{a_{32}a_{11}-a_{31}a_{12}}{d}\amp \ds \frac{a_{22}a_{11}-a_{21}a_{12}}{d} \end{array} \right]\text{,} \end{equation*}

where

\begin{equation} \begin{aligned}d = a_{33}a_{11}a_{22}\amp -a_{33}a_{21}a_{12}-a_{31}a_{13}a_{22} \\ \amp -a_{32}a_{11}a_{23}+a_{32}a_{21}a_{13}+a_{31}a_{12}a_{23}. \end{aligned}\tag{17.2} \end{equation}

In this case, we can see that the inverse of the \(3 \times 3\) matrix \(A\) will be defined if and only if \(d \neq 0\text{.}\) So, in the \(3 \times 3\) case the determinant of \(A\) will be given by the value of \(d\) in Equation (17.2). What remains is for us to see how this is related to determinants of \(2 \times 2\) sub-matrices of \(A\text{.}\)

To start, we collect all terms involving \(a_{11}\) in \(d\text{.}\) A little algebra shows that

\begin{equation*} \det(A) = a_{11} \left( a_{33} a_{22} - a_{32} a_{23} \right) - a_{33} a_{21} a_{12} - a_{31} a_{13} a_{22} + a_{32}a_{21}a_{13} + a_{31} a_{12} a_{23}\text{.} \end{equation*}

Now let's collect the remaining terms involving \(a_{12}\text{:}\)

\begin{equation*} \det(A) = a_{11} \left( a_{33} a_{22} - a_{32} a_{23} \right) - a_{12} \left(a_{33} a_{21} - a_{31} a_{23} \right) - a_{31} a_{13} a_{22} + a_{32}a_{21}a_{13}\text{.} \end{equation*}

Finally, we collect the terms involving \(a_{13}\text{:}\)

\begin{equation*} \det(A) = a_{11} \left( a_{33} a_{22} - a_{32} a_{23} \right) - a_{12} \left(a_{33} a_{21} - a_{31} a_{23} \right) + a_{13} \left(a_{32} a_{21} - a_{31} a_{22} \right)\text{.} \end{equation*}

Now we can connect the determinant of \(A\) to determinants of \(2 \times 2\) sub-matrices of \(A\text{.}\)

  • Notice that

    \begin{equation*} a_{33} a_{22} - a_{32} a_{23} \end{equation*}

    is the determinant of the \(2 \times 2\) matrix \(\left[ \begin{array}{cc} a_{22}\amp a_{23} \\ a_{32}\amp a_{33} \end{array} \right]\) obtained from \(A\) by deleting the first row and first column.

  • Similarly, the expression

    \begin{equation*} a_{33} a_{21} - a_{31} a_{23} \end{equation*}

    is the determinant of the \(2 \times 2\) matrix \(\left[ \begin{array}{cc} a_{21}\amp a_{23} \\ a_{31}\amp a_{33} \end{array} \right]\) obtained from \(A\) by deleting the first row and second column.

  • Finally, the expression

    \begin{equation*} a_{32} a_{21} - a_{31} a_{22} \end{equation*}

    is the determinant of the \(2 \times 2\) matrix \(\left[ \begin{array}{cc} a_{21}\amp a_{22} \\ a_{31}\amp a_{32} \end{array} \right]\) obtained from \(A\) by deleting the first row and third column.

Putting this all together gives us formula (17.1) for the determinant of a \(3 \times 3\) matrix as we defined earlier.

Subsection Two Devices for Remembering Determinants

There are useful ways to remember how to calculate the formulas for determinants of \(2 \times 2\) and \(3 \times 3\) matrices. In the \(2 \times 2\) case of \(A = \left[ \begin{array}{cc} a_{11}\amp a_{12} \\ a_{21}\amp a_{22} \end{array} \right]\text{,}\) we saw that

\begin{equation*} |A| = a_{11}a_{22} - a_{21}a_{22}\text{.} \end{equation*}

This makes \(|A|\) the product of the diagonal elements \(a_{11}\) and \(a_{22}\) minus the product of the off-diagonal elements \(a_{12}\) and \(a_{21}\text{.}\) We can visualize this in an array by drawing arrows across the diagonal and off-diagonal, with a plus sign on the diagonal arrow indicting that we add the product of the diagonal elements and a minus sign on the off-diagonal arrow indicating that we subtract the product of the off-diagonal elements as shown in Figure 17.4.

Figure 17.4. A diagram to remember the \(2 \times 2\) determinant.

We can do a similar thing for the determinant of a \(3 \times 3\) matrix. In this case, we extend the \(3 \times 3\) array to a \(3 \times 5\) array by adjoining the first two columns onto the matrix. We then add the products along the diagonals going from left to right and subtract the products along the diagonals going from right to left as indicated in Figure 17.5.

Figure 17.5. A diagram to remember the \(3 \times 3\) determinant.

Subsection Examples

What follows are worked examples that use the concepts from this section.

Example 17.6.

For each of the following

  • Identify the sub-matrices \(A_{1,j}\)

  • Determine the cofactors \(C_{1,j}\text{.}\)

  • Use the cofactor expansion to calculate the determinant.

(a)

\(A = \left[ \begin{array}{ccr} 3\amp 6\amp 2 \\ 0\amp 4\amp -1 \\ 5\amp 0\amp 1 \end{array} \right]\)

Solution.

With a \(3 \times 3\) matrix, we will find the sub-matrices \(A_{11}\text{,}\) \(A_{12}\text{,}\) and \(A_{13}\text{.}\) Recall that \(A_{ij}\) is the sub-matrix of \(A\) obtained by deleting the \(i\)th row and \(j\)th column of \(A\text{.}\) Thus,

\begin{equation*} A_{11} = \left[ \begin{array}{cr} 4\amp -1 \\ 0\amp 1 \end{array} \right] \ A_{12} = \left[ \begin{array}{cr} 0\amp -1 \\ 5\amp 1 \end{array} \right] \ \text{ and } A_{13} = \left[ \begin{array}{cc} 0\amp 4 \\ 5\amp 0 \end{array} \right]\text{.} \end{equation*}

The \(ij\)th cofactor is \(C_{ij} = (-1)^{i+j}\det(A_{ij})\text{,}\) so

\begin{align*} C_{11} \amp = (-1)^2 \left[ \begin{array}{cr} 4\amp -1\\ 0\amp 1 \end{array} \right] = 4\\ C_{12} \amp = (-1)^3 \left[ \begin{array}{cr} 0\amp -1\\ 5\amp 1 \end{array} \right] = -5\\ C_{13} \amp = (-1)^4 \left[ \begin{array}{cc} 0\amp 4\\ 5\amp 0 \end{array} \right] = -20\text{.} \end{align*}

Then

\begin{equation*} \det(A) = a_{11}C_{11} + a_{12}C_{12} + a_{13}C_{13} = (3)(4) +(6)(-5) +(2)(-20) = -58\text{.} \end{equation*}
(b)

\(A = \left[ \begin{array}{rrcr} 3\amp 0\amp 1\amp 1 \\ 2\amp 1\amp 2\amp 1 \\ 1\amp -2\amp 2\amp -1 \\ -3\amp 2\amp 3\amp 1 \end{array} \right]\)

Solution.

With a \(4 \times 4\) matrix, we will find the sub-matrices \(A_{11}\text{,}\) \(A_{12}\text{,}\) \(A_{13}\text{,}\) and \(A_{14}\text{.}\) We see that

\begin{align*} A_{11} \amp = \left[ \begin{array}{rcr} 1\amp 2\amp 1\\ -2\amp 2\amp -1\\ 2\amp 3\amp 1 \end{array} \right]\\ A_{12} \amp = \left[ \begin{array}{rcr} 2\amp 2\amp 1\\ 1\amp 2\amp -1\\ -3\amp 3\amp 1 \end{array} \right]\\ A_{13} \amp = \left[ \begin{array}{rrr} 2\amp 1\amp 1\\ 1\amp -2\amp -1\\ -3\amp 2\amp 1 \end{array} \right]\\ A_{14} \amp = \left[ \begin{array}{rrc} 2\amp 1\amp 2\\ 1\amp -2\amp 2\\ -3\amp 2\amp 3 \end{array} \right]\text{.} \end{align*}

To calculate the \(ij\)th cofactor \(C_{ij} = (-1)^{i+j}\det(A_{ij})\text{,}\) we need to calculate the determinants of the \(A_{1j}\text{.}\) Using the device for calculating the determinant of a \(3 \times 3\) matrix we have that

\begin{align*} \det(A_{11}) \amp =\det\left( \left[ \begin{array}{rrr} 1\amp 2\amp 1\\ -2\amp 2\amp -1\\ 2\amp 3\amp 1 \end{array} \right] \right)\\ \amp = (1)(2)(1)+(2)(-1)(2)+(1)(-2)(3)\\ \amp \qquad - (1)(2)(2)-(1)(-1)(3)-(2)(-2)(1)\\ \amp = -5\text{,} \end{align*}
\begin{align*} \det(A_{12}) \amp = \det\left(\left[ \begin{array}{rcr} 2\amp 2\amp 1\\ 1\amp 2\amp -1\\ -3\amp 3\amp 1 \end{array} \right] \right)\\ \amp = (2)(2)(1)+(2)(-1)(-3)+(1)(1)(3)\\ \amp \qquad - (1)(2)(-3)-(2)(-1)(3)-(2)(1)(1)\\ \amp = 23\text{,} \end{align*}
\begin{align*} \det(A_{13}) \amp = \det\left(\left[ \begin{array}{rrr} 2\amp 1\amp 1\\ 1\amp -2\amp -1\\ -3\amp 2\amp 1 \end{array} \right] \right)\\ \amp = (2)(-2)(1)+(1)(-1)(-3)+(1)(1)(2)\\ \amp \qquad - (1)(-2)(-3)-(2)(-1)(2)-(1)(1)(1)\\ \amp = -2\text{,} \end{align*}

and

\begin{align*} \det(A_{14}) \amp = \det\left(\left[ \begin{array}{rrc} 2\amp 1\amp 2\\ 1\amp -2\amp 2\\ -3\amp 2\amp 3 \end{array} \right] \right)\\ \amp = (2)(-2)(3)+(1)(2)(-3)+(2)(1)(2)\\ \amp \qquad - (2)(-2)(-3)-(2)(2)(2)-(1)(1)(3)\\ \amp = -37\text{.} \end{align*}

Then

\begin{align*} C_{11} \amp = (-1)^2 \det(A_{11}) = -5\\ C_{12} \amp = (-1)^3 \det(A_{12})= -23\\ C_{13} \amp = (-1)^4 \det(A_{13}) = -2\\ C_{14} \amp = (-1)^5 \det(A_{13}) = 37 \end{align*}

and so

\begin{align*} \det(B) \amp = b_{11}C_{11} + b_{12}C_{12} + b_{13}C_{13} + b_{14}C_{14}\\ \amp = (3)(-5) +(0)(-23) +(1)(-2)+ (1)(37)\\ \amp = 20\text{.} \end{align*}

Example 17.7.

Show that for any \(2 \times 2\) matrices \(A\) and \(B\text{,}\)

\begin{equation*} \det(AB) = \det(A) \det(B)\text{.} \end{equation*}

Solution.

Let \(A = \left[ \begin{array}{cc} a_{11}\amp a_{12} \\ a_{21}\amp a_{22} \end{array} \right]\) and \(B = \left[ \begin{array}{cc} b_{11}\amp b_{12} \\ b_{21}\amp b_{22} \end{array} \right]\text{.}\) Then

\begin{equation*} AB = \left[ \begin{array}{cc} a_{11}b_{11}+a_{12}b_{21}\amp a_{11}b_{12}+a_{12}b_{22} \\ a_{21}b_{11}+a_{22}b_{21}\amp a_{21}b_{12}+a_{22}b_{22} \end{array} \right]\text{.} \end{equation*}

So

\begin{align*} \det(AB) \amp = (a_{11}b_{11}+a_{12}b_{21})(a_{21}b_{12}+a_{22}b_{22})\\ \amp \qquad - (a_{11}b_{12}+a_{12}b_{22})(a_{21}b_{11}+a_{22}b_{21})\\ \amp = (a_{11}b_{11}a_{21}b_{12} + a_{11}b_{11}a_{22}b_{22} + a_{12}b_{21}a_{21}b_{12} + a_{12}b_{21}a_{22}b_{22})\\ \amp \qquad - (a_{11}b_{12}a_{21}b_{11} + a_{11}b_{12}a_{22}b_{21} + a_{12}b_{22}a_{21}b_{11} + a_{12}b_{22}a_{22}b_{21})\\ \amp = a_{11}b_{11}a_{22}b_{22} + a_{12}b_{21}a_{21}b_{12} - a_{11}b_{12}a_{22}b_{21} - a_{12}b_{22}a_{21}b_{11}\text{.} \end{align*}

Also,

\begin{align*} \det(A) \det(B) \amp = (a_{11}a_{22}-a_{12}a_{21})(b_{11}b_{22}-b_{12}b_{21})\\ \amp = a_{11}a_{22}b_{11}b_{22} - a_{11}a_{22}b_{12}b_{21} - a_{12}a_{21}b_{11}b_{22} + a_{12}a_{21}b_{12}b_{21}\text{.} \end{align*}

We conclude that \(\det(AB) = \det(A) \det(B)\) if \(A\) and \(B\) are \(2 \times 2\) matrices.

Subsection Summary

  • The determinant of an \(n \times n\) matrix \(A = [a_{ij}]\) is found by taking the cofactor expansion of \(A\) along the first row. That is

    \begin{equation*} \det(A) = a_{11}C_{11} + a_{12}C_{12} + a_{13}C_{13} + \cdots + a_{1n}C_{1n}\text{,} \end{equation*}

    where

    • \(A_{ij}\) is the sub-matrix of \(A\) found by deleting the \(i\)th row and \(j\)th column of \(A\text{.}\)

    • \(C_{ij} = (-1)^{i+j} \det\left(A_{ij}\right)\) is the \(ij\)th cofactor of \(A\text{.}\)

  • The matrix \(A\) is invertible if and only if \(\det(A) \neq 0\text{.}\)

Exercises Exercises

1.

Use the cofactor expansion to explain why multiplying each of the entries of a \(3\times 3\) matrix \(A\) by 2 multiplies the determinant of \(A\) by 8.

2.

Use the determinant criterion to determine for which \(c\) the matrix \(A=\left[ \begin{array}{crc} 1\amp 1\amp 2\\ 1\amp 0\amp c\\ 2\amp -1\amp 2 \end{array} \right]\) is invertible.

3.

Let \(A\) be a square matrix.

(a)

Explain why \(\det(A^2) = [\det(A)]^2\)

(b)

Expand on the argument from (a) to explain why \(\det(A^k) = [\det(A)]^k\) for any positive integer \(k\text{.}\)

(c)

Suppose that \(A\) is an invertible matrix and \(k\) is a positive integer. Must \(A^k\) be an invertible matrix? Why or why not?

4.

Let \(A\) be an invertible matrix. Explain why \(\det(A^{-1})= \dfrac{1}{\det(A)}\) using determinant properties.

5.

Simplify the following determinant expression using determinant properties:

\begin{equation*} \det(PA^4P^{-1}A^T(A^{-1})^3) \end{equation*}

6.

Find the eigenvalues of the following matrices. Find a basis for and the dimension of each eigenspace.

(a)

\(A=\left[ \begin{array}{ccc} 1\amp 1\amp 1\\1\amp 1\amp 1\\1\amp 1\amp 1 \end{array} \right]\)

(b)

\(A=\left[ \begin{array}{ccc} 2\amp 0\amp 3\\0\amp 1\amp 0\\0\amp 1\amp 2 \end{array} \right]\)

7.

Label each of the following statements as True or False. Provide justification for your response.

(a) True/False.

For any two \(n\times n\) matrices \(A\) and \(B\text{,}\) \(\det (A+B) = \det A + \det B\text{.}\)

(b) True/False.

For any square matrix \(A\text{,}\) \(\det(-A)= -\det(A)\text{.}\)

(c) True/False.

For any square matrix \(A\text{,}\) \(\det(-A)= \det(A)\text{.}\)

(d) True/False.

The determinant of a square matrix with all non-zero entries is non-zero.

(e) True/False.

If the determinant of \(A\) is non-zero, then so is the determinant of \(A^2\text{.}\)

(f) True/False.

If the determinant of a matrix \(A\) is 0, then one of the rows of \(A\) is a linear combination of the other rows.

(g) True/False.

For any square matrix \(A\text{,}\) \(\det(A^2)>\det(A)\text{.}\)

(h) True/False.

If \(A\) and \(B\) are \(n \times n\) matrices and \(AB\) is invertible, then \(A\) and \(B\) are invertible.

(i) True/False.

If \(A^2\) is the zero matrix, then the only eigenvalue of \(A\) is 0.

(j) True/False.

If 0 is an eigenvalue of \(A\text{,}\) then 0 is an eigenvalue of \(AB\) for any \(B\) of the same size as \(A\text{.}\)

(k) True/False.

Suppose \(A\) is a \(3 \times 3\) matrix. Then any three eigenvectors of \(A\) will form a basis of \(\R^3\text{.}\)

Subsection Project: Area and Volume Using Determinants

The approach we will take to connecting area (volume) to the determinant will help shed light on properties of the determinant that we will discuss from an algebraic perspective in a later section. First, we mention some basic properties of area (we focus on area for now, but these same properties are valid for volumes as well). volume). As a shorthand, we denote the area of a region \(R\) by \(\Area(R)\text{.}\)

  • Area cannot be negative.

  • If two regions \(R_1\) and \(R_2\) don't overlap, then the area of the union of the regions is equal to the sum of the areas of the regions. That is, if \(R_1 \cap R_2 = \emptyset\text{,}\) then \(\Area(R_1 \cup R_2) = \Area(R_1) + \Area(R_2)\text{.}\)

  • Area is invariant under translation. That is, if we move a geometric region by the same amount uniformly in a given direction, the area of the original region and the area of the transformed region are the same. A translation of a region is done by just adding a fixed vector to each vector in the region. That is, a translation by a vector \(\vv\) is a function \(T_{\vv}\) such that the image \(T_{\vv}(R)\) of a region \(R\) is defined as

    \begin{equation*} T_{\vv}(R) = \{\vr+\vv : \vr \in R\}\text{.} \end{equation*}

    Since area is translation invariant, \(\Area(T_{\vv}(R)) = \Area(R)\text{.}\)

  • The area of a one-dimensional object like a line segment is \(0\text{.}\)

Now we turn our attention to areas of parallelograms. Let \(\vu\) and \(\vv\) be vectors in \(\R^2\text{.}\) The parallelogram \(P(\vu,\vv)\) defined by \(\vu\) and \(\vv\) with point \(Q\) as basepoint is the set

\begin{equation*} P(\vu, \vv) = \{\overrightarrow{OQ}+r \vu + s \vv : 0 \leq r, s \leq 1\}\text{.} \end{equation*}

An illustration of such a parallelogram is shown at left in Figure 17.8.

Figure 17.8. A parallelogram and a translated, rotated parallelogram.

If \(\vu = [u_1 \ u_2]^{\tr}\) and \(\vv = [v_1 \ v_2]^{\tr}\text{,}\) then we will also represent \(P(\vu,\vv)\) as \(P\left( \left[ \begin{array}{cc}u_1\amp u_2 \\ v_1\amp v_2 \end{array} \right] \right)\text{.}\)

Since area is translation and rotation invariant, we can translate our parallelogram by \(-\overrightarrow{OQ}\) to place its basepoint at the origin, then rotate by an angle \(\theta\) (as shown at left in Figure 17.8. This transforms the vector \(\vv\) to a vector \(\vv'\) and the vector \(\vu\) to a vector \(\vu'\) as shown at right in Figure 17.8. With this in mind we can always assume that our parallelograms have one vertex at the origin, with \(\vu\) along the \(x\)-axis, and \(\vv\) in standard position. Now we can investigate how to calculate the area of a parallelogram.

Project Activity 17.3.

There are two situations to consider when we want to find the area of a parallelogram determined by vectors \(\vu\) and \(\vv\text{,}\) both shown in Figure 17.9. The parallelogram will be determined by the lengths of these vectors.

Figure 17.9. Parallelograms formed by \(\vu\) and \(\vv\)
(a)

In the situation depicted at left in Figure 17.9, use geometry to explain why \(\Area(P(\vu,\vv)) = h |\vu|\text{.}\)

Hint.

What can we say about the triangles \(ODB\) and \(EAC\text{?}\)

(b)

In the situation depicted at right in Figure 17.9, use geometry to again explain why \(\Area(P(\vu,\vv)) = h |\vu|\text{.}\) (Hint: What can we say about \(\Area(AEC)\) and \(\Area(ODB)\text{?}\))

The result of Project Activity 17.3 is that the area of \(P(\vu,\vv)\) is given by \(h |\vu|\text{,}\) where \(h\) is the height of the parallelogram determined by dropping a perpendicular from the terminal point of \(\vv\) to the line determined by the vector \(\vu\text{.}\)

Now we turn to the question of how the determinant is related to area of a parallelogram. Our approach will use some properties of the area of \(P(\vu, \vv)\text{.}\)

Project Activity 17.4.

Let \(\vu\) and \(\vv\) be vectors that determine a parallelogram in \(\R^2\text{.}\)

Figure 17.10. Parallelograms formed by \(k\vu\) and \(\vv\) and by \(\vu\) and \(\vv+k\vu\text{.}\)
(a)

Explain why

\begin{equation} \Area(P(\vu,\vv)) = \Area(P(\vv,\vu))\tag{17.3} \end{equation}
(b)

If \(k\) is any scalar, then \(k\vu\) either stretches or compresses \(\vu\text{.}\) Use this idea, and the result of Project Activity 17.3, to explain why

\begin{equation} \Area(P(k\vu,\vv)) = \Area(P(\vu,k\vv)) = |k| \Area(P(\vu,\vv))\tag{17.4} \end{equation}

for any real number \(k\text{.}\) A representative picture of this situation is shown at left in Figure 17.9 for a value of \(k > 1\text{.}\) You will also need to consider what happens when \(k \lt 0\text{.}\)

(c)

Finally, use the result of Project Activity 17.3 to explain why

\begin{equation} \Area(P(\vu+k\vv,\vv)) = \Area(P(\vu,\vv+k\vu)) = \Area(P(\vu,\vv))\tag{17.5} \end{equation}

for any real number \(k\text{.}\) A representative picture is shown at right in Figure 17.10.

Properties (17.4) and (17.5) will allow us to calculate the area of the parallelogram determined by vectors \(\vu\) and \(\vv\text{.}\)

Project Activity 17.5.

Let \(\vu = [u_1 \ u_2]^{\tr}\) and \(\vv = [v_1 \ v_2]^{\tr}\text{.}\) We will now demonstrate that

\begin{equation*} \Area(P(\vu,\vv)) = \left|\det\left(\left| \begin{array}{cc} u_1\amp u_2\\v_1\amp v_2 \end{array} \right] \right)\right|\text{.} \end{equation*}

Before we begin, note that if both \(u_1\) and \(v_1\) are \(0\text{,}\) then \(\vu\) and \(\vv\) are parallel. This makes \(P(\vu, \vv)\) a line segment and so \(\Area(P(\vu,\vv)) = 0\text{.}\) But if \(u_1 = v_1 = 0\text{,}\) it is also the case that

\begin{equation*} \det\left(\left| \begin{array}{cc} u_1\amp u_2\\v_1\amp v_2 \end{array} \right] \right) = u_1v_2-u_2v_1 = 0 \end{equation*}

as well. So we can assume that at least one of \(u_1\text{,}\) \(v_1\) is not \(0\text{.}\) Since \(P(\vu, \vv) = P(\vv, \vu)\text{,}\) we can assume without loss of generality that \(u_1 \neq 0\text{.}\)

(a)

Explain using properties (17.4) and (17.5) as appropriate why

\begin{equation*} \Area(P(\vu,\vv)) =\Area\left(P\left(\vu, \left[0 \ v_2-\frac{v_1}{u_1}u_2\right] \right) \right)\text{.} \end{equation*}
(b)

Let \(\vv_1 = \left[0 \ v_2-\frac{v_1}{u_1}u_2\right]^{\tr}\text{.}\) Recall that our alternate representation of \(P(\vu,\vv))\) allows us to write

\begin{equation*} \Area(P(\vu, \vv_1)) = \Area\left( P\left( \left[ \begin{array}{cc} u_1\amp u_2 \\ 0\amp v_2-\frac{v_1}{u_1}u_2 \end{array} \right] \right) \right)\text{.} \end{equation*}

This should seem very suggestive. We are essentially applying the process of Gaussian elimination to our parallelogram matrix to reduce it to a diagonal matrix. From there, we can calculate the area. The matrix form should indicate the next step — applying an operation to eliminate the entry in the first row and second column. To do this, we need to consider what happens if \(v_2-\frac{v_1}{u_1}u_2 = 0\) and if \(v_2-\frac{v_1}{u_1}u_2 \neq 0\text{.}\)

(i)

Assume that \(v_2-\frac{v_1}{u_1}u_2 = 0\text{.}\) Explain why \(\Area(P(\vu,\vv)) = 0\text{.}\) Then explain why \(\Area(P(\vu,\vv)) = 0 = \det\left(\left[ \begin{array}{cc} u_1\amp u_2\\v_1\amp v_2 \end{array} \right]\right)\text{.}\)

(ii)

Now we consider the case when \(v_2-\frac{v_1}{u_1}u_2 \neq 0\text{.}\) Complete the process as in part (a), using properties (17.4) and (17.5) (compare to Gaussian elimination) to continue to reduce the problem of calculating \(\Area(P(\vu,\vv))\) to one of calculating \(\Area(P(\ve_1, \ve_2))\text{.}\) Use this process to conclude that

\begin{equation*} \Area(P(\vu,\vv)) = \left| \det\left(\left[ \begin{array}{cc} u_1\amp u_2\\v_1\amp v_2 \end{array} \right]\right)\right|\text{.} \end{equation*}

We can apply the same arguments as above using rotations, translations, shearings, and scalings to show that the properties of area given above work in any dimension. Given vectors \(\vu_1\text{,}\) \(\vu_2\text{,}\) \(\ldots\text{,}\) \(\vu_n\) in \(\R^n\text{,}\) we let

\begin{equation*} P(\vu_1, \vu_2, \ldots, \vu_n) = \{\overrightarrow{OQ}+x_1 \vu_1 + x_2 \vu_2 + \cdots + x_n \vu_n : 0 \leq x_i \leq 1 \text{ for each } i\}\text{.} \end{equation*}

If \(n = 2\text{,}\) then \(P(\vu_1,\vu_2)\) is the parallelogram determined by \(\vu_1\) and \(\vu_2\) with basepoint \(Q\text{.}\) If \(n = 3\text{,}\) then \(P(\vu_1, \vu_2, \vu_3)\) is the parallelepiped with basepoint \(Q\) determined by \(\vu_1\text{,}\) \(\vu_2\text{,}\) and \(\vu_3\text{.}\) In higher dimensions the sets \(P(\vu_1, \vu_2, \ldots,\vu_n)\) are called parallelotopes, and we use the notation \(\Vol(P(\vu_1, \vu_2, \ldots,\vu_n))\) for their volume. The \(n\)-dimensional volumes of these paralleotopes satisfy the following properties:

\begin{align} \Vol\amp (P(\vu_1, \vu_2, \ldots, \vu_{i-1}, \vu_i, \vu_{i+1}, \ldots, \vu_{j-1}, \vu_j, \vu_{j+1}, \ldots,\vu_n))\notag\\ \amp = \Vol(P(\vu_1, \vu_2, \ldots, \vu_{i-1}, \vu_j, \vu_{i+1}, \ldots, \vu_{j-1}, \vu_i, \vu_{j+1}, \ldots,\vu_n))\tag{17.6} \end{align}

for any \(i\) and \(j\text{.}\)

\begin{equation} \Vol(P(\vu_1,\vu_2, \ldots, \vu_{i-1}, k\vu_i, \vu_{i+1}, \ldots, \vu_n)) = |k| \Vol(P(\vu_1,\vu_2, \ldots, \vu_n))\tag{17.7} \end{equation}

for any real number \(k\) and any \(i\text{.}\)

\begin{equation} \Vol(P(\vu_1,\vu_2, \ldots, \vu_{i-1}, \vu_{i}+k\vu_j, \vu_{i+1}, \ldots, \vu_n)) = \Vol(P(\vu_1,\vu_2, \ldots, \vu_n))\tag{17.8} \end{equation}

for any real number \(k\) and any distinct \(i\) and \(j\text{.}\)

Project Activity 17.6.

We now show that \(\Vol(P(\vu_1, \vu_2, \vu_3))\) is the absolute value of the determinant of \(\left[ \begin{array}{c} \vu_1 \\ \vu_2 \\ \vu_3 \end{array} \right]\text{.}\) For easier notation, let \(\vu = [u_1 \ u_2 \ u_3]^{\tr}\text{,}\) \(\vv = [v_1 \ v_2 \ v_3]^{\tr}\text{,}\) and \(\vw = [w_1 \ w_2 \ w_3]^{\tr}\text{.}\) As we argued in the 2-dimensional case, we can assume that all terms that we need to be nonzero are nonzero, and we can do so without verification.

(a)

Explain how property (17.7) shows that \(\Vol(P(\vu, \vv, \vw))\) is equal to

\begin{equation*} \Vol\left(P\left( \left[ \begin{array}{ccc} u_{1}\amp u_{2}\amp u_{3} \\ 0\amp \frac{1}{u_1}(v_{2}u_1-v_{1}u_{2})\amp \frac{1}{u_1}(v_{3}u_1-v_{1}u_{3})\\ 0\amp \frac{1}{u_1}(w_{2}u_1-w_{1}u_{2})\amp \frac{1}{u_1}(w_{3}u_1-w_{1}u_{3}) \end{array} \right] \right)\right)\text{.} \end{equation*}
Hint.

Think about how these properties are related to row operations.

(b)

Now let

\begin{equation*} \vv_1 = \left[ 0 \ \frac{1}{u_1}(v_{2}u_1-v_{1}u_{2}) \ \frac{1}{u_1}(v_{3}u_1-v_{1}u_{3})\right]^{\tr} \end{equation*}

and

\begin{equation*} \vw_1 = \left[ 0 \ \frac{1}{u_1}(w_{2}u_1-w_{1}u_{2}) \ \frac{1}{u_1}(w_{3}u_1-w_{1}u_{3})\right]^{\tr}\text{.} \end{equation*}

Explain how property (17.7) shows that \(\Vol(P(\vu, \vv, \vw))\) is equal to

\begin{equation*} \Vol\left(P\left( \left[ \begin{array}{ccc} u_{1}\amp u_{2}\amp u_{3} \\ 0\amp \frac{1}{u_1}(v_{2}u_1-v_{1}u_{2})\amp \frac{1}{u_1}(v_{3}u_1-v_{1}u_{3})\\ 0\amp 0\amp d \end{array} \right] \right)\right)\text{,} \end{equation*}

where

\begin{equation*} d = \frac{1}{u_1v_2-u_2v_1}(u_1(v_2w_3-v_3w_2)-u_2(v_1w_3-v_{3}w_1)+u_3(v_1w_2-v_2w_1))\text{.} \end{equation*}
(c)

Just as we saw in the 2-dimensional case, we can proceed to use the diagonal entries to eliminate the entries above the diagonal without changing the volume to see that

\begin{equation*} \Vol(P(\vu, \vv, \vw)) = \Vol\left(P\left( \left[ \begin{array}{ccc} u_{1}\amp 0\amp 0 \\ 0\amp \frac{1}{u_1}(v_{2}u_1-v_{1}u_{2})\amp 0\\ 0\amp 0\amp d \end{array} \right] \right)\right)\text{.} \end{equation*}

Complete the process, applying appropriate properties to explain why

\begin{equation*} \Vol(P(\vu, \vv, \vw)) = x \Vol(P(\ve_1, \ve_2, \ve_3)) \end{equation*}

for some constant \(x\text{.}\) Find the constant and, as a result, find a specific expression for \(\Vol(P(\vu, \vv, \vw))\) involving a determinant.

Properties (17.6), (17.7), and (17.8) involve the analogs of row operations on matrices, and we will prove algebraically that the determinant exhibits the same properties. In fact, the determinant can be uniquely defined by these properties. So in a sense, the determinant is an area or volume function.

Tucker, Alan. (1993). The Growing Importance of Linear Algebra in Undergraduate Mathematics. The College Mathematics Journal, 1, 3-9.