Linear Transformations

Section 37 Linear Transformations

Focus Questions

By the end of this section, you should be able to give precise and thorough answers to the questions listed below. You may want to keep these questions in mind to focus your thoughts as you complete the section.

What is a linear transformation?
What is the kernel of a linear transformation? What algebraic structure does a kernel of a linear transformation have?
What is a one-to-one linear transformation? How does its kernel tell us if a linear transformation is one-to-one?
What is the range of a linear transformation? What algebraic property does the range of a linear transformation possess?
What is an onto linear transformation? What relationship is there between the codomain and range if a linear transformation is onto?
What is an isomorphism of vector spaces?

Subsection Application: Fractals

Sierpinski triangles and Koch's curves have become common phrases in many mathematics departments across the country. These objects are examples of what are called fractals, beautiful geometric constructions that exhibit self-similarity. Fractals are applied in a variety of ways: they help make smaller antennas (e.g., for cell phones) and are used in fiberoptic cables, among other things. In addition, fractals can be used to model realistic objects, such as the Black Spleenwort fern depicted as a fractal image in Figure 37.1. As we will see later in this section, one way to construct a fractal is with an Iterated Function System (IFS).

Figure 37.1. An approximation of the Black Spleenwort fern.

Subsection Introduction

We have encountered functions throughout our study of mathematics — we explore graphs of functions in algebra and differentiate and integrate functions in calculus. In linear algebra we have investigated special types of functions, e.g., matrix and coordinate transformations, that preserve the vector space structure. Any function that has the same properties as matrix and coordinate transformations is a linear transformation. Linear transformations are important in linear algebra in that we can study similarities and connections between vector spaces by examining transformations between them. Linear transformations model or approximately model certain real-life processes (like discrete dynamical systems, geometrical transformations, Google PageRank, etc.). Also, we can determine the behavior of an entire linear transformation by knowing how it acts on just a basis.

Definition 37.2.

A linear transformation from a vector space \(V\) to a vector space \(W\) is a function \(T: V \to W\) such that

\(T(\vu + \vv) = T(\vu) + T(\vv)\) and
\(\displaystyle T(c\vv) = cT(\vv)\)

for all \(\vu, \vv\) in \(V\) and all scalars \(c\text{.}\)

These transformations are called linear because they respect linear combinations. We can combine both parts of this definition into one statement and say that a mapping \(T\) from a vector space \(V\) to a vector space \(W\) is a linear transformation if

\begin{equation*} T(a\vu + b \vv) = aT(\vu) + bT(\vv) \end{equation*}

for all vectors \(\vu, \vv\) in \(V\) and all scalars \(a\) and \(b\text{.}\) We can extend this property of a linear transformation (by mathematical induction) to any finite linear combination of vectors. That is, if \(\vv_1\text{,}\) \(\vv_2\text{,}\) \(\ldots\text{,}\) \(\vv_k\) are any vectors in the domain of a linear transformation \(T\) and \(c_1\text{,}\) \(c_2\text{,}\) \(\ldots\text{,}\) \(c_k\) are any scalars, then

\begin{equation*} T(c_1 \vv_1 + c_2 \vv_2 + \cdots + c_k \vv_k) = c_1 T(\vv_1) + c_2 T(\vv_2) + \cdots + c_k T(\vv_k)\text{.} \end{equation*}

This is the property that \(T\) respects linear combinations.

Preview Activity 37.1.

(a)

Consider the transformation \(T: \R^3 \to \R^2\) defined by

\begin{equation*} T\left( \left[ \begin{array}{cc} x\\y\\z \end{array} \right] \right) = \left[ \begin{array}{c} x+y\\z+3 \end{array} \right] \end{equation*}

Check that \(T\) is not linear by finding two vectors \(\vu, \vv\) which violate the additive property of linear transformations.

(b)

Consider the transformation \(T : \pol_2 \to \pol_3\) defined by

\begin{equation*} T(a_0+a_1t+a_2t^2) = (a_0+a_1) + a_2t + a_1t^3 \end{equation*}

Check that \(T\) is a linear transformation.

(c)

Let \(\D\) be the set of all differentiable functions from \(\R\) to \(\R\text{.}\) Since \(\frac{d}{dx}(0) = 0\text{,}\) \((f+g)'(x) = f'(x)+g'(x)\) and \((cf)'(x) = cf'(x)\) for any differentiable functions \(f\) and \(g\) and any scalar \(c\text{,}\) it follows that \(\D\) is a subspace of \(\F\text{,}\) the vector space of all functions from \(\R\) to \(\R\text{.}\) Let \(T: \D \to \F\) be defined by \(T(f) = f'\text{.}\) Check that \(T\) is a linear transformation.

(d)

Every matrix transformation is a linear transformation, so we might expect that general linear transformations share some of the properties of matrix transformations. Let \(T\) be a linear transformation from a vector space \(V\) to a vector space \(W\text{.}\) Use the linearity properties to show that \(T(\vzero_V) = \vzero_W\text{,}\) where \(\vzero_V\) is the additive identity in \(V\) and \(\vzero_W\) is the additive identity in \(W\text{.}\)

Hint.

\(0_V + 0_V = 0_V\text{.}\)

Subsection Onto and One-to-One Transformations

Recall that in Section 7 we expressed existence and uniqueness questions for matrix equations in terms of one-to-one and onto properties of matrix transformations. The question about the existence of a solution to the matrix equation \(A \vx = \vb\) for any vector \(\vb\text{,}\) where \(A\) is an \(m \times n\) matrix, is also a question about the existence of a vector \(\vx\) so that \(T(\vx) = \vb\text{,}\) where \(T(\vx) = A \vx\text{.}\) If, for each \(\vb\) in \(\R^m\) there is at least one \(\vx\) with \(T(\vx) = \vb\text{,}\) then \(T\) is an onto transformation. We can make a similar definition for any linear transformation.

Definition 37.3.

A linear transformation \(T\) from a vector space \(V\) to a vector space \(W\) is onto if each \(\vb\) in \(W\) is the image of at least one \(\vx\) in \(V\text{.}\)

Similarly, the uniqueness of a solution to \(A \vx = \vb\) for any \(\vb\) in \(\Col A\) is the same as saying that for any \(\vb\) in \(\R^m\text{,}\) there is at most one \(\vx\) in \(\R^n\) such that \(T(\vx) = \vb\text{.}\) A matrix transformation with this property is one-to-one, and we can make a similar definition for any linear transformation.

Definition 37.4.

A linear transformation \(T\) from a vector space \(V\) to a vector space \(W\) is one-to-one if each \(\vb\) in \(W\) is the image of at most one \(\vx\) in \(V\text{.}\)

With matrix transformations we saw that there are easy pivot criteria for determining whether a matrix transformation is one-to-one (a pivot in each column) or onto (a pivot in each row). If a linear transformation can be represented as a matrix transformation, then we can use these ideas. However, not every general linear transformation can be easily viewed as a matrix transformation, and in those cases we might have to resort to applying the definitions directly.

Activity 37.2.

For each of the following transformations, determine if \(T\) is one-to-one and/or onto.

(a)

\(T: \pol_2 \to \pol_1\) defined by \(T(a_0+a_1t+a_2t^2) = a_0+(a_1+a_2)t\text{.}\)

(b)

\(T: \D \to \F\) defined by \(T(f) = f'\text{.}\)

Subsection The Kernel and Range of Linear Transformation

As we saw in Preview Activity 37.1, any linear transformation sends the additive identity to the additive identity. If \(T\) is a matrix transformation defined by \(T(\vx) = A \vx\) for some \(m \times n\) matrix \(A\text{,}\) then we have seen that the set of vectors that \(T\) maps to the zero vector is \(\Nul A = \{\vx : A \vx = \vzero\}\text{,}\) which is also \(\Ker(T) = \{\vx : T(\vx) = \vzero\}\text{.}\) We can extend this idea of the kernel of a matrix transformation to any linear transformation \(T\text{.}\)

Definition 37.5.

Let \(T : V \to W\) be a linear transformation from the vector space \(V\) to the vector space \(W\text{.}\) The kernel of \(T\) is the set

\begin{equation*} \Ker(T) = \{\vx \in V : T(\vx) = \vzero_W\}\text{,} \end{equation*}

where \(\vzero_W\) is the additive identity in \(W\text{.}\)

Just as the null space of an \(m \times n\) matrix \(A\) is a subspace of \(\R^n\text{,}\) the kernel of a linear transformation from a vector space \(V\) to a vector space \(W\) is a subspace of \(V\text{.}\) The proof is left to the exercises.

Theorem 37.6.

Let \(T : V \to W\) be a linear transformation from a vector space \(V\) to vector space \(W\text{.}\) Then \(\Ker(T)\) is a subspace of \(V\text{.}\)

The kernel of a linear transformation provides a convenient way to determine if the linear transformation is one-to-one. If \(T\) is one-to-one, then the only solution to \(T(\vx) = \vzero_W\) is \(\vzero_V\) and \(\Ker(T)\) contains only the zero vector. If \(T\) is not one-to-one (and the domain of \(T\) is not just \(\{\vzero\}\)), then the number of solutions to \(T(\vx) = \vzero_W\) is infinite and \(\Ker(T)\) contains more than just the zero vector. We formalize this idea in the next theorem. (Compare to Theorem 13.3.) The formal proof is left for the exercises.

Theorem 37.7.

A linear transformation \(T\) from a vector space \(V\) to a vector space \(W\) is one-to-one if and only if \(\Ker(T) = \{\vzero_V\}\text{,}\) where \(\vzero_V\) is the additive identity in \(V\text{.}\)

Activity 37.3.

(a)

Let \(T: \pol_1 \to \pol_2\) be defined by \(T(a_0+a_1t) = a_1t^2\text{.}\) Find \(\Ker(T)\text{.}\) Is \(T\) one-to-one? Explain.

(b)

Let \(T: \D \to \F\) be defined by \(T(f) = f'\text{.}\) Find \(\Ker(T)\text{.}\)

Recall that the matrix-vector product \(A \vx\) is a linear combination of the columns of \(A\) and the set of all vectors of the form \(A \vx\) is the column space of \(A\text{.}\) For the matrix transformation \(T\) defined by \(T(\vx) = A\vx\text{,}\) the set of all vectors of the form \(A \vx\) is also the range of the transformation \(T\text{.}\) We can extend this idea to arbitrary linear transformations to define the range of a transformation.

Definition 37.8.

Let \(T : V \to W\) be a linear transformation from the vector space \(V\) to the vector space \(W\text{.}\) The range of \(T\) is the set

\begin{equation*} \Range(T) = \{T(\vx) : \vx \text{ is in } V\}\text{.} \end{equation*}

We also call the range of a transformation \(T\) the image of the transformation and use the notation \(\Image(T)\text{.}\)

If \(T(\vx) = A\vx\) for some \(m \times n\) matrix \(A\text{,}\) we know that \(\Col A\text{,}\) the span of the columns of \(A\text{,}\) is a subspace of \(\R^m\text{.}\) Consequently, the range of \(T\text{,}\) which is also the column space of \(A\text{,}\) is a subspace of \(\R^m\text{.}\) In general, the range of a linear transformation \(T\) from \(V\) to \(W\) is a subspace of \(W\text{.}\) The fact that the range of a transformation is a subspace of the codomain is a consequence of the more general result in the following theorem.

Theorem 37.9.

Let \(T : V \to W\) be a linear transformation from a vector space \(V\) to vector space \(W\text{,}\) and let \(U\) be a subspace of \(V\text{.}\) Then the set

\begin{equation*} T(U) = \{T(\vu) : \vu \in U\} \end{equation*}

is a subspace of \(W\text{.}\)

Proof.

Let \(T : V \to W\) be a linear transformation from a vector space \(V\) to vector space \(W\text{,}\) and let \(U\) be a subspace of \(V\text{.}\) Let \(\vzero_V\) and \(\vzero_W\) be the additive identities in \(V\) and \(W\text{,}\) respectively. We have already shown that \(T(\vzero_V) = \vzero_W\text{,}\) so \(\vzero_W\) is in \(T(U)\text{.}\) To show that \(T(U)\) is a subspace of \(W\) we must also demonstrate that \(\vw+\vz\) is in \(T(U)\) whenever \(\vw\) and \(\vz\) are in \(T(U)\) and that \(a\vw\) is in \(T(U)\) whenever \(a\) is a scalar and \(\vw\) is in \(T(U)\text{.}\) Let \(\vw\) and \(\vz\) be in \(T(U)\text{.}\) Then \(T(\vx) = \vw\) and \(T(\vy) = \vz\) for some vectors \(\vx\) and \(\vy\) in \(U\text{.}\) Since \(U\) is a subspace of \(V\text{,}\) we know that \(\vx+\vy\) is in \(U\text{.}\) The fact that \(T\) is a linear transformation means that

\begin{equation*} T(\vx + \vy) = T(\vx) + T(\vy) = \vw + \vz\text{.} \end{equation*}

So \(\vw+\vz\) is in \(T(U)\text{.}\)

Finally, let \(a\) be a scalar. Since \(U\) is a subspace of \(V\text{,}\) we know that \(a \vx\) is in \(U\text{.}\) The linearity of \(T\) gives us

\begin{equation*} T(a\vx) = aT(\vx) = a\vw\text{,} \end{equation*}

so \(a \vw\) is in \(T(U)\text{.}\) We conclude that \(T(U)\) is a subspace of \(W\text{.}\)

Letting \(U = V\) shows that \(\Range(T)\) is a subspace of \(W\text{.}\)

The subspace \(\Range(T)\) provides us with a convenient criterion for the transformation \(T\) being onto. The transformation \(T\) is onto if for each \(\vb\) in \(W\text{,}\) there is at least one \(\vx\) for which \(T(\vx)=\vb\text{.}\) This means that every \(\vb\) in \(W\) belongs to \(\Range(T)\) for \(T\) to be onto.

Theorem 37.10.

A linear transformation \(T\) from a vector space \(V\) to a vector space \(W\) is onto if and only if \(\Range(T) = W\text{.}\)

Activity 37.4.

Let \(T: \pol_1 \to \pol_2\) be defined by \(T(a_0+a_1t) = a_1t^2\) as in Activity 37.3. Describe the vectors in \(\Range(T)\text{.}\) Is \(T\) onto? Explain.

Subsection Isomorphisms

If \(V\) is a vector space with a basis \(\CB = \{\vv_1, \vv_2, \ldots, \vv_n\}\text{,}\) we have seen that the coordinate transformation \(T : V \to \R^n\) defined by \(T(\vx) = [\vx]_{\CB}\) is a linear transformation that is both one-to-one and onto. This allows us to uniquely identify any vector in \(V\) with a vector in \(\R^n\) so that the vector space structure is preserved. In other words, the vector space \(V\) is for all intents and purposes the same as the vector space \(\R^n\text{,}\) except for the way we represent the vectors. This is a very powerful idea in that it shows that any vector space of dimension \(n\) is essentially the same as \(\R^n\text{.}\) When this happens we say that the vectors spaces are isomorphic and call the coordinate transformation an isomorphism.

Definition 37.11.

An isomorphism from a vector space \(V\) to a vector space \(W\) is a linear transformation \(T : V \to W\) that is one-to-one and onto.

We this terminology, we summarize the discussion in the following theorem.

Theorem 37.12.

If \(V\) is a vector space of dimension \(n\text{,}\) then any coordinate transformation from \(V\) to \(\R^n\) is an isomorphism.

It is left for the exercises to show that if \(T: V \to W\) is an isomorphism, then \(T^{-1}: W \to V\) is also an isomorphism. So if there is an isomorphism from a vector space \(V\) to a vector space \(W\text{,}\) then there is an isomorphism from \(W\) to \(V\text{.}\) As a result, we say that \(V\) and \(W\) are isomorphic vector spaces. It is also true that any vector space is isomorphic to itself, and that if \(V\) is isomorphic to \(W\) and \(W\) is isomorphic to \(U\text{,}\) then \(V\) and \(U\) are also isomorphic. The proof of this result is left to the exercises. From this last property we can conclude that any two vector spaces of dimension \(n\) are isomorphic vector spaces. This validates statement we made earlier that the vector space \(\pol_n\) is for all intents and purposes the same as \(\R^{n+1}\text{,}\) since the two spaces are isomorphic. So to understand all finite dimensional vector spaces, it is enough to understand \(\R^n\text{.}\) The next activity provides a little practice with isomorphisms.

Activity 37.5.

Assume that each of the following maps is a linear transformation. Which, if any, is an isomorphism? Justify your reasoning.

(a)

\(T: \pol_1 \to \R^2\) defined by \(T(a_0+a_1t) = \left[ \begin{array}{c} a_1\\a_1+a_0 \end{array} \right]\)

(b)

\(T: \M_{2 \times 2} \to \pol_3\) defined by \(T\left(\left[ \begin{array}{cc} a\amp b\\ c\amp d \end{array} \right] \right) = a+bt+ct^2+dt^3\text{.}\)

(c)

\(T: \R^2 \to \pol_2\) defined by \(T\left(\left[ \begin{array}{c} a\\ b \end{array} \right] \right) = a+bt+at^2\text{.}\)

Subsection Examples

What follows are worked examples that use the concepts from this section.

Example 37.13.

Let \(T : \pol_2 \to \pol_3\) be defined by \(T(p(t)) = tp(t)+p(0)\text{.}\)

(a)

Show that \(T\) is a linear transformation.

Solution.

To show that \(T\) is a linear transformation we must show that \(T(p(t)+q(t)) = T(p(t)) + T(q(t))\) and \(T(cp(t)) = cT(p(t))\) for every \(p(t)\text{,}\) \(q(t)\) in \(\pol_2\) and any scalar \(c\text{.}\) Let \(p(t)\) and \(q(t)\) be in \(\pol_2\text{.}\) Then

\begin{align*} T(p(t)+q(t)) \amp = t(p(t)+q(t)) + (p(0)+q(0))\\ \amp = \left(tp(t) + p(0)\right) + \left( tq(t) + q(0) \right)\\ \amp = T(p(t)) + T(q(t)) \end{align*}

and

\begin{equation*} T(cp(t)) = tcp(t) + cp(0) = c\left(tp(t)+p(0)\right) = cT(p(t))\text{.} \end{equation*}

Therefore, \(T\) is a linear transformation.

(b)

Is \(T\) one-to-one? Justify your answer.

Solution.

To determine if \(T\) is one-to-one, we find \(\Ker(T)\text{.}\) Suppose \(p(t) = a_0+a_1t+a_2t^2 \in \Ker(T)\text{.}\) Then

\begin{equation*} 0 = T(p(t)) = tp(t) + p(0) = a_0t+a_1t^2 +a_2t^3 + a_0\text{.} \end{equation*}

Equating coefficients on like powers of \(t\) shows that \(a_0 = a_1 = a_2 = 0\text{.}\) Thus, \(p(t) = 0\) and \(\Ker(T) = \{0\}\text{.}\) Thus, \(T\) is one-to-one.

(c)

(i)

Find three different polynomials in \(\Range(T)\text{.}\)

Solution.

We can find three polynomials in \(\Range(T)\) by applying \(T\) to three different polynomials in \(\pol_2\text{.}\) So three polynomials in \(\Range(T)\) are

\begin{align*} T(1) \amp = t+1\\ T(t) \amp = t^2\\ T\left(t^2\right) \amp = t^3\text{.} \end{align*}

(ii)

Find, if possible, a polynomial that is not in \(\Range(T)\text{.}\)

Solution.

A polynomial \(q(t) = b_0+b_1t+b_2t^2+b_3t^3\) is in \(\Range(T)\) if \(q(t) = T(p(t))\) for some polynomial \(p(t) = a_0+a_1t+a_2t^2\) in \(\pol_2\text{.}\) This would require that

\begin{equation*} b_0+b_1t+b_2t^2+b_3t^3 = a_0t+a_1t^2+a_2t^3+a_0\text{.} \end{equation*}

But this would mean that \(b_0 = a_0 = b_1\text{.}\) So the polynomial \(1+2t+t^2+t^3\) is not in \(\Range(T)\text{.}\)

(iii)

Describe \(\Range(T)\text{.}\) What is \(\dim(\Range(T))\text{?}\) Is \(T\) an onto transformation? Explain.

Solution.

Let \(q(t)\) be in \(\Range(T)\text{.}\) Then \(q(t) = T(p(t))\) for some polynomial \(p(t) = a_0+a_1t+a_2t^2 \in \pol_2\text{.}\) Thus,

\begin{align*} q(t) \amp = T(p(t))\\ \amp = tp(t) + p(0)\\ \amp = a_0t+a_1t^2+a_2t^3 + a_0\\ \amp = a_0(t+1) + a_1t^2 +a_2t^3\text{.} \end{align*}

Therefore, \(\Range(T) = \Span\{t+1, t^2, t^3\}\text{.}\) Since the reduced row echelon form of \(\left[ \begin{array}{ccc} 1\amp 0\amp 0\\1\amp 0\amp 0\\0\amp 1\amp 0\\0\amp 0\amp 1 \end{array} \right]\) is \(\left[ \begin{array}{ccc} 1\amp 0\amp 0\\0\amp 1\amp 0\\0\amp 0\amp 1\\0\amp 0\amp 0 \end{array} \right]\text{,}\) we conclude that the set \(\{t+1, t^2, t^3\}\) is linearly independent. Thus, \(\dim(\Range(T)) = 3\text{.}\) Since \(\Range(T)\) is a three-dimensional subspace of the four-dimensional space \(\pol_3\text{,}\) it follows that \(T\) is not onto.

Example 37.14.

Let \(T: \M_{2 \times 2} \to \pol_3\) be defined by

\begin{equation*} T\left( \left[ \begin{array}{cc} a\amp b\\c\amp d \end{array} \right] \right) = a+bt+ct^2+dt^3\text{.} \end{equation*}

Show that \(T\) is an isomorphism from \(\M_{2 \times 2}\) to \(\pol_3\text{.}\)

Solution.

We need to show that \(T\) is a linear transformation, and that \(T\) is both one-to-one and onto. We start by demonstrating that \(T\) is a linear transformation. Let \(A = [a_{ij}]\) and \(B = [b_{ij}]\) be in \(\M_{2 \times 2}\text{.}\) Then

\begin{align*} T(A+B) \amp = T\left([a_{ij}+b_{ij}]\right)\\ \amp = \left(a_{11}+b_{11}\right) + \left(a_{12}+b_{12}\right)t + \left(a_{21}+b_{21}\right)t^2 + \left(a_{22}+b_{22}\right)t^3\\ \amp = \left(a_{11}+a_{12}t+a_{21}t^2+a_{22}t^3\right) + \left(b_{11}+b_{12}t+b_{21}t^2+b_{22}t^3\right)\\ \amp = T(A) + T(B)\text{.} \end{align*}

Now let \(c\) be a scalar. Then

\begin{align*} T(cA) \amp = T\left([ca_{ij}]\right)\\ \amp = \left(ca_{11}\right)+\left(ca_{12}\right)t+\left(ca_{21}\right)t^2+\left(ca_{22}\right)t^3\\ \amp = c\left(a_{11}+a_{12}t+a_{21}t^2+a_{22}t^3\right)\\ \amp = cT(A)\text{.} \end{align*}

Therefore, \(T\) is a linear transformation.

Next we determine \(\Ker(T)\text{.}\) Suppose \(A \in \Ker(T)\text{.}\) Then

\begin{equation*} 0 = T(A) = a_{11}+a_{12}t+a_{21}t^2+a_{22}t^3\text{.} \end{equation*}

Equating coefficients of like powers of \(T\) shows that \(a_{ij} = 0\) for each \(i\) and \(j\text{.}\) Thus, \(A = 0\) and \(\Ker(T) = \{0\}\text{.}\) It follows that \(T\) is one-to-one.

Finally, we show that \(\Range(T) = \pol_3\text{.}\) If \(q(t) = a+bt+ct^2+dt^3 \in \pol_3\text{,}\) then \(T\left( \left[ \begin{array}{cc} a\amp b\\c\amp d \end{array} \right] \right) = q(t)\text{.}\) So every polynomial in \(\pol_3\) is the image of some matrix in \(\M_{2 \times 2}\text{.}\) It follows that \(T\) is onto and also that \(T\) is an isomorphism.

Subsection Summary

Let \(T\) be a linear transformation from a vector space \(V\) to a vector space \(W\text{.}\)

A function \(T\) from a vector space \(V\) to a vector space \(W\) is a linear transformation if
1. \(T(\vu + \vv) = T(\vu) + T(\vv)\) for all \(\vu\) and \(\vv\) in \(V\) and
2. \(T(c\vu) = cT(\vu)\) for all \(\vu\) in \(V\) and all scalars \(c\text{.}\)
Let \(T\) be a linear transformation from a vector space \(V\) to a vector space \(W\text{.}\) The kernel of \(T\) is the set

\begin{equation*} \Ker(T) = \{\vx \in V : T(\vx) = \vzero\}\text{.} \end{equation*}

The kernel of \(T\) is a subspace of \(V\text{.}\)
Let \(T\) be a linear transformation from a vector space \(V\) to a vector space \(W\text{.}\) The transformation \(T\) is one-to-one if every vector in \(W\) is the image under \(T\) of at most one vector in \(V\text{.}\) A linear transformation \(T\) is one-to-one if and only if \(\Ker(T) = \{\vzero\}\text{.}\)
Let \(T\) be a linear transformation from a vector space \(V\) to a vector space \(W\text{.}\) The range of \(T\) is the set

\begin{equation*} \Range(T) = \{T(\vv) : \vv \in V\}\text{.} \end{equation*}

The range of \(T\) is a subspace of \(W\text{.}\)
Let \(T\) be a linear transformation from a vector space \(V\) to a vector space \(W\text{.}\) The linear transformation \(T\) is onto if every vector in \(W\) is the image under \(T\) of at least one vector in \(V\text{.}\) The transformation \(T\) is onto if and only if \(\Range(T) = W\text{.}\)
An isomorphism from a vector space \(V\) to a vector space \(W\) is a linear transformation \(T : V \to W\) that is one-to-one and onto.

Exercises Exercises

1.

We have seen that \(C[a,b]\text{,}\) the set of all continuous functions on the interval \([a,b]\) is a vector space. Let \(T : C[a,b] \to \R\) be defined by \(T(f) = \int_a^b f(x) \, dx\text{.}\) Show that \(T\) is a linear transformation.

Hint.

Use properties of the definite integral from calculus.

2.

If \(V\) is a vector space, prove that the identity map \(I:V \to V\) defined by \(I(\vx) = \vx\) for all \(\vx \in V\) is an isomorphism.

3.

Let \(V\) and \(W\) be vector spaces and let \(T : V \to W\) be an isomorphism. Since \(T\) is a bijection, \(T\) has an inverse function. (Recall that two functions \(f\) and \(g\) are inverses if \((f \circ g)(x) = x\) for all \(x\) in the domain of \(g\) and \((g \circ f)(x) = x\) for all \(x\) in the domain of \(f\text{.}\))

(a)

Let \(T: \pol_1 \to \R^2\) be defined by \(T(a_0+a_1t) = \left[ \begin{array}{c} a_0\\a_0+a_1 \end{array} \right]\text{.}\) Assume that \(T\) is an isomorphism. Show that \(S: \R^2 \to \pol_1\) defined by \(S([a \ b]^{\tr}) = a + (b-a)t\) is the inverse of \(T\text{.}\)

Hint.

Show that \((S \circ T)(\vx) = \vx\) for all \(\vx\) in \(V\) and that \((T \circ S)(\vw) = \vw\) for all \(\vw\) in \(W\text{.}\)

(b)

Let \(T: \M_{2 \times 2} \to \pol_3\) be defined by \(T\left(\left[ \begin{array}{cc} a\amp b\\ c\amp d \end{array} \right] \right) = a+bt+ct^2+dt^3\text{.}\) Assume that \(T\) is an isomorphism. Find the inverse of \(T\text{.}\) Make sure to verify that your conjectured function really is the inverse of \(T\text{.}\)

Hint.

Define \(S : \pol_3 \to \M_{2 \times 2}\) by

\begin{equation*} S(a+bt+ct^2+dt^3) = \left[ \begin{array}{cc} a\amp b\\ c\amp d \end{array} \right]\text{.} \end{equation*}

(c)

Now assume that \(T\) is an arbitrary isomorphism from a vector space \(V\) to a vector space \(W\text{.}\) We know that the inverse \(T^{-1}\) of \(T\) satisfies the property that \(T^{-1}(\vw) = \vv\) whenever \(T(\vv) = \vw\text{.}\) From previous courses you have probably shown that the inverse of a bijection is a bijection. You may assume this as fact. Show that \(T^{-1}\) is a linear transformation and, consequently, an isomorphism.

Hint.

The property that \(T^{-1}(\vw) = \vv\) whenever \(T(\vv) = \vw\) is the key to this problem.

4.

Let \(U\text{,}\) \(V\text{,}\) and \(W\) be vector spaces, and let \(S : U \to V\) and \(T : V \to W\) be linear transformations. Determine which of the following is true. If a statement is true, verify it. If a statement is false, give a counterexample. (The result of this exercise, along with Exercise 2 and Exercise 3, shows that the isomorphism relation is reflexive, symmetric, and transitive. Thus, isomorphism is an equivalence relation.)

(a)

\(T \circ S\) is a linear transformation

(b)

\(S \circ T\) is a linear transformation

(c)

If \(S\) and \(T\) are one-to-one, then \(T \circ S\) is one-to-one.

(d)

If \(S\) and \(T\) are onto, then \(T \circ S\) is onto.

(e)

If \(S\) and \(T\) are isomorphisms, then \(T \circ S\) is an isomorphism.

5.

For each of the following maps, determine which is a linear transformation. If a mapping is not a linear transformation, provide a specific example to show that a property of a linear transformation is not satisfied. If a mapping is a linear transformation, verify that fact and then determine if the mapping is one-to-one and/or onto. Throughout, let \(V\) and \(W\) be vector spaces.

(a)

\(T: V \to V\) defined by \(T(\vx) = -\vx\)

(b)

\(T: V \to V\) defined by \(T(\vx) = 2\vx\)

6.

Let \(V\) be a vector space. In this exercise, we will investigate mappings \(T: V \to V\) of the form \(T(\vx) = k\vx\text{,}\) where \(k\) is a scalar.

(a)

For which values, if any, of \(k\) is \(T\) a linear transformation?

(b)

For which values of \(k\text{,}\) if any, is \(T\) an isomorphism?

7.

Let \(V\) be a finite-dimensional vector space and \(W\) a vector space. Show that if \(T: V \to W\) is an isomorphism, and \(\CB = \{\vv_1, \vv_2, \ldots, \vv_n\}\) is a basis for \(V\text{,}\) then \(\CC = \{T(\vv_1)\text{,}\) \(T(\vv_2)\text{,}\) \(\ldots\text{,}\) \(T(\vv_n)\}\) is a basis for \(W\text{.}\) Hence, if \(V\) is a vector space of dimension \(n\) and \(W\) is isomorphic to \(V\text{,}\) then \(\dim(W) = n\) as well.

Hint.

Use the fact that \(T\) is one-to-one to show that \(\CC\) is linearly independent, and that \(T\) is onto to show that \(\CC\) spans \(W\text{.}\)

8.

Let \(W = \left\{ \left[ \begin{array}{rr} x \amp -x \\ -x \amp x \end{array} \right] : x \text{ is a scalar } \right\}\text{.}\)

(a)

Show that \(W\) is a subspace of \(\M_{2 \times 2}\text{.}\)

(b)

The space \(W\) is isomorphic to \(\R^k\) for some \(k\text{.}\) Determine the value of \(k\) and explain why \(W\) is isomorphic to \(\R^k\text{.}\)

9.

Is it possible for a vector space to be isomorphic to one of its proper subspaces? Justify your answer.

10.

Prove Theorem 37.6

Hint.

See Preview Activity 13.1.

11.

Prove Theorem 37.7.

Hint.

Show that \(T(\vv) = \vw\) has at most one solution for each \(\vw\) in \(W\text{.}\)

12.

Let \(V\) and \(W\) be finite dimensional vector spaces, and let \(T : V \to W\) be a linear transformation. Show that \(T\) is uniquely determined by its action on a basis for \(V\text{.}\) That is, if \(\CB = \{\vv_1, \vv_2, \ldots, \vv_n\}\) is a basis for \(V\text{,}\) and we know \(T(\vv_1)\text{,}\) \(T(\vv_2)\text{,}\) \(\ldots\text{,}\) \(T(\vv_n)\text{,}\) then we know exactly which vector \(T(\vu)\) is for any \(\vu\) in \(V\text{.}\)

13.

Let \(V\) and \(W\) be vectors spaces and let \(T: V \to W\) be a linear transformation. If \(V'\) is a subspace of \(V\text{,}\) let

\begin{equation*} T(V') = \{T(\vv) : \vv \text{ is in } V'\}\text{.} \end{equation*}

(a)

Show that \(T(V')\) is a subspace of \(W\text{.}\)

Hint.

Think of \(T(V')\) as the range of a suitable restriction of \(T\text{.}\)

(b)

Suppose \(\dim(V') = n\text{.}\) Show that \(\dim(T(V')) \leq n\text{.}\) Can \(\dim(T(V')) = n\) even if \(T\) is not an isomorphism? Justify your answer.

Hint.

Use Exercise 12. It is possible.

14.

Let \(V\) and \(W\) be vector spaces and define \(L(V,W)\) to be the set of all linear transformations from \(V\) to \(W\text{.}\) If \(S\) and \(T\) are in \(L(V,W)\) and \(c\) is a scalar, then define \(S+T\) and \(cT\) as follows:

\begin{equation*} (S+T)(\vv) = S(\vv) + T(\vv) \ \ \text{ and } \ \ (cT)(\vv) = cT(\vv) \end{equation*}

for all \(\vv\) in \(V\text{.}\) Prove that \(L(V,W)\) is a vector space.

15.

The Rank-Nullity Theorem (Theorem 15.5) states that the rank plus the nullity of a matrix equals the number of columns of the matrix. There is a similar result for linear transformations that we prove in this exercise. Let \(T : V \to W\) be a linear transformation from an \(n\) dimensional vector space \(V\) to an \(m\) dimensional vector space \(W\text{.}\) Show that \(\dim(\Ker(T)) + \dim(\Range(T)) = n\text{.}\) (Hint: Let \(\{\vv_1, \vv_2, \ldots, \vv_k\}\) be a basis for \(\Ker(T)\text{.}\) Extend this basis to a basis \(\{\vv_1\text{,}\) \(\vv_2\text{,}\) \(\ldots\text{,}\) \(\vv_k\text{,}\) \(\vv_{k+1}\text{,}\) \(\ldots\text{,}\) \(\vv_n\}\) of \(V\text{.}\) Use this basis to find a basis for \(\Range(T)\text{.}\))

16.

Let \(V\) and \(W\) be vector spaces with \(\dim(V) = n = \dim(W)\text{,}\) and let \(T: V \to W\) be a linear transformation.

(a)

Prove or disprove: If \(T\) is one-to-one, then \(T\) is also onto.

Hint.

Use Exercise 15.

(b)

Prove or disprove: If \(T\) is onto, then \(T\) is also one-to-one.

Hint.

Use Exercise 15.

17.

Suppose \(\CB_1\) is a basis of an \(n\)-dimensional vector space \(V\text{,}\) and \(\CB_2\) is a basis of an \(n\)-dimensional vector space \(W\text{.}\) Show that the map \(T_{VW}\) which sends every \(\vx\) in \(V\) to the vector \(\vy\) in \(W\) such that \([\vx]_{\CB_1} = [\vy]_{\CB_2}\) is a linear transformation. (Combined with Exercise 16 in Section 16, we can conclude that \(T_{VW}\) is an isomorphism. Thus, any two vectors space of dimension \(n\) are isomorphic.)

18.

There is an important characterization of linear functionals that we examine in this problem. A linear functional is a linear transformation from a vector space \(V\) to \(\R\text{.}\) Let \(V\) be a finite-dimensional inner product space, and let \(T: V \to \R\) be a linear functional. We will show that there is a vector \(\vx\) in \(V\) such that

\begin{equation*} T(\vv) = \langle \vv, \vx \rangle \end{equation*}

for every vector \(\vv\) in \(V\text{.}\) That is, every linear functional on an inner product space can be represented by an inner product with a fixed vector. Let \(n = \dim(V)\text{.}\)

(a)

Suppose \(\Range(T) = \{\vzero\}\text{.}\) What vector could we use for \(\vx\text{?}\)

(b)

Now assume that \(\dim(\Range(T)) > 0\text{.}\) Note that \(\Range(T)\) is a subspace of \(\R\text{.}\) Explain why the only subspaces of \(\R\) are \(\{0\}\) and \(\R\text{.}\) Conclude that \(\Range(T) = \R\text{.}\)

(c)

Explain why \(\dim(\Ker(T)) = n-1\text{.}\)

(d)

Let \(\{\vv_1, \vv_2, \ldots, \vv_{n-1}\}\) be an orthonormal basis for \(\Ker(T)\text{.}\) Explain why there is a vector \(\vv_n\) in \(V\) so that \(\{\vv_1, \vv_2, \ldots, \vv_{n-1}, \vv_n\}\) is an orthonormal basis for \(V\text{.}\)

(e)

Let \(\vv\) be in \(V\text{.}\) Explain why

\begin{equation*} \vv = \langle \vv, \vv_1 \rangle \vv_1 + \langle \vv, \vv_2 \rangle \vv_2 + \cdots + \langle \vv, \vv_n \rangle \vv_n\text{.} \end{equation*}

(f)

Now explain why

\begin{equation*} T(\vv) = \langle \vv, \vv_n \rangle T(\vv_n)\text{.} \end{equation*}

(g)

Finally, explain why \(T(\vv_n)\vv_n\) is the vector \(\vx\) we are looking for.

(h)

As an example, let \(T: \pol_2 \to \R\) be defined by \(T(p(t)) = p(0)\text{.}\) Consider \(\pol_2\) as an inner product space with the inner product

\begin{equation*} \langle p(t), q(t) \rangle = \int_{-1}^1 p(t)q(t) \, dt\text{.} \end{equation*}

Find a polynomial \(h(t)\) so that

\begin{equation*} T(p(t)) = \int_{-1}^1 p(t)h(t) \, dt \end{equation*}

for every \(p(t)\) in \(\pol_2\text{.}\)

19.

Label each of the following statements as True or False. Provide justification for your response.

(a) True/False.

The mapping \(T: \M_{2 \times 2} \to \R\) defined by \(T(A) = \det(A)\) is a linear transformation.

(b) True/False.

The mapping \(T: \M_{2 \times 2} \to \M_{2 \times 2}\) defined by \(T(A) = A^{-1}\) if \(A\) is invertible and \(T(A) = 0\) (the zero matrix) otherwise is a linear transformation.

(c) True/False.

Let \(V\) and \(W\) be vector spaces. The mapping \(T: V \to W\) defined by \(T(\vx) = \vzero\) is a linear transformation.

(d) True/False.

If \(T: V \to W\) is a linear transformation, and if \(\{\vv_1, \vv_2\}\) is a linearly independent set in \(V\text{,}\) then the set \(\{T(\vv_1), T(\vv_2)\}\) is a linearly independent set in \(W\text{.}\)

(e) True/False.

A one-to-one transformation is a transformation where each input has a unique output.

(f) True/False.

A one-to-one linear transformation is a transformation where each output can only come from a unique input.

(g) True/False.

Let \(V\) and \(W\) be vector spaces with \(\dim(V) = 2\) and \(\dim(W) = 3\text{.}\) A linear transformation from \(V\) to \(W\) cannot be onto.

(h) True/False.

Let \(V\) and \(W\) be vector spaces with \(\dim(V) = 3\) and \(\dim(W) = 2\text{.}\) A linear transformation from \(V\) to \(W\) cannot be onto.

(i) True/False.

Let \(V\) and \(W\) be vector spaces with \(\dim(V) = 3\) and \(\dim(W) = 2\text{.}\) A linear transformation from \(V\) to \(W\) cannot be one-to-one.

(j) True/False.

If \(\vw\) is in the range of a linear transformation \(T\text{,}\) then there is an \(\vx\) in the domain of \(T\) such that \(T(\vx)=\vw\text{.}\)

(k) True/False.

If \(T\) is a one-to-one linear transformation, then \(T(\vx)=\vzero\) has a non-trivial solution.

Subsection Project: Fractals via Iterated Function Systems

In this project we will see how to use linear transformations to create iterated functions systems to generate fractals. We illustrate the idea with the Sierpinski triangle, an approximate picture of which is shown at left in Figure 37.15.

Figure 37.15. Left: An approximation of the Sierpinski triangle. Right: A triangle.

To make this figure, we need to identify linear transformations that can be put together to produce the Sierpinski triangle.

Project Activity 37.6.

Let \(\vv_1 = [0 \ 0]^{\tr}\text{,}\) \(\vv_2 = [1 \ 0]^{\tr}\text{,}\) and \(\vv_3 = \left[ \frac{1}{2} \ \frac{\sqrt{3}}{4}\right]^{\tr}\) be three vectors in the plane whose endpoints form the vertices of a triangle \(P_0\) as shown at right in Figure 37.15. Let \(T : \R^2 \to \R^2\) be the linear transformation defined by \(T(\vx) = A \vx\text{,}\) where \(A = \frac{1}{2} I_2 = \frac{1}{2} \left[ \begin{array}{cc} 1\amp 0 \\ 0\amp 1 \end{array} \right]\text{.}\)

(a)

What are \(T(\vv_1)\text{,}\) \(T(\vv_2)\text{,}\) and \(T(\vv_3)\text{?}\) Draw a picture of the figure \(T(P_0)\) whose vertices are the endpoint of these three vectors. How is \(T(P_0)\) related to \(P_0\text{?}\)

(b)

Since the transformation \(T\) shrinks objects, we call \(T\) (or \(A\)) a contraction mapping). Notice that when we apply \(T\) to \(P_0\) it creates a smaller version of \(P_0\text{.}\) The next step is to use \(T\) to make three smaller copies of \(P_0\text{,}\) and then translate these copies to the vertices of \(P_0\text{.}\) A translation can be performed by adding a vector to \(T(P_0)\text{.}\) Find \(C_1\) so that \(C_1(P_0)\) is a copy of \(P_0\) half the size and translated so that \(C_1(\vv_1) = \vv_1\text{.}\) Then find \(C_2\) so that \(C_2(P_0)\) is a copy of \(P_0\) half the size and translated so that \(C_2(\vv_2) = \vv_2\text{.}\) Finally, find \(C_3\) so that \(C_3(P_0)\) is a copy of \(P_0\) half the size and translated so that \(C_3(\vv_3) = \vv_3\text{.}\) Draw pictures of each to illustrate.

Project Activity 37.6 contains the information we need to create an iterated function system to produce the Sierpinski triangle. One more step should help us understand the general process.

Project Activity 37.7.

Using the results of Project Activity 37.6, define \(P_{1,i}\) to be \(C_i(P_0)\) for each \(i\text{.}\) That is, \(P_{1,1} = C_1(P_0)\text{,}\) \(P_{1,2} = C_2(P_0)\text{,}\) and \(P_{1,3} = C_3(P_0)\text{.}\) So \(P_{1,i}\) is a triangle half the size of the original translated to the \(i\)th vertex of the original. Let \(P_1 = \bigcup_{i=1}^3 P_{1,i}\text{.}\) That is, \(P_1\) is the union of the shaded triangles in Figure 37.16.

Figure 37.16. \(C_1(P_0)\text{,}\) \(C_2(P_0)\text{,}\) and \(C_3(P_0)\text{.}\)

(a)

Apply \(C_1\) from Project Activity 37.6 to \(P_1\text{.}\) What is the resulting figure? Draw a picture to illustrate.

(b)

Apply \(C_2\) from Project Activity 37.6 to \(P_1\text{.}\) What is the resulting figure? Draw a picture to illustrate.

(c)

Apply \(C_3\) from Project Activity 37.6 to \(P_1\text{.}\) What is the resulting figure? Draw a picture to illustrate.

The procedures from Project Activity 37.6 and Project Activity 37.7 can be continued, replacing \(P_0\) with \(P_1\text{,}\) then \(P_2\text{,}\) and so on. In other words, for \(i\) = 1, 2, and 3, let \(P_{2,i} = C_i(P_1)\text{.}\) Then let \(P_2 = \bigcup_{i=1}^3 P_{2,i}\text{.}\) A picture of \(P_2\) is shown at left in Figure 37.17. We can continue this procedure, each time replacing \(P_{j-1}\) with \(P_j\text{.}\) A picture of \(P_9\) is shown at right in Figure 37.17.

Figure 37.17. Left: \(P_2\text{.}\) Right: \(P_9\text{.}\)

If we continue this process, taking the limit as \(i\) approaches infinity, the resulting sequence of sets converges to a fixed set, in this case the famous Sierpinski triangle. So the picture of \(P_9\) in Figure 37.17 is a close approximation of the Sierpinski triangle. This algorithm for building the Sierpinski triangle is called the deterministic algorithm. A Sage cell to illustrate this algorithm for producing approximations to the Sierpinski triangle can be found at faculty.gvsu.edu/schlicks/STriangle_Sage.html.

In general, an iterated function system (IFS) is a finite set \(\{f_1, f_2, \ldots, f_m\}\) of contraction mappings from \(\R^2\) to \(\R^2\text{.}\) If we start with a set \(S_0\text{,}\) and let \(S_1 = \bigcup f_i(S_0)\text{,}\) \(S_2 = \bigcup f_i(S_1)\text{,}\) and so on, then in “nice” cases (we won't concern ourselves with what “nice” means here), the sequence \(S_0\text{,}\) \(S_1\text{,}\) \(S_2\text{,}\) \(\ldots\) converges in some sense to a fixed set. That fixed set is called the attractor of the iterated function system. It is that attractor that is the fractal. It is fascinating to note that our starting set does not matter, the same attractor is obtained no matter which set we use as \(S_0\text{.}\)

One aspect of a fractal is that fractals are self-similar, that is they are made up of constituent pieces that look just like the whole. More specifically, a subset \(S\) of \(\R^2\) is self-similar if it can be expressed in the form

\begin{equation*} S = S_1 \cup S_2 \cup \cdots \cup S_k \end{equation*}

for non-overlapping sets \(S_1\text{,}\) \(S_2\text{,}\) \(\ldots\text{,}\) \(S_k\text{,}\) each of which is congruent to \(S\) by the same scaling factor. So, for example, the Sierpinski triangle is self-similar, made up of three copies of the whole, each contracted by a factor of \(2\text{.}\)

If we have an IFS, then we can determine the attractor by drawing the sequence of sets that the IFS generates. A more interesting problem is, given a self-similar figure, whether we can construct an IFS that has that figure as its attractor.

Project Activity 37.8.

A picture of an emerging Sierpinski carpet is shown at left in Figure 37.18. A Sage cell to illustrate this algorithm for producing approximations to the Sierpinski carpet can be found at faculty.gvsu.edu/schlicks/SCarpet_Sage.html. In this activity we will see how to find an iterated function system that will generate this fractal.

(a)

To create an IFS to generate this fractal, we need to understand how many self-similar pieces make up this figure. Use the image at right in Figure 37.18 to determine how many pieces we need.

(b)

For each of the self-similar pieces identified in part (a), find a linear transformation and a translation that maps the entire figure to the self-similar piece.

Hint.

You could assume that the carpet is embedded in the unit square.

(c)

Test your IFS to make sure that it actually generates the Sierpinski carpet. There are many websites that allow you to do this, one of which is cs.lmu.edu/~ray/notes/ifs/. In this program, the mapping \(f\) defined by

\begin{equation*} f(\vx) = \left[ \begin{array}{cc} a\amp b\\c\amp d \end{array} \right] \vx + \left[ \begin{array}{c} e\\f \end{array} \ \right] \end{equation*}

is represented as the string

\begin{equation*} a \ b \ c \ d \ e \ f\text{.} \end{equation*}

Most programs generally use a different algorithm to create the attractor, plotting points instead of sets. In this algorithm, each contraction mapping is assigned a probability as well (the larger parts of the figure are usually given higher probabilities), so you will enter each contraction mapping in the form

\begin{equation*} a \ b \ c \ d \ e \ f \ p \end{equation*}

where \(p\) is the probability attached to that contraction mapping. As an example, the IFS code for the Sierpinski triangle is

\begin{equation*} \begin{array}{ccccccc} 0.5 \amp 0 \amp 0 \amp 0.5 \amp 0 \amp 0 \amp 0.33 \\ 0.5 \amp 0 \amp 0 \amp 0.5 \amp 1 \amp 0 \amp 0.33 \\ 0.5 \amp 0 \amp 0 \amp 0.5 \amp 0.5 \amp 0.43 \amp 0.33 \end{array} \end{equation*}

The contraction mappings we have used so far only involve contractions and translations. But we do not have to restrict ourselves to just these types of contraction mappings.

Project Activity 37.9.

Consider the fractal represented in Figure 37.19. Find an IFS that generates this fractal. Explain your reasoning.

Check with a fractal generator to ensure that you have an appropriate IFS.

Hint.

Two reflections are involved.

We conclude our construction of fractals with one more example. The contraction mappings in iterated function can also involve rotations.

Project Activity 37.10.

Consider the Lévy Dragon fractal shown at left in Figure 37.20. Find an IFS that generates this fractal. Explain your reasoning.

Check with a fractal generator to ensure that you have an appropriate IFS.

Hint.

Two rotations are involved — think of the fractal as contained in a blue triangle as shown at right in Figure 37.20.

We end this project with a short discussion of fractal dimension. The fractals we have seen are very strange in many ways, one of which is dimension. We have studied the dimensions of subspaces of \(R^n\) — each subspace has an integer dimension that is determined by the number of elements in a basis. Fractals also have a dimension, but the dimension of a fractal is generally not an integer. To try to understand fractal dimension, notice that a line segment is self-similar. We can break up a line segment into 2 non-overlapping line segments of the same length, or 3, or 4, or any number of non-overlapping segments of the same length. A square is slightly different. We can partition a square into 4 non-overlapping squares, or 9 non-overlapping squares, or \(n^2\) non-overlapping squares for any positive integer \(n\) as shown in Figure 37.21.

Similarly, we can break up a cube into \(n^3\) non-overlapping congruent cubes. A line segment lies in a one-dimensional space, a square in a two-dimensional space, and a cube in the three-dimensional space. Notice that these dimensions correspond to the exponent of the number of self-similar pieces with scaling \(n\) into which we can partition the object. We can use this idea of dimension in a way that we can apply to fractals. Let \(d(\text{ object } )\) be the dimension of the object. We can partition a square into \(n^2\) non-overlapping squares, so

\begin{equation*} d(\text{ square } ) = 2 = 2\frac{\ln(n)}{\ln(n)} = \frac{\ln(n^2)}{\ln(n)} = \frac{\ln(\text{ number of self-similar pieces } )}{\ln(\text{ scaling factor } )}\text{.} \end{equation*}

Similarly, for the cube we have

\begin{equation*} d(\text{ cube } ) = 3 = 3\frac{\ln(n)}{\ln(n)} = \frac{\ln(n^3)}{\ln(n)} = \frac{\ln(\text{ number of self-similar pieces } )}{\ln(\text{ scaling factor } )}\text{.} \end{equation*}

We can then take this as our definition of the dimension of a self-similar object, when the scaling factors are all the same (the fern fractal in Figure 37.1 is an example of a fractal generated by an iterated function system in which the scaling factors are not all the same).

Definition 37.22.

The fractal or Hausdorff dimension \(h\) of a self-similar set \(S\) is

\begin{equation*} h(S) = \frac{\ln(\text{ number of self-similar pieces } )}{\ln(\text{ scaling factor } )}\text{.} \end{equation*}

Project Activity 37.11.

Find the fractal dimensions of the Sierpinski triangle and the Sierpinski carpet. These are well-known and you can look them up to check your result. Then find the fractal dimension of the fractal with IFS

\begin{equation*} \begin{array}{ccccccc} 0.38 \amp 0 \amp 0 \amp 0.38 \amp -0.59 \amp 0.81 \amp 0.2 \\ 0.38 \amp 0 \amp 0 \amp 0.38 \amp -0.95 \amp -0.31 \amp 0.2 \\ 0.38 \amp 0 \amp 0 \amp 0.38 \amp 0 \amp -1 \amp 0.2 \\ 0.38 \amp 0 \amp 0 \amp 0.38 \amp 0.95 \amp -0.31 \amp 0.2 \\ 0.38 \amp 0 \amp 0 \amp 0.38 \amp 0.59 \amp 0.81 \amp 0.2 \end{array}\text{.} \end{equation*}

You might want to draw this fractal using an online generator.