Skip to main content

Section 6.5 Inverse Functions

For this section, we will use the concept of Cartesian product of two sets \(A\) and \(B\text{,}\) denoted by \(A \times B\text{,}\) which is the set of all ordered pairs \((x, y)\) where \(x \in A\) and \(y \in B\text{.}\) That is, \(A \times B = \left\{ { {\left( {x,y} \right)} \mid x \in A\text{ and } y \in B} \right\}\text{.}\) See Beginning Activity 2 in Section 5.4 for a more thorough discussion of this concept.

Beginning Activity Beginning Activity 1: Functions and Sets of Ordered Pairs

When we graph a real function, we plot ordered pairs in the Cartesian plane where the first coordinate is the input of the function and the second coordinate is the output of the function. For example, if \(g\x \mathbb{R} \to \mathbb{R}\text{,}\) then every point on the graph of \(g\) is an ordered pair \(( {x, y} )\) of real numbers where \(y = g( x )\text{.}\) This shows how we can generate ordered pairs from a function. It happens that we can do this with any function. For example, let

\begin{equation*} A = \left\{ {1, 2, 3} \right\} \text{ and } B = \left\{ {a, b} \right\}\text{.} \end{equation*}

Define the function \(F\x A \to B\) by

\begin{equation*} F( 1 ) = a, F( 2 ) = b, \text{ and } F( 3 ) = b\text{.} \end{equation*}

We can convert each of these to an ordered pair in \(A \times B\) by using the input as the first coordinate and the output as the second coordinate. For example, \(F( 1 ) = a\) is converted to \(( 1, a )\text{,}\) \(F( 2 ) = b\) is converted to \(( 2, b )\text{,}\) and \(F( 3 ) = b\) is converted to \(( 3, b )\text{.}\) So we can think of this function as a set of ordered pairs, which is a subset of \(A \times B\text{,}\) and write

\begin{equation*} F = \left\{ {( {1, a} ), ( {2, b} ), ( {3, b} )} \right\}\!\text{.} \end{equation*}

Note: Since \(F\) is the name of the function, it is customary to use \(F\) as the name for the set of ordered pairs.

1.

Let \(A = \left\{ {1, 2, 3} \right\}\) and let \(C = \left\{ {a, b, c, d} \right\}\text{.}\) Define the function \(g\x A \to C\) by \(g( 1 ) = a\text{,}\) \(g( 2 ) = b\text{,}\) and \(g( 3 ) = d\text{.}\) Write the function \(g\) as a set of ordered pairs in \(A \times C\text{.}\)

For another example, if we have a real function, such as \(g\x \mathbb{R} \to \mathbb{R}\) by \(g( x ) = x^2 - 2\text{,}\) then we can think of \(g\) as the following infinite subset of \(\mathbb{R} \times \mathbb{R}\text{:}\)

\begin{equation*} g = \left\{ { {( {x, y} ) \in \mathbb{R} \times \mathbb{R}} \mid y = x^2 - 2} \right\}\!\text{.} \end{equation*}

We can also write this as \(g = \left\{ {( {x, x^2 - 2} )} \mid x \in \mathbb{R} \right\}\text{.}\)

2.

Let \(f\x \Z \to \Z\) be defined by \(f(m) = 3m +5\text{,}\) for all \(m \in \Z\text{.}\) Use set builder notation to write the function \(f\) as a set of ordered pairs, and then use the roster method to write the function \(f\) as a set of ordered pairs.

So any function \(f\x A \to B\) can be thought of as a set of ordered pairs that is a subset of \(A \times B\text{.}\) This subset is

\begin{equation*} f = \left\{ { {( {a, f( a )} ) } \mid a \in A} \right\} \text{ or } f = \left\{ {( {a, b} ) \in A \times B \mid b = f( a )} \right\}\!\text{.} \end{equation*}

On the other hand, if we started with \(A = \left\{ {1, 2, 3} \right\}\text{,}\) \(B = \left\{ {a, b} \right\}\text{,}\) and

\begin{equation*} G = \left\{ {( {1, a} ), ( {2, a} ), ( {3, b} )} \right\} \subseteq A \times B\text{,} \end{equation*}

then we could think of \(G\) as a function from \(A\) to \(B\) with \(G( 1 ) = a\text{,}\) \(G( 2 ) = a\text{,}\) and \(G( 3 ) = b\text{.}\) The idea is to use the first coordinate of each ordered pair as the input, and the second coordinate as the output. However, not every subset of \(A \times B\) can be used to define a function from \(A\) to \(B\text{.}\) This is explored in the following questions.

3.

Let \(f = \left\{ {( {1, a} ), ( {2, a} ), ( {3, a} ), ( {1, b} )} \right\}\text{.}\) Could this set of ordered pairs be used to define a function from \(A\) to \(B\text{?}\) Explain.

4.

Let \(g = \left\{ {( {1, a} ), ( {2, b} ), ( {3, a} )} \right\}\text{.}\) Could this set of ordered pairs be used to define a function from \(A\) to \(B\text{?}\) Explain.

5.

Let \(h = \left\{ {( {1, a} ), ( {2, b} )} \right\}\text{.}\) Could this set of ordered pairs be used to define a function from \(A\) to \(B\text{?}\) Explain.

Beginning Activity Beginning Activity 2: A Composition of Two Specific Functions

Let \(A = \left\{ {a, b, c, d} \right\}\) and let \(B = \left\{ {p, q, r, s} \right\}\text{.}\)

1.

Construct an example of a function \(f\x A \to B\) that is a bijection. Draw an arrow diagram for this function.

2.

On your arrow diagram, draw an arrow from each element of \(B\) back to its corresponding element in \(A\text{.}\) Explain why this defines a function from \(B\) to \(A\text{.}\)

3.

If the name of the function in Exercise 2 is \(g\text{,}\) so that \(g\x B \to A\text{,}\) what are \(g( p )\text{,}\) \(g( q )\text{,}\) \(g( r )\text{,}\) and \(g( s )\text{?}\)

4.

Construct a table of values for each of the functions \(g \circ f\x A \to A\) and \(f \circ g\x B \to B\text{.}\) What do you observe about these tables of values?

Subsection The Ordered Pair Representation of a Function

In Beginning Activity 1, we observed that if we have a function \(f\x A \to B\text{,}\) we can generate a set of ordered pairs \(f\) that is a subset of \(A \times B\) as follows:

\begin{equation*} f = \left\{ { {\left( {a, f( a )} \right)} \mid a \in A} \right\} \text{ or } f = \left\{ {( {a, b} ) \in A \times B \mid b = f( a )} \right\}\!\text{.} \end{equation*}

However, we also learned that some sets of ordered pairs cannot be used to define a function. We now wish to explore under what conditions a set of ordered pairs can be used to define a function. Starting with a function \(f\x A \to B\text{,}\) since \(\text{ dom} ( f ) = A\text{,}\) we know that

\begin{equation} \text{ For every } a \in A, \text{ there exists a } b \in B \text{ such that } ( {a, b} ) \in f\text{.}\tag{55} \end{equation}

Specifically, we use \(b = f( a )\text{.}\) This says that every element of \(A\) can be used as an input. In addition, to be a function, each input can produce only one output. In terms of ordered pairs, this means that there will never be two ordered pairs \(( {a, b} )\) and \(( {a, c} )\) in the function \(f\) where \(a \in A\text{,}\) \(b, c \in B\text{,}\) and \(b \ne c\text{.}\) We can formulate this as a conditional statement as follows:

\begin{align} \amp \text{ For every } a \in A \text{ and every } b, c \in B, \amp \amp\notag\\ \amp \text{ if } ( {a, b} ) \in f \text{ and } ( {a, c} ) \in f, \text{ then } b = c\tag{56} \end{align}

This also means that if we start with a subset \(f\) of \(A \times B\) that satisfies conditions (55) and (56), then we can consider \(f\) to be a function from \(A\) to \(B\) by using \(b = f( a )\) whenever \(( {a, b} )\) is in \(f\text{.}\) This proves the following theorem.

A Note about Theorem 6.31. The first condition in Theorem 6.31 means that every element of \(A\) is an input, and the second condition ensures that every input has exactly one output. Many texts will use Theorem 6.31 as the definition of a function. Many mathematicians believe that this ordered pair representation of a function is the most rigorous definition of a function. It allows us to use set theory to work with and compare functions. For example, equality of functions becomes a question of equality of sets. Therefore, many textbooks will use the ordered pair representation of a function as the definition of a function.

Progress Check 6.32. Sets of Ordered Pairs that Are Not Functions.

Let \(A = \left\{ {1, 2, 3} \right\}\) and let \(B = \left\{ {a, b} \right\}\text{.}\) Explain why each of the following subsets of \(A \times B\) cannot be used to define a function from \(A\) to \(B\text{.}\)

(a)

\(F = \left\{ {( {1, a} ), ( {2, a} )} \right\}\text{.}\)

Solution.

The set \(F\) does not satisfy the first condition of Theorem 6.31.

(b)

\(G = \left\{ {( {1, a} ), ( {2, b} ), ( {3, c} ), ( {2, c} )} \right\}\text{.}\)

Solution.

The set \(G\) does not satisfy the second condition of Theorem 6.31.

Subsection The Inverse of a Function

In previous mathematics courses, we learned that the exponential function (with base \(e\)) and the natural logarithm function are inverses of each other. This was often expressed as follows:

\begin{align*} \amp \text{ For each } x \in \R \text{ with } x > 0 \text{ and for each } y \in \R,\\ \amp y = \ln x \text{ if and only if } x = e^y\text{.} \end{align*}

Notice that this means that \(x\) is the input and \(y\) is the output for the natural logarithm function if and only if \(y\) is the input and \(x\) is the output for the exponential function. In essence, the inverse function (in this case, the exponential function) reverses the action of the original function (in this case, the natural logarithm function). In terms of ordered pairs (input-output pairs), this means that if \(( {x, y} )\) is an ordered pair for a function, then \(( {y, x} )\) is an ordered pair for its inverse. This idea of reversing the roles of the first and second coordinates is the basis for our definition of the inverse of a function.

Definition.

Let \(f\x A \to B\) be a function. The inverse of \(\boldsymbol{f}\), denoted by \(f^{ - 1}\text{,}\) is the set of ordered pairs \(\left\{ { {( {b, a} ) \in B \times A} \mid f( a ) = b} \right\}\text{.}\) That is,

\begin{equation*} f^{ - 1} = \left\{ { {( {b, a} ) \in B \times A} \mid f( a ) = b} \right\}\!\text{.} \end{equation*}

If we use the ordered pair representation for \(f\text{,}\) we could also write

\begin{equation*} f^{ - 1} = \left\{ { {( {b, a} ) \in B \times A} \mid ( {a, b} ) \in f} \right\}\!\text{.} \end{equation*}

Notice that this definition does not state that \(f^{ - 1}\) is a function. It is simply a subset of \(B \times A\text{.}\) After we study the material in Chapter 7, we will say that this means that \(f^{ - 1}\) is a relation from \(B\) to \(A\text{.}\) This fact, however, is not important to us now. We are mainly interested in the following question:

Under what conditions will the inverse of the function \(f\x A \to B\) be a function from \(B\) to \(A\text{?}\)

Progress Check 6.33. Exploring the Inverse of a Function.

Let \(A = \left\{ {a, b, c} \right\}\text{,}\) \(B = \left\{ {a,b,c,d} \right\}\text{,}\) and \(C = \left\{ {p, q, r} \right\}\text{.}\) Define

\(f\x A \to C\) by \(g\x A \to C\) by \(h\x B \to C\) by
\(f( a ) = r\) \(g( a ) = p\) \(h( a ) = p\)
\(f( b ) = p\) \(g( b ) = q\) \(h( b ) = q\)
\(f( c ) = q\) \(g( c ) = p\) \(h( c ) = r\)
\(h( d ) = q\)
(a)

Draw an arrow diagram for each function.

(b)

Determine the inverse of each function as a set of ordered pairs.

Solution.

\(f^{ - 1} = \left\{ {( {r, a} ), ( {p, b} ), ( {q, c} )} \right\}\)

\(g^{ - 1} = \left\{ {( {p, a} ), ( {q, b} ), ( {p, c} )} \right\}\)

\(h^{ - 1} = \left\{ {( {p, a} ), ( {q, b} ), ( {r, c} ), ( {q, d} )} \right\}\)

(c)

Explain each of the following.

(i)

Is \(f^{ - 1}\) a function from \(C\) to \(A\text{?}\) Explain.

Solution.

\(f^{ - 1}\) is a function from \(C\) to \(A\text{.}\)

(ii)

Is \(g^{ - 1}\) a function from \(C\) to \(A\text{?}\) Explain.

Solution.

\(g^{ - 1}\) is not a function from \(C\) to \(A\) since \(( {p, a} ) \in g^{ - 1}\)and \(( {p, c} ) \in g^{ - 1}\text{.}\)

(iii)

Is \(h^{ - 1}\) a function from \(C\) to \(B\text{?}\) Explain.

Solution.

\(h^{ - 1}\) is not a function from \(C\) to \(B\) since \(( {q, b} ) \in h^{ - 1}\) and \(( {q, d} ) \in h^{ - 1}\text{.}\)

(d)

Draw an arrow diagram for each inverse from Task 6.33.c that is a function. Use your existing arrow diagram from Task 6.33.a to draw this arrow diagram.

(e)

Make a conjecture about what conditions on a function \(F\x S \to T\) will ensure that its inverse is a function from \(T\) to \(S\text{.}\)

Solution.

In order for the inverse of a function \(F:S \to T\) to be a function from \(T\) to \(S\text{,}\) the function \(F\) must be a bijection.

We will now consider a general argument suggested by the explorations in Progress Check 6.33. By definition, if \(f\x A \to B\) is a function, then \(f^{ - 1}\) is a subset of \(B \times A\text{.}\) However, \(f^{ - 1}\) may or may not be a function from \(B\) to \(A\text{.}\) For example, suppose that \(s, t \in A\) with \(s \ne t\) and \(f( s ) = f( t )\text{.}\) This is represented in Figure 6.34.

Figure 6.34. The Inverse Is Not a Function

In this case, if we try to reverse the arrows, we will not get a function from \(B\) to \(A\text{.}\) This is because \(( {y, s} ) \in f^{ - 1}\) and \(( {y, t} ) \in f^{ - 1}\) with \(s \ne t\text{.}\) Consequently, \(f^{ - 1}\) is not a function. This suggests that when \(f\) is not an injection, then \(f^{ - 1}\) is not a function.

Also, if \(f\) is not a surjection, then there exists a \(z \in B\) such that \(f( a ) \ne z\) for all \(a \in A\text{,}\) as in the diagram in Figure 6.34. In other words, there is no ordered pair in \(f\) with \(z\) as the second coordinate. This means that there would be no ordered pair in \(f^{ - 1}\) with \(z\) as a first coordinate. Consequently, \(f^{ - 1}\) cannot be a function from \(B\) to \(A\text{.}\)

This motivates the statement in Theorem 6.35. In the proof of this theorem, we will frequently change back and forth from the input-output representation of a function and the ordered pair representation of a function. The idea is that if \(G\x S \to T\) is a function, then for \(s \in S\) and \(t \in T\text{,}\)

\begin{equation*} G( s ) = t \text{ if and only if } ( {s, t} ) \in G\text{.} \end{equation*}

When we use the ordered pair representation of a function, we will also use the ordered pair representation of its inverse. In this case, we know that

\begin{equation*} ( {s, t} ) \in G\text{ if and only if } ( {t, s} ) \in G^{ - 1}\text{.} \end{equation*}

Proof.

Let \(A\) and \(B\) be nonempty sets and let \(f\x A \to B\text{.}\) We will first assume that \(f\) is a bijection and prove that \(f^{ - 1}\) is a function from \(B\) to \(A\text{.}\) To do this, we will show that \(f^{ - 1}\) satisfies the two conditions of Theorem 6.31.

We first choose \(b \in B\text{.}\) Since the function \(f\) is a surjection, there exists an \(a \in A\) such that \(f( a ) = b\text{.}\) This implies that \(( {a, b} ) \in f\) and hence that \(( {b, a} ) \in f^{ - 1}\text{.}\) Thus, each element of \(B\) is the first coordinate of an ordered pair in \(f^{ - 1}\text{,}\) and hence \(f^{ - 1}\) satisfies the first condition of Theorem 6.31.

To prove that \(f^{ - 1}\) satisfies the second condition of Theorem 6.31, we must show that each element of \(B\) is the first coordinate of exactly one ordered pair in \(f^{ - 1}\text{.}\) So let \(b \in B\text{,}\) \(a_1 , a_2 \in A\) and assume that

\begin{equation*} ( {b, a_1 } ) \in f^{ - 1} \text{ and } ( {b, a_2 } ) \in f^{ - 1}\text{.} \end{equation*}

This means that \(( {a_1 , b} ) \in f\) and \(( {a_2 , b} ) \in f\text{.}\) We can then conclude that

\begin{equation*} f( {a_1 } ) = b\text{ and } f( {a_2 } ) = b\text{.} \end{equation*}

But this means that \(f( {a_1 } ) = f( {a_2 } )\text{.}\) Since \(f\) is a bijection, it is an injection, and we can conclude that \(a_1 = a_2\text{.}\) This proves that \(b\) is the first element of only one ordered pair in \(f^{ - 1}\text{.}\) Consequently, we have proved that \(f^{ - 1}\) satisfies both conditions of Theorem 6.31 and hence that \(f^{ - 1}\) is a function from \(B\) to \(A\text{.}\)

We now assume that \(f^{ - 1}\) is a function from \(B\) to \(A\) and prove that \(f\) is a bijection. First, to prove that \(f\) is an injection, we assume that \(a_1 , a_2 \in A\) and that \(f( {a_1 } ) = f( {a_2 } )\text{.}\) We wish to show that \(a_1 = a_2\text{.}\) If we let \(b = f( {a_1 } ) = f( {a_2 } )\text{,}\) we can conclude that

\begin{equation*} ( {a_1 , b} ) \in f \text{ and } ( {a_2 , b} ) \in f\text{.} \end{equation*}

But this means that

\begin{equation*} ( {b, a_1 } ) \in f^{ - 1} \text{ and } ( {b, a_2 } ) \in f^{ - 1}\text{.} \end{equation*}

Since we have assumed that \(f^{ - 1}\) is a function, we can conclude that \(a_1 = a_2\text{.}\) Hence, \(f\) is an injection.

Now to prove that \(f\) is a surjection, we choose \(b \in B\) and will show that there exists an \(a \in A\) such that \(f( a ) = b\text{.}\) Since \(f^{ - 1}\) is a function, \(b\) must be the first coordinate of some ordered pair in \(f^{ - 1}\text{.}\) Consequently, there exists an \(a \in A\) such that

\begin{equation*} ( {b, a} ) \in f^{ - 1}\text{.} \end{equation*}

Now this implies that \(( {a, b} ) \in f\) and hence that \(f( a ) = b\text{.}\) This proves that \(f\) is a surjection. Since we have also proved that \(f\) is an injection, we conclude that \(f\) is a bijection.

Subsection Inverse Function Notation

In the situation where \(f\x A \to B\) is a bijection and \(f^{ - 1}\) is a function from \(B\) to \(A\text{,}\) we can write \(f^{ - 1} \x B \to A\text{.}\) In this case, we frequently say that \(f\) is an invertible function, and we usually do not use the ordered pair representation for either \(f\) or \(f^{ - 1}\text{.}\) Instead of writing \(( {a, b} ) \in f\text{,}\) we write \(f( a ) = b\text{,}\) and instead of writing \(( {b, a} ) \in f^{ - 1}\text{,}\) we write \(f^{ - 1} ( b ) = a\text{.}\) Using the fact that \(( {a, b} ) \in f\) if and only if \(( {b, a} ) \in f^{ - 1}\text{,}\) we can now write \(f( a ) = b\) if and only if \(f^{ - 1} ( b ) = a\text{.}\) We summarize this in Theorem 6.36.

Example 6.37. Inverse Function Notation.

For an example of the use of the notation in Theorem 6.36, let \(\R^ + = \left\{ { {x \in \R} \mid x > 0} \right\}\text{.}\) Define

\begin{equation*} f\x \R \to \R \text{ by } f( x ) = x^3; \text{ and } g\x \R \to \R^ + \text{ by }g( x ) = e^x\text{.} \end{equation*}

Notice that \(\R^+\) is the codomain of \(g\text{.}\) We can then say that both \(f\) and \(g\) are bijections. Consequently, the inverses of these functions are also functions. In fact, \(f^{ - 1} \x \R \to \R\) by \(f^{ - 1} ( y ) = \sqrt[3]{y}\text{;}\) and \(g^{ - 1} \x \R^ + \to \R\) by \(g^{ - 1} ( y ) = \ln y\text{.}\) For each function (and its inverse), we can write the result of Theorem 6.36 as follows:

Theorem 6.36 Translates to:
For \(x, y \in \R\text{,}\) \(f( x ) = y\)
if and only if \(f^{ - 1} ( y ) = x\text{.}\)
For \(x, y \in \R\text{,}\) \(x^3 = y\)
if and only if \(\sqrt[3]{y} = x\text{.}\)
For \(x \in \R, y \in \R^ +\text{,}\) \(g( x ) = y\)
if and only if \(g^{ - 1} ( y ) = x\text{.}\)
For \(x \in \R, y \in \R^ +\text{,}\) \(e^x = y\)
if and only if \(\ln y = x\text{.}\)

Subsection Theorems about Inverse Functions

The next two results in this section are two important theorems about inverse functions. The first is actually a corollary of Theorem 6.36.

Proof.

Let \(A\) and \(B\) be nonempty sets and assume that \(f\x A \to B\) is a bijection. So let \(x \in A\) and let \(f( x ) = y\text{.}\) By Theorem 6.36, we can conclude that \(f^{ - 1} ( y ) = x\text{.}\) Therefore,

\begin{align*} \left( {f^{ - 1} \circ f} \right)( x ) \amp = f^{ - 1} \! \left( {f( x )} \right)\\ \amp = f^{-1}( y )\\ \amp = x\text{.} \end{align*}

Hence, for each \(x \in A\text{,}\) \(\left( {f^{ - 1} \circ f} \right)( x ) = x\text{.}\)

The proof that for each \(y\) in \(B\text{,}\) \(\left( f \circ f^{ - 1}\right)(y) = y\) is Exercise 4.

Example 6.39.

This example is a continuation of Example 6.37.

For the cubing function and the cube root function, we have seen that For \(x, y \in \R\text{,}\) \(x^3 = y\) if and only if \(\sqrt[3]{y} = x\text{.}\) Notice that

  • If we substitute \(x^3 = y\) into the equation \(\sqrt[3]{y} = x\text{,}\) we obtain \(\sqrt[3]{{x^3 }} = x\text{.}\)

  • If we substitute \(\sqrt[3]{y} = x\) into the equation \(x^3 = y\text{,}\) we obtain \(\left( {\sqrt[3]{y}} \right)^3 = y\text{.}\)

This is an illustration of Corollary 6.38. We can see this by using \(f\x \R \to \R\) defined by \(f( x ) = x^3\) and \(f^{ - 1} \x \R \to \R\) defined by \(f^{ - 1} ( y ) = \sqrt[3]{y}\text{.}\) Then \(f^{ - 1} \circ f\x \R \to \R\) and \(f^{ - 1} \circ f = I_\R\text{.}\) So for each \(x \in \R\text{,}\)

\begin{align*} \left( {f^{ - 1} \circ f} \right)( x ) \amp = x\\ f^{ - 1} \! \left( {f( x )} \right) \amp = x\\ f^{ - 1} \!\left( {x^3 } \right) \amp = x\\ \sqrt[3]{{x^3 }} \amp = x \end{align*}

Similarly, the equation \(\left( {\sqrt[3]{y}} \right)^3 = y\) for each \(y \in \R\) can be obtained from the fact that for each \(y \in \R\text{,}\) \((f \circ f^{ - 1})(y) = y\text{.}\)

We will now consider the case where \(f\x A \to B\) and \(g\x B \to C\) are both bijections. In this case, \(f^{ - 1} \x B \to A\) and \(g^{ - 1} \x C \to B\text{.}\) Figure 6.40 can be used to illustrate this situation.

Figure 6.40. Composition of Two Bijections

By Theorem 6.29, \(g \circ f\x A \to C\) is also a bijection. Hence, by Theorem 6.35, \(( {g \circ f} )^{ - 1}\) is a function and, in fact, \(( {g \circ f} )^{ - 1} \x C \to A\text{.}\) Notice that we can also form the composition of \(g^{ - 1}\) followed by \(f^{ - 1}\) to get \(f^{ - 1} \circ g^{ - 1} \x C \to A\text{.}\) Figure 6.40 helps illustrate the result of the next theorem.

Proof.

Let \(f\x A \to B\) and \(g\x B \to C\) be bijections. Then \(f^{ - 1} \x B \to A\) and \(g^{ - 1} \x C \to B\text{.}\) Hence, \(f^{ - 1} \circ g^{ - 1} \x C \to A\text{.}\) Also, by Theorem 6.29, \(g \circ f\x A \to C\) is a bijection, and hence \(( {g \circ f} )^{ - 1} \x C \to A\text{.}\) We will now prove that for each \(z \in C\text{,}\) \(( {g \circ f} )^{ - 1} ( z ) = ( {f^{ - 1} \circ g^{ - 1} } )( z )\text{.}\)

Let \(z \in C\text{.}\) Since the function \(g\) is a surjection, there exists a \(y \in B\) such that

\begin{equation} g( y ) = z\text{.}\tag{57} \end{equation}

Also, since \(f\) is a surjection, there exists an \(x \in A\) such that

\begin{equation} f( x ) = y\text{.}\tag{58} \end{equation}

Now these two equations can be written in terms of the respective inverse functions as

\begin{align} g^{ - 1} ( z ) \amp = y; \text{ and }\tag{59}\\ f^{ - 1} ( y ) \amp = x\tag{60} \end{align}

Using equation (59) and equation (60), we see that

\begin{align} \left( {f^{ - 1} \circ g^{ - 1} } \right)( z ) \amp = f^{ - 1} \left( {g^{ - 1} ( z )} \right)\notag\\ \amp = f^{ - 1} ( y )\notag\\ \amp = x\text{.}\tag{61} \end{align}

Using equation (57) and equation (58) again, we see that \(( {g \circ f} )( x ) = z\text{.}\) However, in terms of the inverse function, this means that

\begin{equation} ( {g \circ f} )^{ - 1} ( z ) = x\text{.}\tag{62} \end{equation}

Comparing equation (61) and equation (62), we have shown that for all \(z \in C\text{,}\)

\(( {g \circ f} )^{ - 1} ( z ) = ( {f^{ - 1} \circ g^{ - 1} } )( z )\text{.}\) This proves that \(( {g \circ f} )^{ - 1} = f^{ - 1} \circ g^{ - 1}\text{.}\)

Exercises Exercises

1.

Let \(A = \left\{ {1, 2, 3} \right\}\) and \(B = \left\{ {a, b, c} \right\}\text{.}\)

(a)

Construct an example of a function \(f\x A \to B\) that is not a bijection. Write the inverse of this function as a set of ordered pairs. Is the inverse of \(f\) a function? Explain. If so, draw an arrow diagram for \(f\) and \(f^{ - 1}\text{.}\)

(b)

Construct an example of a function \(g\x A \to B\) that is a bijection. Write the inverse of this function as a set of ordered pairs. Is the inverse of \(g\) a function? Explain. If so, draw an arrow diagram for \(g\) and \(g^{ - 1}\text{.}\)

2.

Let \(S = \left\{ {a, b, c, d} \right\}\text{.}\) Define \(f\x S \to S\) by defining \(f\) to be the following set of ordered pairs.

\begin{equation*} f = \left\{ {( {a, c} ), ( {b, b} ), ( {c, d} ), ( {d, a} )} \right\} \end{equation*}
(a)

Draw an arrow diagram to represent the function \(f\text{.}\) Is the function \(f\) a bijection?

(b)

Write the inverse of \(f\) as a set of ordered pairs. Is \(f^{ - 1}\) a function? Explain.

Answer.

\(f^{ - 1} = \left\{ {( {c,a} ),( {b,b} ),( {d,c} ),( {a,d} )} \right\}\)

(c)

Draw an arrow diagram for \(f^{ - 1}\) using the arrow diagram from Task 2.a.

(d)

Compute \(\left( {f^{ - 1} \circ f} \right)( x )\) and \(\left( {f \circ f^{ - 1} } \right)( x )\) for each \(x\) in \(S\text{.}\) What theorem does this illustrate?

Answer.

For each \(x \in S\text{,}\) \(\left( {f^{ - 1} \circ f} \right)( x ) = x = ( {f \circ f^{ - 1} } )( x )\text{.}\) This illustrates Corollary 6.38.

3.

Inverse functions can be used to help solve certain equations. The idea is to use an inverse function to undo the function.

(a)

Since the cube root function and the cubing function are inverses of each other, we can often use the cube root function to help solve an equation involving a cube. For example, the main step in solving the equation

\begin{equation*} ( {2t - 1} )^3 = 20 \end{equation*}

is to take the cube root of each side of the equation. This gives

\begin{align*} \sqrt[3]{{( {2t - 1} )^3 }} \amp = \sqrt[3]{{20}}\\ 2t - 1 \amp = \sqrt[3]{{20}}\text{.} \end{align*}

Explain how this step in solving the equation is a use of Corollary 6.38.

Answer.

This is a use of Corollary 6.38 since the cube root function and the cubing function are inverse functions of each other and consequently, the composition of the cubing function with the cube root function is the identity function.

(b)

A main step in solving the equation \(e^{2t - 1} = 20\) is to take the natural logarithm of both sides of this equation. Explain how this step is a use of Corollary 6.38, and then solve the resulting equation to obtain a solution for \(t\) in terms of the natural logarithm function.

Answer.

This is a use of Corollary 6.38 since the natural logarithm function and the exponential function with base \(e\) are inverse functions of each other and consequently, the composition of the natural logarithm function with the exponential function with base \(e\) is the identity function.

(c)

How are the methods of solving the equations in Task 3.a and Task 3.b similar?

Answer.

They are similar because they both use the concept of an inverse function to “undo” one side of the equation.

4.

Prove Item 2 of Corollary 6.38. Let \(A\) and \(B\) be nonempty sets and let \(f\x A \to B\) be a bijection. Then for every \(y\) in \(B\text{,}\) \(\left( f \circ f^{ - 1}\right)(y) = y\text{.}\)

Answer.

Using the notation from Corollary 6.38, if \(y = f(x)\) and \(x = f^{-1}(y)\text{,}\) then

\begin{align*} \left(f \circ f^{-1} \right) (y) \amp = f ( f^{-1} ( y ) )\\ \amp = f ( x )\\ \amp = y \end{align*}

5.

In Progress Check 6.11, we defined the identity function on a set. The identity function on the set \(\boldsymbol{T}\), denoted by \(I_T\) , is the function \(I_T \x T \to T\) defined by \(I_T ( t ) = t\) for every \(t\) in \(T\text{.}\) Explain how Corollary 6.38 can be stated using the concept of equality of functions and the identity functions on the sets \(A\) and \(B\text{.}\)

6.

Let \(f\x A \to B\) and \(g\x B \to A\text{.}\) Let \(I_A\) and \(I_B\) be the identity functions on the sets \(A\) and \(B\text{,}\) respectively. Prove each of the following:

(a)

If \(g \circ f = I_A\text{,}\) then \(f\) is an injection.

Answer.

Let \(x, y \in A\) and assume that \(f ( x ) = f ( y )\text{.}\) Apply \(g\) to both sides of this equation to prove that \(( g \circ f ) ( x ) = ( g \circ f ) ( y )\text{.}\) Since \(g \circ f = I_A\text{,}\) this implies that \(x = y\) and hence that \(f\) is an injection.

(b)

If \(f \circ g = I_B\text{,}\) then \(f\) is a surjection.

Answer.

Start by assuming that \(f \circ g = I_B\text{,}\) and then let \(y \in B\text{.}\) You need to prove there exists an \(x \in A\) such that \(f ( x ) = y\text{.}\)

(c)

If \(g \circ f = I_A\) and \(f \circ g = I_B\text{,}\) then \(f\) and \(g\) are bijections and \(g = f^{ - 1}\text{.}\)

7.

Justify your conclusions for the following.

(a)

Define \(f\x \mathbb{R} \to \mathbb{R}\) by \(f( x ) = e^{ - x^2 }\text{.}\) Is the inverse of \(f\) a function?

Answer.

\(f:\mathbb{R} \to \mathbb{R}\) is defined by \(f\left( x \right) = e^{ - x^2 }\text{.}\) Since this function is not an injection, the inverse of \(f\) is not a function.

(b)

Let \(\mathbb{R}^* = \left\{ { {x \in \mathbb{R} } \mid x \geq 0} \right\}\text{.}\) Define \(g\x \mathbb{R}^* \to ( {0, 1} ]\) by \(g( x ) = e^{ - x^2 }\text{.}\) Is the inverse of \(g\) a function?

Answer.

\(g:\mathbb{R}^* \to \left( {0, 1} \right]\) is defined by \(g\left( x \right) = e^{ - x^2 }\text{.}\) In this case, \(g\) is a bijection and hence, the inverse of \(g\) is a function. To see that \(g\) is an injection, assume that \(x, y \in \mathbb{R}^*\) and that \(e^{-x^2} = e^{-y^2}\text{.}\) Then, \(x^2 = y^2\) and since \(x, y \geq 0\text{,}\) we see that \(x = y\text{.}\) To see that \(g\) is a surjection, let \(y \in \left( 0, 1 \right]\text{.}\) Then, \(\ln y \lt 0\) and \(- \ln y > 0\text{,}\) and \(g \left( \sqrt{-\ln y} \right) = y\text{.}\)

8.

Complete the following.

(a)

Let \(f\x \mathbb{R} \to \mathbb{R}\) be defined by \(f( x ) = x^2\text{.}\) Explain why the inverse of \(f\) is not a function.

(b)

Let \(\R^* = \left \{ t \in \R \mid t \geq 0 \right\}\text{.}\) Define \(g \x \R^* \to \R^*\) by \(g(x) = x^2\text{.}\) Explain why this squaring function (with a restricted domain and codomain) is a bijection.

(c)

Explain how to define the square root function as the inverse of the function in Task 8.b.

(d)

True or false: \(\left( \sqrt{x} \right)^2 = x\) for all \(x \in \mathbb{R}\) such that \(x \geq 0\text{.}\)

(e)

True or false: \(\sqrt{x^2} = x\) for all \(x \in \mathbb{R}\text{.}\)

9.

Prove the following:

If \(f\x A \to B\) is a bijection, then \(f^{ - 1} \x B \to A\) is also a bijection.

10.

For each natural number \(k\text{,}\) let \(A_k\) be a set, and for each natural number \(n\text{,}\) let \(f_n \x A_n \to A_{n + 1}\text{.}\) For example, \(f_1 \x A_1 \to A_2\text{,}\) \(f_2 \x A_2 \to A_3\text{,}\) \(f_3 \x A_3 \to A_4\text{,}\) and so on. Use mathematical induction to prove that for each natural number \(n\) with \(n \geq 2\text{,}\) if \(f_1 , f_2 , \ldots , f_n\) are all bijections, then \(f_n \circ f_{n - 1} \circ \cdots \circ f_2 \circ f_1\) is a bijection and

\begin{equation*} ( {f_n \circ f_{n - 1} \circ \cdots \circ f_2 \circ f_1 } )^{ - 1} = f_1^{ - 1} \circ f_2^{ - 1} \circ \cdots \circ f_{n - 1}^{ - 1} \circ f_n^{ - 1}\text{.} \end{equation*}

Note: This is an extension of Theorem 6.41. In fact, Theorem 6.41 is the basis step of this proof for \(n = 2\text{.}\)

11.

Complete the following.

(a)

Define \(f\x \R \to \R\) by \(f(x) = x^2 - 4\) for all \(x \in \R\text{.}\) Explain why the inverse of the function \(f\) is not a function.

(b)

Let \(\R^* = \left\{ x \in \R \mid x \geq 0 \right\}\) and let \(T = \left\{ y \in \R \mid y \geq -4 \right\}\text{.}\) Define \(F\x \R^* \to T\) by \(F(x) = x^2 - 4\) for all \(x \in \R^*\text{.}\) Explain why the inverse of the function \(F\) is a function and find a formula for \(F^{-1}(y)\text{,}\) where \(y \in T\text{.}\)

12.

Let \(R_5 = \left\{ 0, 1, 2, 3, 4 \right\}\text{.}\)

(a)

Define \(f\x R_5 \to R_5\) by \(f(x) = x^2 + 4 \pmod 5\) for all \(x \in R_5\text{.}\) Write the inverse of \(f\) as a set of ordered pairs and explain why \(f^{-1}\) is not a function.

(b)

Define \(g\x R_5 \to R_5\) by \(g(x) = x^3 + 4 \pmod 5\) for all \(x \in R_5\text{.}\) Write the inverse of \(g\) as a set of ordered pairs and explain why \(g^{-1}\) is a function.

(c)

Is it possible to write a formula for \(g^{-1}(y)\text{,}\) where \(y \in R_5\text{?}\) The answer to this question depends on whether or not it is possible to define a cube root of elements of \(R_5\text{.}\) Recall that for a real number \(x\text{,}\) we define the cube root of \(x\) to be the real number \(y\) such that \(y^3 = x\text{.}\) That is,

\begin{equation*} y = \sqrt[3]{x} \text{ if and only if } y^3 = x\text{.} \end{equation*}

Using this idea, is it possible to define the cube root of each number in \(R_5\text{?}\) If so, what are \(\sqrt[3]{0}\text{,}\) \(\sqrt[3]{1}\text{,}\) \(\sqrt[3]{2}\text{,}\) \(\sqrt[3]{3}\text{,}\) and \(\sqrt[3]{4}\text{?}\)

(d)

Now answer the question posed at the beginning of Task 12.c If possible, determine a formula for \(g^{-1}(y)\) where \(g^{-1}\x R_5 \to R_5\text{.}\)

Activity 40. Constructing an Inverse Function.

If \(f\x A \to B\) is a bijection, then we know that its inverse is a function. If we are given a formula for the function \(f\text{,}\) it may be desirable to determine a formula for the function \(f^{ - 1}\text{.}\) This can sometimes be done, while at other times it is very difficult or even impossible. Let \(f\x \R \to \R\) be defined by \(f( x ) = 2x^3 - 7\text{.}\) A graph of this function would suggest that this function is a bijection.

(a)

Prove that the function \(f\) is an injection and a surjection.

(b)

Let \(y \in \R\text{.}\) One way to prove that \(f\) is a surjection is to set \(y = f( x )\) and solve for \(x\text{.}\) If this can be done, then we would know that there exists an \(x \in \R\) such that \(f(x) = y\text{.}\) For the function \(f\text{,}\) we are using \(x\) for the input and \(y\) for the output. By solving for \(x\) in terms of \(y\text{,}\) we are attempting to write a formula where \(y\) is the input and \(x\) is the output. This formula represents the inverse function.

Solve the equation \(y = 2x^3 - 7\) for \(x\text{.}\) Use this to write a formula for \(f^{ - 1} ( y )\text{,}\) where \(f^{ - 1} \x \R \to \R\text{.}\)

(c)

Use the result of Task 40.b to verify that for each \(x \in \R\text{,}\) \(f^{ - 1} \!\left( {f( x )} \right) = x\) and for each \(y \in \R\text{,}\) \(f \!\left( {f^{ - 1} ( y )} \right) = y\text{.}\)

(d)

Now let \(\R^+ = \left\{ { {y \in \mathbb{R}} \mid y > 0} \right\}\text{.}\) Define \(g \x \mathbb{R} \to \R^+\) by \(g( x ) = e^{2x - 1}\text{.}\)

Set \(y = e^{2x - 1}\) and solve for \(x\) in terms of \(y\text{.}\)

(e)

Use your work in Task 40.d to define a function \(h\x \mathbb{R}^ + \to \mathbb{R}\text{.}\)

(f)

For each \(x \in \mathbb{R}\text{,}\) determine \(( {h \circ g} )( x )\) and for each \(y \in \mathbb{R}^ +\text{,}\) determine \(( {g \circ h} )( y )\text{.}\)

(g)

Use Exercise 6 to explain why \(h = g^{ - 1}\text{.}\)

Activity 41. The Inverse Sine Function.

We have seen that in order to obtain an inverse function, it is sometimes necessary to restrict the domain (or the codomain) of a function.

(a)

Let \(f\x \R \to \R\) be defined by \(f( x ) = \sin x\text{.}\) Explain why the inverse of the function \(f\) is not a function. (A graph may be helpful.)

(b)

Notice that if we use the ordered pair representation, then the sine function can be represented as

\begin{equation*} f = \left\{ { {( {x, y} ) \in \R \times \R } \mid y = \sin x} \right\}\!\text{.} \end{equation*}

If we denote the inverse of the sine function by \(\sin ^{ - 1}\text{,}\) then

\begin{equation*} f^{ - 1} = \left\{ { {( {y, x} ) \in \R \times \R } \mid y = \sin x} \right\}\!\text{.} \end{equation*}

Task 41.a proves that \(f^{ - 1}\) is not a function. However, in previous mathematics courses, we frequently used the “inverse sine function.” This is not really the inverse of the sine function as defined in Task 41.a but, rather, it is the inverse of the sine function restricted to the domain \(\left[ {-\dfrac{{\pi }}{2}, \dfrac{\pi }{2}} \right]\text{.}\)

Explain why the function \(F\x \left[ {- \dfrac{{\pi }}{2}, \dfrac{\pi }{2}} \right] \to \left[ { - 1, 1} \right]\) defined by \(F( x ) = \sin x\) is a bijection.

(c)

The inverse of the function in Task 41.b is itself a function and is called the inverse sine function (or sometimes the arcsine function).

What is the domain of the inverse sine function? What are the range and codomain of the inverse sine function?

(d)

Let us now use \(F ( x ) = \text{ Sin } ( x )\) to represent the restricted sine function in Task 41.b. Therefore, \(F^{-1} ( x ) = \text{ Sin } ^{ - 1} ( x )\) can be used to represent the inverse sine function. Observe that

\begin{equation*} F\x \left[ -{\frac{{\pi }}{2}, \frac{\pi }{2}} \right] \to \left[ { - 1, 1} \right] \text{ and } F^{ - 1} \x \left[ { - 1, 1} \right] \to \left[ -{\frac{{\pi }}{2}, \frac{\pi } {2}} \right]\text{.} \end{equation*}

Using this notation, explain why

\(\text{ Sin } ^{ - 1} y = x \text{ if and only if } \left[ {y = \sin x\text{ and } -\dfrac{{\pi }}{2} \leq x \leq \dfrac{\pi } {2}} \right]\text{;}\)
\(\text{ Sin } \!\left( \text{ Sin } ^{-1} ( y ) \right) = y\) for all \(y \in \left[ -1, 1 \right]\text{;}\) and
\(\text{ Sin } ^{-1} \!\left( \text{ Sin } ( x ) \right) = x\) for all \(x \in \left[ - \dfrac{\pi}{2}, \dfrac{\pi}{2} \right]\text{.}\)