Properties of Matrix Transformations
Composition, Invertibility, and Equivalence
Suppose \(T_{A}\) maps from \(R^{n}\) to \(R^{k}\) and \(T_{B}\) maps from \(R^{k}\) to \(R^{m}\).
The composition \(T_{B}\circ T_{A}\) maps \(\mathbf{x}\) in \(R^{n}\) to \(T_{B}(T_{A}(\mathbf{x}))\) in \(R^{m}\).
\[ (T_{B}\circ T_{A})(\mathbf{x}) = T_{B}(T_{A}(\mathbf{x})) \tag{1} \]
This composition is itself a matrix transformation:
\[ (T_{B}\circ T_{A})(\mathbf{x}) = B(A\mathbf{x}) = (B A)\mathbf{x} \] Thus, the standard matrix for \(T_{B}\circ T_{A}\) is \(BA\).
\[ T_{B}\circ T_{A} = T_{B A} \tag{2} \]
Note
The transformation \(T_A\) is performed first, then \(T_B\).
This is analogous to function composition \((f \circ g)(x) = f(g(x))\).
Compositions can extend to finite successions of matrix transformations. For \(T_{A}\colon R^{n}\to R^{k}\), \(T_{B}\colon R^{k}\to R^{l}\), and \(T_{C}\colon R^{l}\to R^{m}\):
\[ (T_{C}\circ T_{B}\circ T_{A})(\mathbf{x}) = T_{C}(T_{B}(T_{A}(\mathbf{x}))) \] The standard matrix for this composition is \(CBA\):
\[ T_{C}\circ T_{B}\circ T_{A} = T_{C B A} \tag{3} \]
We denote the standard matrix for a transformation \(T\) as \([T]\).
So, Formula (2) can be restated as:
\[ [T_{2} \circ T_{1}] = [T_{2}][T_{1}] \tag{4} \] And Formula (3) as:
\[ [T_{3} \circ T_{2} \circ T_{1}] = [T_{3}][T_{2}][T_{1}] \tag{5} \]
Warning
WARNING: In general, \(T_{B} \circ T_{A} \neq T_{A} \circ T_{B}\). Order matters!
Let \(T_{1}\colon R^{2} \to R^{2}\) be reflection about \(y=x\).
Let \(T_{2}\colon R^{2} \to R^{2}\) be orthogonal projection onto the \(y\)-axis.
Standard matrices:
\([T_{1}] = \left[ \begin{array}{l l}{0} & {1}\\ {1} & {0} \end{array} \right]\)
\([T_{2}] = \left[ \begin{array}{l l}{0} & {0}\\ {0} & {1} \end{array} \right]\)
\[ [T_{1}\circ T_{2}] = [T_{1}][T_{2}] = \left[ \begin{array}{l l}{0} & {1}\\ {1} & {0} \end{array} \right]\left[ \begin{array}{l l}{0} & {0}\\ {0} & {1} \end{array} \right] = \left[ \begin{array}{l l}{0} & {1}\\ {0} & {0} \end{array} \right] \]
\[ [T_{2}\circ T_{1}] = [T_{2}][T_{1}] = \left[ \begin{array}{l l}{0} & {0}\\ {0} & {1} \end{array} \right]\left[ \begin{array}{l l}{0} & {1}\\ {1} & {0} \end{array} \right] = \left[ \begin{array}{l l}{0} & {0}\\ {1} & {0} \end{array} \right] \] Clearly, \([T_{2} \circ T_{1}] \neq [T_{1} \circ T_{2}]\).
Let’s see the effect of \(T_1 \circ T_2\) vs \(T_2 \circ T_1\) on a vector \(\mathbf{x} = (1, 1)\).
\(T_1 \circ T_2\) (Projection then Reflection)
Final result: \((1,0)\)
\(T_2 \circ T_1\) (Reflection then Projection)
Final result: \((0,1)\)
Figure 4.10.2: Visual demonstration of non-commutative composition.
Let \(T_{1}, T_{2}\) be rotations about the origin by \(\theta_{1}\) and \(\theta_{2}\). Standard matrices:
\[ [T_{1}] = \left[ \begin{array}{cc}\cos \theta_{1} & -\sin \theta_{1} \\ \sin \theta_{1} & \cos \theta_{1} \end{array} \right], \quad [T_{2}] = \left[ \begin{array}{cc}\cos \theta_{2} & -\sin \theta_{2} \\ \sin \theta_{2} & \cos \theta_{2} \end{array} \right] \]
The composition \(T_{2} \circ T_{1}\) rotates by \(\theta_{1} + \theta_{2}\). Its standard matrix should be:
\[ [T_{2} \circ T_{1}] = \left[ \begin{array}{cc}\cos (\theta_{1} + \theta_{2}) & -\sin (\theta_{1} + \theta_{2}) \\ \sin (\theta_{1} + \theta_{2}) & \cos (\theta_{1} + \theta_{2}) \end{array} \right] \]
Verify this with matrix multiplication and trigonometric identities:
\[ \begin{array}{r l}{[T_{2}][T_{1}] = \left[ \begin{array}{c c}{\cos \theta_{2}} & {-\sin \theta_{2}}\\ {\sin \theta_{2}} & {\cos \theta_{2}} \end{array} \right]\left[ \begin{array}{c c}{\cos \theta_{1}} & {-\sin \theta_{1}}\\ {\sin \theta_{1}} & {\cos \theta_{1}} \end{array} \right]} & {}\\ {= \left[ \begin{array}{c c}{\cos \theta_{2}\cos \theta_{1} - \sin \theta_{2}\sin \theta_{1}} & {-(\cos \theta_{2}\sin \theta_{1} + \sin \theta_{2}\cos \theta_{1})}\\ {\sin \theta_{2}\cos \theta_{1} + \cos \theta_{2}\sin \theta_{1}} & {-\sin \theta_{2}\sin \theta_{1} + \cos \theta_{2}\cos \theta_{1}} \end{array} \right]} & {}\\ {= \left[ \begin{array}{c c}{\cos (\theta_{1} + \theta_{2})} & {-\sin (\theta_{1} + \theta_{2})}\\ {\sin (\theta_{1} + \theta_{2})} & {\cos (\theta_{1} + \theta_{2})} \end{array} \right]} & {}\\ {= [T_{2}\circ T_{1}]} & {} \end{array} \]
This implies \(R_{\theta_{1}}R_{\theta_{2}} = R_{\theta_{1} + \theta_{2}}\).
Figure 4.10.3: Interactive demonstration of commutative rotations.
Let \(T_{1}\colon R^{2}\to R^{2}\) be reflection about the \(y\)-axis.
Let \(T_{2}\colon R^{2}\to R^{2}\) be reflection about the \(x\)-axis.
Standard matrices:
\([T_{1}] = \left[ \begin{array}{c c}{-1} & 0\\ 0 & 1 \end{array} \right]\)
\([T_{2}] = \left[ \begin{array}{c c}{1} & 0\\ 0 & {-1} \end{array} \right]\)
\[ [T_{1}\circ T_{2}] = [T_{1}][T_{2}] = \left[ \begin{array}{c c}{-1} & 0\\ 0 & 1 \end{array} \right]\left[ \begin{array}{c c}{1} & 0\\ 0 & {-1} \end{array} \right] = \left[ \begin{array}{c c}{-1} & 0\\ 0 & {-1} \end{array} \right] \]
\[
[T_{2}\circ T_{1}] = [T_{2}][T_{1}] = \left[ \begin{array}{c c}{1} & 0\\ 0 & {-1} \end{array} \right]\left[ \begin{array}{c c}{-1} & 0\\ 0 & 1 \end{array} \right] = \left[ \begin{array}{c c}{-1} & 0\\ 0 & {-1} \end{array} \right]
\] In this special case, \(T_{1}\circ T_{2} = T_{2}\circ T_{1}\). Both result in reflection about the origin.
The operator \(T(\mathbf{x}) = - \mathbf{x}\) is reflection about the origin.
Find the standard matrix for \(T\colon R^{3}\to R^{3}\) that:
The composition is \(T = T_{3}\circ T_{2}\circ T_{1}\).
Standard matrices for individual transformations:
\[ [T_{1}] = \left[ \begin{array}{ccc}\cos \theta & -\sin \theta & 0 \\ \sin \theta & \cos \theta & 0 \\ 0 & 0 & 1 \end{array} \right] \quad \text{(Rotation about z-axis)} \]
\[ [T_{2}] = \left[ \begin{array}{ccc} - 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array} \right] \quad \text{(Reflection about yz-plane)} \]
\[ [T_{3}] = \left[ \begin{array}{ccc}1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 0 \end{array} \right] \quad \text{(Projection onto xy-plane)} \]
Now, we multiply the matrices in the correct order:
\[ [T] = [T_{3}][T_{2}][T_{1}] \]
\[ [T] = \left[ \begin{array}{ccc}1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 0 \end{array} \right]\left[ \begin{array}{ccc} - 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array} \right]\left[ \begin{array}{ccc}\cos \theta & -\sin \theta & 0 \\ \sin \theta & \cos \theta & 0 \\ 0 & 0 & 1 \end{array} \right] \]
First, calculate \([T_2][T_1]\): \[ [T_{2}][T_{1}] = \left[ \begin{array}{ccc} - 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{array} \right]\left[ \begin{array}{ccc}\cos \theta & -\sin \theta & 0 \\ \sin \theta & \cos \theta & 0 \\ 0 & 0 & 1 \end{array} \right] = \left[ \begin{array}{ccc} - \cos \theta & \sin \theta & 0 \\ \sin \theta & \cos \theta & 0 \\ 0 & 0 & 1 \end{array} \right] \]
Then, multiply by \([T_3]\): \[ [T] = \left[ \begin{array}{ccc}1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 0 \end{array} \right]\left[ \begin{array}{ccc} - \cos \theta & \sin \theta & 0 \\ \sin \theta & \cos \theta & 0 \\ 0 & 0 & 1 \end{array} \right] \]
\[ [T] = \left[ \begin{array}{ccc} - \cos \theta & \sin \theta & 0 \\ \sin \theta & \cos \theta & 0 \\ 0 & 0 & 0 \end{array} \right] \]
Important
DEFINITION 1: A matrix transformation \(T_{A} \colon R^{n} \to R^{m}\) is said to be one-to-one if \(T_{A}\) maps distinct vectors (points) in \(R^{n}\) into distinct vectors (points) in \(R^{m}\).
Alternative expressions of Definition 1:
Rotation operators on \(R^{2}\) are one-to-one because distinct vectors rotated through the same angle have distinct images.
In contrast, orthogonal projection of \(R^{2}\) onto the \(x\)-axis is not one-to-one. It maps distinct points on the same vertical line into the same point on the \(x\)-axis.
Tip
DEFINITION 2: If \(T_{A}\colon R^{n}\to R^{m}\) is a matrix transformation:
In brief:
\[ \ker (T_{A}) = \mathrm{null~space~of~}A \tag{6} \]
\[ R(T_{A}) = \mathrm{column~space~of~}A \tag{7} \]
The key to solving mathematical problems often lies in adopting the right point of view.
Subspace of \(R^{n}\) (Kernel/Null Space)
Subspace of \(R^{m}\) (Range/Column Space)
If \(A\) is an \(n\times n\) matrix and \(T_{A}\colon R^{n}\to R^{n}\) is the corresponding matrix operator, then the following statements are equivalent:
Important
This theorem establishes fundamental relationships between the invertibility of a matrix \(A\) and properties of its corresponding matrix transformation \(T_A\).
Proof of \((b)\Rightarrow (c)\):
Assume the kernel of \(T_{A}\) is \(\{\mathbf{0}\}\).
By Formula (6), the null space of \(A\) is \(\{\mathbf{0}\}\).
This implies that the nullity of \(A\) is 0.
By the Rank-Nullity Theorem (\(n = \mathrm{rank}(A) + \mathrm{nullity}(A)\)), if nullity is 0, then \(\mathrm{rank}(A) = n\).
Since \(A\) is an \(n \times n\) matrix and its rank is \(n\), its column space spans \(R^{n}\) (i.e., the column space is all of \(R^{n}\)).
By Formula (7), the range of \(T_{A}\) is \(R^{n}\).
As illustrated in Figure 4.10.6, the operator \(T\colon R^{2}\to R^{2}\) that rotates vectors through an angle \(\theta\) is one-to-one.
In accordance with parts (a) and (d) of Theorem 4.10.1, its standard matrix should be invertible.
The standard matrix for \(T\) is:
\[ [T] = \left[ \begin{array}{cc}\cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end{array} \right] \tag{10} \]
This matrix is invertible because its determinant is nonzero:
\[ \operatorname *{det}[T] = \left| \begin{array}{cc}\cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end{array} \right| = \cos^{2}\theta +\sin^{2}\theta = 1\neq 0 \]
As illustrated in Figure 4.10.7, the operator \(T\colon R^{2}\to R^{2}\) that projects onto the \(x\)-axis in the \(xy\)-plane is not one-to-one.
In accordance with parts (a) and (d) of Theorem 4.10.1, its standard matrix should not be invertible.
The standard matrix for \(T\) is:
\[ [T] = \left[ \begin{array}{cc}1 & 0 \\ 0 & 0 \end{array} \right] \]
Since \(\operatorname *{det}[T] = (1)(0) - (0)(0) = 0\), the operator \(T\) is not one-to-one.
If \(T_{A}\colon R^{n}\to R^{n}\) is a one-to-one matrix operator, then \(A\) is invertible (by Theorem 4.10.1).
The matrix operator \(T_{A^{-1}}\colon R^{n}\to R^{n}\) corresponding to \(A^{-1}\) is called the inverse operator of \(T_{A}\).
This terminology is appropriate because \(T_{A}\) and \(T_{A^{-1}}\) cancel each other’s effects:
\[ \begin{array}{r}T_{A}(T_{A^{-1}}(\mathbf{x})) = A A^{-1}\mathbf{x} = I\mathbf{x} = \mathbf{x} \\ T_{A^{-1}}(T_{A}(\mathbf{x})) = A^{-1}A\mathbf{x} = I\mathbf{x} = \mathbf{x} \end{array} \]
Equivalently:
\[ \begin{array}{c}T_{A}\circ T_{A^{-1}} = T_{A A^{-1}} = T_{I} \\ T_{A^{-1}}\circ T_{A} = T_{A^{-1}A} = T_{I} \end{array} \]
Notational matters:
The standard matrices for these operators are related by:
\[ [T_{A^{-1}}] = [T_{A}]^{-1} \tag{8} \] Or, more generally:
\[ [T^{-1}] = [T]^{-1} \tag{9} \]
Let \(T\colon R^{2}\to R^{2}\) be the operator that rotates each vector in \(R^{2}\) through the angle \(\theta\).
Its standard matrix is \([T] = \left[ \begin{array}{cc}\cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end{array} \right]\).
Geometrically, to undo the effect of \(T\), one must rotate each vector in \(R^{2}\) through the angle \(-\theta\).
The standard matrix for a rotation by \(-\theta\) is:
\[ \left[ \begin{array}{cc}\cos (-\theta) & -\sin (-\theta) \\ \sin (-\theta) & \cos (-\theta) \end{array} \right] = \left[ \begin{array}{cc}\cos \theta & \sin \theta \\ -\sin \theta & \cos \theta \end{array} \right] \]
Let’s compute \([T]^{-1}\): For a \(2 \times 2\) matrix \(\left[ \begin{array}{cc}a & b \\ c & d \end{array} \right]\), its inverse is \(\frac{1}{ad-bc}\left[ \begin{array}{cc}d & -b \\ -c & a \end{array} \right]\).
\[ [T]^{-1} = \frac{1}{\cos^{2}\theta - (-\sin^{2}\theta)}\left[ \begin{array}{cc}\cos \theta & \sin \theta \\ -\sin \theta & \cos \theta \end{array} \right] = \frac{1}{1}\left[ \begin{array}{cc}\cos \theta & \sin \theta \\ -\sin \theta & \cos \theta \end{array} \right] \]
\[ [T]^{-1} = \left[ \begin{array}{cc}\cos \theta & \sin \theta \\ -\sin \theta & \cos \theta \end{array} \right] \] This confirms \([T^{-1}] = \left[ \begin{array}{cc}\cos (-\theta) & -\sin (-\theta) \\ \sin (-\theta) & \cos (-\theta) \end{array} \right]\).
Let’s rotate a vector and then apply its inverse.
Show that the operator \(T\colon R^{2}\to R^{2}\) defined by the equations:
\[ \begin{array}{r}w_{1} = 2x_{1} + x_{2} \\ w_{2} = 3x_{1} + 4x_{2} \end{array} \] is one-to-one, and find \(T^{- 1}(w_{1},w_{2})\).
Solution: The matrix form of these equations is:
\[ \left[ \begin{array}{l}w_{1} \\ w_{2} \end{array} \right] = \left[ \begin{array}{ll}2 & 1 \\ 3 & 4 \end{array} \right]\left[ \begin{array}{l}x_{1} \\ x_{2} \end{array} \right] \] So the standard matrix for \(T\) is:
\[ [T] = \left[ \begin{array}{ll}2 & 1 \\ 3 & 4 \end{array} \right] \] First, check if \(T\) is one-to-one by checking if \([T]\) is invertible (i.e., \(\operatorname *{det}[T] \neq 0\)). \(\operatorname *{det}[T] = (2)(4) - (1)(3) = 8 - 3 = 5 \neq 0\). Since the determinant is non-zero, \([T]\) is invertible, and thus \(T\) is one-to-one.
Now, find the standard matrix for \(T^{-1}\):
\[ [T^{-1}] = [T]^{-1} = \frac{1}{5}\left[ \begin{array}{cc}4 & -1 \\ -3 & 2 \end{array} \right] = \left[ \begin{array}{cc}\frac{4}{5} & -\frac{1}{5} \\ -\frac{3}{5} & \frac{2}{5} \end{array} \right] \]
Thus, to find \(T^{-1}(w_1, w_2)\), we multiply the inverse matrix by the vector \(\left[ \begin{array}{l}w_{1} \\ w_{2} \end{array} \right]\):
\[ [T^{-1}]\left[ \begin{array}{c}w_{1} \\ w_{2} \end{array} \right] = \left[ \begin{array}{cc}\frac{4}{5} & -\frac{1}{5} \\ -\frac{3}{5} & \frac{2}{5} \end{array} \right]\left[ \begin{array}{c}w_{1} \\ w_{2} \end{array} \right] = \left[ \begin{array}{c}\frac{4}{5} w_{1} - \frac{1}{5} w_{2} \\ -\frac{3}{5} w_{1} + \frac{2}{5} w_{2} \end{array} \right] \]
From this, we conclude that:
\[ T^{-1}(w_{1},w_{2}) = \left(\frac{4}{5} w_{1} - \frac{1}{5} w_{2}, - \frac{3}{5} w_{1} + \frac{2}{5} w_{2}\right) \]
This theorem compiles many equivalent statements regarding an \(n \times n\) matrix \(A\). If \(A\) is an \(n \times n\) matrix, then the following statements are equivalent:
Tip
This comprehensive theorem is a cornerstone of linear algebra, connecting concepts from systems of equations, matrix properties, vector spaces, and transformations. Understanding these equivalences deepens your comprehension of matrix behavior.