Section 8.2 The matrix transpose
Definition 8.2.1.
The transpose of the \(m\times n\) matrix \(A\) is the \(n\times m\) matrix \(A^T\) whose rows are the columns of \(A\text{.}\)
Example 8.2.2.
If \(A=\begin{bmatrix}
4 \amp -3 \amp 0 \amp 5 \\
-1 \amp 2 \amp 1 \amp 3 \\
\end{bmatrix}
\text{,}\) then \(A^T=\begin{bmatrix}
4 \amp -1 \\
-3 \amp 2 \\
0 \amp 1 \\
5 \amp 3 \\
\end{bmatrix}\)
Activity 8.2.1.
This activity illustrates how multiplying a vector by \(A^T\) is related to computing dot products with the columns of \(A\text{.}\) You’ll develop a better understanding of this relationship if you compute the dot products and matrix products in this activity without using technology.
If \(B =
\begin{bmatrix}
3 \amp 4 \\
-1 \amp 2 \\
0 \amp -2 \\
\end{bmatrix}
\text{,}\) write the matrix \(B^T\text{.}\)
Suppose that
\begin{equation*}
\vvec_1=\threevec20{-2},\hspace{24pt}
\vvec_2=\threevec112,\hspace{24pt}
\wvec=\threevec{-2}23\text{.}
\end{equation*}
Find the dot products \(\vvec_1\cdot\wvec\) and \(\vvec_2\cdot\wvec\text{.}\)
Now write the matrix \(A = \begin{bmatrix} \vvec_1 \amp \vvec_2 \end{bmatrix}\) and its transpose \(A^T\text{.}\) Find the product \(A^T\wvec\) and describe how this product computes both dot products \(\vvec_1\cdot\wvec\) and \(\vvec_2\cdot\wvec\text{.}\)
Suppose that \(\xvec\) is a vector that is orthogonal to both \(\vvec_1\) and \(\vvec_2\text{.}\) What does this say about the dot products \(\vvec_1\cdot\xvec\) and \(\vvec_2\cdot\xvec\text{?}\) What does this say about the product \(A^T\xvec\text{?}\)
Use the matrix \(A^T\) to give a parametric description of all the vectors \(\xvec\) that are orthogonal to \(\vvec_1\) and \(\vvec_2\text{.}\)
Remember that \(\nul(A^T)\text{,}\) the null space of \(A^T\text{,}\) is the solution set of the equation \(A^T\xvec=\zerovec\text{.}\) If \(\xvec\) is a vector in \(\nul(A^T)\text{,}\) explain why \(\xvec\) must be orthogonal to both \(\vvec_1\) and \(\vvec_2\text{.}\)
Remember that \(\col(A)\text{,}\) the column space of \(A\text{,}\) is the set of linear combinations of the columns of \(A\text{.}\) Therefore, any vector in \(\col(A)\) can be written as \(c_1\vvec_1+c_2\vvec_2\text{.}\) If \(\xvec\) is a vector in \(\nul(A^T)\text{,}\) explain why \(\xvec\) is orthogonal to every vector in \(\col(A)\text{.}\)
Solution.
\(\displaystyle A^T = \begin{bmatrix}
3 \amp -1 \amp 0 \\
4 \amp 2 \amp -2
\end{bmatrix}\)
\begin{align*}
\vvec_1\cdot\wvec \amp{}={} 2(-2) + 0(2) + (-2)3
\amp{}={} -10\\
\vvec_2\cdot\wvec \amp{}={} 1(-2) + 1(2) + 2(3)
\amp{}={} 6
\end{align*}
\begin{equation*}
A^T\wvec =
\twovec{2(-2)+0(2)+(-2)3}{1(-2)+1(2)+2(3)} =
\twovec{-10}{6}
=\twovec{\vvec_1\cdot\wvec}{\vvec_2\cdot\wvec}
\end{equation*}
Both dot products are 0 so we have \(A^T\xvec =
\zerovec\text{.}\)
We need to solve the equation \(A^T\xvec =
\zerovec\) so we find the reduced row echelon form
\begin{equation*}
A^T \sim \begin{bmatrix}
1 \amp 0 \amp -1 \\
0 \amp 1 \amp 3\\
\end{bmatrix}\text{.}
\end{equation*}
The vectors orthogonal to both \(\vvec_1\) and \(\vvec_2\) have the form \(\xvec=x_3\threevec1{-3}1\text{.}\)
\(A^T\xvec =
\twovec{\vvec_1\cdot\xvec}{\vvec_2\cdot\xvec} =
\twovec00\) tells us that \(\vvec_1\cdot\xvec=0\) and \(\vvec_2\cdot\xvec=0\text{.}\)
Since \(\xvec\) is orthogonal to both \(\vvec_1\) and \(\vvec_2\text{,}\) we have
\begin{equation*}
(c_1\vvec_1+c_2\vvec_2)\cdot\xvec =
c_1\vvec_1\cdot\xvec + c_2\vvec_2\cdot\xvec = 0
\end{equation*}
The previous activity demonstrates an important connection between the matrix transpose and dot products. More specifically, the components of the product \(A^T\xvec\) are simply the dot products of the columns of \(A\) with \(\xvec\text{.}\) We will make frequent use of this observation so let’s record it as a proposition.
Proposition 8.2.3.
If \(A\) is the matrix whose columns are \(\vvec_1,\vvec_2,\ldots,\vvec_n\text{,}\) then
\begin{equation*}
A^T\xvec =
\begin{bmatrix}
\vvec_1\cdot\xvec \\
\vvec_2\cdot\xvec \\
\vdots \\
\vvec_n\cdot\xvec \\
\end{bmatrix}
\end{equation*}