Linear transformations

Contents

Linear Transformations
Examples from geometry
Examples from algebra
Examples from differential equations
Null space and range; rank
Solving linear equations
- Does $Tx=y$ have a solution?
- What is the form of the general solution to $Tx=y$ ?
Systems of equations
The components of a vector with respect to a basis
Matrix representation of a linear transformation
Special case: $A:\F^n\to\F^m$
Composition of transformations and matrix multiplication
The vector space $\mathcal{L}(V,W)$
Inverses and other powers of a linear transformation
- Solving linear equations

Linear Transformations

Definition. A map $T:V\to W$ from one vector space $V$ to another $W$ is called linear if for all $x, y\in V$ and all $a, b\in\mathbb{F}$ one has $T(ax+by) = aT(x)+bT(y)$ .

Synonym: a linear map is also called a linear transformation or a linear operator.

In this definition it is assumed that $V$ and $W$ are vector spaces over the same field $\mathbb{F}$ . If the two vector spaces are defined over different fields then the concept of linear map from $V$ to $W$ is not defined.

Sometimes it is more convenient to split the task of verifying that a certain linear map $T:V\to W$ is linear into two steps:

Verify that $T(x+y) = T(x)+T(y)$ for all $x, y\in V$
Verify that $T(ax)=a T(x)$ for all $x\in V$ and $a\in\mathbb{F}$ .

Problem. Show that the definition implies that if $T:V\to W$ is linear, then $T(0_V)=0_W$ .

Examples from geometry

Rotations in the plane. Let $V=W=\R^2$ and $\mathbb{F}=\R$ and consider the map $R:\R^2\to\R^2$ defined by counterclockwise rotation by an angle of $\theta$ radians. Show that $R$ is linear. Find a formula for $R\begin{pmatrix} x_1 \\ x_2 \end{pmatrix}$ .

Reflections in the plane. Again consider $V=W=\R^2$ , and let $\ell\subset\R^2$ be some line through the origin. Define $S(x)$ to be the reflection of $x$ in the line $\ell$ .

Show that $S:\R^2\to\R^2$ is linear. and find a formula for $S\begin{pmatrix} x_1 \\ x_2 \end{pmatrix}$ , in the case where $\ell$ is the diagonal $x_1=x_2$ .

Projection onto a line. Let $V=\R^2$ and $W$ be a line through the origin in $\R^2$ . Consider the map $P:V\to W$ for which $Px$ is the orthogonal projection onto $W$ . Show that $P$ is a linear transformation.

Rigid rotation in $\R^3$ . In this example, let $V=W=\R^3$ , and let $T:\R^3\to\R^3$ be the map defined by first rotating around the $z$ -axis over $30^\circ$ and then rotating around the $x$ -axis over $45^\circ$ . Show that $T:\R^3\to\R^3$ is linear.

We postpone finding a formula for $T\left(\begin{smallmatrix} x_1 \\ x_2 \\x_3 \end{smallmatrix}\right)$ to the next chapter on matrices.

Examples from algebra

Let $a,b,c,d\in\mathbb{F}$ be given numbers in the field $\mathbb{F}$ and consider the map $T:\mathbb{F}^2 \to \mathbb{F}^2$ given by

$T\begin{pmatrix} x_1 \\ x_2 \end{pmatrix} =\begin{pmatrix} ax_1+bx_2 \\ cx_1+dx_2 \end{pmatrix}$

for all $\begin{pmatrix} x_1 \\ x_2 \end{pmatrix}\in\mathbb{F}^2$ . Show that $T$ is linear. Visualize the map $T$ you get if $a=2$ , $b=c=0$ , $d=\frac12$ .

Examples from differential equations

Let $W=\mathcal{F}(\R,\R)$ , and let $V\subset W$ be the subspace of functions that are differentiable. Consider the map $D:V\to W$ given by

$(Df)(x) = f'(x).$

Thus $D(e^x) = e^x$ , $D(\sin x) = \cos x$ , $D(x^2)=2x$ , etc.

Verify that $D:V\to W$ is linear.

Null space and range; rank

Definition. If $T:V\to W$ is a linear map, then

the Null space of $T$ is $N(T) = \{x\in V \mid T(x)=0\}$
the Range of $T$ is $R(T) = \{T(x) \mid x\in V\} = \{y\in W \mid \exists x\in V: y=Tx\}$ .

The null space is sometimes also called the kernel of the map and the notation $N(T) = \mathrm{ker}\,T$ is sometimes used.

Theorem. The Null space a linear transformation $T:V\to W$ is linear subspace of $V$ ; the range of $T$ is a linear subspace of $W$ .

Proof

$N(T)$ is a linear subspace:

$0_V\in N(T)$ because $T(0_V)=0_W$
If $x, y\in N(T)$ then $T(x+y)=T(x)+T(y)=0+0=0$ . Hence $x+y\in N(T)$ .
If $a\in\mathbb{F}$ and $x\in N(T)$ then $T(ax)=aT(x)=a\cdot 0_W = 0_W$ . So $ax\in N(T)$ .

It follows that $N(T)$ is not empty, and closed under addition and scalar multiplication, so that $N(T)$ is a linear subspace.

$R(T)\subset W$ is a linear subspace:

$0_W\in R(T)$ because $T(0_V)=0_W$
If $x, y\in R(T)$ then then there exist $u, v\in V$ with $x=T(u)$ , $y=T(v)$ . It follows that $x+y= T(u)+T(v)=T(u+v)$ . Hence $x+y\in R(T)$ .
If $a\in\mathbb{F}$ and $x\in R(T)$ then there is a $u\in V$ with $x=T(u)$ . It follows that $ax = aT(u)=T(au)$ . So $ax\in R(T)$ .

It follows that $R(T)$ is not empty, and closed under addition and scalar multiplication, so that $R(T)$ is a linear subspace.

Definition. The rank of $T$ is the dimension of the range of $T$ .

Injectivity Theorem. A linear map $T:V\to W$ is injective if and only if $N(T)=\{0\}$ .

Proof

First we show that $N(T)=\{0\}$ implies that $T$ is injective.

Suppose $N(T)=\{0\}$ . To prove that $T$ is injective, we have to show for all $x,y\in V$ that $Tx=Ty$ implies $x=y$ .

So let $x,y\in V$ be given with $Tx=Ty$ . Then $T(x-y) = Tx - Ty = 0$ . Therefore $x-y\in N(T)$ . Since $N(T)=\{0\}$ it follows that $x-y=0$ , which implies that $x=y$ .

Next we show that if $T$ is injective then $N(T)=0$

Assume $T$ is injective. We must show that $N(T)$ only contains the zero vector. Suppose $x\in N(T)$ . Then $Tx=0$ . It is always true that $T(0)=0$ . Since $T$ is injective and since $T(x) = T(0)$ we conclude that $x=0$ . So $N(T)$ only contains the zero vector.

Rank+Nullity Theorem. If $T:V\to W$ is linear, and if $V$ is finite dimensional, then

$\dim N(T) + \dim R(T) = \dim V .$

Proof

Choose a basis $\{v_1, \dots, v_r\}$ of the null space $N(T)$ . Then choose vectors $v_{r+1}, \dots, v_n\in V$ so that $\{v_1, \dots, v_r, v_{r+1}, \dots, v_n\}$ is a basis for $V$ . We will show that $\beta = \{Tv_{r+1}, \dots, Tv_n\}$ is a basis for $R(T)$ . The rank $+$ nullity formula then follows because we will have shown that $\dim V=n$ , $\dim N(T)=r$ , and $\dim R(T)=n-r$ .

$\beta$ spans $R(T)$ : if $y\in R(T)$ then there is an $x\in V$ with $y=Tx$ . We can write $x=x_1v_1+\cdots+x_nv_n$ . Since $Tv_1=\cdots=Tv_r=0$ we have

$\begin{aligned} y=Tx&=T(x_1v_1+\cdots+x_nv_n)\\ &=x_{r+1}Tv_{r+1}+\cdots+x_nTv_n \\ &=x_{r+1}w_{r+1} + \cdots +x_nw_n\\ &\in\mathrm{span}(w_{r+1}, \dots, w_n). \end{aligned}$

$\beta$ is linearly independent: Suppose $c_{r+1}w_{r+1} + \cdots + c_nw_n=0$ for certain $c_{r+1}, \dots, c_n\in\mathbb{F}$ . Then

$T\bigl(c_{r+1}v_{r+1} + \cdots + c_nv_n\bigr) = 0,$

which implies $c_{r+1}v_{r+1} + \cdots + c_nv_n \in N(T)$ . It follows that there are numbers $c_1, \dots, c_r$ such that

$c_{r+1}v_{r+1} + \cdots + c_nv_n = c_1v_1+\cdots+c_rv_r.$

Since $\{v_1, \dots, v_n\}$ is a basis for $V$ we conclude that $c_1=\cdots=c_n=0$ . Hence $\beta$ is linearly independent.

Injectivity Theorem. If $T:V\to W$ is a linear transformation then $T$ is injective if and only if $N(T)=\{0\}$ .

Proof

$T$ injective implies $N(T)=\{0\}$ :

If $x\in N(T)$ then $T(x)=0$ . Since $T(0)=0$ we have $T(0)=T(x)$ . It follows from injectivity of $T$ that $x=0$ . Hence $0$ is the only vector in $N(T)$ .

$N(T)=\{0\}$ implies $T$ is injective:

Suppose $T(x)=T(y)$ . Then linearity of $T$ implies $T(x-y)=T(x)-T(y)=0$ . By assumption $N(T)=\{0\}$ , so $T(x-y)=0$ implies $x-y=0$ , i.e. $x=y$ . This means that $T$ is injective.

Bijectivity Theorem. If $V$ and $W$ are finite dimensional vector spaces with the same dimension, and if $T:V\to W$ is a linear transformation then the following are equivalent:

$T$ is injective (one-to-one)

$N(T)=\{0\}$

$\mathrm{rank}\,T = \dim V$

$T$ is surjective (onto)

A very important special case is when $V=W$ and $V$ is finite dimensional.

Proof

$1\Longleftrightarrow 2$ : this is what the Injectivity Lemma says.

$2\Longleftrightarrow 3$ : If $N(T)=\{0\}$ then $\dim N(T)=0$ and the rank+nullity theorem says $\dim V = \dim R(T)+\dim N(T) = \dim R(T) = \mathrm{rank}\, T$ . Conversely, if $\mathrm{rank}\,T=\dim V$ then the rank+nullity theorem says that $\dim N(T)=0$ , i.e. $N(T)=\{0\}$ .

$3\implies 4$ : If $\mathrm{rank}\,T=\dim V$ then $R(T)$ is a subspace of $V$ with the same dimension as $W$ . This implies $R(T)=W$ , i.e. $T$ is onto.

$4\implies 3$ : If $T$ is onto then $R(T)=W$ , and thus $\mathrm{rank}\,T=\dim R(T)=\dim W=\dim V$ .

Solving linear equations

Let $T:V\to W$ be a linear transformation between vector spaces, and consider the equation

$Tx=y.$

Here $y\in W$ is given and $x\in V$ is the unknown. The standard questions are is there a solution? and if there is a solution, how many? Linear algebra provides the following answers:

Does $Tx=y$ have a solution?

$Tx=y$ has a solution if and only if $y\in R(T)$ . This is the definition of the range of a linear transformation. What does this tell us? It doesn't say we can always solve $Tx=y$ , but the set of $y$ for which there are solutions has a nice property — $R(T)$ is a linear subspace of $W$ . Therefore, if we can find a solution to $Tx_1=y_1$ and $Tx_2=y_2$ then we can also find a solution to $Tz=c_1y_1+c_2y_2$ (one solution is $z=c_1x_1+c_2x_2$ — there might be others).

What is the form of the general solution to $Tx=y$ ?

Suppose that $x, x'\in V$ both are solutions, i.e. $T(x)=T(x')=y$ . Then $T(x-x')=0$ , so $x-x'\in N(T)$ : so we see that the difference between any two solutions lies in the null space of $T$ .

Conversely, suppose that $x$ is a solution, i.e. $T(x)=y$ , and suppose $u\in N(T)$ . Then $T(x+u)=T(x)+T(u)=y+0=y$ , i.e. $x+u$ is also a solution. Hence, given a solution $x$ to $T(x)=y$ one can get another solution by adding any vector $u$ in the null space to $x$

Particular solutions and the homogeneous equation. If $x_p\in V$ is a solution of $T(x)=y$ , then every solution of $T(x)=y$ is given by

$x = x_p+x_h\quad\text{with }x_h\in N(T).$

In this context the following terminology is very often used:

$T(x)=y$ is the inhomogenous equation
$T(x)=0$ is the homogeneous equation
$x_p$ is a particular solution
$x_h\in N(T)$ the general solution of the homogeneous equation

If $r\stackrel{\rm def}{=}\dim N(T)<\infty$ , and if we know a basis $\{u_1, \dots, u_r\}$ for the null space $N(T)$ , then every vector $x_h$ in the null space is given by

$x_h = c_1u_1+\cdots+c_ru_r$

for certain $c_1, \dots, c_r\in\mathbb{F}$ . If we still know a particular solution $x_p$ of $T(x)=y$ then the general solution to the equation $T(x)=y$ (i.e. every solution) is given by

$x = x_p+c_1u_1+\cdots+c_ru_r$

where $c_1, \dots, c_r\in\mathbb{F}$ are arbitrary constants.

Systems of equations

Consider the linear transformation $A:\mathbb{F}^n\to\mathbb{F}^m$ given by

$A\begin{bmatrix} x_1\\ \vdots \\x_n \end{bmatrix} \stackrel{\rm def}{=} \begin{bmatrix} a_{11}x_1+\cdots+a_{1n}x_n \\ \vdots \\ a_{m1}x_1+\cdots+a_{mn}x_n \end{bmatrix}$

For any vector $y = \left[\begin{smallmatrix} y_1\\\vdots\\y_m \end{smallmatrix}\right] \in \mathbb{F}^m$ the equation $Ax=y$ is then equivalent with the system of linear equations for the unknowns $x_1, \dots, x_n$ given by

$\begin{aligned} a_{11}x_1 + \cdots + a_{1n}x_n \,&= y_1 \\ \vdots\quad& \\ a_{m1}x_1 + \cdots + a_{mn}x_n &= y_m \end{aligned}$

In other words, we are considering $m$ linear equations with $n$ unknowns.

In this setting the rank+nullity theorem says that $\dim N(A)+\dim R(A)=\dim V$ , i.e. $\dim N(A) + \dim R(A) = n$ .

More equations than unknowns, i.e. $m\gt n$

It follows from $\dim N(A) + \dim R(A) = n$ that $\dim R(A) = n-\dim N(A) \leq n \lt m$ . So in this case $R(A)$ is always a proper subspace of $W$ : the equation $T(x)=y$ does not have a solution for most $y$ .

For those $y\in \mathbb{F}^m$ for which the system does have a solution the general solution contains $r$ constants, where $r=\dim N(A) = n-\dim R(A)$ .

More unknowns than equations, i.e. $m\lt n$

Since $R(A)\subset \R^m$ , we have $\dim R(A)\leq m$ . By the nullity+rank theorem,

$\dim N(A) = \dim V-\dim R(A) = n-\dim R(A)\geq n-m \gt 0.$

So in this case the dimension of the null space always is positive, i.e. there are nonzero solutions to $Ax=0$ . For any $y\in \mathbb{F}^m$ one of the following occurs:

$y\not\in R(A)$ and there is no solution
$y\in R(A)$ and the general solution contains $r$ free constants, where $r=\dim N(A)\gt 0$ .

As many equations as unknowns, i.e. $m = n$

We again have $\dim N(A) = n-\dim R(A)$ .

If $A$ is injective then $N(A)=\{0\}$ , and hence $\dim N(A)=0$ , so that $\dim R(A) = n$ . Since $R(A)\subset \R^n$ , this implies that when $A$ is injective, $A$ also is surjective.

If on the other hand $A$ is not injective then $\dim N(A)>0$ and thus $\dim R(A)<n$ . In this situation $R(A)$ is a proper subspace of $\R^n$ and therefore $Ax=y$ does not have a solution for all $y$ .

The components of a vector with respect to a basis

Definition. An ordered basis of a vector space $V$ is an ordered list of vectors $\beta = (v_1, \dots, v_n)$ such that $\{v_1, \dots, v_n\}$ is a basis of $V$ .

If $\beta = (v_1, \dots, v_n)$ is an ordered basis of $V$ and $x\in V$ is any vector then there exist $x_1, \dots, x_n\in \mathbb{F}$ such that

$x = x_1v_1 + \cdots + x_nv_n \,.$

The numbers $x_1, \dots, x_n$ are called the components of $x$ with respect to the basis $\beta$ . These components determine a column vector. In the notation of the text book:

$[x]_\beta= \begin{pmatrix} x_1 \\ \vdots \\ x_n \end{pmatrix} \in\mathbb{F}^n.$

Instead of components, the $x_i$ are sometimes also called the coordinates of $x$ .

Matrix representation of a linear transformation

Let $T:V\to W$ be a linear transformation, and let $\beta = \{v_1, \dots, v_n\}$ be an ordered basis for $V$ and $\gamma = \{w_1, \dots, w_m\}$ an ordered basis for $W$ . Each vector $Tv_i$ can be written as a linear combination of $w_1, \dots, w_m$ , i.e. there exist numbers $a_{ij}\in\mathbb{F}$ such that

$Tv_i = a_{1i}w_1 + \cdots + a_{mi}w_m \qquad (i=1, 2, \dots, n).$

Linearity of $T$ allows us to compute $T(x)$ if we know the coefficients $a_{ij}$ and the components $x_j$ of $x$ in the basis $v_1, \dots, v_n$ . Namely one has:

$\begin{aligned} Tx &= T(x_1v_1+\cdots +x_nv_n) \\ &=x_1 T(v_1) + \cdots + x_n T(v_n) \\ &=x_1 \left(a_{11}w_1 + \cdots + a_{m1}w_m\right) + \cdots + x_n \left(a_{1n}w_1 + \cdots + a_{mn}w_m\right) \\ &\qquad\text{(rearrange terms)} \\ &=\left(a_{11}x_1+\cdots+a_{1n}x_n\right) w_1 + \cdots + \left(a_{m1}x_1+\cdots+a_{mn}x_n\right) w_m \end{aligned}$

Thus

$[Tx]_\gamma = \begin{pmatrix} a_{11}x_1+\cdots+a_{1n}x_n \\ \vdots \\ a_{m1}x_1+\cdots+a_{mn}x_n \end{pmatrix}$

The coefficients $a_{ij}$ of the linear transformation $T$ with respect to the ordered bases $\beta$ and $\gamma$ form a matrix

$[T]_\beta^\gamma = \begin{pmatrix} a_{11} & \cdots & a_{1n} \\ \vdots & & \vdots \\ a_{m1} & \cdots & a_{mn} \end{pmatrix}$

which is called the matrix representation of $T$ in the bases $\beta, \gamma$ .

Since it turns out to be easy to confuse rows and columns the following observation may be helpful: the first column $\tmat a_{11} \\ \cdot \\ \cdot \\ a_{m1} \trix$ of the matrix $[T]_\beta^\gamma$ contains the components of $Tv_1$ expressed in the basis $\{w_1, \dots, w_n\}$ .

Example. If $m=3$ and $n=2$ , so that $W$ is three dimensional with basis $\gamma=\{w_1, w_2, w_3\}$ and $V$ is two dimensional with basis $\beta=\{v_1, v_2\}$ , and if

$Tv_1 = w_1-3w_2+5w_3, \qquad Tv_2=-w_1,$

then the matrix of $T$ in these bases is

$[T]_\beta^\gamma = \begin{pmatrix} 1 & -1 \\ -3 & 0 \\ 5 & 0 \end{pmatrix}.$

Special case: $A:\F^n\to\F^m$

The vector space $\F^n$ has the standard basis $\{e_1, \dots, e_n\}$ . If the matrix of a linear transformation $A:\F^m\to\F^n$ is given by $(a_{ij})$ , then $A$ is given by matrix multiplication

$Ax = \begin{pmatrix} a_{11} & \cdots & a_{1n} \\ \vdots && \vdots \\ a_{m1} & \cdots & a_{mn} \end{pmatrix} \begin{pmatrix} x_1 \\ \vdots \\ x_m \end{pmatrix}$

Composition of transformations and matrix multiplication

If we have three vector spaces $U, V, W$ and linear transformations $A:V\to W$ and $B:U\to V$ then we can define the composition $AB:U\to W$ by

$(AB)(x) \stackrel{\rm def}{=} A\bigl(B(x)\bigr) \text{ for all $x\in U$}.$

Theorem. If $A:V\to W$ and $B:U\to V$ are linear transformations of vector spaces $U,V,W$ , then the composition $AB:U\to W$ is also linear.

The proof is a homework problem.

If we have ordered bases $\alpha=\{u_1, \dots, u_l\}$ for $U$ , $\beta=\{v_1,\dots, v_m\}$ for $V$ , and $\gamma=\{w_1,\dots, w_n\}$ for $W$ then the matrices of $A$ and $B$ with respect to these bases are

$[A]_\beta^\gamma = \begin{pmatrix} a_{11} & \cdots & a_{1m} \\ \vdots & & \vdots \\ a_{n1} & \cdots & a_{nm} \end{pmatrix}, \qquad [B]_\alpha^\beta = \begin{pmatrix} b_{11} & \cdots & b_{1l} \\ \vdots & & \vdots \\ b_{m1} & \cdots & b_{ml} \end{pmatrix}$

$Av_j = a_{1j}w_1+\cdots+a_{nj}w_n,\qquad Bu_k= b_{1k}v_1+\cdots+b_{mk}v_m$

To find the matrix of $AB$ with respect to the bases $\alpha, \gamma$ we express $AB(u_i)$ in terms of $\{w_1, \dots, w_n\}$ :

$\begin{aligned} AB(u_k) =&\; A\bigl(Bu_k\bigr) \\ =&\; A\left(b_{1k}v_1+\cdots+b_{mk}v_m \right) \\ =&\; b_{1k}Av_1+\cdots+b_{mk}Av_m \\ =&\; b_{1k}\left\{a_{11}w_1+\cdots+a_{n1}w_n\right\}+\\ &\; b_{2k}\left\{a_{12}w_1+\cdots+a_{n2}w_n\right\}+\\ &\;\quad\vdots\quad+\\ &\; b_{mk}\left\{a_{1m}w_1+\cdots+a_{nm}w_n\right\} \\ =&\; \left\{a_{11}b_{1k} + \cdots + a_{1m}b_{mk}\right\} w_1 + \cdots + \left\{a_{n1}b_{1k} + \cdots + a_{nm}b_{mk}\right\} w_n \end{aligned}$

This shows that the $k^{\rm th}$ column of the matrix $[AB]_\alpha^\gamma$ is given by

$\begin{pmatrix} a_{11}b_{1k} + \cdots + a_{1m}b_{mk} \\ \vdots \\ a_{n1}b_{1k} + \cdots + a_{nm}b_{mk} \end{pmatrix}$

Definition. If $\mathcal A = (a_{ij})$ is an $n\times l$ matrix and $\mathcal B = (b_{jk})$ is an $l\times m$ matrix then the matrix product $\mathcal A\mathcal B$ is defined to be the $n\times m$ matrix $\mathcal C=(c_{ik})$ whose entries are given by

$c_{ij} = a_{i1}b_{1j}+\cdots+a_{il}b_{lj} = \sum_{k=1}^l a_{ik}b_{kj} .$
With this definition we have just shown the following

Theorem. $[AB]_\alpha^\gamma = [A]_\beta^\gamma\, [B]_\gamma^\alpha$

Example. Let $R(\theta):\R^2\to\R^2$ be rotation through an angle $\theta$ . Then the matrix of $R(\theta)$ is given by

$\mathcal{R} (\theta) = \begin{pmatrix} \cos \theta & -\sin \theta \\ \sin\theta & \cos \theta \end{pmatrix}$

If we first rotate by $\theta$ and then by $\phi$ we achieve the same as rotating by $\theta+\phi$ . This implies

$\cR(\theta+\phi) = \cR(\theta)\cR(\phi).$

I.e.

$\begin{aligned} &\begin{pmatrix} \cos (\theta+\phi) & -\sin (\theta+\phi) \\ \sin(\theta+\phi) & \cos (\theta+\phi) \end{pmatrix}\\ &\qquad= \begin{pmatrix} \cos \theta & -\sin \theta \\ \sin\theta & \cos \theta \end{pmatrix} \begin{pmatrix} \cos \phi & -\sin \phi \\ \sin\phi & \cos \phi \end{pmatrix}\\ &\qquad= \begin{pmatrix} \cos \theta \cos\phi - \sin\theta\sin\phi & -\sin \theta\cos\phi-\cos\theta\sin\phi \\ \sin\theta\cos\phi+\cos\theta\sin\phi & \cos \theta \cos\phi - \sin\theta\sin\phi \end{pmatrix} \end{aligned}$

Thus we recover the addition formulas for sine and cosine:

$\begin{aligned} \cos(\theta+\phi) &= \cos \theta \cos\phi - \sin\theta \sin\phi \\ \sin(\theta+\phi) &= \sin\theta\cos\phi+\cos\theta\sin\phi \end{aligned}$

The vector space $\mathcal{L}(V,W)$

The set of all linear transformations from one vector space $V$ to another $W$ , is itself a vector space over the same field $\mathbb{F}$ . Addition is defined by saying that for any two linear maps $T,S:V\to W$ one has

$(T+S)(x) \stackrel{\rm def}{=} T(x)+S(x), \text{ for all }x\in V,$

and for any linear map $T:V\to W$ and any number $a\in\mathbb{F}$ one has

$(aT)(x) \stackrel{\rm def}{=} a\bigl(T(x)\bigr) \text{ for all }x\in V.$

Notation. $\cL(V,W)=\bigl\{T \mid T:V\to W \text{ is a linear transformation}\bigr\}$

If $V=W$ then one writes $\mathcal{L}(V)$ instead of $\mathcal{L}(V, W)$ .

Theorem. If $T,S:V\to W$ are linear and if $a\in\F$ then the maps $T+S:V\to W$ and $aT:V\to W$ are linear. The set $\mathcal{L}(V , W)$ of linear maps $T:V\to W$ is a vector space.

Proof

To show that $T+S$ is linear consider

$\begin{aligned} (T+S)(x+y) &= T(x+y)+S(x+y) \\ &= Tx + Ty+Sx+Sy\\ &= Tx + Sx+ Ty+Sy\\ &= (T+S)(x) + (T+S)(y). \end{aligned}$

A similar computation shows that $(T+S)(tx) = t(T+S)(x)$ for all $x\in V$ and $t\in\F$ . This proves that $T+S$ is linear, i.e. $T+S\in\mathcal{L}(V,W)$ .

The same kind of computations also show that $aT\in\cL(V,W)$ .

Yet more routine computations prove that $\cL(V,W)$ satisfies the vector space axioms.

The case in which $V=W$ is special because if $T,S:V\to V$ then not only are $T+S$ and $aT$ linear transformations $V\to V$ , but the compositions $ST$ and $TS$ also are linear transformations from $V$ to itself.

Inverses and other powers of a linear transformation

Definition. A linear transformation $T:V\to W$ is called invertible if $T$ is both injective and surjective.

Theorem. If $T:V\to W$ is linear and invertible, then $T^{-1}:W\to V$ is also linear.

Proof

By definition $T^{-1}(y)=x \iff y=T(x)$ for all $x\in V$ , $y\in W$ . To show that $T^{-1}$ is additive, let $y_1, y_2\in W$ be given, and define $x_1,x_2\in V$ by $x_1=T^{-1}y_1$ , $x_2=T^{-1}y_2$ . Then

$\begin{aligned} T(x_1+x_2) = Tx_1 + Tx_2 = y_1+y_2 &\implies x_1+x_2 = T^{-1}(y_1+y_2) \\ &\implies T^{-1}y_1+T^{-1}y_2 = T^{-1}(y_1+y_2). \end{aligned}$

So $T^{-1}$ is indeed additive.

Similar arguments show that $T^{-1}(ay)=aT^{-1}(y)$ for all $a\in \F$ , $y\in W$ .

Theorem. If $T:V\to W$ is invertible and $\dim V\lt \infty$ then $\dim W=\dim V$ .

Definition. If $T:V\to V$ is linear, then $T^k$ , the $k^{\rm th}$ power of $T$ is defined by

$T^k = \overbrace{T\cdot T\cdot T\cdots T}^{k\text{ factors}}$

if $k$ is a positive integer. If $T$ is invertible, then one also defines $T^{-k} = \bigl(T^{-1}\bigr)^k$ .

Theorem. $T^{k+l} = T^k T^l$ for all $k,l\in\N$ and all $T\in \cL(V)$ .

Solving linear equations

If $A:V\to W$ is linear and invertible, then the equation

$Ax=y$

has a unique solution $x\in V$ for each $y\in W$ . The solution is

$x=A^{-1}y$

Example: compute the matrix of the inverse

Let $A:\R^2\to\R^2$ be given by

$A\mat x_1\\x_2\rix = \mat 1 & 2\\ 2& 5 \rix \mat x_1 \\ x_2 \rix$

Is $A$ invertible, and if it is, compute the matrix of $A^{-1}$ .

Linear Transformations

Examples from geometry

Examples from algebra

Examples from differential equations

Null space and range; rank

Solving linear equations

Does Tx=yTx=yTx=y have a solution?

What is the form of the general solution to Tx=yTx=yTx=y?

Systems of equations

More equations than unknowns, i.e. m>nm\gt nm>n

More unknowns than equations, i.e. m<nm\lt nm<n

As many equations as unknowns, i.e. m=nm = nm=n

The components of a vector with respect to a basis

Matrix representation of a linear transformation

Special case: A:Fn→FmA:\F^n\to\F^mA:Fn→Fm

Composition of transformations and matrix multiplication

The vector space L(V,W)\mathcal{L}(V,W)L(V,W)

Inverses and other powers of a linear transformation

Solving linear equations

Does $Tx=y$ have a solution?

What is the form of the general solution to $Tx=y$ ?

More equations than unknowns, i.e. $m\gt n$

More unknowns than equations, i.e. $m\lt n$

As many equations as unknowns, i.e. $m = n$

Special case: $A:\F^n\to\F^m$

The vector space $\mathcal{L}(V,W)$