14
Jun 18

Matrix inversion: doing some housekeeping at elementary level

Matrix inversion: doing some housekeeping at elementary level

Any axiomatic treatment of the mathematical material should be preceded by a concrete, I would say naive, exploration.

P.S. Alexandrov (1896-1982)

What candidate would you suggest for the identity matrix?

Think simple, take a 2D case. Some students, even those who have taken matrix algebra, suggest

I=\left(\begin{array}{cc}1&1\\1&1\end{array}\right).

I suggest analyzing the defining property AI=A, that is

\left(\begin{array}{cc}a_{11}&a_{12}\\a_{21}&a_{22}\end{array}\right)\left(\begin{array}{cc}  i_{11}&i_{12}\\i_{21}&i_{22}\end{array}\right)=\left(\begin{array}{cc}a_{11}&a_{12}\\a_{21}&a_{22}\end{array}\right).

Upon multiplication this becomes

\left(\begin{array}{cc}a_{11}i_{11}+a_{12}i_{21}&a_{11}i_{12}+a_{12}i_{22}\\  a_{21}i_{11}+a_{22}i_{21}&a_{21}i_{12}+a_{22}i_{22}\end{array}\right)=\left(\begin{array}{cc}  a_{11}&a_{12}\\a_{21}&a_{22}\end{array}\right).

This should be true for ANY a_{ij}. Selecting a_{11}=0, a_{12}=1 and comparing elements in the upper left corner we see that i_{21}=0. Similarly, equating the elements in the lower right corner, we get i_{12}=0. The equation simplifies to

\left(\begin{array}{cc}a_{11}i_{11}&a_{12}i_{22}\\a_{21}i_{11}&a_{22}i_{22}\end{array}  \right) =\left(\begin{array}{cc}a_{11}&a_{12}\\a_{21}&a_{22}\end{array}\right).

This implies i_{11}=i_{22}=1. Thus,

I=\left(\begin{array}{cc}1&0\\0&1\end{array}  \right) .

This easily generalizes to higher dimensions:

Definition. The identity matrix is a square matrix with unities on the main diagonal and zeros outside it. The above exercise does not free us from the necessity of proving AI=A for all A, which I leave to the reader.

Invertibility of matrices

Definition. Let A be a square matrix. Its inverse, denoted A^{-1}, satisfies by definition

(1) AA^{-1}=A^{-1}A=I.

We say that A is invertible if its inverse A^{-1} exists.

Remark. (1) includes two equations: AA^{-1}=I (right inverse) and A^{-1}A=I (left inverse). We need both because of absence of commutativity. In fact, the existence of the right inverse implies the existence of the left inverse, and vice versa. There are many facts equivalent to invertibility.

Surprise #2. The condition A\neq 0 is not sufficient for A to be invertible.

Proof. Take

A=\left(\begin{array}{cc}1&0\\0&0\end{array}\right)\neq 0.

Suppose it's inverse exists and denote it

B=\left(\begin{array}{cc}b_{11}&b_{12}\\b_{21}&b_{22}\end{array}\right).

From the equation AB=I we have

\left(\begin{array}{cc}1&0\\0&0\end{array}\right)\left(\begin{array}{cc}b_{11}&b_{12}\\  b_{21}&b_{22}\end{array}\right)=\left(\begin{array}{cc}1&0\\0&1\end{array}\right).

The element in the lower right corner gives 0=1, which is impossible.

How to use definition (1)

Rephrasing (1): whenever you have a matrix B such that AB=BA=I, you can conclude that A^{-1}=B.

Example 1 (analog of (\frac{1}{a})^{-1}=a, inverse of an inverse(A^{-1})^{-1}=A. This is because by (1) B=A satisfies BA^{-1}=A^{-1}B=I.

Example 2 (surprise #3, inverse of a product). If A,B are invertible, then (AB)^{-1}=B^{-1}A^{-1}. To prove this, it suffices to check that the product B^{-1}A^{-1} inverts AB. We do this just for the right inverse:

(AB)(B^{-1}A^{-1})               (by associativity)

=A(BB^{-1})A^{-1}             (by (1))

=AIA^{-1}                         (the identity matrix can be omitted)

=AA^{-1}=I                    (again by (1)) .

Leave a Reply

You must be logged in to post a comment.