web.archive.org

invertible matrix: Information and Much More from Answers.com

  • ️Wed Jul 01 2015

In linear algebra, an n-by-n (square) matrix A is called invertible or non-singular if there exists an n-by-n matrix B such that

where In denotes the n-by-n identity matrix and the multiplication used is ordinary matrix multiplication. If this is the case, then the matrix B is uniquely determined by A and is called the inverse of A, denoted by A - 1. It follows from the theory of matrices that if

for square matrices A and B, then also

BA = I \ .

While the most common case is that of matrices over the real or complex numbers, all these definitions can be given for matrices over any ring.

A square matrix that is not invertible is called singular or degenerate. A square matrix is singular if and only if its determinant is 0.

Over the field of real numbers, the set of singular n-by-n matrices, considered as a subset of Rn×n, is a null set, i.e., has Lebesgue measure zero. (This is true because singular matrices can be thought of as the roots of the polynomial function given by the determinant.) This can be interpreted as saying that almost all n-by-n matrices are invertible. Intuitively, this means that if you pick a random square matrix over the reals, the probability that it will be singular is zero. In practice however, one may encounter non-invertible matrices. And in numerical calculations, matrices which are invertible, but close to a non-invertible matrix, can still be problematic; such matrices are said to be ill conditioned.

Matrix inversion is the process of finding the matrix B that satisfies the prior equation for a given invertible matrix A.

Properties of invertible matrices

Let A be a square n by n matrix over a field K (for example the field R of real numbers). Then the following statements are equivalent:

  • A is invertible.
  • A is row-equivalent to the n-by-n identity matrix In.
  • A is column-equivalent to the n-by-n identity matrix In.
  • A has n pivot positions.
  • det A ≠ 0.
  • rank A = n.
  • The equation Ax = 0 has only the trivial solution x = 0 (i.e., Null A = {0})
  • The equation Ax = b has exactly one solution for each b in Kn.
  • The columns of A are linearly independent.
  • The columns of A span Kn (i.e. Col A = Kn).
  • The columns of A form a basis of Kn.
  • The linear transformation mapping x to Ax is a bijection from Kn to Kn.
  • There is an n by n matrix B such that AB = In.
  • The transpose AT is an invertible matrix.
  • The matrix times its transpose, AT×A is an invertible matrix.
  • The number 0 is not an eigenvalue of A.

In general, a square matrix over a commutative ring is invertible if and only if its determinant is a unit in that ring.

The inverse of an invertible matrix A is itself invertible, with

\left(A^{-1}\right)^{-1} = A.

The inverse of an invertible matrix A multiplied by a non-zero scalar k yields the product of the inverse of both the matrix and the scalar

\left(kA\right)^{-1} = k^{-1}A^{-1}.

For an invertible matrix A, the transpose of the inverse is the inverse of the transpose:

(A^\mathrm{T})^{-1} = (A^{-1})^\mathrm{T} \,

The product of two invertible matrices A and B of the same size is again invertible, with the inverse given by

\left(AB\right)^{-1} = B^{-1}A^{-1}

(note that the order of the factors is reversed.) As a consequence, the set of invertible n-by-n matrices forms a group, known as the general linear group Gl(n).

Proof for matrix product rule

If A1, A2, ..., An are nonsingular square matrices over a field, then

(A_1A_2\cdots A_n)^{-1} = A_n^{-1}A_{n-1}^{-1}\cdots A_1^{-1}

It becomes evident why this is the case if one attempts to find an inverse for the product of the Ais from first principles, that is, that we wish to determine B such that

(A_1A_2\cdots A_n)B=I

where B is the inverse matrix of the product. To remove A1 from the product, we can then write

A_1^{-1}(A_1A_2\cdots A_n)B=A_1^{-1}I

which would reduce the equation to

(A_2A_3\cdots A_n)B=A_1^{-1}I

Likewise, then, from

A_2^{-1}(A_2A_3\cdots A_n)B=A_2^{-1}A_1^{-1}I

which simplifies to

(A_3A_4\cdots A_n)B=A_2^{-1}A_1^{-1}I

If one repeat the process up to An, the equation becomes

B=A_n^{-1}A_{n-1}^{-1}\cdots A_2^{-1}A_1^{-1}I
B=A_n^{-1}A_{n-1}^{-1}\cdots A_2^{-1}A_1^{-1}

but B is the inverse matrix, i.e B = (A_1A_2\cdots A_n)^{-1} so the property is established.

Methods of matrix inversion

Gaussian elimination

Gaussian elimination is an algorithm that can be used to determine whether a given matrix is invertible and to find the inverse. An alternative is the LU decomposition which generates an upper and a lower triangular matrices which are easier to invert. For special purposes, it may be convenient to invert matrices by treating mn-by-mn matrices as m-by-m matrices of n-by-n matrices, and applying one or another formula recursively (other sized matrices can be padded out with dummy rows and columns). For other purposes, a variant of Newton's method may be convenient (particularly when dealing with families of related matrices, so inverses of earlier matrices can be used to seed generating inverses of later matrices).

Analytic solution

Writing another special matrix of cofactors, known as an adjugate matrix, can also be an efficient way to calculate the inverse of small matrices (since this method is essentially recursive, it becomes inefficient for large matrices). To determine the inverse, we calculate a matrix of cofactors:

A^{-1}={1 \over \begin{vmatrix}A\end{vmatrix}}\left(C_{ij}\right)^{T}={1 \over \begin{vmatrix}A\end{vmatrix}}\left(C_{ji}\right)={1 \over \begin{vmatrix}A\end{vmatrix}} \begin{pmatrix} C_{11} & C_{21} & \cdots & C_{j1} \\ C_{12} & C_{22} & \cdots & C_{j2} \\ \vdots & \vdots & \ddots & \vdots \\ C_{1i} & C_{2i} & \cdots & C_{ji} \\ \end{pmatrix}

where |A| is the determinant of A, Cij is the matrix cofactor, and AT represents the matrix transpose.

For most practical applications, it is not necessary to invert a matrix to solve a system of linear equations; however, for a unique solution, it is necessary that the matrix involved be invertible.

Decomposition techniques like LU decomposition, are much faster than inversion, and various fast algorithms for special classes of linear systems have also been developed.

Inversion of 2×2 matrices

The cofactor equation listed above yields the following result for 2×2 matrices. Inversion of these matrices can be done easily as follows: [1]

A^{-1} = \begin{bmatrix} a & b \\ c & d \\ \end{bmatrix}^{-1} = \frac{1}{ad - bc} \begin{bmatrix} d & -b \\ -c & a \\ \end{bmatrix}

Blockwise inversion

Matrices can also be inverted blockwise by using the following analytic inversion formula:

where A, B, C and D are matrix sub-blocks of arbitrary size. This strategy is particularly advantageous if A is diagonal and (D - CA - 1B) (the Schur complement of A) is a small matrix, since they are the only matrices requiring inversion. This technique was invented by Volker Strassen, who also invented the Strassen algorithm for fast(er) matrix multiplication.

The inversion procedure that led to Equation (1) performed matrix block operations that operated on C and D first. Instead, if A and B are operated on first, the result is

Equating Equations (1) and (2) leads to

(A-BD^{-1}C)^{-1}BD^{-1} = A^{-1}B(D-CA^{-1}B)^{-1}\,
D^{-1}C(A-BD^{-1}C)^{-1} = (D-CA^{-1}B)^{-1}CA^{-1}\,
D^{-1}+D^{-1}C(A-BD^{-1}C)^{-1}BD^{-1} = (D-CA^{-1}B)^{-1}\,

where Equation (3) is the matrix inversion lemma, which is equivalent to the binomial inverse theorem.

Proof (slightly different then the binomial inverse theorem proof, but more intuitive)

First multiply the RHS of (3) by the inverse of the LHS to get

I=I-BD^{-1}CA^{-1}+\left(A-BD^{-1}C\right)\left(A^{-1}B\right)\left(D-CA^{-1}B\right)^{-1}CA^{-1}

Note that

\left(A-BD^{-1}C\right)\left(A^{-1}B\right)\left(D-CA^{-1}B\right)^{-1}=\left(B-BD^{-1}CA^{-1}B\right)\left(D-CA^{-1}B\right)^{-1}=B\underbrace{\left(I-D^{-1}CA^{-1}B\right)\left(D-CA^{-1}B\right)^{-1}}_P

If we can show that P = D - 1, then the BD - 1CA - 1 terms would canceled out. This is accomplished by noting

\left(I-D^{-1}CA^{-1}B\right)=D^{-1}\left(D-CA^{-1}B\right)

Hence we have shown that P is indeed equal to D - 1. After the cancellation of the BD - 1CA - 1 term, all there is left is the identity matrix; and the proof is completed.

The derivative of the matrix inverse

Suppose that the matrix A depends on a parameter t. Then the derivative of the inverse of A with respect to t is given by

\frac{\mathrm{d}A^{-1}}{\mathrm{d}t} = - A^{-1} \frac{\mathrm{d}A}{\mathrm{d}t} A^{-1}.

This formula can be found by differentiating the identity

A^{-1}A = I.\,

The Moore-Penrose pseudoinverse

Some of the properties of inverse matrices are shared by (Moore-Penrose) pseudoinverses, which can be defined for any m-by-n matrix.

Matrix inverses in real-time simulations

Matrix inversion plays a significant role in computer graphics, particularly in 3D graphics rendering and 3D simulations. Examples include screen-to-world ray casting, world-to-subspace-to-world object transformations, and physical simulations. The problem there is usually the numerical complexity of calculating the inverses of 3×3 and 4×4 matrices. Compared to matrix multiplication or creation of rotation matrices, inversion is several orders of magnitude slower. There are existing solutions which use hand-crafted assembly routines and SIMD processor extensions (SSE, SSE2, Altivec) that address this problem and achieve a performance improvement of as much as 5X.

See also

References

  1. ^

    Strang, Gilbert (2006). Linear Algebra and Its Applications. Thomson Brooks/Cole, p. 46. ISBN 0-03-010567-6. 

External links

This entry is from Wikipedia, the leading user-contributed encyclopedia. It may not have been reviewed by professional editors (see full disclaimer)