chapA.html

A.1 Vector and Matrix Multiplication

We consider a vector v of length J (in an abstract vector space of dimensions J) to be an ordered sequence of J numbers¹⁰³. The vector can be displayed either as a column

⎛
⎜
⎜
⎜
⎜
⎜
⎝

v₁

v₂

v_J

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(A.1)

or as a row, which we regard as the transpose, denoted ^T, of the column vector:

v^T=(v₁, v₂, ..., v_J).

(A.2)

Vectors of the same dimensions can be added together so that the j^th entry of u+v is u_j+v_j.

The scalar product of two vectors u, v, in vector notation is indicated by a dot, but in matrix notation the dot is usually omitted. Instead we write it

u^Tv =

J
∑
j=1

u_jv_j.

(A.3)

If we have a set of k column vectors v_k, for k=1,...,K, the j^th element of the k^th vector can be written, V_jk, and they can be arrayed compactly one after the other as

V =

⎛
⎜
⎜
⎜
⎜
⎜
⎝

V₁₁

V₁₂

...

V_1K

V₂₁

V₂₂

...

V_2K

^··_·

V_J1

V_J2

...

V_JK

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(A.4)

This is a matrix. We can consider matrix multiplication to be a generalization of the scalar product. So premultiplying a J×K matrix V, by a length J row vector u^T gives a new row vector of length K

u^TV = (

J
∑
j=1

u_jV_j1,

J
∑
j=1

u_jV_j2,...,

J
∑
j=1

u_jV_jK).

(A.5)

If we further have a set of M row vectors, we can display them as a matrix

U =

⎛
⎜
⎜
⎜
⎜
⎜
⎝

U₁₁

U₁₂

...

U_1J

U₂₁

U₂₂

...

U_2J

^··_·

U_M1

U_M2

...

U_MJ

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(A.6)

(dispensing with the transpose notation for brevity and consistency). And multiplication of the matrices U (M×J) and V (J×K) can be considered to give an M×K matrix:

UV=

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎝

J
∑
j=1

U_1jV_j1

J
∑
j=1

U_1jV_j2

...

J
∑
j=1

U_1jV_jK

J
∑
j=1

U_2jV_j1

J
∑
j=1

U_2jV_j2

...

J
∑
j=1

U_2jV_jK

^··_·

J
∑
j=1

U_MjV_j1

J
∑
j=1

U_MjV_j2

...

J
∑
j=1

U_MjV_jK

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎠

(A.7)

This is the definition of matrix multiplication. A matrix (or vector) can also be multiplied by a single number: a scalar, λ (say). The (jk)th element of λV is λV_jk.

The transpose of a matrix A=(A_ij) is simply the matrix formed from reversing the order of suffixes¹⁰⁴: A^T=(A^T_ij)=(A_ji). The transpose of a product of two matrices is therefore the reverse of the product of the transposes:

(AB)^T = B^TA^T.

(A.8)

A.2 Determinants

The determinant of a square matrix is a single scalar that is an important measure of its character. Determinants may be defined inductively. Suppose we know the definition of determinants of matrices of size (M−1)×(M−1). Define the determinant of an M×M matrix A whose ij^th entry is A_ij, as the expression

det

(A)=|A| =

M
∑
j=1

A_1j Co_1j(A)

(A.9)

where Co_ij(A) is the ij^th cofactor of the matrix A. The ij^th cofactor of an M×M matrix is (−1)^i+j times the determinant of the (M−1)×(M−1) matrix obtained by removing the i^th row and the j^th column of the original matrix:

Co_ij(A)=(−1)^i+j

⎢
⎢
⎢
⎢
⎢
⎢
⎢
⎢
⎢

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎝

A₁₁

...

A_1j−1

A_1j+1

...

A_1M

A_i−1,1

...

A_i−1,j−1

A_i−1,j+1

...

A_i−1,M

A_i+1,1

...

A_i+1,j−1

A_i+1,j+1

...

A_i+1,M

A_M1

...

A_Mj−1

A_Mj+1

...

A_MM

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎠

⎢
⎢
⎢
⎢
⎢
⎢
⎢
⎢
⎢

(A.10)

The inductive definition is completed by defining the determinant of a 1×1 matrix to be equal to its single element. The determinant of a 2×2 matrix is then A₁₁A₂₂−A₁₂A₂₁, and of a 3×3 matrix is A₁₁(A₂₂A₃₃−A₂₃A₃₂)+A₁₂(A₂₃A₃₁−A₂₁A₁₃)+A₁₃(A₂₁A₃₂−A₂₂A₃₁).

The determinant of an M×M matrix may equivalently be defined as the sum over all the M! possible permutations P of the integers 1,...,M, of the product of the entries

∏
i

A_i,P(i)

times the signum of P (plus or minus 1 according to whether P is even or odd):

|A| =

∑
P

sgn(P) A_1,P(1) A_2,P(2) ... A_M,P(M) .

(A.11)

This expression shows that there is nothing special about the first row in eq. (A.9). One could equally well have used any row, i, giving

|A| =

M
∑
j=1

A_ij Co_ij(A)

; or one could have used any column, j,

|A| =

M
∑
i=1

A_ijCo_ij(A)

. All the results are the same.

The determinant of the transpose of a matrix A is equal to its determinant: |A^T|=|A|. The determinant of a product of two matrices is the product of the determinants: |AB| = |A||B|. A matrix is said to be singular if its determinant is zero, otherwise it is nonsingular. If a matrix has two identical (or proportional, i.e. dependent) rows or two identical columns, then its determinant is zero and it is singular¹⁰⁵.

A.3 Inverses

The unit matrix is square,

I=(δ_ij) =

⎛
⎜
⎜
⎜
⎜
⎜
⎝

...

^··_·

...

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(A.12)

with ones on the diagonal and zeroes elsewhere. It may be of any size, N, and if need be then denoted I_N. For any M×N matrix A,

I_M A = A and AI_N = A.

(A.13)

The inverse of a square matrix A, if it exists, is another matrix written A⁻¹ such that¹⁰⁶

A⁻¹A = AA⁻¹ = I.

(A.14)

A nonsingular square matrix possesses an inverse. A singular matrix does not.

The inverse of a matrix may be identified by considering the identity

M
∑
j=1

A_ij Co_kj(A) = δ_ik|A|.

(A.15)

For i=k, this equality arises as the expansion of the determinant by row i. For i ≠ k, the sum represents the determinant, expanded by row k, of a matrix in which the row k has been replaced by a copy of row i. The modified matrix has two rows identical, so its determinant is zero, as is δ_ij, i ≠ j. Now if we regard Co(A) as a matrix, consisting of all the cofactors. Then we can consider

M
∑
j=1

A_ij Co_kj(A)

as being the matrix product of A by the transpose of the cofactor matrix, ACo(A)^T. So if |A| is nonzero we may divide (A.15) through by it and find

A[Co(A)^T/|A|] = I.

(A.16)

This equality shows that

A⁻¹ = Co(A)^T/|A|.

(A.17)

Consequently the solution of the nonsingular matrix equation Ax=b is

x =

Co(A)^T b

|A|

(A.18)

which for column vectors x and b is Cramer's rule.

The inverse of the product of two nonsingular matrices is the reversed product of their inverses:

(AB)⁻¹=B⁻¹A⁻¹.

(A.19)

A.4 Eigenanalysis

A square matrix A maps the linear space of column vectors onto itself via A x = y, with y the vector onto which x is mapped. An eigenvector is a vector which is mapped onto a multiple of itself. That is

Ax = λx,

(A.20)

where λ is a scalar called the eigenvalue. In general a square matrix of dimension N has N different eigenvectors. Obviously an eigenvector times any scalar is still an eigenvector, which is not considered to be different.

Since eq. (A.20), which is (A−λI)x=0, is a homogeneous equation for the elements of x, in order for there to be a non-zero solution, x, the determinant of the coefficients must be zero:

|A−λI|=0.

(A.21)

For an N×N matrix, this determinant gives a polynomial of order N for λ, whose N roots are the N eigenvalues.

If A is symmetric, that is if A^T=A, then the eigenvectors corresponding to different eigenvalues are orthogonal, that is, their scalar product is zero. See this by considering two eigenvectors e₁ and e₂, corresponding to different eigenvalues λ₁, λ₂, and using the respective versions of eq. (A.20) and the properties of the transpose.

e^T₂Ae₁ = e₂^Tλ₁e₁, e₂^TA^Te₁=(e^T₁Ae₂)^T = (e₁^Tλ₂e₂)^T=e₂^Tλ₂e₁.

(A.22)

So by subtraction

0=e₂^T(A−A^T)e₁ = (λ₁−λ₂)e₂^Te₁.

(A.23)

If there are multiple independent eigenvectors with identical eigenvalues, they can be chosen to be orthogonal. In that standard case, the eigenvectors are all orthogonal: e_i^Te_j=0 for i ≠ j.

If we then take the eigenvectors also to be normalized such that e_j^Te_j=1, we can construct a square matrix U whose columns are equal to these eigenvectors (as in eq. (A.4)). The matrix U whose columns are orthonormal is said to be an orthonormal matrix (sometimes just called orthogonal). The inverse of U is its transpose: U⁻¹ = U^T. This U is a unitary basis transformation which diagonalizes A. This fact follows from the observation that AU=DU=UD where D is the diagonal matrix constructed from the eigenvalues:

D =

⎛
⎜
⎜
⎜
⎜
⎜
⎝

λ₁

...

λ₂

...

^··_·

...

λ_N

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(A.24)

Therefore

U^TAU = U^T UD = D.

(A.25)

HEAD

Appendix A Summary of Matrix Algebra

A.1 Vector and Matrix Multiplication

A.2 Determinants

A.3 Inverses

A.4 Eigenanalysis

Appendix A
Summary of Matrix Algebra