ma1506chap6

Upload: minh-tieu

Post on 02-Jun-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/11/2019 MA1506CHAP6

    1/58

    CHAPTER 6

    LINEAR TRANSFORMATIONS

    6.1 WHAT IS A LINEAR TRANSFORMA-

    TION?

    You know what a function is - its a RULE which

    turns NUMBERS INTO OTHER NUMBERS: f(x) =

    x2 means please turn 3 into 9, 12 into 144 and so

    on.

    Similarly a TRANSFORMATION is a rule whichturns VECTORS into other VECTORS. For exam-

    ple, please rotate all 3-dimensional vectors through

    an angle of 90 clockwise around thez-axis. A LIN-

    EAR TRANSFORMATION T is one that ALSOsatisfies these rules: ifc is any scalar, and

    u and

    v

    are vectors, then

    T(cu ) =cT(

    u ) and T(

    u +

    v ) =T(

    u ) + T(

    v ).

    1

  • 8/11/2019 MA1506CHAP6

    2/58

    EXAMPLE: Let I be the rule Iu =

    u for all

    u .

    You can check that I is linear! Called IDENTITY

    Linear Transformation.

    EXAMPLE : Let D be the rule Du = 2

    u for all

    u .

    D(cu ) = 2(c

    u ) =c(2

    u ) =cD

    u

    D(u +

    v ) = 2(

    u +

    v ) = 2

    u + 2

    v = D

    u +D

    v

    LINEAR!

    Note: Usually we write D(u ) as just D

    u .

    6.2. THE BASIC BOX, AND THE MATRIX

    OF A LINEAR TRANSFORMATION

    The usual vectors i andj define a square:

    Lets call this the BASIC BOX in two dimensions.

    2

  • 8/11/2019 MA1506CHAP6

    3/58

    Similarly, i, j, and k define the BASIC BOX in 3

    dimensions.

    Now let T be any linear transformation. You know

    that any 2-dimensional vector can be written asai +

    bj, for some numbers aand b. So for any vector, we

    have

    T(ai + bj) =aTi + bTj.

    This formula tells us something very important: IF

    I KNOW WHAT T DOES TO i and j, THEN I

    KNOW EVERYTHING ABOUT T - because now I

    can tell you what Tdoes to ANY vector.

    EXAMPLE: Suppose I know that T(i) = i+ 14j

    and T(j) = 14 i +j. Then what is T(2i + 3j)?

    Answer: T(2i+ 3j) = 2Ti+ 3Tj = 2

    i + 14j

    +

    314 i +

    j

    = 2i + 12j+ 34 i + 3

    j = 114 i + 72

    j.

    Since Ti and Tj tell me everything I need to know,

    3

  • 8/11/2019 MA1506CHAP6

    4/58

    this means that I can tell you everything about T by

    telling you WHAT IT DOES TO THE BASIC BOX.

    EXAMPLE: Let Tbe the same transformation as

    above, T(i) = i + 14j and T(j) = 14 i +

    j.

    The basic box has been squashed a bit! Pictures

    of WHAT T DOES TO THE BASIC BOX tell us

    everything about T!

    EXAMPLE: IfD is the transformation Du = 2

    u ,

    then the Basic Box just gets stretched:

    So every LT can be pictured by

    seeing what it does to the Basic Box.

    There is another way!

    Let Ti =

    a

    c

    and Tj =

    b

    d

    . Then we DEFINE

    4

  • 8/11/2019 MA1506CHAP6

    5/58

    THE MATRIX OF TRELATIVE TO i, jas

    a b

    c d

    ,

    that is, the first COLUMN tells us what happened

    to i, and the second column tells us what happened

    toj.

    EXAMPLE: Let Ibe the identity transformation.

    Then Ii=

    i=

    10

    , Ij =

    j =

    01

    , so the matrix of

    the identity transformation relative to ij is

    1 00 1

    .

    EXAMPLE: If Du = 2

    u , then Di =

    20

    and

    Dj =

    02

    so the matrix of D relative to i, j is2 00 2

    .

    EXAMPLE: IfTi= i+ 14j and Tj = 14 i+j, then

    the matrix is

    1

    1

    414 1

    .

    EXAMPLE: If Ti =j and Tj = i, the matrix is0 11 0

    . Basic box is REFLECTED

    5

  • 8/11/2019 MA1506CHAP6

    6/58

    EXAMPLE: Suppose in 3 dimensions Ti= i+4j +

    7k, Tj = 2i+ 5j+ 8k, Tk = 3i+ 6j+ 9k, then the

    matrix is

    1 2 34 5 67 8 9

    , relative to ijk.

    EXAMPLE: Suppose Ti = i+j + 2k and Tj =

    i

    3k. This is an example of a LT that eats 2-

    dimensional vectors but PRODUCES 3-dimensional

    vectors. But it still has a matrix,

    1 11 0

    2 3

    . Its

    just that this matrix is not a SQUARE MATRIX,

    that is, it is not 2 by 2 or 3 by 3. Instead it is 3 by

    2.

    We shall say that a linear transformation is a 2-

    dimensional L.T. if it eats 2-dimensional vectors AND

    produces 2-dimensional vectors. A 2-dimensional

    L.T. has a square, 2 by 2 matrix relative to i, j. Sim-

    ilarly a 3-dimensional linear transformation has a 3

    by 3 matrix. In Engineering applications, most lin-

    6

  • 8/11/2019 MA1506CHAP6

    7/58

    ear transformations are 2-dimensional or 3-dimensional,

    so we are mainly interested in these two cases.

    EXAMPLE: Suppose T is a linear transformation

    that eats 3-dimensional vectors and produces

    2-dimensional vectors according to the rule Ti= 2i,

    Tj = i + j, Tk= i

    j. What is its matrix?

    Answer:

    2 1 10 1 1

    , a 2 by 3 matrix.

    EXAMPLE: Suppose, in solid mechanics, you take

    a flat square of rubber and SHEAR it, as shown.

    In other words, you dontchange its volume,

    you just push it like a pack

    of cards. The base stays fixed but the top moves

    a distance tan(). (The height remains the same, 1unit.) Clearly the shearing transformationSsatisfies

    Si = i, Sj = i tan +j, so the matrix ofS relative

    to i, j is

    1 tan 0 1

    .

    7

  • 8/11/2019 MA1506CHAP6

    8/58

    EXAMPLE: Suppose T i = i+j and Tj = i+

    j. Matrix is

    1 11 1

    and basic box is SQUASHED

    FLAT!

    EXAMPLE: Rotations in the plane. Suppose you

    ROTATE the whole plane through an angle (anti-

    clockwise). Then simple trigonometry shows you

    that

    Ri= cos i + sin j

    Rj = sin i + cos jSo the rotation matrix is

    R() =

    cos sin sin cos

    .

    Application: Suppose an object is moving on a cir-

    cle at constant angular speed . What is its accel-

    8

  • 8/11/2019 MA1506CHAP6

    9/58

    eration?

    Answer: Let its position vector at t = 0 b e

    r0.Because the object is moving on a circle, its position

    at a later time t is given by rotatingr0 by an angle

    (t). So

    r (t) =

    cos sin sin cos

    r0

    Differentiate

    d

    rdt

    =

    sin cos cos

    sin

    r0by the chain rule. Here

    is actually , so

    d

    rdt

    =

    sin cos cos sin

    r0. Differentiate again,

    d2

    rdt2

    = cos sin sin cos

    2r0

    = 2

    cos sin sin cos

    r0

    .

    9

  • 8/11/2019 MA1506CHAP6

    10/58

    Substitute the equation for

    r (t),

    d

    2

    rdt2 = 2

    r ,

    which is formula you know from physics.

    6.3. COMPOSITE TRANSFORMATIONS

    AND MATRIX MULTIPLICATION.

    You know what it means to take the COMPOSITE

    of two functions: if f(u) = sin(u), and u(x) = x2,

    then f

    umeans: please do uFIRST, THEN f, so

    f u(x) = sin(x2). NOTE THE ORDER!!u f(x) = sin2(x), NOT the same!

    Similarly ifAand Bare linear transformations, then

    AB means do B FIRST, then A.

    NOTE: BE CAREFUL! According to our definition,

    AandB both eat vectors and both produce vectors.

    But then you have to take care that Acan eat what

    10

  • 8/11/2019 MA1506CHAP6

    11/58

    B produces!

    EXAMPLE: Suppose A eats and produces 2-dimensionalvectors, andB eats and produces 3-dimensional vec-

    tors. Then AB would not make sense!

    EXAMPLE: Suppose B eats 2-d vectors and pro-

    duces 3-d vectors (so its matrix relative to ijk looks

    like this:

    b11 b12b21 b22

    b31 b32

    , a 3 by 2 matrix) and suppose

    A eats 3-d vectors and produces 2-d vectors. Then

    AB DOES make sense, because A can eat what B

    produces. (In this case, BA also makes sense.).

    IMPORTANT FACT: Suppose aij is the matrix

    of a linear transformationArelative to ijk, and sup-

    pose bij is the matrix of the Linear Transformation

    B relative to ijk. Suppose that AB makes sense.

    Then the matrix of AB relative to ij or ijk is just

    11

  • 8/11/2019 MA1506CHAP6

    12/58

    the matrix product ofaij and bij .

    EXAMPLE: What happens to the vector 12 if

    we shear 45 parallel to the x axis and then rotate

    90 anticlockwise? What if we do the same in the

    reverse order?

    Answer: Shear

    1 tan 0 1

    so in this case it is

    1 10 1

    . A rotation through

    has matrix

    cos sin sin cos

    , so here it is

    0 11 0

    Hence

    SHEAR, THEN ROTATE

    0

    1

    1 0 1 1

    0 1

    = 0

    11 1

    ROTATE, THEN SHEAR

    1 10 1

    0 11 0

    =

    1 11 0

    .

    12

  • 8/11/2019 MA1506CHAP6

    13/58

    So shear, then rotate

    1

    2

    0 11 1

    1

    2=

    23 .

    Rotate, then shear12

    1 11 0

    12

    =

    11

    Very different!

    EXAMPLE: Suppose B is a LT with matrix 1 00 1

    1 1

    andAis a LT with matrix

    0 1 11 1 0

    .

    What is the matrix ofAB? OfBA?

    Answer: 0 1 11 1 0

    1 00 11 1

    =

    1 01 1

    =AB

    2 by 3 3 by 2 2 by 2 1 00 1

    1 1

    0 1 11 1 0

    =

    0 1 11 1 0

    1 0 1

    3 by 2 2 by 3 3 by 3

    13

  • 8/11/2019 MA1506CHAP6

    14/58

    EXAMPLE: Suppose you take a piece of rubber in

    2 dimensions and shear it parallel to the x axis by

    degrees, and then shear it again by degrees. What

    happens?

    1 tan 0 1

    1 tan 0 1

    = 1 tan + tan

    0 1

    which is also a shear, but NOT through + !

    The shear angles dont add up, since tan +tan =tan(+ ).

    EXAMPLE: Rotate 90 aroundz-axis, then rotate

    90 around x-axis in 3 dimensions. [Always anti-

    clockwise unless otherwise stated.] Is it the same

    if we reverse the order? Rotate about z axisi becomes j, j becomesi, k stays the same, so 0 1 01 0 0

    0 0 1

    . Rotate aboutxaxis,istays the same,

    14

  • 8/11/2019 MA1506CHAP6

    15/58

    j becomes k, k becomesj, so

    1 0 00 0 10 1 0

    , and

    1 0 00 0 1

    0 1 0

    0 1 01 0 0

    0 0 1

    = 0

    1 0

    1 0 00 0 1

    1 0 00 0 10 1 0

    ,

    so the answer is NO!

    6.4 DETERMINANTS

    You probably know that the AREA of the box de-

    fined by two vectors is

    |

    u

    v |, magnitudeof the vector product.

    If you dont know it, you can easily check it, since

    the area of any parallelogram is given by

    15

  • 8/11/2019 MA1506CHAP6

    16/58

    AREA=HEIGHT Base= |

    v

    |sin |

    u

    |= |u | |v | sin = |u v |.

    Similarly, the VOLUME of a three-dimensional par-

    allelogram [called a PARALLELOPIPED!] is given

    by

    VOLUME = (AREA OF BASE) HEIGHT.

    If you take any 3 vectors in 3 dimensions, sayu ,

    v ,

    w,

    then they define a 3-dimensional parallelogram. The

    area of the base is|u v |,height is

    |w

    | |sin

    2 |

    where is the angle between

    u v and w, so VOLUMEdefined by

    u ,

    v ,

    w is just

    16

  • 8/11/2019 MA1506CHAP6

    17/58

    |u v | |w| | sin

    2

    |

    =

    |u

    v

    | |w

    | |cos

    |=|u v w|.

    [Check: Volume of Basic Box defined by ijk is

    |i j k| = |k k| = 1, correct!

    Now let T be any linear transformation in two di-

    mensions. [This means that it acts on vectors in the

    xyplane and turns them into other vectors in the xy

    plane.]

    We let Tact on the Basic Box, as usual.

    Now Ti and Tj still lie in the same plane as i and

    j, s o (Ti) (Tj) must be perpendicular to thatplane. Hence it must be some multiple of k. We

    17

  • 8/11/2019 MA1506CHAP6

    18/58

    define the DETERMINANT ofT to be that multi-

    ple, that is, by definition, det(T) is the number given

    [STRICTLY IN 2 DIMENSIONS] by

    (Ti) (Tj) = det(T)k.

    EXAMPLE: IfI= identity, then

    Ii I

    j =

    i

    j =

    k= 1

    k

    so det(I) = 1.

    EXAMPLE: Du = 2

    u

    Di

    Dj = 4i

    j = 4k

    det(D) = 4

    EXAMPLE: Ti= i + 14j, Tj = 14 i +

    j,

    Ti Tj =

    i +1

    4j

    1

    4i + j

    = i j+ 1

    16j i

    = 1516i j = 1516 k det T = 1516 .

    EXAMPLE: Ti=j, Tj = i,

    Ti Tj =j i= k det T = 1

    18

  • 8/11/2019 MA1506CHAP6

    19/58

    EXAMPLE: Shear, Si= i, Sj = i tan + j,

    Si Sj =k det S= 1.

    EXAMPLE: Ti= i + j =Tj,

    Ti Tj = 0 det T = 0.

    EXAMPLE: Rotation

    Ri Rj = (cos i + sin j) ( sin i + cos j)= (cos2 sin2 )k=k det(R) = 1.

    The area of the Basic Box is initially|i j| = 1.After we let Tact on it, the area becomes

    |Ti T

    j| = |det T| |

    k| = | det T|.

    So

    Final Area of Basic Box

    Initial Area of Basic Box =

    | det T|1

    = | det T|

    19

  • 8/11/2019 MA1506CHAP6

    20/58

    so | det T| TELLS YOU THE AMOUNT BY WHICHAREAS ARE CHANGED BY T. So det T =

    1

    means that the area is UNCHANGED (Shears, ro-

    tations, reflections) while det T= 0 means that the

    Basic Box is squashed FLAT, zero area.

    Take a general 2 by 2 matrix M =

    a bc d

    . We

    know that this means Mi = ai+cj, Mj = bi+dj.

    Hence Mi Mj =

    ai + cj

    bi + dj

    = (ad bc)k, so

    det

    a bc d

    =ad bc.

    Check: det

    2 00 2

    = 4, det

    1 tan 0 1

    = 1,

    det

    cos sin sin cos

    = 1, det

    1 11 1

    = 0.

    IN THREE dimensions there is a similar gadget.

    20

  • 8/11/2019 MA1506CHAP6

    21/58

    The Basic Box is defined by ijk, and we can let any

    3-dimensional L.T. act on it, to get a new box de-

    fined by Ti, Tj, Tk. We define

    det T =

    Ti

    Tj

    Tk

    where the dot is the scalar product, as usual. Since

    |TiTj Tk| is the volume of the 3-dimensionalparallelogram defined by Ti, Tj, Tk, we see that

    | det T| = Final Volume of Basic BoxInitial Volume of Basic Box

    ,

    that is,|

    det T| tells you how much T changes vol-

    umes. IfTsquashes the Basic Box flat, then

    det T = 0.

    Just as det a b

    c d = ad

    bc, there is a formula

    for the determinant of a 3 by 3 matrix. The usual

    notation is this. We DEFINE

    a b

    c d

    = det

    a b

    c d

    =ad bc.

    21

  • 8/11/2019 MA1506CHAP6

    22/58

    Similarly

    a11 a12 a13a21 a22 a23a31 a32 a33

    is the determinant of

    a11 a12 a13a21 a22 a23

    a31 a32 a33

    and there is a formula for it, as

    follows:

    a11 a12 a13a21 a22 a23a31 a32 a33

    =a11

    a22 a23a32 a33 a12

    a21 a23a31 a33+ a13

    a21 a22a31 a32 .

    In other words, we can compute a three-dimensional

    determinant if we know how to work out 2-dimensional

    determinants.

    COMMENTS:

    [a] We worked along the top row. Actually, a THE-

    OREM says that you can use ANY ROW OR ANYCOLUMN!

    [b] How did I know that a12had to multiply the par-

    ticular 2-dimensional determinant

    a21 a23a31 a33

    ? Easy:

    22

  • 8/11/2019 MA1506CHAP6

    23/58

    I just struck out EVERYTHING IN THE SAME

    ROW AND COLUMN as a12 :

    a21

    a23

    a31 a33

    and

    just kept the survivors!

    This is the pattern, for example if you expand along

    the second row you will get

    a21

    a12 a13 a32 a33

    + a22

    a11 a13

    a31 a33

    a23

    a11 a12 a31 a32

    [c] What is the pattern of the + and signs? It is

    an (Ang Moh ) CHESSBOARD, starting with a +

    in the top left corner:+ + + + +

    [d] You can do exactly the same thing in FOUR di-

    23

  • 8/11/2019 MA1506CHAP6

    24/58

    mensions, following this pattern, using

    + + + ++

    +

    + +because now you know how to work out 3-dimensional

    determinants. And so on!

    Example:

    1 1 01 1 12 0 0

    = 1 1 10 0

    1 1 12 0

    + 0 1 12 0

    = 0 + 2 + 0 = 2

    (expanding along the top row) or, if you use the sec-

    ond row,

    1 1 01 1 12 0 0

    = 1

    1 00 0+ 1

    1 02 0 1

    1 12 0

    = 0 + 0 + 2 = 2

    24

  • 8/11/2019 MA1506CHAP6

    25/58

    or

    1

    1 0

    1 1 12 0 0

    = 1

    1 10 0 1

    1 00 0+ 2

    1 01 1

    = 0 + 0 + 2 = 2

    (expanding down the first column).

    Important Properties of Determinants

    [a] Let S and Tbe two linear transformations such

    that det Sand det T are defined. Then

    det ST= det T S= (det S) (det T).

    Therefore, det[ST U] = det[U ST] = det[T U S] and

    so on: det doesnt care about the order. Remember

    however that this DOES NOT mean that ST U =

    U ST etc etc.

    25

  • 8/11/2019 MA1506CHAP6

    26/58

    [b] IfM is a square matrix, then

    det MT = det M.

    [c] Ifcis a number and M is an nbynmatrix, then

    det(cM) =cn det M.

    EXAMPLE: Remember from Section 2[g] of Chap-

    ter 5 that an ORTHOGONAL matrix satisfies M MT =

    I. So det(M MT) = det I = 1. But det(M MT) =

    det(M)det(MT) = det(M) det(M) = (det M)2,thus

    det M= 1

    for any orthogonal matrix.

    6.5. INVERSES.

    If I give you a 3-dimensional vector

    u and a 3-

    dimensional linear transformation T, then T sends

    26

  • 8/11/2019 MA1506CHAP6

    27/58

    u to a particular vector, it never sends

    u to two

    DIFFERENT VECTORS! So this picture is impos-

    sible:

    Tu

    u

    Tu

    But what about this picture:u

    Tu =T

    v

    v

    Can T send TWO DIFFERENT VECTORS TO

    ONE? Yes!

    1 0 00 0 00 0 0

    010

    = 000

    = 1 0 00 0 00 0 0

    001

    So it can happen! Notice that this transformation

    destroys j (and also k). In fact if

    u = v and

    27

  • 8/11/2019 MA1506CHAP6

    28/58

    Tu = T

    v , then T(

    u v ) = 0, that is, Tw = 0

    wherew IS NOT THE ZERO VECTOR. So if this

    happens, T destroys everything in the w direction.

    That is, T SQUASHES 3-dimensional space down

    to two or even less dimensions. This means that

    TLOSES INFORMATION it throws away all ofthe information stored in the w direction. ClearlyT

    squashes the basic box down to zero volume, so

    det T = 0

    and we say T is SINGULAR.

    SUMMARY: A SINGULAR LINEAR TRANSFOR-

    MATION

    [a] Maps two different vectors to one vector

    [b] Destroys all of the vectors in at least one direction

    [c] Loses all information associated with those direc-

    tions

    [d] Satisfies det T = 0.

    28

  • 8/11/2019 MA1506CHAP6

    29/58

    Conversely, a NON-SINGULAR transformation never

    maps 2 vectors to one,

    u Tuv Tv

    Different

    Therefore if I give you Tu , THERE IS EXACTLY

    ONEu . The transformation that takes you fromT

    u

    back tou is called the INVERSE OF T. The idea is

    that since a NON-SINGULAR linear transformation

    does NOT destroy information, we can re-constructu if we are givenT

    u . ClearlyTHAS AN INVERSE,

    CALLED T1, if and only if det T= 0. All thisworks in other dimensions too.

    EXAMPLE:

    1 11 1

    34

    =

    77

    ,

    1 11 1

    43

    =

    77

    ,

    29

  • 8/11/2019 MA1506CHAP6

    30/58

    two different vectors to one!

    1 11 1

    11 = 1 1

    1 1 22

    =

    1 11 1

    13.5913.59

    =

    00

    It destroys everything

    in that direction!Finally det

    1 11 1

    = 0.

    So it is SINGULAR and

    has NO inverse.

    11

    EXAMPLE: Take

    0 11 0

    and suppose it acts on

    and

    a

    b

    and sends them to the same vector,

    so

    0 11 0

    = 0 11 0

    a

    b

    .

    Then

    =

    ba

    =a

    =b

    =

    a

    b

    30

  • 8/11/2019 MA1506CHAP6

    31/58

    so

    and

    a

    b

    are the same this transforma-

    tion never maps different vectors to the same vector.

    No vector is destroyed, no information is lost, noth-

    ing gets squashed! And det

    0 11 0

    = 1, NON-

    SINGULAR.

    How to FIND THE INVERSE.

    By definition, T1 sends Tu to

    u , i.e.

    T1(T(u )) =

    u =T(T1(

    u )).

    Butu =I

    u (identity) so T1 satisfies

    T1T =T T1 =I.

    So to find the inverse of 0 11 0 we just have to

    find a matrix

    a b

    c d

    such that

    a b

    c d

    0 11 0

    =

    1 00 1

    , b= 1, a= 0, d= 0, c= 1 so answer is

    31

  • 8/11/2019 MA1506CHAP6

    32/58

    0 11 0

    . In fact its easy to show that

    a bc d

    1

    = 1ad bc

    d bc a

    .

    For example, when we needed to find the matrix S

    in Section 4 of Chapter 5, we needed to find a way

    of solvingS

    0.7 0.40.5 0.7

    =I.

    This just means that we need to inverse of

    0.7 0.40.5 0.7

    ,

    and the above formula does the job for us.

    For bigger square matrices there are many tricks

    for finding inverses. A general [BUT NOT VERY

    PRACTICAL] method is as follows:

    [a] Work out the matrix of COFACTORS. [A cofac-

    tor is what you get when you work out the smaller

    determinant obtained by striking out a row and a

    column, for example the cofactor of 6 in

    1 2 34 5 67 8 9

    32

  • 8/11/2019 MA1506CHAP6

    33/58

    is

    1 27 8

    =6. You can do this for each elementin a given matrix, to obtain a new matrix of the

    same size. For example, the matrix of cofactors of 1 0 10 1 0

    0 0 1

    is

    1 0 00 1 0

    1 0 1

    .

    [b] Keep or reverse the signs of every element accord-

    ing to+ + + + +

    (you get

    1 0 00 1 0

    1 0 1

    above.)

    [c] Take the TRANSPOSE,

    1 0 10 1 0

    0 0 1

    .

    [d] Divide by the determinant of the original ma-

    trix. THE RESULT IS THE DESIRED INVERSE.

    1 0 10 1 00 0 1

    in this example. Check:

    1 0 10 1 0

    0 0 1

    1 0 10 1 0

    0 0 1

    =

    1 0 00 1 0

    0 0 1

    .

    33

  • 8/11/2019 MA1506CHAP6

    34/58

    INVERSE OF A PRODUCT:

    (AB)1 =B1A1

    Note the order! Easily checked:

    (AB)1

    AB =B1

    (A1

    A)B=B1

    IB =I.

    APPLICATION: SOLVING LINEAR SYSTEMS.

    Suppose you want to solve

    x + 2y+ 3z = 14x + 5y+ 6z = 27x + 8y+ 9z= 4.

    One way is to write it as

    1 2 34 5 6

    7 8 9

    xy

    z

    =

    12

    4

    .

    34

  • 8/11/2019 MA1506CHAP6

    35/58

    Then all you have to do is find

    1 2 34 5 6

    7 8 9

    1

    and

    multiply it on both sides, so you getxy

    z

    =

    1 2 34 5 6

    7 8 9

    1 12

    4

    = answer.

    So this is a systematic way of solving such problems!

    Now actually det

    1 2 34 5 6

    7 8 9

    = 0, and you can see

    why:

    1 2 34 5 6

    7 8 9

    1

    21

    =

    00

    0

    .So this transformation destroys everything in the di-

    rection of

    12

    1

    . In fact it squashes 3-dimensional

    space down to a certain 2-dimensional space. [We

    say that the matrix has RANK 2. If it had squashed

    everything down to a 1-dimensional space, we would

    say that it had RANK 1.] Now actually

    12

    4

    . DOES

    35

  • 8/11/2019 MA1506CHAP6

    36/58

    NOT lie in that two-dimensional space. Since

    1 2 34 5 67 8 9

    squashes EVERYTHING into that two-dimensional

    space, it is IMPOSSIBLE for

    1 2 34 5 6

    7 8 9

    xy

    z

    to be

    equal to

    124

    . Hence the system has NO solutions.

    If we change

    12

    4

    to

    12

    3

    , this vector DOES lie

    in the special 2-dimensional space, and the system

    1 2 34 5 67 8 9

    xyz

    = 123

    DOES have a solutionin fact it has infinitely many!

    SUMMARY:

    Any system of linear equations can be written as

    Mr =

    a

    where M is a matrix,

    r = the vector of variables,

    36

  • 8/11/2019 MA1506CHAP6

    37/58

    anda is a given vector. Suppose M is square.

    [a] If det M= 0, there is exactly one solution,

    r =M1

    a .

    [b] If det M= 0, there is probably no solution. But

    if there is one, then there will be many.

    PRACTICAL ENGINEERING PERSPECTIVE:

    In the REAL world, NOTHING IS EVER EXACTLY

    EQUAL TO ZERO! So if det M= 0, either [a] you

    have made a mistake, OR [b] you are pretending that

    your data are more accurate than they really are!

    1 2 34 5 67 8 9

    REALLY means

    1.01 2.08 3.033.99 4.97 6.027.01 7.96 8.98

    and of course the determinant of THIS is non-zero!

    Actually, det = 0.597835!

    37

  • 8/11/2019 MA1506CHAP6

    38/58

    6.6 EIGENVECTORS AND EIGENVALUES.

    Remember we said that a linear transformation USU-

    ALLY changes the direction of a vector. But there

    may be some special vectors which DONT have their

    direction changed!

    EXAMPLE:

    1 22 2

    clearly DOES change the

    direction of i and j, since

    12

    is not parallel to i

    and 22 is not parallel toj. BUT

    1 22 2

    21

    =

    42

    = 2

    21

    which IS parallel to 21.

    In general if a transformationTdoes not change the

    direction of a vectoru , that is

    Tu =

    u

    38

  • 8/11/2019 MA1506CHAP6

    39/58

    for some (SCALAR), thenu is called an EIGEN-

    VECTOR of T. The scalar is called the EIGEN-

    VALUE ofu .

    6.7 FINDING EIGENVALUES AND EIGEN-

    VECTORS.

    There is a systematic way of doing this. Take the

    equation

    Tu =

    u

    and write u =Iu , I = identity. Then

    (T I)u = 0

    Lets supposeu= 0[of course, 0is always an eigen-

    vector, that is boring]. So the equation says that

    T I SQUASHES everything in the u direction.Hence

    det(T I) = 0.

    39

  • 8/11/2019 MA1506CHAP6

    40/58

    This is an equation which can be SOLVED to find

    .

    EXAMPLE: Find the eigenvalues of

    1 22 2

    :

    det

    1 22 2

    1 00 1

    = 0

    det 1

    2

    2 2 = 0 (1 )(2 + ) 4 = 0 = 2 OR 3

    So there are TWO answers for a 2 by 2 matrix. Sim-

    ilarly, in general there are three answers for 3 by 3

    matrices, etc.

    What are the eigenvectors for = 2, = 3?

    IMPORTANT POINT: Let

    u be an eigenvectorofT. Then 2

    u is also an eigenvector with the same

    eigenvalue!

    T(2u ) = 2T

    u = 2

    u = (2u ).

    40

  • 8/11/2019 MA1506CHAP6

    41/58

    Similarly 3u , 13.59

    u etc are all eigenvectors! SO

    YOU MUST NOT EXPECT A UNIQUE ANSWER!

    OK, with that in mind, lets find an eigenvector for

    = 2. Lets call an eigenvector

    . Then

    (T I)u = 0

    1 22 2

    = 0

    1 2

    2 4

    = 0

    + 2= 0

    2 4= 0But these equations are actually the SAME, so we

    really only have ONE equation for 2 unknowns. We

    arent surprised, because we did not expect a unique

    answer anyway! We can just CHOOSE = 1 (or13.59 or whatever) and then solve for . Clearly

    = 12 , so an eigenvector corresponding to = 2

    is

    112

    . But if you said

    21

    or

    10050

    that is also

    41

  • 8/11/2019 MA1506CHAP6

    42/58

    correct!

    What about =

    3?

    4 22 1

    = 0

    4 + 2= 0

    2 + = 0

    Again we can set = 1, then =2, so an eigen-vector corresponding to =3 is

    12

    or

    24

    or 10

    20 etc.

    EXAMPLE: Find the eigenvalues, and correspond-

    ing eigenvectors, of

    0 11 0

    .

    Answer: We have det

    11

    = 02 + 1 =

    0 = i, i= 1.Eigenvector for i: we set

    i 11 i

    1

    = 0

    42

  • 8/11/2019 MA1506CHAP6

    43/58

    i = 0=i so an eigenvector for i is 1

    i

    . For =

    i we have

    i 11 i

    1

    = 0

    i = 0 = i so an eigenvector foriis 1

    i. Note that a REAL matrix can have COM-

    PLEX eigenvalues and eigenvectors! This is hap-

    pening simply because

    0 11 0

    is a ROTATION

    through 90, and of course such a transformation

    leaves NO [real] vectors direction unchanged (apart

    from the zero vector).

    6.8. DIAGONAL FORM OF A LINEAR TRANS-

    FORMATION.

    Remember that we defined the matrix of a linear

    transformation T WITH RESPECT TO i, j by let-

    tingTact on iandj and then putting the results in

    43

  • 8/11/2019 MA1506CHAP6

    44/58

    the columns. So to say that T has matrix

    a b

    c d

    with respect to i, j means that

    Ti=ai + cj

    Tj =bi + dj.

    Whats so special about the two vectors i and j?

    Nothing, except that EVERY vector in two dimen-

    sions can be written as i + j for some , .

    Now actually we only really use iandj for CONVE-

    NIENCE. In fact, we can do this with ANY pair of

    vectors u , v in two dimensions,

    PROVIDED that they are not parallel.

    That is, any vectorw can be

    expressed as

    w =u + v

    for some scalars , . You can see this from the

    diagramby stretching u to u and v to v , wecan make their sum equal to

    w.

    44

  • 8/11/2019 MA1506CHAP6

    45/58

    We callu ,

    v a BASIS for 2-dimensional vectors. Let

    u =P11i + P21j = P11

    P21

    v =P12i + P22j =

    P12P22

    Then the transformation that takes

    i, j

    to (u ,

    v )

    has matrixP11 P12

    P21 P22

    =P. In order for

    u ,

    v to be

    a basis, Pmust not squash the volume of the Basic

    Box down to zero, since otherwiseu and

    v will be

    parallel. So we must have

    det P= 0.

    The same idea works in 3 dimensions: ANY set of

    3 vectors forms a basis PROVIDED that the matrix

    of components satisfies det P

    = 0.

    EXAMPLE: The pair of vectors

    u =

    10

    ,v =

    11

    forms a basis, because det

    1 10 1

    = 1 = 0.

    45

  • 8/11/2019 MA1506CHAP6

    46/58

    Now of course the COMPONENTS of a vector will

    change if you choose a different basis. For example,12

    = 1i + 2j BUT

    12

    = 1u + 2v .

    Instead,

    12

    = u+ 2v , so the components of this

    vector relative tou ,

    v are

    12

    (

    u,

    v )

    . Where did I

    get these numbers?

    As usual, set u = Pi, v = Pj where P =

    1 10 1

    .

    We want to find , such that

    12

    =

    u +

    v .

    Substitutingu = Pi,

    v = Pj into this equation we

    get

    12

    =Pi + Pj =P[i + j] =P

    .

    46

  • 8/11/2019 MA1506CHAP6

    47/58

    We knowPis not singular, so we can take Pover to

    the left side by multiplying both sides of this equa-

    tion by the inverse ofP. So we get

    =P1

    12

    and this is our answer: this is how we find

    and

    !So to get and we just have to work out

    P1

    12

    =

    1 10 1

    12

    =

    12

    ,

    that is, the components of this vector relative to u , v

    are found as

    12

    (

    u,

    v )

    =P1

    12

    (i,j)

    THE COMPONENTS RELATIVE TO

    u ,v ARE

    OBTAINED BY MULTIPLYING P1 INTO THE

    COMPONENTS RELATIVE TO i, j. Similarly for

    47

  • 8/11/2019 MA1506CHAP6

    48/58

    linear transformationsif a certain linear transfor-mation T has matrix

    1 20

    1

    ij

    relative to i, j it

    will have a DIFFERENT matrix relative to

    u ,v .

    We have

    1 20 1

    (i,j)

    12

    (i,j)

    =

    52

    (i,j)

    That is, the matrix ofTrelative to i, j sends

    12

    (ij)

    to

    52

    (ij)

    . In the same way, the matrix ofT rela-

    tive to (u ,

    v ), which we dont know and want to find,

    sends

    12

    (

    u,

    v )

    to

    72

    (

    u,

    v )

    , because these are

    the components of these two vectors relative to u , v ,

    as you can show by multiplyingP1 into

    12

    (ij)

    and

    52

    (ij)

    respectively.

    So the unknown matrix we want satisfies

    ? ?? ?

    (

    u,

    v )

    12

    (

    u,

    v )

    =

    72

    (

    u,

    v )

    48

  • 8/11/2019 MA1506CHAP6

    49/58

    But we know

    1

    2(

    u,

    v ) =P1 1

    2(i,j) and

    72

    (

    u,

    v )

    =P1

    52

    (i,j)

    so

    ? ?? ?

    (

    u,

    v )

    P1

    12

    (ij)

    =P1

    52

    (ij)

    .

    Multiply both sides by P and get

    P ? ?

    ? ?(

    u,

    v )P1 1

    2(ij)

    = 52

    (ij)

    Compare this with

    1 20

    1

    (i,j)

    12

    (i,j)

    =

    5

    2

    (i,j)

    1 20 1

    (i,j)

    =P

    ? ?? ?

    (

    u,

    v )

    P1

    ? ?? ?

    (

    u,

    v )

    =P1

    1 20 1

    (i,j)

    P.

    49

  • 8/11/2019 MA1506CHAP6

    50/58

    [In the last step, we multiplied both sides on the

    LEFT by P1, and on the RIGHT by P.]

    We conclude that THE MATRIX OF T REL-

    ATIVE TOu ,

    v , IS OBTAINED BY MULTIPLY-

    INGP1 ON THE LEFT ANDPON THE RIGHT

    INTO THE MATRIX OF TRELATIVE TO i, j.

    In this example,

    ? ?? ?

    (

    u,

    v )

    =

    1 10 1

    1 20 1

    1 10 1

    = 1 1

    0 1 1 3

    0 1 = 1 4

    0 1 .

    So now we know how to work out the matrix of any

    linear transformation relative to ANY basis.

    Now let Tbe a linear transformation in 2 dimensions,

    with eigenvectors e1,

    e2, eigenvalues 1, 2. Now

    e1

    ande2may or may not give a basis for 2-dimensional

    space. But suppose they do.

    50

  • 8/11/2019 MA1506CHAP6

    51/58

    QUESTION: What is the matrix of T relative toe1,

    e2?

    ANSWER: As always, we see what T does to e1

    ande2, and put the results into the columns!

    By definition of eigenvectors and eigenvalues,

    Te1=1

    e1 =1

    e1+ 0

    e2

    Te2=2

    e2 = 0

    e1+ 2

    e2

    So the matrix is

    1 00 2

    (

    e1,

    e2)

    .

    We say that a matrix of the form

    a 00 d

    or

    0 00 00 0

    is DIAGONAL. So we see that THE MATRIX OF

    A TRANSFORMATION RELATIVE TO ITS OWN

    EIGENVECTORS (assuming that these form a ba-sis) is DIAGONAL.

    EXAMPLE: We know that the eigenvectors of1 22 2

    are

    112

    and

    12

    . So here P =

    1 112 2

    ,

    51

  • 8/11/2019 MA1506CHAP6

    52/58

    P1 = 25 2 1

    12 1

    ,

    P1

    1 22 2

    P = 2

    5

    2 1 12 1

    1 22 2

    1 112 2

    = 25

    2 1 12 1

    2 31 6

    = 2

    5

    5 0

    0 152

    =

    2 00 3

    as expected since the

    eigenvalues are 2 and -3.

    EXAMPLE: The shear matrix

    1 tan 0 1

    .

    Eigenvalues: det

    1 tan

    0 1

    = 0 (1)2 =

    0

    = 1. Only one eigenvector, namely

    1

    0, so

    the eigenvectors DO NOT give us a basis in this case

    NOT possible to diagonalize this matrix!

    EXAMPLE: Suppose you want to do a reflection

    52

  • 8/11/2019 MA1506CHAP6

    53/58

    of the entire 2-dimensional plane around a straight

    line that passes through the origin and makes an

    angle of with the x-axis. Then the vector

    cossin

    lies along this line, so it is left unchanged by the

    reflection; in other words, it is an eigenvector of

    with eigenvalue 1. On the other hand, the vector sincos is perpendicular to the first vector [checkthat their scalar product is zero] so it is reflected into

    its own negative by . That is, it is an eigenvector

    with eigenvalue 1.

    So has a matrix with these eigenvectors and

    eigenvalues. The P matrix in this case is

    P =

    cos sin sin cos

    ,

    and clearly P1 =

    cos sin sin cos

    , and since the

    eigenvalues are1, we have

    =

    cos sin sin cos

    1 00 1

    cos sin sin cos

    .

    53

  • 8/11/2019 MA1506CHAP6

    54/58

    Doing the matrix multiplication and using the trigono-

    metric identities for cos2 and sin2, you will find

    that

    =

    cos2 sin2sin2 cos2

    .

    Notice that the determinant is 1, as is typi-cal for a reflection. Check that this gives the right

    answer for reflections around the 45 degree diagonal

    and around the x-axis.

    6.9 APPLICATION MARKOV CHAINS.We saw back in Section 3 of Chapter 5 that to predict

    the weather 4 days from now, we needed the 4th

    power of the matrix

    0.6 0.30.4 0.7

    .

    But suppose I want the weather 30 days from now

    I need M30! There is an easy way to work this

    54

  • 8/11/2019 MA1506CHAP6

    55/58

    out using eigenvalues.

    Suppose I can diagonalize M, that is, I can write

    P1M P =D=

    1 00 2

    for some matrixP. Then

    M=P DP1

    M2

    = (P DP1

    )(P DP1

    )

    =P DP1P DP1 =P D2P1

    M3 =M M2 =P DP1P D2P1 =P D3P1 etc

    M30 =P D30P1.

    But D30 is very easy to work out it is just

    301 00 302

    .

    Lets see how this works!

    Eigenvectors and eigenvalues of

    0.6 0.30.4 0.7

    are

    1

    1

    (eigenvalue 0.3) and

    143

    (eigenvalue 1) so

    P =

    1 11 43

    , D=

    0.3 00 1

    , P1 =

    47 3737

    37

    55

  • 8/11/2019 MA1506CHAP6

    56/58

    D30 = (0.3)30 0

    0 1

    2

    1016 0

    0 1

    so

    M30 =

    1 11 43

    2 1016 0

    0 1

    47 3737

    37

    = 1

    7

    3 + 8 1016 3 6 10164 8 1016 4 + 6 1016

    37

    37

    47

    47

    So if it is rainy today, the probability of rain tomor-

    row is 60%, but the probability of rain 30 days from

    now is only 3

    7 43%. As we go forward in time, thefact that it rained today becomes less and less im-

    portant! The probability of rain in 31 days is almost

    the same as the probability of rain in 30 days!

    6.10 APPLICATION: THE TRACE OF A

    DIAGONALIZABLE MATRIX IS THE SUM

    OF ITS EIGENVALUES.

    56

  • 8/11/2019 MA1506CHAP6

    57/58

    Let M be any square matrix. Then the TRACE

    of M, denoted T rM, is defined as the sum of the

    diagonal entries: T r

    1 00 1

    = 2, T r

    1 2 34 5 67 8 9

    =

    15, T r

    1 5 167 2 15

    11 9 8

    = 11, etc.

    In general it is NOT true that T r(M N) =TrM T rN

    BUT it is true that T rMN=T rNM.

    Proof: T rM=i

    Mii so

    TrMN=i

    j

    MijNji =j

    i

    NjiMij =T rNM.

    Hence T r(P1AP) = T r(AP P1) = T rA so THE

    TRACE OF A MATRIX IS ALWAYS THE SAME

    NO MATTER WHICH BASIS YOU USE! This iswhy the trace is interesting: it doesnt care which

    basis you use. In particular, if A is diagonalizable,

    T rA =T r

    1 00 2

    =1+ 2.

    57

  • 8/11/2019 MA1506CHAP6

    58/58

    So the trace is equal to the sum of the eigen-

    values. This gives a quick check that you have not

    made a mistake in working out the eigenvalues: they

    have to add up to the same number as the trace of

    the original matrix. Check this for the examples in

    this chapter.