3404 General Case Eigenvalues and Eigenvectors
We are now ready to tackle the general case of a linear system:
where the are constants. We will be following exactly the strategy that we laid out in the previous note. These are key concepts for the rest of the unit, and you should take the time to absorb them.
We want to learn to write the system efficiently in matrix form. So, throughout the derivation, we will give the expanded matrix form of our manipulations on the left, and the abridged form on the right. For example, our system is:
Following the method in the previous note, we look for solutions to our system having the form
where , and are unknown constants. We substitute this into the system to determine these unknown constants. Since , we arrive at
We can cancel the factor from both sides, getting
As it stands, we cannot combine the two sides by subtraction, since the scalar cannot be subtracted from the square matrix on the right. As in the previously worked example, the trick is to replace the scalar by the diagonal matrix . This gives
If we now proceed as we did in the example, subtracting the left side from the right one and using the distributive law for matrix addition and multiplication, we get a homogeneous linear system of equations:
Written out without using matrices, the equations are
According to the theorem on square homogeneous systems this system has a non-zero solution for the 's if and only if the determinant of the coefficients is zero, i.e.,
Evaluating the determinant we get a quadratic equation in :
Definition. This is called the characteristic equation of the matrix
and if often denoted . Its roots and $$ are called the eigenvalues or characteristic values of the matrix .
Remark. In calculating the characteristic equation notice that
Using this, the characteristic equation for a matrix can be written as
In this form, the characteristic equation of can be written down by inspection; you don't need the intermediate step of writing down =0.
Remark. Abridged vs. expanded notation
In the manipulations above, the matrix notation on the right is compact to write, which makes the derivation look simpler. On the other hand, its chief disadvantage for beginners is that it is very compressed. Practice writing the sequence of matrix equations so you get some skill in using the notation. Until you acquire some confidence, keep referring to the writtenout form on the left, so you are sure you understand what the abridged form is actually saying.
There are now various cases to consider, according to whether the eigenvalues of the matrix are: 1. two distinct real numbers, 2. a single repeated real number, 3. a pair of conjugate complex numbers.
We begin with the first case: for the rest of this note, the eigenvalues are two distinct real numbers and .
Real distinct eigenvalues
To complete our work, we have to find the solutions to the system corresponding to the eigenvalues and . Formally, the systems become
The solutions to these two systems are column vectors, for which we will typically use .
Definition. The respective solutions and to the systems are called eigenvectors (or characteristic vectors) corresponding to the eigenvalues and .
Remarks.
1. If the work has been done correctly, in each of the two systems in , the two equations will be dependent, i.e., one will be a constant multiple of the other. Why? The two values of have been selected so that in each case the coefficient determinant will be zero, which means the equations will be dependent.
2. The solution is determined only up to an arbitrary non-zero constant factor: if is an eigenvector for , then so is , for any real constant ; because of this, the line through is sometimes called an eigenline. A convenient way of finding the eigenvector is to assign the value 1 to one of the , then use the equation to solve for the corresponding value of the other . (First try ; if that does not work, then will.)
Once the eigenvalues and their corresponding eigenvectors have been found, we have two independent solutions to the system . They are
Definition. In science and engineering applications, these are usually called the normal modes.
Using superposition, the general solution to the system is
Remarks.
1. The normal nodes often have physical interpretations; this means that it is sometimes possible to find them just by inspection of the physical problem.
2. In the compact notation, the definitions and derivations are valid for square systems of any size. Thus, for instance, you know how to solve a system, if its eigenvalues turn out to be real and distinct. We won't consider any such systems in these notes, though.