#13 (03/05/2025)

Solving simultaneous equations

A x = c,

(1)

⎛
⎜
⎜
⎜
⎜
⎜
⎝

a₁₁

a₁₂

…

a_1n

a₂₁

a₂₂

…

a_2n

a_n1

a_n2

…

a_nn

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

x₁

x₂

x_n

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

c₁

c₂

c_n

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(2)

⎧
⎪
⎪
⎪
⎨
⎪
⎪
⎪
⎩

a₁₁ x₁ + a₁₂ x₂ + a₁₃ x₃ + …+ a_1nx_n = c₁

a₂₁ x₁ + a₂₂ x₂ + a₂₃ x₃ + …+ a_2nx_n = c₂

a₃₁ x₁ + a₃₂ x₂ + a₃₃ x₃ + …+ a_3nx_n = c₃

……

a_n1 x₁ + a_n2 x₂ + a_n3 x₃ + …+ a_nnx_n = c_n

(3)

For many problems in engineering/science, the size of the matrix A can easily go up to n=1,000,00.

Determinant

2 ×2 matrices

⎢
⎢
⎢
⎢

a₁₁
a₁₂

a₂₁
a₂₂.
⎢
⎢
⎢
⎢ ≡ a₁₁ a₂₂− a₁₂ a₂₁
(4)
3 ×3 matrices

⎢
⎢
⎢
⎢
⎢

a₁₁
a₁₂
a₁₃

a₂₁
a₂₂
a₂₃

a₃₁
a₃₂
a₃₃
⎢
⎢
⎢
⎢
⎢ ≡ a₁₁ a₂₂ a₃₃ + a₁₂ a₂₃ a₃₁+ a₂₁ a₃₂ a₁₃− a₁₃ a₂₂a₃₁−a₁₂ a₂₁ a₃₃ − a₂₃ a₃₂ a₁₁
(5)
The determinant of n ×n (n > 3) matrices can be expressed similar to the above. It consists of n! terms each of which is a product of n variables.

Some properties of determinants

| A B | = | A | | B |
| I | = 1 where I is the identity matrix.
| A |=0 ⇒ A is singular.

Determinant/Cramer's rule

Solving

a₁₁ x₁ + a₁₂ x₂

c₁

a₂₁ x₁ + a₂₂ x₂

c₂

(6)

yields

x₁

c₁ a₂₂−c₂ a₁₂

a₁₁ a₂₂− a₁₂ a₂₁

⎢
⎢
⎢
⎢

c₁

a₁₂

c₂

a₂₂

⎢
⎢
⎢
⎢

a₁₁

a₁₂

a₂₁

a₂₂

⎢
⎢
⎢
⎢

(7)

x₂

c₂ a₁₁−c₁ a₂₁

a₁₁ a₂₂− a₁₂ a₂₁

⎢
⎢
⎢
⎢

a₁₁

c₁

a₂₁

c₂

⎢
⎢
⎢
⎢

a₁₁

a₁₂

a₂₁

a₂₂

⎢
⎢
⎢
⎢

(8)

Similarly, for three simultaneous equations,

a₁₁ x₁ + a₁₂ x₂ + a₁₃ x₃

c₁

a₂₁ x₁ + a₂₂ x₂ + a₂₃ x₃

c₂

a₃₁ x₁ + a₃₂ x₂ + a₃₃ x₃

c₃

(9)

the solution is expressed as

x₁

⎢
⎢
⎢
⎢
⎢

c₁

a₁₂

a₁₃

c₂

a₂₂

a₂₃

c₃

a₃₂

a₃₃

⎢
⎢
⎢
⎢
⎢

a₁₁

a₁₂

a₁₃

a₂₁

a₂₂

a₂₃

a₃₁

a₃₂

a₃₃

⎢
⎢
⎢
⎢
⎢

(10)

x₂

⎢
⎢
⎢
⎢
⎢

a₁₁

c₁

a₁₃

a₂₁

c₂

a₂₃

a₃₁

c₃

a₃₃

⎢
⎢
⎢
⎢
⎢

a₁₁

a₁₂

a₁₃

a₂₁

a₂₂

a₂₃

a₃₁

a₃₂

a₃₃

⎢
⎢
⎢
⎢
⎢

(11)

x₃

⎢
⎢
⎢
⎢
⎢

a₁₁

a₁₂

c₁

a₂₁

a₂₂

c₂

a₃₁

a₃₂

c₃

⎢
⎢
⎢
⎢
⎢

a₁₁

a₁₂

a₁₃

a₂₁

a₂₂

a₂₃

a₃₁

a₃₂

a₃₃

⎢
⎢
⎢
⎢
⎢

(12)

The above formulas are called Cramer's rule and in general can be extended to higher order matrices. However, it's not practical to use Cramer's rule for a set of more than 4 simultaneous equations. The number of multiplications required for Cramer's rule for n simultaneous equations is n! (n−1) (n+1). For n=10, this amounts to 3,628,800.

Approximate computational time for n simultaneous equations with a 100 MFLOPS computer (an old PC) using Cramer's rule is estimated as shown in Table .¹

Table 1: Time required for Cramer's rule

n	10	12	14	16	18	20
Time	0.4 sec.	1 min.	3.6 hrs.	41 days	38 years	16,000 years

Gauss-Jordan elimination method

The linearity principle:

a₁₁ x₁ + a₁₂x₂ + …+a_1nx_n

c₁

a₂₁ x₁ + a₂₂x₂ + …+a_2nx_n

c₂,

(13)

(λ₁ a₁₁ + λ₂ a₁₂)x₁+(λ₁ a₂₁ + λ₂ a₂₂)x₂+…+(λ₁ a_1n + λ₂ a_2n)x_n = λ₁ c₁ + λ₂ c₂.

(14)

Example

2x+3y+4z = 6

3x+5y+2z = 5

4x+3y+30z=32.

⎫
⎪
⎬
⎪
⎭

(15)

⇒

1x+0y+0z = ?

0x+1y+0z = ?

0x+0y+1z = ?.

⎫
⎪
⎬
⎪
⎭

(16)


Ref. line number	x	y	z	=	Comment
(1)	2	3	4	6
(2)	3	5	2	5
(3)	4	3	30	32

(4)	1	1.5	2	3	(1)÷2
(5)	3	5	2	5
(6)	4	3	30	32

(7)	1	1.5	2	3
(8)	0	0.5	-4	-4	(5)−(4)×3
(9)	0	-3	22	20	(6)−(4)×4

(10)	1	1.5	2	3
(11)	0	1	-8	-8	(8)÷0.5
(12)	0	-3	22	20

(13)	1	0	14	15	(10)−(11)×1.5
(14)	0	1	-8	-8
(15)	0	0	-2	-4	(12)−(11)×(−3)

(16)	1	0	14	15
(17)	0	1	-8	-8
(18)	0	0	1	2	(15)÷(−2)

(19)	1	0	0	-13	(16)−(18)×14
(20)	0	1	0	8	(17)− (18)×(−8)
(21)	0	0	1	2

Finally we obtain

x = −13, y=8, z=2.

The number of multiplications required for the Gauss-Jordan elimination method is about n³. For n=10, this amounts to 1,000. Compare this number with the one required for Cramer's rule.

Inverse matrix by the Gauss-Jordan elimination method (optional)

Apply the Gauss-Jordan elimination method for

( A | I)

(17)

i.e.

⎛
⎜
⎜
⎜
⎝

⎞
⎟
⎟
⎟
⎠

→

⎛
⎜
⎜
⎜
⎝

2/3

1/2

−4

−3/2

−3

−2

⎞
⎟
⎟
⎟
⎠

→

⎛
⎜
⎜
⎜
⎝

−3

−8

−3

−2

−11

⎞
⎟
⎟
⎟
⎠

→

⎛
⎜
⎜
⎜
⎝

−72

−22

−4

11/2

−3

−1/2

⎞
⎟
⎟
⎟
⎠

(18)

We thus obtain

A⁻¹ =

⎛
⎜
⎜
⎜
⎝

−72

−22

−4

11/2

−3

−1/2

⎞
⎟
⎟
⎟
⎠

(19)

Implementation

#include <stdio.h>
#define N 3
int main()
{
 double a[N][N+1]={{2, 3, 4, 6},{3, 5, 2, 5},{4, 3, 30, 32}};
  double pivot,d;
  int i,j,k;

for(k=0; k<N; k++)
{
 pivot=a[k][k];

 for(j=k; j<N+1; j++) a[k][j]=a[k][j]/pivot;
 for(i=0; i<N;  i++)
  {
    if(i != k)
     {
       d=a[i][k]; 
        for(j=k; j<N+1; j++) a[i][j]=a[i][j]-d*a[k][j];
     }
   }
}

for(i=0; i<N; i++) printf("x[%d]=%lf\n", i+1, a[i][N]);
return 0;
}

LU decomposition (optional)

Any matrix, A, can be uniquely factorized as

A = L U,

(20)

where

L =

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎝

…

l₂₁

…

l₃₁

l₃₂

…

^··_·

l_n1

l_n2

l_n3

…

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎠

, U =

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎝

u₁₁

u₁₂

u₁₃

…

u_1n

u₂₂

u₂₃

…

u_2n

u₃₃

…

u_3n

^··_·

…

u_nn

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎠

(21)

The matrix, L, is a lower triangular matrix and the matrix, U, is an upper triangular matrix. Note that the diagonal elements of L are set to be 1. This decomposition is called LU decomposition (or LU factorization) and provides an effective way of solving simultaneous equations which is more efficient than the Gauss-Jordan elimination method.

The decomposition above is unique as it is possible to find L and U directly as

⎛
⎜
⎜
⎜
⎜
⎜
⎝

l₂₁

l₃₁

l₃₂

l₄₁

l₄₂

l₄₃

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

u₁₁

u₁₂

u₁₃

u₁₄

u₂₂

u₂₃

u₂₄

u₃₃

u₃₄

u₄₄

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

a₁₁

a₁₂

a₁₃

a₁₄

a₂₁

a₂₂

a₂₃

a₂₄

a₃₁

a₃₂

a₃₃

a₃₄

a₄₁

a₄₂

a₄₃

a₄₄

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(22)

Writing each element explicitly gives

⎧
⎪
⎪
⎨
⎪
⎪
⎩

u₁₁ = a₁₁,

u₁₂=a₁₂,

u₁₃=a₁₃,

u₁₄=a₁₄

l₂₁ u₁₁ = a₂₁,

l₂₁ u₁₂+u₂₂ = a₂₂,

l₂₁ u₁₃+u₂₃ = a₂₃,

l₂₁ u₁₄+u₂₄ = a₂₄

l₃₁u₁₁ = a₃₁,

l₃₁u₁₂+l₃₂u₂₂ = a₃₂,

l₃₁u₁₃+l₃₂u₂₃+u₃₃ = a₃₃,

l₃₁u₁₄+l₃₂u₂₄+u₃₄ = a₃₄

l₄₁u₁₁ = a₄₁,

l₄₁u₁₂+l₄₂u₂₂ = a₄₂,

l₄₁u₁₃+l₄₂u₂₃+l₄₃u₃₃ = a₄₃,

l₄₁u₁₄+l₄₂u₂₄+l₄₃u₃₄+u₄₄ = a₄₄,

(23)

from which one can solve the unknowns, l_ij and u_ij, as

⎧
⎪
⎪
⎪
⎪
⎨
⎪
⎪
⎪
⎪
⎩

u₁₁=a₁₁,

u₁₂=a₁₂,

u₁₃=a₁₃,

u₁₄=a₁₄

l₂₁=

a₂₁

u₁₁

u₂₂ = a₂₂− l₂₁u₁₂,

u₂₃=a₂₃− l₂₁ u₁₃,

u₂₄=a₂₄−l₂₁u₁₄

l₃₁=

a₃₁

u₁₁

l₃₂=

a₃₂− l₃₁u₁₂

u₂₂

u₃₃=a₃₃− l₃₁ u₁₃− l₃₂ u₂₃,

u₃₄=a₃₄− l₃₁ u₁₄− l₃₂ u₂₄

l₄₁=

a₄₁

u₁₁

l₄₂ =

a₄₂−l₄₁u₁₂

u₂₂

l₄₃=

a₄₃−l₄₁u₁₃−l₄₂u₂₃

u₃₃

u₄₄=a₄₄−l₄₁u₁₄−l₄₂u₂₄−l₄₃u₃₄.

(24)

Note that the immediate results for u_ij and l_ij must be used to solve for the next result.

To solve

A x = L U x = c,

(25)

Solve L y = c first.
Then, solve U x = y.

Finding L and U from A requires less effort than finding A⁻¹ and can accomplish the same goal as finding A⁻¹.

⎛
⎜
⎜
⎜
⎜
⎜
⎝

l₂₁

l₃₁

l₃₂

l₄₁

l₄₂

l₄₃

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

y₁

y₂

y₃

y₄

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

c₁

c₂

c₃

c₄

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(26)

This can be solved easily as

⎧
⎪
⎪
⎨
⎪
⎪
⎩

y₁ = c₁

l₂₁ y₁ + y₂ = c₂

l₃₁ y₁ + l₃₂y₂ + y₃ = c₃

l₄₁ y₁ +l₄₂ y₂ + l₄₃y₃ + y₄ = c₄

(27)

from which one obtains

⎧
⎪
⎪
⎨
⎪
⎪
⎩

y₁ = c₁

y₂ = c₂ −l₂₁ y₁

y₃ = c₃− l₃₁ y₁ − l₃₂y₂

y₄ = c₄ − l₄₁ y₁−l₄₂ y₂ − l₄₃y₃

(28)

For U x=y,

⎛
⎜
⎜
⎜
⎜
⎜
⎝

u₁₁

u₁₂

u₁₃

u₁₄

u₂₂

u₂₃

u₂₄

u₃₃

u₃₄

u₄₄

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

x₁

x₂

x₃

x₄

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

y₁

y₂

y₃

y₄

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(29)

First

⎧
⎪
⎪
⎨
⎪
⎪
⎩

u₁₁ x₁ + u₁₂ x₂ + u₁₃ x₃ + u₁₄ x₄ = y₁

u₂₂ x₂+u₂₃x₃ + u₂₄x₄ = y₂

u₃₃ x₃ + u₃₄x₄ = y₃

u₄₄ x₄=y₄

(30)

from which one obtains

⎧
⎪
⎪
⎪
⎪
⎪
⎨
⎪
⎪
⎪
⎪
⎪
⎩

x₄ =

y₄

u₄₄

x₃ =

y₃− u₃₄x₄

u₃₃

x₂ =

y₂−u₂₃x₃ − u₂₄x₄

u₂₂

x₁ =

y₁−u₁₁ x₁ − u₁₂ x₂ − u₁₃ x₃

u₁₁

(31)

This procedure is called backward substitution.

It can be shown that the number of operations (multiplications, divisions) to decompose A into A = LU is about n³/3 and to obtain x from Ly=c and Ux=y, n² is required so the total number of operations is about n³/3 which is 1/3 of the number required for the Gauss-Jordan elimination method.

Notes

If A is a symmetric matrix (a_ij=a_ji), LU-decomposition is called the Cholesky decomposition. The number of operations required for the Cholesky decomposition is n³/6.
If A is a sparse matrix, so are L and U.
If A is a triangular diagonal matrix (typical of the finite element method), so are L and U.

Gauss-Seidel method (Important)

If the diagonal elements of the matrix, A, are generally larger than non-diagonal elements, the linear equation, A x = c, can be solved by an iterative method called the Gauss-Seidel method.

Example:

⎧
⎪
⎨
⎪
⎩

7x+y+2z

x+8y+3z

2x+3y+9z

(32)

⎧
⎪
⎨
⎪
⎩

(10−y−2z)/7

(8−x−3z)/8

(6−2x−3y)/9.

(33)

⎧
⎪
⎨
⎪
⎩

x_n+1

(10−y_n−2z_n)/7

y_n+1

(8−x_n−3z_n)/8

z_n+1

(6−2x_n−3y_n)/9,

(34)

with x₀ = y₀ = z₀ = 0 as the initial approximation. This scheme is also called the Jacobi method. A slightly better iteration scheme is to use the most immediate values for the next approximation, i.e.

⎧
⎪
⎨
⎪
⎩

x_n+1

(10−y_n−2z_n)/7

y_n+1

(8−x_n+1−3z_n)/8

z_n+1

(6−2x_n+1−3y_n+1)/9,

(35)

This scheme is called the Gauss-Seidel iterative method. It is easy to implement this method:

#include <stdio.h>
int main()
{
double x, y, z;
int i,n;

x=y=z=0.0;

printf("Enter # of iteration = ");
scanf("%d", &n);

for (i=0;i<n;i++)
{
x = (10-y-2*z)/7;
y = (8-x-3*z)/8.0;
z = (6-2*x-3*y)/9.0;
}

printf("x = %lf, y= %lf, z=%lf\n", x,y,z);
return 0;
}

Footnotes:

¹Dahmen and Reusken, Numerik für Ingenieure und Naturwissenschaftler, Springer, 2006.

File translated from T_EX by T_TH, version 4.03.
On 04 Mar 2025, 20:24.