#06 (09/08/2025)

Jacobian

Question: Consider the polar coordinate system, (r, θ), defined by

x = r cos θ, y = r sin θ.

It follows

d x

d r

= cos θ.

On the other hand, the above relationship can be inverted as

r =

√

x² + y²

, θ = arctan

Thus,

d r

d x

√

x² + y²

r cos θ

= cos θ.

Therefore, it was found that

d x

d r

d x

which looks odd, as you expect that

d x

d r

⎛
⎝

d r

d x

⎞
⎠

So what gives ?

Answer: If the transformation is for one variable functions, i.e., x = x(u), it always follows

d x

d u

⎛
⎝

d u

d x

⎞
⎠

(1)

However, this relationship does not hold for transformations for multi-variable functions. Consider the transformation from (x, y) to (u, v) defined as

x = x(u, v), y = y(u, v).

(2)

Take the total derivative of Eq. (2) to obtain

d x

∂x

∂u

d u +

∂x

∂v

d v,

d y

∂y

∂u

d u +

∂y

∂v

d v,

⎛
⎜
⎜
⎝

d x

d y

⎞
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

∂x

∂u

∂x

∂v

∂y

∂u

∂y

∂v

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎝

d u

d v

⎞
⎟
⎟
⎠

⎛
⎜
⎜
⎝

d u

d v

⎞
⎟
⎟
⎠

(3)

where

J ≡

⎛
⎜
⎜
⎜
⎜
⎜
⎝

∂x

∂u

∂x

∂v

∂y

∂u

∂y

∂v

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(4)

is called the Jacobian matrix. As was noted in class, the Jacobian matrix is the generalization of partial derivatives in multi-variable functions.

The Jacobian matrix is also denoted as

∂(x, y)

∂(u, v)

(5)

Differentiating Eq. (2) with respect to x and y, one gets

1 =

∂x

∂u

∂x

∂v

∂x

0 =

∂x

∂u

∂y

∂x

∂v

∂y

0 =

∂y

∂u

∂x

∂y

∂v

∂x

1 =

∂y

∂u

∂y

∂v

∂y

⎛
⎜
⎜
⎝

⎞
⎟
⎟
⎠

⎛
⎜
⎜
⎝

x_u

x_v

y_u

y_v

⎞
⎟
⎟
⎠

⎛
⎜
⎜
⎝

u_x

u_y

v_x

v_y

⎞
⎟
⎟
⎠

∂(x, y)

∂(u, v)

∂(x, y)

(6)

Therefore, it follows

∂(x, y)

∂(u, v)

⎛
⎝

∂(u, v)

∂(x, y)

⎞
⎠

−1

(7)

This is what is equivalent to Eq. (1) in two-variable functions.

For the polar coordinate system, Eq. (7) is expressed as

⎛
⎜
⎜
⎝

cos θ

− r sin θ

sin θ

r cos θ

⎞
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎝

cos θ

sin θ

−

sin θ

cos θ

⎞
⎟
⎟
⎟
⎠

−1

Examples of Jacobian matrices

Multiple integrals

In single integrations, integration by substitution (u-substitution) is stated as

⌠
⌡

b

a

f(x) d x =

⌠
⌡

β

α

f(x(u))

d x

d u

d u.

(8)

In multiple integrations, the above formula can be extended to

⌠
⌡

f(x, y) d x d y =

⌠
⌡

D^′

f(x(u), y(v))

⎢
⎢

∂(x, y)

∂(u, v)

⎢
⎢

d u dv,

(9)

where the determinant of the Jacobian matrix,

| J | ≡

⎢
⎢

∂(x, y)

∂(u, v)

⎢
⎢

is called the Jacobian determinant or simply the Jacobian. The Jacobian is hence interpreted as the scaling factor of an area element from one coordinate system to another.

(Proof) Consider the area spanned by the two base vectors, →AB and →AC, below:

When the coordinates of point A are denoted as (x, y), the coordinates of the points B and C are expressed as (x + ∂x/∂u du, y + ∂y/∂u du) and (x + ∂x/∂v dv, y + ∂y/∂v dv), respectively, so the components of the vectors, →AB and →AC, are

→

⎛
⎝

∂x

∂u

du,

∂y

∂u

⎞
⎠

→

⎛
⎝

∂x

∂v

dv,

∂y

∂v

⎞
⎠

Therefore, the area of the parallelogram spanned by the two vectors is ¹

⎢
⎢
⎢
⎢
⎢
⎢
⎢

⎛
⎜
⎜
⎜
⎜
⎜
⎝

∂x

∂u

∂x

∂v

∂y

∂u

∂y

∂v

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎢
⎢
⎢
⎢
⎢
⎢
⎢

du dv

⎢
⎢

∂(x, y)

∂(u, v)

⎢
⎢

du d v

| J | d u dv.

Exercise: Find |J| for the polar coordinate system:

x = r cos θ, y = r sin θ.

(Answer)

| J |

⎢
⎢
⎢
⎢
⎢
⎢
⎢

⎛
⎜
⎜
⎜
⎜
⎜
⎝

∂x

∂r

∂x

∂θ

∂y

∂r

∂y

∂θ

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎢
⎢
⎢
⎢
⎢
⎢
⎢

⎢
⎢
⎢
⎢

⎛
⎜
⎜
⎝

cos θ,

−r sin θ

sin θ,

r cos θ

⎞
⎟
⎟
⎠

⎢
⎢
⎢
⎢

r cos² θ+ r sin² θ

Exercise: Evaluate

I ≡

⌠
⌡

∞

−∞

e^−x² dx.

(Answer)

I²

⌠
⌡

∞

−∞

e^−x² dx

⌠
⌡

∞

−∞

e^−y² dy

⌠
⌡

∞

−∞

⌠
⌡

∞

−∞

e^{−(x² + y²)} dx dy

⌠
⌡

2π

0

⌠
⌡

∞

0

e^−r² r dr dθ

2π

⌠
⌡

∞

0

e^−r² r dr

2π×

π,

I =

√

where r² = t was used in evaluating ∫₀^r² r e^−r² dr .

Newton-Raphson method (single variable)
The formula for the Newton-Raphson method can be derived by retaining the first two terms of the Taylor series of f(x) as

f(x) = f(x_o) + f′(x_o) (x − x_o) + (higher order terms).
(10)

With f(x) = 0, the above equation can be solved for x as

x = x_o − f(x_o)
f′(x_o)
,
(11)
which can be used as the iteration scheme:
For example, √2 can be computed by solving f(x) ≡ x² − 2 = 0.

f(x) = x² −2, f′(x) = 2 x.

Start with x₁ = 2.0 (initial guess), then,

x₁

=

2,

x₂

=

2− f(2)
f ′(2)
= 2 − 2
4
= 1.5,

x₃

=

1.5 − f(1.5)
f ′(1.5)
= 1.5 − 0.25
3
= 1.41667,

x₄

=

1.4167− f(1.4167)
f ′(1.4167)
= … = 1.4142.

The convergence is attained after only 4 iterations.
More examples:
1. Square root of a ( = √a)
  Let f(x) = x² − a. It follows
  
  x_n+1
  
  =
  
  x_n − x_n² − a
  2 x_n
  
  =
  
  1
  2
  ⎛
  ⎝ x_n + a
  x_n
  ⎞
  ⎠ .
  (12)
  
  Example (√3 = 1.732 …)
  
  x₁ = 1,     x₂ = 1
  2
  (1 + 3
  1
  ) = 2,    x₃ = 1
  2
  (2 + 3
  2
  ) = 1.75,    x₄ = 1
  2
  (1.75 + 3
  1.75
  ) = 1.732.
2. Inverse of a ( = 1/a)
  Let f(x) = 1/x − a. It follows
  
  x_n+1
  
  =
  
  x_n −
  1
  x_n
  −a
  
  −1
  x_n²
  
  =
  
  x_n (2 − a x_n).
  (13)
  
  Example (1/6 = 0.16667…)
  
  x₁
  
  =
  
  0.2, x₂ = 0.2 (2 − 6×0.2)=0.16, x₃ = 0.16 (2 − 6×0.16) = 0.1664,
  
  x₄
  
  =
  
  0.1664 (2 − 6×0.1664) = 0.166666.
3. 1/√a
  Let f(x) = 1/x² − a, it follows
  
  x_n+1
  
  =
  
  x_n − 1/x_n² − a
  − 2/x_n³
  
  =
  
  x_n (3 − a x_n²)/2.
  (14)

Newton-Raphson method (multiple variables)

We want to solve a set of (non-linear) simultaneous equations in the form of

f₁(x₁, x₂, x₃, …x_n)

f₂(x₁, x₂, x₃, …x_n)

…

f_n(x₁, x₂, x₃, …x_n)

By expanding each of the above by the Taylor series, one obtains

f₁(x) ∼ f₁ (x_o) +

∂f₁

∂x₁

⎢
⎢

x₀

(x₁ − x₁₀) +

∂f₁

∂x₂

⎢
⎢

x₀

(x₂ − x₂₀) + …+

∂f₁

∂x_n

⎢
⎢

x₀

(x_n − x_n0),

f₂(x) ∼ f₂ (x_o) +

∂f₂

∂x₁

⎢
⎢

x₀

(x₁ − x₁₀) +

∂f₂

∂x₂

⎢
⎢

x₀

(x₂ − x₂₀) + …+

∂f₂

∂x_n

⎢
⎢

x₀

(x_n − x_n0),

…

f_n(x) ∼ f_n (x_o) +

∂f_n

∂x₁

⎢
⎢

x₀

(x₁ − x₁₀) +

∂f_n

∂x₂

⎢
⎢

x₀

(x₂ − x₂₀) + …+

∂f_n

∂x_n

⎢
⎢

x₀

(x_n − x_n0).

If x satisfies

f₁(x) = 0, f₂(x) = 0, …f_n(x) = 0,

the above equations can be written as

⎛
⎜
⎜
⎜
⎜
⎜
⎝

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

f₁

f₂

f_n

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎝

∂f₁

∂x₁

∂f₁

∂x₂

…

∂f₁

∂x_n

∂f₂

∂x₁

∂f₂

∂x₂

…

∂f₂

∂x_n

…

∂f_n

∂x₁

∂f_n

∂x₂

…

∂f_n

∂x_n

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

x₁ − x₁₀

x₂ − x₂₀

x_n − x_n0

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(15)

0 = f(x_o) + J (x − x_o),

(16)

where J is the Jacobian matrix defined as

≡

⎛
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎜
⎝

∂f₁

∂x₁

∂f₁

∂x₂

…

∂f₁

∂x_n

∂f₂

∂x₁

∂f₂

∂x₂

…

∂f₂

∂x_n

…

∂f_n

∂x₁

∂f_n

∂x₂

…

∂f_n

∂x_n

⎞
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎟
⎠

∂(f₁, f₂, …, f_n)

∂(x₁, x₂, …, x_n)

(17)

Equation (16) can be solved for x as

x = x_o − J⁻¹ f(x_o),

(18)

where J⁻¹ is the inverse matrix of J.

Example:

Solve numerically for

x³ + y² = 1, x y =

(19)

(Solution) Let

f₁ ≡ x³ + y² − 1, f₂ ≡ x y −

It follows

J =

∂(f₁, f₂)

∂(x, y)

⎛
⎜
⎜
⎝

3 x²,

2 y

⎞
⎟
⎟
⎠

and

J⁻¹ =

⎛
⎜
⎜
⎜
⎜
⎜
⎝

3 x³ − 2 y²

−

2 y

3 x³ − 2 y²

−

3 x³ − 2 y²

3x²

3 x³ − 2 y²

⎞
⎟
⎟
⎟
⎟
⎟
⎠

J⁻¹ f =

⎛
⎜
⎜
⎜
⎜
⎜
⎝

3 x³ − 2 y²

−

2 y

3 x³ − 2 y²

−

3 x³ − 2 y²

3x²

3 x³ − 2 y²

⎞
⎟
⎟
⎟
⎟
⎟
⎠

⎛
⎜
⎜
⎝

x³ + y² − 1

x y − 1/2

⎞
⎟
⎟
⎠

⎛
⎜
⎜
⎜
⎜
⎜
⎝

x⁴−x (y²+1)+y

3 x³−2 y²

4 x³ y−3 x²−2 y³+2 y

6 x³−4 y²

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(20)

Hence the iteration scheme is

⎛
⎜
⎜
⎝

x_n+1

y_n+1

⎞
⎟
⎟
⎠

⎛
⎜
⎜
⎝

x_n

y_n

⎞
⎟
⎟
⎠

−

⎛
⎜
⎜
⎜
⎜
⎜
⎝

x_n⁴−x_n (y_n²+1)+y_n

3 x_n³−2 y_n²

4 x_n³ y_n−3 x_n²−2 y_n³+2 y_n

6 x_n³−4 y_n²

⎞
⎟
⎟
⎟
⎟
⎟
⎠

(21)

Sample C code:

#include <stdio.h>
#include <math.h>
int main()
{
double x=1.0, y=1.0;
int i;

for (i=0;i<=10;i++)
  { x = x -(pow(x,4) + y - x*(1 + y*y))/(3*pow(x,3) - 2*y*y);
    y = y -(-3*x*x + 2*y + 4*pow(x,3)*y - 2*pow(y,3))/(6*pow(x,3) - 4*y*y);
  }

printf("%f %f\n", x, y);
return 0;
}

Sample Matlab/Octave code:

x=1; y=1;
for i=0:10
 eqs=[ x*x*x+y*y-1; x*y - 1/2];
 jacob=[3*x*x 2*y; y x];
 right=inv(jacob)*eqs;
 x=x-right(1);
 y=y-right(2);
end;

fprintf('%f %f\n', x, y);

Sample Mathematica code:

FindRoot[{x^3 + y^2 == 1, x y == 1/2}, {{x, 1}, {y, 1}}]

Online C compiler
Online Matlab/Octave
Wolfram Alpha

Starting (x, y) = (1.0, 1.0), the convergence was reached at (x, y) = (0.877275, 0.569947) after only 4 iterations. Note that this is just one of multiple roots. It is necessary to try different initial guess to obtain other roots.

Footnotes:

¹ The area of a parallelogram spanned by vectors, a and b, is

a b sin θ

√

a² b² (1 − cos² θ)

√

(a,a) (b,b) − (a,b)²

√

(a₁² + a₂²)(b₁² + b₂²) − (a₁ b₁ + a₂ b₂)²

√

(a₁ b₂ −a₂ b₁)²

⎢
⎢
⎢
⎢

a₁

b₁

a₂

b₂

⎢
⎢
⎢
⎢

File translated from T_EX by T_TH, version 4.03.
On 08 Sep 2025, 15:23.