Indirect method

We introduce the pseudo-Hamiltonian

\[ h(x,p,p^0,u) = p^0 x + p u.\]

For the sake of simplicity, we assume that the BC-extremals associated to the solution of the studied problem is normal, and so we fix $p^0 = -1$. According to the Pontryagin maximum principle, the maximizing control is given by $u(x,p) \to \mathrm{sign}(p)$. This function is non-differentiable, and may lead to numerical issues.

Let's start by defining the problem.

using OptimalControl
using Plots
using ForwardDiff
using DifferentialEquations
using MINPACK

t0 = 0                                                      # initial time
x0 = 0                                                      # initial state
tf = 5                                                      # final time
xf = 0                                                      # final state

@def ocp begin                                              # problem definition

    t ∈ [ t0, tf ], time
    x ∈ R, state
    u ∈ R, control

    x(t0) == x0
    x(tf) == xf

    ẋ(t) == u(t)

    ∫( x(t) ) → min

end

Thanks to the control-toolbox, the flow $\varphi$ of the (true) Hamiltonian

\[ H(x,p) = h(x,p,-1, u(x,p)) = p^0 x + \lvert p \rvert \]

is given by the function $\texttt{Flow}$. The shooting function $S \colon \mathbb{R} \to \mathbb{R}$ is defined by

\[ S(p_0) = \pi \big( \varphi(t_0, x_0, p_0, t_f) \big) - x_f\]

where $\pi (x,p) = x$ is the classical $x$-space projection.

ϕ = Flow(ocp, (x,p) -> sign(p))                             # flow with maximizing control
π((x,p)) = x;                                               # projection on state space

S(p0) = π( ϕ(t0, x0, p0, tf) ) - xf;                        # shooting function
nle = p0 -> [S(p0[1])]                                      # auxiliary function

# Plot
plot(range(-7, 2, 500), S, xlim = [-7, 2])
plot!([-7,2], [0,0], color = :black)
plot!(xlabel = "p0", ylabel = "S", legend=false)

Finite difference method

The main goal now is to find the zero of $S$. To this purpose, we use the numerical solver $\texttt{hybrd1}$ given in the package $\texttt{MINPACK.jl}$. If we don't provide the Jacobian $J_S$ of $S$ to the solver, the finite difference method is used to approximate it.

ξ = [-1.0]                                                  # initial guess
S!(s, ξ) = (s[:] .= S(ξ[1]); nothing)                       # auxiliary function
p0_sol = fsolve(S!, ξ, show_trace = true)                   # solve
println(p0_sol)

Iter     f(x) inf-norm    Step 2-norm      Step time
------   --------------   --------------   --------------
     1     3.000000e+00     0.000000e+00         0.241687
     2     6.261430e-03     2.240618e+00         0.005824
     3     3.191915e-08     9.801378e-06         0.000619
     4     4.155130e-08     2.547106e-16         0.000563
     5     1.187702e-09     8.146872e-17         0.000605
Results of Nonlinear Solver Algorithm
 * Algorithm: Modified Powell
 * Starting Point: [-1.0]
 * Zero: [-2.500000004924035]
 * Inf-norm of residuals: 0.000000
 * Convergence: true
 * Message: algorithm estimates that the relative error between x and the solution is at most tol
 * Total time: 0.249311 seconds
 * Function Calls: 5
 * Jacobian Calls (df/dx): 1

sol = ϕ((t0, tf), x0, p0_sol.x)                             # get the optimal trajectory
plot(sol)                                                   # plot

Automatic differentiation (wrong way)

Now, we want to provide $J_S$ to the solver, thanks to the $\texttt{ForwardDiff.jl}$ package. This Jacobian is computed with the variational equation, and leads to a false result in our case.

Details.

Denoting $z_0 = (x_0,p_0)$ the initial state costate couple, we have

$\varphi(t_0, z_0, t_f) = z_0 + \int_{t_0}^{t_f} \vec H\big(\varphi(t_0, z_0, t)\big) \,\mathrm dt.$

If we assume that $z_0 \to \varphi(t_0, z_0, t_f)$ is differentiable, we have

$\frac{\partial \varphi}{\partial z_0}(t_0, z_0, t_f)\cdot \delta z_0 = \delta z_0 + \int_{t_0}^{t_f} \vec H'\big(\varphi(t_0, z_0, t)\big)\cdot \left( \frac{\partial \varphi}{\partial z_0}(t_0, z_0, t) \cdot \delta z_0 \right) \,\mathrm dt,$

and so, $z_0 \to \frac{\partial \varphi}{\partial z_0}(t_0, z_0, t_f)\cdot \delta z_0$ is solution of the variational equations

$\frac{\partial \delta z}{\partial t}(t) = \vec H'\big(\varphi(t_0, z_0, t_f)\big) \cdot \delta z(t), \qquad \delta z(t_0) = \delta z_0.$

In the studied optimal control problem, we have

$\vec H(x,p) = (\mathrm{sign}(p), -1)$

and so, we have $\vec H'(z) = 0_2$ almost everywhere, which implies

$\frac{\partial \varphi}{\partial z_0}(t_0, z_0, t_f) \cdot \delta z_0 = \mathrm{exp}\big((t_f-t_0) 0_2 \big)\cdot \delta z_0 = \delta z_0.$

The Jacobian of the shooting function is then given by

$S'(p_0) = \pi \left( \frac{\partial \varphi}{\partial p_0}(t_0, x_0, p_0, t_f) \right) = \pi \left( \frac{\partial \varphi}{\partial z_0}(t_0, z_0, t_f) \cdot (0,1) \right) = \pi(0,1) = 0.$

ξ = [-1.0]                                                  # initial guess
JS(ξ) = ForwardDiff.jacobian(p -> [S(p[1])], ξ)             # compute jacobian by forward differentiation
println("ξ = ", ξ[1])
println("JS(ξ) : ", JS(ξ)[1])

ξ = -1.0
JS(ξ) : 0.0

However, the solver $\texttt{hybrd1}$ uses rank 1 approximations to actualize the Jacobian instead of compute it at each iteration, which imply that it still converges to the solution even if the given Jacobian is completely false.

JS!(js, ξ) = (js[:] .= JS(ξ); nothing)                      # auxiliary function
p0_sol = fsolve(S!, JS!, ξ, show_trace = true)              # solve
println(p0_sol)

Iter     f(x) inf-norm    Step 2-norm      Step time
------   --------------   --------------   --------------
     1     3.000000e+00     0.000000e+00         0.023997
     2     5.000000e+00     1.000000e+04         0.062400
     3     5.000000e+00     3.906250e+03         0.000143
     4     5.000000e+00     3.906250e+03         0.000894
     5     5.000000e+00     3.906250e+03         0.000079
     6     5.000000e+00     5.493164e+02         0.000072
     7     5.000000e+00     7.724762e+01         0.000074
     8     9.550781e-01     1.086295e+01         0.000625
     9     3.984897e-08     2.280436e-01         0.000568
    10     7.810507e-10     3.969852e-16         0.000582
    11     5.208866e-10     1.467028e-19         0.000578
    12     2.842273e-15     5.880714e-19         0.000580
Results of Nonlinear Solver Algorithm
 * Algorithm: Modified Powell (User Jac, Expert)
 * Starting Point: [-1.0]
 * Zero: [-2.499999992669224]
 * Inf-norm of residuals: 0.000000
 * Convergence: true
 * Message: algorithm estimates that the relative error between x and the solution is at most tol
 * Total time: 0.090606 seconds
 * Function Calls: 12
 * Jacobian Calls (df/dx): 2

sol = ϕ((t0, tf), x0, p0_sol.x)                             # get the optimal trajectory
plt = plot(sol)                                             # plot

Automatic differentiation (good way)

The goal is to provide the true Jacobian of $S$ by using the $\texttt{ForwardDiff}$ package, and so we need to indicate to the solver that the dynamic of the system change when $p = 0$.

To understand why we need to give this information to the solver, see the following details.

Details.

The problem is that the Hamiltonian $H$ is not differentiable everywhere due to the maximizing control. This control is bang-bang ( $u = 1$ and $u = -1$ ).

Let now construct the two smooth Hamiltonians associated to these two controls

$H^+(x,p) = h(x,p,-1,1) = -x + p \qquad \text{and} \qquad H^-(x,p) = h(x,p,-1,-1) = -x - p.$

Their associated vector fields are given by

$\vec H^+(x,p) = (1,1) \qquad \text{and} \qquad \vec H^-(x,p) = (-1, 1),$

and their associated flow correspond to

$\varphi^+(t_0, z_0, t_f) = z_0 + \left( \begin{array}{c} 1 \\ 1 \end{array} \right) (t_f -t_0) \qquad \text{and} \qquad \varphi^-(t_0, z_0, t_f) = z_0 + \left( \begin{array}{c} -1 \\ \phantom{-} 1 \end{array} \right) (t_f -t_0).$

If we assume that the optimal structure of the problem is negative then positive bangs, then the associated flow is defined by

$\varphi(t_0, z_0, t_f) = \varphi^+ \big( t_1(z_0), \varphi^-\big(t_0, z_0, t_1(z_0)\big), t_f \big),$

with the following condition

$\pi_p \big( \varphi^-(t_0, z_0, t_1(z_0)) \big) = 0,$

where $\pi_p(x,p) = p$ is the classical $p$ -space projection. By devlopping this last condition, an explicit form of the function $t_1(\cdot)$ is given by

$t_1(x_0, p_0) = t_0 - p_0.$

Finally, we have

$\begin{align*} \frac{\partial \varphi}{\partial z_0} &= \frac{\partial \varphi^+}{\partial t_0} \frac{\partial t_1}{\partial z_0} + \frac{\varphi^+}{\partial z_0} \left( \frac{\partial \varphi^-}{\partial z_0} + \frac{\partial \varphi^-}{\partial t_f} \frac{\partial t_1}{\partial z_0} \right) \\ &= \left( \begin{array}{c} -1 \\ -1 \end{array} \right) \left( \begin{array}{cc}0 & -1 \end{array} \right) + \left( \begin{array}{cc} 1 & 0 \\ 0 & 1 \end{array} \right) \left[ \left( \begin{array}{cc} 1 & 0 \\ 0 & 1 \end{array} \right) + \left( \begin{array}{c} -1 \\ \phantom - 1 \end{array} \right) \left( \begin{array}{cc}0 & -1 \end{array} \right) \right] \\ &= \left( \begin{array}{cc} 0 & 1 \\ 0 & 1 \end{array} \right) + \left( \begin{array}{cc} 1 & 0 \\ 0 & 1 \end{array} \right) + \left( \begin{array}{cc} 0 & \phantom -1 \\ 0 & -1 \end{array} \right) \\ &= \left( \begin{array}{cc} 1 & 2 \\ 0 & 1 \end{array} \right) \end{align*}$

and so, we have that

$S'(p_0) = \pi \left( \frac{\partial \varphi}{\partial p_0}(t_0, x_0, p_0, t_f) \right) = \pi \left( \frac{\partial \varphi}{\partial z_0}(t_0, z_0, t_f) \cdot (0,1) \right) = \pi(2,1) = 2.$

To provide this change of dynamic to the solver, we need to use a callback during the integration that will execute the function $\texttt{affect!}$ when $\texttt{condition(x,p)} = 0$.

For us, the condition is given by $(x,p) \to p$. For the $\texttt{affect!}$ function, we use a global parameter $\alpha$. This parameter will be set to $\pm 1$ at the beginning of the integration and it sign will change with the $\texttt{affect!}$ function.

Thanks to the $\texttt{control-toolbox}$ package, the created callback can be easily pass to the integrator through the $\texttt{Flow}$ function.

global α                                                    # parameter: ̇p(t) = α with α = ±1

function condition(z, t, integrator)                        # event when condition(x,p) == 0
    x,p = z
    return p
end

function affect!(integrator)                                # action when condition == 0
    global α = -α
    nothing
end

cb = ContinuousCallback(condition, affect!)                 # callback

φ_ = Flow(ocp, (x,p) -> α, callback = cb)                   # intermediate flow

function φ(t0, x0, p0, tf; kwargs...)                       # flow
    global α = sign(p0)
    return φ_(t0, x0, p0, tf; kwargs...)
end

function φ((t0, tf), x0, p0; kwargs...)                     # flow for plot
    global α = sign(p0)
    return φ_((t0, tf), x0, p0; kwargs...)
end

Shoot(p0) =  π( φ(t0, x0, p0, tf) ) - xf                    # shooting function

ξ = [-1.0]                                                  # initial guess
JShoot(ξ) = ForwardDiff.jacobian(p -> [Shoot(p[1])], ξ)     # compute jacobian by forward differentiation
println("ξ = ", ξ[1])
println("JS(ξ) : ", JShoot(ξ)[1])
Shoot!(shoot, ξ) = (shoot[:] .= Shoot(ξ[1]); nothing)       # auxiliary function
JShoot!(jshoot, ξ) = (jshoot[:] .= JShoot(ξ); nothing)      # auxiliary function

p0_sol = fsolve(Shoot!, JShoot!, ξ, show_trace = true)      # solve
println(p0_sol)

ξ = -1.0
JS(ξ) : 1.9999999999999958
Iter     f(x) inf-norm    Step 2-norm      Step time
------   --------------   --------------   --------------
     1     3.000000e+00     0.000000e+00         3.666874
     2     3.330669e-16     2.250000e+00         0.009451
     3     3.330669e-16     0.000000e+00         0.000159
     4     3.000000e+00     2.250000e+00         0.000123
     5     3.330669e-16     2.250000e+00         0.000249
     6     7.500000e-01     1.406250e-01         0.000117
     7     3.330669e-16     1.406250e-01         0.000101
     8     1.875000e-01     8.789062e-03         0.000099
     9     3.330669e-16     8.789062e-03         0.000098
    10     4.687500e-02     5.493164e-04         0.000099
    11     3.330669e-16     5.493164e-04         0.000096
    12     1.171875e-02     3.433228e-05         0.000095
Results of Nonlinear Solver Algorithm
 * Algorithm: Modified Powell (User Jac, Expert)
 * Starting Point: [-1.0]
 * Zero: [-2.500000000000003]
 * Inf-norm of residuals: 0.000000
 * Convergence: true
 * Message: iteration is not making good progress, measured by improvement from last 10 iterations
 * Total time: 3.677572 seconds
 * Function Calls: 12
 * Jacobian Calls (df/dx): 2

# get optimal trajectory
sol = φ((t0, tf), x0, p0_sol.x[1], saveat=range(t0, tf, 500))

# plot
t = time_grid(sol)
x = state(sol)
p = costate(sol)
u = sign ∘ p

plt_x = plot(t, x, label = "x")
plt_p = plot(t, p, label = "p")
plt_u = plot(t, u, label = "u")

plt_xp = plot(plt_x, plt_p, layout=(1, 2))
plot(plt_xp, plt_u, layout = (2, 1))