on pulse optimal control of linear systems with aftereffect

8
ISSN 0005-1179, Automation and Remote Control, 2013, Vol. 74, No. 11, pp. 1802–1809. © Pleiades Publishing, Ltd., 2013. Original Russian Text © N.I. Zhelonkina, A.B. Lozhnikov, and A.N. Sesekin, 2013, published in Avtomatika i Telemekhanika, 2013, No. 11, pp. 39–48. NONLINEAR SYSTEMS On Pulse Optimal Control of Linear Systems with Aftereffect N. I. Zhelonkina , A. B. Lozhnikov ,∗∗ , and A. N. Sesekin ,∗∗ Ural Federal University, Yekaterinburg, Russia ∗∗ Institute of Mathematics and Mechanics, Ural Branch, Russian Academy of Sciences, Yekaterinburg, Russia Received February 4, 2013 Abstract—The paper deals with the problem of pulse optimal control of a linear dynamic system with aftereffect. As a functional, the degenerate quadratic functional of the most common kind is considered. The absence of control in the functional leads to the fact that the optimal control contains a pulse component. Sufficient conditions of existence of the pulse optimal control are obtained and the equations describing coefficients in the optimal control are worked out. Sufficient conditions that make it possible to integrate equations and to find the coefficients in an explicit form for the optimal control are established. A model example is considered. DOI: 10.1134/S0005117913110039 1. INTRODUCTION The degenerate linear-quadratic optimization problems that are considered both at the finite and at the infinite time intervals are of great applied importance [1]. This is true both for the systems described by ordinary differential equations and for the systems with aftereffect. The degenerate linear-quadratic problems for the two above-mentioned classes of systems display such a feature that in the class of measurable controls these problems are unsolvable [2–5], and so to afford the existence of optimal control, it is necessary to extend the set of admissible controls, allowing for pulse controls. In this work we consider the programmed problem of optimal stabilization with a degenerate in- tegral quadratic functional of a rather common form. The summands with aftereffect are contained both in the right side of the system and under the integral in the functional. The problem with a similar dynamics of the system and with a structure of the functional at the finite time interval was dealt with in [4, 5]. The equations describing the coefficients in optimal control are derived. The obtained system is fairly complex, and the way of integrating it is not simple. The suggestion is made to simplify the system in the same way as in [6], defining the matrices in the functional in a special way, which makes it possible to obtain in the explicit form the coefficients in the control. 2. STATEMENT OF THE PROBLEM AND ITS REDUCTION It is necessary to develop the control action v that affords the asymptotic stability of a zero solution of the system ˙ x(t)= Ax(t)+ A τ x(t τ )+ 0 τ G(θ)x(t + θ)+ B ˙ v(t) (1) with the initial condition x(t)= ϕ(t), t 0 τ t t 0 , 1802

Upload: a-n

Post on 23-Dec-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: On pulse optimal control of linear systems with aftereffect

ISSN 0005-1179, Automation and Remote Control, 2013, Vol. 74, No. 11, pp. 1802–1809. © Pleiades Publishing, Ltd., 2013.Original Russian Text © N.I. Zhelonkina, A.B. Lozhnikov, and A.N. Sesekin, 2013, published in Avtomatika i Telemekhanika, 2013, No. 11, pp. 39–48.

NONLINEAR SYSTEMS

On Pulse Optimal Control of Linear Systems with Aftereffect

N. I. Zhelonkina∗, A. B. Lozhnikov∗,∗∗, and A. N. Sesekin∗,∗∗

∗Ural Federal University, Yekaterinburg, Russia∗∗Institute of Mathematics and Mechanics, Ural Branch, Russian Academy of Sciences,

Yekaterinburg, Russia

Received February 4, 2013

Abstract—The paper deals with the problem of pulse optimal control of a linear dynamic systemwith aftereffect. As a functional, the degenerate quadratic functional of the most common kindis considered. The absence of control in the functional leads to the fact that the optimalcontrol contains a pulse component. Sufficient conditions of existence of the pulse optimalcontrol are obtained and the equations describing coefficients in the optimal control are workedout. Sufficient conditions that make it possible to integrate equations and to find the coefficientsin an explicit form for the optimal control are established. A model example is considered.

DOI: 10.1134/S0005117913110039

1. INTRODUCTION

The degenerate linear-quadratic optimization problems that are considered both at the finite andat the infinite time intervals are of great applied importance [1]. This is true both for the systemsdescribed by ordinary differential equations and for the systems with aftereffect. The degeneratelinear-quadratic problems for the two above-mentioned classes of systems display such a featurethat in the class of measurable controls these problems are unsolvable [2–5], and so to afford theexistence of optimal control, it is necessary to extend the set of admissible controls, allowing forpulse controls.

In this work we consider the programmed problem of optimal stabilization with a degenerate in-tegral quadratic functional of a rather common form. The summands with aftereffect are containedboth in the right side of the system and under the integral in the functional. The problem witha similar dynamics of the system and with a structure of the functional at the finite time intervalwas dealt with in [4, 5]. The equations describing the coefficients in optimal control are derived.The obtained system is fairly complex, and the way of integrating it is not simple. The suggestionis made to simplify the system in the same way as in [6], defining the matrices in the functional ina special way, which makes it possible to obtain in the explicit form the coefficients in the control.

2. STATEMENT OF THE PROBLEM AND ITS REDUCTION

It is necessary to develop the control action v that affords the asymptotic stability of a zerosolution of the system

x(t) = Ax(t) +Aτx(t− τ) +

0∫

−τ

G(θ)x(t+ θ)dθ +Bv(t) (1)

with the initial condition

x(t) = ϕ(t), t0 − τ � t � t0,

1802

Page 2: On pulse optimal control of linear systems with aftereffect

ON PULSE OPTIMAL CONTROL OF LINEAR SYSTEMS WITH AFTEREFFECT 1803

and minimizes the functional

J [v(·)] =∞∫

t0

⎡⎣xT(t)Φ0x(t) + xT(t)

0∫

−τ

Φ1(θ)x(t+ θ)dθ

+

0∫

−τ

xT(t+ θ)ΦT1 (θ)dθ x(t) +

0∫

−τ

xT(t+ s)Φ2(s)x(t+ s)ds

+

0∫

−τ

0∫

−τ

xT(t+ θ)Φ3(θ, ρ)x(t+ ρ)dθdρ+ xT(t− τ)Φ4x(t− τ)

⎤⎦ dt, (2)

where Φ0, Φ1, Φ2, Φ3, Φ4 are symmetric (apart from Φ1 and Φ3) continuous (by the aggregate ofvariables) matrix-valued functions of the dimension n × n, x(t), ϕ(t) are vector functions of thedimension n, v(t) is the vector function of the dimension m, A, Aτ are constant matrices of thedimension n×n, G(θ) is the matrix-valued function of the dimension n×n with piecewise continuouselements, B is the constant matrix of the dimension n×m.

The given problem is degenerate [1] and is not solvable in the class of absolutely continuousfunctions. To obtain an optimal solution, we will extend the problem by introducing pulse controls.Further we will assume that v(t) and thus x(t) are the functions of a limited variation, the derivativesof which are understood in the generalized sense [9]. The initial function ϕ(t) will also be assumedthe function of a limited variation.

As a solution of Eq. (1) we will take the solution of the appropriate integral equation,

x(t) = ϕ(t0) +

t∫

t0

A(s)x(s)ds +

t∫

t0

Aτ (s)x(s − τ)ds

+

t∫

t0

0∫

−τ

G(θ)x(s+ θ)dθds+

t∫

t0

B dv(s),

where the integrals are understood in the Riemann–Stieltjes sense.

On replacing the variable

y(t) = x(t)−Bv(t) (3)

we will set up an auxiliary problem in the same way as in [4]. The function y(t) will satisfy thesystem of equations

y(t) = Ay(t) +Aτ

(y(t− τ) +Bv(t− τ)

)

+

0∫

−τ

G(θ)(y(t+ θ) +Bv(t+ θ1)

)dθ +ABv(t). (4)

In this system, in view of (3), we have

y(t− τ) +Bv(t− τ) = ϕ(t− τ) (5)

at t ∈ [t0, t0 + τ ]. Therefore, the solution of the system (4), we consideration for (5), will coincidewith the solution of the system (4) at the initial condition

y(t) = ϕ(t), v(t) = 0, t0 − τ � t � t0.

AUTOMATION AND REMOTE CONTROL Vol. 74 No. 11 2013

Page 3: On pulse optimal control of linear systems with aftereffect

1804 ZHELONKINA et al.

On replacing (3), the functional (2) takes the form

J∗[v(·)] =∞∫

t0

W [y, yτ (·), vτ (·)] dt, (6)

where

W [y, yτ (·), vτ (·)] =(y(t) +Bv(t)

)TΦ0

(y(t) +Bv(t)

)

+(y(t) +Bv(t)

)T 0∫

−τ

Φ1(θ)(y(t+ θ) +Bv(t+ θ)

)dθ

+

0∫

−τ

(y(t+ θ) +Bv(t+ θ)

)TΦT1 (θ)dθ

(y(t) +Bv(t)

)

+

0∫

−τ

(y(t+ s) +Bv(t+ s)

)TΦ2(s)

(y(t+ s) +Bv(t+ s)

)ds

+

0∫

−τ

0∫

−τ

(y(t+ θ) +Bv(t+ θ)

)TΦ3(θ, ρ)

(y(t+ ρ) +Bv(t+ ρ)

)dθdρ

+(y(t− τ) +Bv(t− τ)

)TΦ4

(y(t− τ) +Bv(t− τ)

).

3. SOLUTION OF THE AUXILIARY PROBLEM

The method of dynamic programming is used for the solution of an auxiliary problem. Thecontrol action will be sought in the form

v(t) = W0y(t) +

0∫

−τ

W1(θ)y(t+ θ) dθ +

0∫

−τ

W2(θ1)v(t+ θ1) dθ1. (7)

Note that if the control action takes the form (7), the trajectory of a controllable system with thecontrol (7) will satisfy the system

y(t) = Ay(t) +Aτ (y(t− τ) +Bv(t− τ))

+

0∫

−τ

G(θ)(y(t+ θ) +Bv(t+ θ1)

)dθ +ABv(t),

v(t) = W0

⎡⎣Ay(t) +Aτ

(y(t− τ) +Bv(t− τ)

)

+

0∫

−τ

G(θ)(y(t+ θ) +Bv(t+ θ)

)dθ +ABv(t)

⎤⎦

+W1(0)y(t) −W1(−τ)y(t− τ)−0∫

−τ

dW1(θ)

dθy(t+ θ) dθ

+W2(0)v(t) −W2(−τ)v(t− τ)−0∫

−τ

dW2(θ)

dθv(t+ θ) dθ.

AUTOMATION AND REMOTE CONTROL Vol. 74 No. 11 2013

Page 4: On pulse optimal control of linear systems with aftereffect

ON PULSE OPTIMAL CONTROL OF LINEAR SYSTEMS WITH AFTEREFFECT 1805

For an optimal value of the functional (6), at the position of {y, yτ (·), vτ (·)} (Bellman functional),we will introduce the designation

I(y(t), yτ (·), vτ (·)) = minv(·)

J∗[v(·)].

The Bellman functional will be sought in the form

I(y(t), yτ (·), vτ (·)) = yT(t)Py(t) + 2yT(t)

0∫

−τ

Q(s)y(s) ds

+

0∫

−τ

0∫

−τ

yT(s)R(s, ρ)y(ρ) dsdρ + 2yT(t)

0∫

−τ

P1(θ1)v(θ1)dθ1

+2

0∫

−τ

0∫

−τ

yT(θ)P2(θ, θ1)y(θ1) dθdθ1 +

0∫

−τ

0∫

−τ

vt(θ)P3(θ, θ1)v(θ1) dθdθ1

+

0∫

−τ

yT(θ)P4(θ)y(θ) dθ + 2yT(t− τ)

0∫

−τ

P5(p)v(p) dp +

0∫

−τ

vT(r)P6(r)v(r) dr.

Theorem 1. Let for t ∈ [t0,∞) the following conditions be fulfilled:

(a) the matrices P, Q(s), R(s, θ), P4(s) are the solution of the system

Q(0) +QT(0) + PTA+ATP + P4(0) + Φ0 = FT0 H−1F0, (8)

− d

dθQ(θ) +R(0, θ) + PG(θ) +ATQ(θ) + Φ1(θ) = FT

0 H−1F1(θ), (9)

−(

∂θ+

∂ρ

)R(θ, ρ) +GT(θ)Q(ρ) +QT(θ)G(ρ)

+Φ3(θ, ρ) = FT1 (θ)H−1F1(ρ), (10)

− d

dsP4(s) + Φ2(s) = 0 (11)

under boundary conditions

ATτ P = QT(−τ), AT

τ Q(θ) +QT(θ)Aτ = RT(θ,−τ) +R(−τ, θ),

Φ4 = P4(−τ)(12)

for −τ � θ, ρ, s � 0; for the functions P1(s), P2(s, θ), P3(s, θ), P5(s), P6(s) the following presenta-tions are valid:

P1(θ1) = Q(θ1)B, P3(θ1, θ2) = BTR(θ1, θ2)B,

P2(θ, θ1) = R(θ, θ1)B, P5(p) = P4(p)B, P6(r) = BTP4(r)B;

(b) the matrix BTΦ0B + P6(0) is positive definite;

(c) the matrix ( 1τΦ0 Φ1(θ)

ΦT1 (θ) Φ2(θ)

)

is positive semidefinite for θ ∈ [−τ, 0];

AUTOMATION AND REMOTE CONTROL Vol. 74 No. 11 2013

Page 5: On pulse optimal control of linear systems with aftereffect

1806 ZHELONKINA et al.

(d) the matrix Φ3(θ, ρ) has the structure

Φ3(θ, ρ) = ΦT3 (θ) Φ3(ρ),

where Φ3(·) is some n× n matrix with piecewise continuous elements;

(e) the matrix Φ4 is positive semidefinite;

(f) the Bellman functional I(y(t), yτ (·), vτ (·)) is positive definite.

Then the auxiliary problem of optimal stabilization has a solution and the optimal stabilizingcontrol has the form

v(t) = W0y(t) +

0∫

−τ

W1(θ)y(t+ θ) dθ +

0∫

−τ

W2(θ1)v(t+ θ1) dθ1, (13)

where

W0 = −H−1 F0, W1(θ) = −H−1 F1(θ), W2(θ1) = −H−1 F2(θ1),

H = BTΦ0B + P6(0),

F0 = PT1 (0) +BTATP + PT

5 (0) +BTΦ0,

F1(θ) = BTATQ(θ) + PT2 (θ, 0) +BTΦ1(θ),

F2(θ1) = BTATP1(θ1) + PT3 (θ1, 0) +BTΦ1(θ1)B = F1(θ1)B.

The proof of Theorem 1 is given in the Appendix.

Remark. Let

Φ1(θ) = FT0 H−1F1(θ)−R(0, θ)− PG(θ),

Φ3(θ, ρ) = FT1 (θ)H−1F1(ρ)−GT(θ)Q(ρ)−QT(θ)G(ρ).

Then Eq. (9) takes the formd

dθQ(θ) = ATQ(θ),

and Eq. (10) becomes

−(

∂θ+

∂ρ

)R(θ, ρ) = 0.

Equations (8) and (11) will remain as before, and also the boundary conditions (12) for them.

4. DEVELOPMENT OF OPTIMAL PROGRAMMED CONTROLFOR THE INITIAL PROBLEM

For the optimal control of the initial problem the following theorem is valid:

Theorem 2. The optimal programmed control in the initial problem has the form

v(t) = Δv(t0, ϕ(·))δ(t − t0) + vr(t),

where

vr(t) =(W0A+W1(0)

)x(t) +

(W0Aτ −W1(−τ)

)x(t− τ)

+

0∫

−τ

(G(0) − dW1(θ)

)x(t+ θ) dθ (14)

AUTOMATION AND REMOTE CONTROL Vol. 74 No. 11 2013

Page 6: On pulse optimal control of linear systems with aftereffect

ON PULSE OPTIMAL CONTROL OF LINEAR SYSTEMS WITH AFTEREFFECT 1807

is the integrable function,

Δv(t0, ϕ(·)) = W0ϕ(t0) +

0∫

−τ

W1(θ)ϕ(t0 + θ) dθ. (15)

It is easy to make sure that Theorem 2 is valid by differentiating the control v(t) from (13) inthe generalized sense [9], considering that v(t) = 0 at t < t0.

5. EXAMPLE

Consider the controllable system

x(t) = x(t)− 1

5x(t− τ) + v(t). (16)

The initial function ϕ(t) is preset by the condition ϕ(t) = 1 at t ∈ [−1; 0]. Note that in the absenceof control the system (16) is unstable.

As a functional (2) we will take

J [v(·)] =∞∫

0

⎡⎢⎣x2(t) + x(t)

0∫

−1

x(t+ θ)dθ +

0∫

−1

x2(t+ θ)dθ +1

12

⎛⎝

0∫

−1

x(t+ θ)dθ

⎞⎠

2

+ x2(t− 1)

⎤⎥⎦ dt.

A trajectory jump caused by the pulse action Δv(t0, ϕ(·))δ(t) at the initial instant of time,according (15), is preset by the quantity −7

6 . Therefore, under the action of the initial pulseΔv(t0, ϕ(·))δ(t) the phase point moves from the position x = 1 to the position x = −1

6 . The motionis then carried out under the action of the regular control (14) vr(t) = −7

6x(t) +1130x(t − 1). The

modeling results are shown in figure.

1.2

1.0

0.6

0.8

0.4

0.2

0

–0.2

x

(

t

)

–1.0 –0.5 0 0.5 1.51.0 2.0 2.5

tFigure.

For the numerical solution of the equation use was made of the algorithm of the Runge–Kutta–Felberg method of the fourth order with a variable step and the interpolation of previous history bymeans of degenerate cubic splines. The realization of the given algorithm is a part of the packageof Time-Delay System Toolbox programs, which is described in [10].

AUTOMATION AND REMOTE CONTROL Vol. 74 No. 11 2013

Page 7: On pulse optimal control of linear systems with aftereffect

1808 ZHELONKINA et al.

ACKNOWLEDGMENTS

This work was supported by the Russian Foundation for Basic Research, project no. 13-01-00304, and the Goal-Oriented Program of the Ural Branch and the Siberian Branch of the RussianAcademy of Sciences “Quality Theory and Numerical Methods for Problems of Dynamics, Controland Optimization” (project no. 12-C-1-1017).

APPENDIX

Proof of Theorem 1. For an optimal value of the functional (6) at the position {y, yτ (·), vτ (·)}we will introduce the designation

I0(y(t), yτ (·), vτ (·)) = minv(·)

J∗[v(·)].

The Bellman equation for the problem considered will have the form

minv(t)

{dI0(y(t), yτ (·), vτ (·))

dt+W [y(t), yτ (·), vτ (·)]

}= 0.

Using the method from [7, 8], it is possible to establish that the matrices P , Q(s), R(s, θ), P1(s),P2(s, θ), P3(s, θ), P4(s), P5(s), P6(s), defining the optimal control and an optimal value of thefunctional, must satisfy the system

Q(0) +QT(0) + PTA+ATP + P4(0) + Φ0 = FT0 H−1F0,

− d

dθQ(θ) +R(0, θ) + PG(θ) +ATQ(θ) + Φ1(θ) = FT

0 H−1F1(θ), (A.1)

−(

∂θ+

∂ρ

)R(θ, ρ) +GT(θ)Q(ρ) +QT(θ)G(ρ) + Φ3(θ, ρ) = FT

1 (θ)H−1F1(ρ), (A.2)

− d

dθ1P1(θ1) + P2(0, θ1) +ATP1(θ1) + PG(θ1)B +Φ1(θ1)B = FT

0 H−1F2(θ1), (A.3)

−(

∂θ+

∂θ1

)P2(θ, θ1) +QT(θ)G(θ1)B +GT(θ)P1(θ1)

+Φ3(θ, θ1)B = FT1 (θ)H−1F2(θ1),

−(

∂θ1+

∂θ2

)P3(θ1, θ2) +BTGT(θ1)P1(θ2) +BTΦ3(θ1, θ2)B = FT

2 (θ1)H−1F2(θ2), (A.4)

− d

dsP4(s) + Φ2(s) = 0,

− d

dpP5(p) + Φ2(p)B = 0,

− d

drP6(r) +BTΦ2(r)B = 0

AUTOMATION AND REMOTE CONTROL Vol. 74 No. 11 2013

Page 8: On pulse optimal control of linear systems with aftereffect

ON PULSE OPTIMAL CONTROL OF LINEAR SYSTEMS WITH AFTEREFFECT 1809

under boundary conditions

PAτB = P1(−τ), BTATτ Q(θ) = PT

2 (θ,−τ), ATτ P = QT(−τ),

ATτ Q(θ) +QT(θ)Aτ = RT(θ,−τ) +R(−τ, θ),

ATτ P1(θ1) = P2(−τ, θ1),

BTATτ P1(θ1) + PT

1 (θ1)AτB = PT3 (θ1,−τ) + P3(−τ, θ1),

Φ4 = P4(−τ), Φ4B = P5(−τ),

BTΦ4(t)B = P6(−τ) (A.5)

for −τ � θ, θ1, θ2, ρ, p, r, s � 0.

Comparing Eqs. (A.1) and (A.3) with consideration for the boundary conditions (A.5), we makesure of the validity of the equality P1(θ1) = Q(θ1)B. From the comparison of Eqs. (A.2) and (A.4)we verify the validity of the equality P3(θ1, θ2) = BTR(θ1, θ2)B.

Condition (b) appears because the control in the Bellman equation is sought as a result of thefulfilment of the minimization operation in v(t).

Condition (f) ensures stabilizability of the optimal trajectory, while conditions (b), (d) and (e)are necessary for the fulfilment of condition (f).

REFERENCES

1. Gurman, V.I., Printsip rasshireniya v zadachakh upravleniya (The Extension Principle in Control Prob-lems), Moscow: Nauka, 1985.

2. Zavalishchin, S.T. and Sesekin, A.N., On Particular Solutions in Optimization Problems of DynamicSystems with the Quadratic Quality Criterion, Differ. Uravn., 1975, no. 4, pp. 665–671.

3. Zavalishchin, S.T., Sesekin, A.N., and Drozdenko, S.E., Dinamicheskie sistemy s impul’snoi strukturoi(Dynamic Systems with the Pulse Structure), Sverdlovsk: Sredn.-Ural. Knizhnoe Izd., 1983.

4. Andreeva, I.Yu. and Sesekin, A.N., Degenerate Linear-Quadratic Optimization with Time Delay, Autom.Remote Control, 1997, vol. 58, no. 7, part 1, pp. 1101–1109.

5. Sesekin, A.N. and Fetisova, Yu.V., The Order of Singularity of Pulse Optimal Controls in DegenerateLinear-Quadratic Optimization Problems with Time Delay, Autom. Remote Control, 2009, vol. 70, no. 4,pp 589–597.

6. Kim, A.V. and Lozhnikov, A.B., A Linear-Quadratic Control Problem for State-Delay Systems. ExactSolutions for the Riccati Equations, Autom. Remote Control, 2000, vol. 61, no. 7, part 1, pp. 1076–1090.

7. Kolmanovskii, V.B. and Maizenberg, T.L., Optimal Estimates of System States and Some Problems ofEquation Control with Aftereffect, Prikl. Mat. Mekh., 1977, vol. 41, no. 3, pp. 446–456.

8. Yanushevskii, R.T., Upravlenie ob”ektami s zapasdyvaniem (Control of Objects with Delay), Moscow:Nauka, 1978.

9. Vladimirov, V.S., Obobshchennye funktsii v matematicheskoi fizike (Generalized Functions in Mathe-matical Physics), Moscow: Nauka, 1976.

10. Kim, A.V. and Pimenov, V.G., I-Gladkii analiz i chislennye metody resheniya funktsional’no-differentsial’nykh uravnenii (I-Smooth Analysis and Numerical Methods of Solving Functional Differen-tial Equations), Moscow–Izhevsk: NITs “Regulyarnaya i Khaoticheskaya Dinamika,” 2004.

This paper was recommended for publication by E.Ya. Rubinovich, a member of the EditorialBoard

AUTOMATION AND REMOTE CONTROL Vol. 74 No. 11 2013