robust lqr and lqi control with actuator failure of a...

10
Int. J. Appl. Math. Comput. Sci., 2016, Vol. 26, No. 2, 325–334 DOI: 10.1515/amcs-2016-0023 ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A 2DOF UNMANNED BICYCLE ROBOT STABILIZED BY AN INERTIAL WHEEL ADAM OWCZARKOWSKI a, ,DARIUSZ HORLA a a Institute of Control and Information Engineering Pozna´ n University of Technology, ul. Piotrowo 3a, 60-965 Pozna´ n, Poland e-mail: [email protected] , [email protected] Essential ingredients for robust control are the ability to cope with different types of system behavior following modeling imperfections and the ability to assure a certain performance level. In this paper, we propose to use an actuator fault-tolerant control law to govern, during experiments, the stabilization of a bicycle robot with an inertial wheel in order to take into account unmodeled uncertainty introduced by using a linearized model in an LQR fashion. Our proposal is illustrated by signal plots and the values of performance indices obtained from a set of experiments. Keywords: unmanned bicycle robot, LQR/LQI control, robustness, actuator failure. 1. Introduction The problem of stabilization, or robust control, is widely discussed in the literature. Some papers are related to performance measures as a quadratic cost for uncertain systems, as the works of Chang and Peng (1972) or Petersen and McFarlane (1992) for the case of continuous-time control systems. In the case of discrete-time control systems, the guaranteed-cost control has been addressed by, e.g., Xie and Soh (1995). Control algorithms, when implemented, are prone to failures. Thus it is important to use controllers that can tolerate these failures, to guarantee stability and a certain level of performance. This paper adopts the approach presented by Yang et al. (2000b) to apply LQR/LQI control strategies to stabilize a bicycle robot in an unstable equilibrium point, based on its linearized model. The actuator failure feature of the controller is used to introduce the uncertainty that originates from adopting a linearized model to control a nonlinear system that can be modeled by nonlinear state-space equations. Appropriate results are shown on the basis of a set of performed experiments with a bicycle robot. The state-feedback control law is used to guarantee the cost in the case of actuator failure or, in other words, to assure that the lack of consistency between the linearized Corresponding author model and the “true” model of the plant is compensated, projecting the “robustness area” on the static characteristic of the nonlinearity present in the control system or an unknown static characteristic of the nonlinear system. The principle and basic introduction to inertial wheel stabilization can be found in the work of Owczarkowski et al. (2014). The LQR approach to stabilize an unmanned bicycle is presented by Yang et al. (2011) or Smerpitak et al. (2012), but only as a result of simulations and no robustness introduced to the control system. The dynamic characteristics, i.e., the response of the closed-loop system to the initial conditions presented in this paper, are similar to the ones presented in the above-cited works, but they can take the uncertainty of the linearized model of the robot into account. 2. Experimental setup: An unmanned bicycle robot Figure 1 shows a real robot. It has two degrees of freedom (the deflection angle from the vertical position and the angle of the reaction wheel) and one actuator (electric motor). The main goal of the control system is to stabilize the robot at an unstable equilibrium point using as little energy as possible. The DC brushed electric motor is fitted inside the construction and accelerates or decelerates the rotating mass (the steering wheel, the reaction wheel).

Upload: others

Post on 16-Jun-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

Int. J. Appl. Math. Comput. Sci., 2016, Vol. 26, No. 2, 325–334DOI: 10.1515/amcs-2016-0023

ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A 2DOFUNMANNED BICYCLE ROBOT STABILIZED BY AN INERTIAL WHEEL

ADAM OWCZARKOWSKI a,∗, DARIUSZ HORLA a

aInstitute of Control and Information EngineeringPoznan University of Technology, ul. Piotrowo 3a, 60-965 Poznan, Polande-mail: [email protected],

[email protected]

Essential ingredients for robust control are the ability to cope with different types of system behavior following modelingimperfections and the ability to assure a certain performance level. In this paper, we propose to use an actuator fault-tolerantcontrol law to govern, during experiments, the stabilization of a bicycle robot with an inertial wheel in order to take intoaccount unmodeled uncertainty introduced by using a linearized model in an LQR fashion. Our proposal is illustrated bysignal plots and the values of performance indices obtained from a set of experiments.

Keywords: unmanned bicycle robot, LQR/LQI control, robustness, actuator failure.

1. Introduction

The problem of stabilization, or robust control, iswidely discussed in the literature. Some papers arerelated to performance measures as a quadratic cost foruncertain systems, as the works of Chang and Peng(1972) or Petersen and McFarlane (1992) for the caseof continuous-time control systems. In the case ofdiscrete-time control systems, the guaranteed-cost controlhas been addressed by, e.g., Xie and Soh (1995).

Control algorithms, when implemented, are proneto failures. Thus it is important to use controllers thatcan tolerate these failures, to guarantee stability and acertain level of performance. This paper adopts theapproach presented by Yang et al. (2000b) to applyLQR/LQI control strategies to stabilize a bicycle robotin an unstable equilibrium point, based on its linearizedmodel. The actuator failure feature of the controller isused to introduce the uncertainty that originates fromadopting a linearized model to control a nonlinear systemthat can be modeled by nonlinear state-space equations.Appropriate results are shown on the basis of a set ofperformed experiments with a bicycle robot.

The state-feedback control law is used to guaranteethe cost in the case of actuator failure or, in other words, toassure that the lack of consistency between the linearized

∗Corresponding author

model and the “true” model of the plant is compensated,projecting the “robustness area” on the static characteristicof the nonlinearity present in the control system or anunknown static characteristic of the nonlinear system.

The principle and basic introduction to inertial wheelstabilization can be found in the work of Owczarkowskiet al. (2014). The LQR approach to stabilize an unmannedbicycle is presented by Yang et al. (2011) or Smerpitaket al. (2012), but only as a result of simulations and norobustness introduced to the control system. The dynamiccharacteristics, i.e., the response of the closed-loop systemto the initial conditions presented in this paper, are similarto the ones presented in the above-cited works, but theycan take the uncertainty of the linearized model of therobot into account.

2. Experimental setup: An unmannedbicycle robot

Figure 1 shows a real robot. It has two degrees of freedom(the deflection angle from the vertical position and theangle of the reaction wheel) and one actuator (electricmotor). The main goal of the control system is to stabilizethe robot at an unstable equilibrium point using as littleenergy as possible. The DC brushed electric motor is fittedinside the construction and accelerates or decelerates therotating mass (the steering wheel, the reaction wheel).

Page 2: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

326 A. Owczarkowski and D. Horla

Fig. 1. Picture of a real robot.

The motor shaft torque creates a reaction which isused for stabilization and is the core principle of operationof this robot. The only control signal is the electric motorcurrent proportional to the torque, which makes the directtorque control absolutely vital and fundamental in thiscase. The measurement system consists of three sensors:a one-axis accelerometer, a one-axis gyroscope and anencoder. Raw measurement values would be uselesswithout a specially prepared filtering and estimationprocess. To obtain a reliable state vector, we decided touse a combination of Kalman and FIR filters. The studiespresented by Drapikowski et al. (2012) prove that thismeasurement system is sufficient for this kind of robot.Every calculation is made by an embedded system with ahigh performance 32-bit STM32 microcontroller.

There are some constraints in the system that must betaken into account, namely, the maximum angle from thevertical position, the maximum velocity of the reactionwheel and the maximum control value. In other words,every electric motor has its own limits, i.e., the maximumvelocity and the maximum torque. If one of these valuessaturates, the system becomes much more difficult tocontrol. In such a situation, the actuator is uncontrollable.This article shows how to cope with actuator failures usingmodified LQR control.

3. Model of the robot

3.1. Continuous-time 2DOF model of the plantand the solution procedure for nonlinear differ-ence equations. The full mathematical model of therobot is defined by the following non-linear differentialequations (time index omitted for brevity):

x1(t) = x2(t) , (1)

x2(t) =g hr mr sin(x1(t))

Irg− br x2(t)

Irg(2)

− bI x4(t)

Irg+

km u(t)

Irg,

x3(t) = x4(t) , (3)

x4(t) =km u(t)

II + Imr− bI x4(t)

II + Imr, (4)

where (see Fig. 2 for the kinematic scheme of the object)x is the state vector, x1 is the angle of the robot fromthe vertical, x2 is the angular velocity of the robot, x3 isthe rotation angle of the reaction wheel, x4 is the angularvelocity of the reaction wheel, u stands for control, ui isthe current of the i-th motor, mr is the weight of the robot,II is the moment of inertia of the reaction wheel, Imr1 isthe moment of inertia of the rotor of the motor 1, Irg isthe moment of inertia of the robot relative to the ground,hr is the distance from the ground to the center of massof the robot, g is the gravity of the Earth, kmi is the i-thmotor constant, br is the coefficient of friction in the robotrotation, bI is the coefficient of friction in the rotation ofthe reaction wheel, P1, P2 are contact points of the wheelswith the ground, C1 is the center of the rear wheel, C2 isthe center of the front wheel, COM stands for the centerof mass.

The above model takes centrifugal force, gravitationforce and also the reaction momentum from an inertialdrive into account. It has been derived with referenceto the rules described by Block et al. (2007), and yet,apart from taking the listed phenomena into account, itis a simplification of reality to some extent.

Ib

1u

II

3x

x

z

1P

2P

1C 2

C

rm

1mrI

rh

(a)

1x

rbrg

I

y

z

(b)

Fig. 2. Kinematic scheme of a bicycle robot: main view (a), sideview (b).

The system can be linearized by a Jacobian matrix to

Page 3: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

Robust LQR and LQI control with actuator failure of a 2DOF unmanned bicycle robot. . . 327

the form x(t) = Alx(t) + b lu(t), where

Al =

⎡⎢⎢⎢⎢⎢⎣

0 1 0 0ghrmr

Irg− brIrg

0 − bIIrg

0 0 0 1

0 0 0 − bIII + Imr

⎤⎥⎥⎥⎥⎥⎦, (5)

b l =

⎡⎢⎢⎢⎢⎢⎣

0kmIrg0km

II + Imr

⎤⎥⎥⎥⎥⎥⎦, (6)

with the linealization point (units have been omittedhere) xl = [0.5, 0, 0, 0]T . The physical modelof the robot has the following parameters: mr =3.96 kg, hr = 0.13m, II = 0.0094 kg ·m2, Imr1 =0.0008 kg ·m2, Irg = 0.0931 kg ·m2, rw = 0.03m, g =9.80665m · s2, λ = π/2 rad, km1 = 0.421N ·m · A−1,br = 0.0013N ·m · s, bI = 0.0003N ·m · s, identifiedby means of previous experiments or appropriatemeasurements.

3.2. Discrete-time model of the plant. In orderto apply the controller on hardware, the followingmultivariable model of the linearized plant must beintroduced:

x t+1 = Ax t + but , (7)

yt= Cx t , (8)

where the appropriate matrices and vectors have knownsizes and are the result of step-invariant discretization ofthe continuous-time model of the plant (5), (6), with achosen sample period TS . The output vector y ∈ R

p, theconstrained control signal u, and the state vector x ∈ R

n

are in the discrete-time domain (denoted henceforth by thesubscript t).

4. Actuator failure models

The LQR-type approach to control a bicycle robotassuring some degree of robustness to actuator failuresis applied in the paper. In the proposed algorithm, thecontrol performance index is defined as in the standardLQR control (Kwakernaak and Sivan, 1972),

J =

∞∑t=0

(xT

tQx t +Ru2t

), (9)

with the weighing matrix Q ≥ 0, and R ≥ 0. In the mostgeneral case, one can assume the actuator failure model asin the work of Zuo et al. (2010), i.e.,

ukt = (1− ρkt )sat(vt; α), k = 1, . . . , gF , (10)

where ρkt is an unknown constant from the span that willbe defined in a further part of the text, index k denotesthe k-th failure model, and gF is the total number offailure models. The symbol uk

t refers to the constrainedcontrol signal, assuming that an actuator failure takesplace (in the other case, uk

t = vt). For any actuator failuremodel, including the situation for constraints imposed onthe control signal, the constant ρkt lies in ρk−,t ≤ ρkt ≤ρk+,t, and the function sat defines the method of applyingconstraints (e.g., a cut-off constraint).

Having taken a single model of failure into account,(10) can be transformed (Yang et al., 2000b; 2000a; Horlaand Królikowski, 2011) to

uFt = �vt , (11)

where

0 ≤ �− ≤ � ≤ �+, (12)

with �− ≤ 1 and �+ ≥ 1. This enables us tomimic uncertainty introduced by the linearized modelwith a kind of actuator failure—the control signal appliedto the system causes different behavior, as expected.Furthermore, for the robot considered, current constraintsintroduce additional failure-like behavior (inability togenerate computed, i.e., required, torques).

With reference to (10), �− = �+ means that there areno active constraints imposed on the control signal or nofailures have taken place, and uF

t = vt. The case �− > 0corresponds to a partial failure, and �− = 0 to the outagecase.

In order to streamline the further presentation, thefollowing notation has been adopted from Yang et al.(2000a) as well as Horla and Królikowski (2011):

ut = uFt . (13)

5. Control strategies considered

Firstly, the LQR control law of the form

vt = kTx t (14)

is considered, where x t is the sampled state vector fromthe robot, measured by means of a suitable combination offiltering and estimation of signals from the accelerometer,the gyroscope and the encoder. It refers to the equations ofthe nonlinear model of the plant (1)–(4). The latter controllaw is called reliable and assures that a specified value ofthe performance index (9) is not exceeded for the givenplant model (referring to the real-world control system), ifit is connected to a certain matrix P , the system (7), (8),and if P satisfies the inequality (Yang et al., 2000a; Horlaand Królikowski, 2011)

(A+ �bkT

)T

P(A+ �bkT

)

− P +R�2kkT +Q ≤ 0 . (15)

Page 4: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

328 A. Owczarkowski and D. Horla

The closed-loop model of the system

x t+1 =(A+ �bkT

)x t (16)

is then stable, and the performance index over an infinitehorizon satisfies

J =∞∑t=0

xTt

(Q+R�2kkT

)x t ≤ xT

0Px 0 . (17)

If no robustness against the actuator failure is takeninto consideration, the optimal state-feedback vector k forthe control law (14) is derived as a solution of the set ofequations

kT = −(bTP b +R

)−1

bTPA , (18)

P = QATPA

−ATP b(bTP b+R

)−1

bTPA , (19)

with an optimal value Jk of the performance index (9),based on deriving k according to (18) and (19), being atthe same time the upper boundary of (17), given as

Jk = xT0Px 0 . (20)

Secondly, the LQI control law is also taken intoaccount,

v t = kTx t + kTI x I,t , (21)

where x t is the measured state vector of the robot, as inthe latter case, and x I,t is an appropriate integral of thestate vector, whereas kI for the described problem is azero vector with the third element with the value a �= 0(compensation of the gyroscopic drift in x3).

6. Derivation of an optimal state-feedbackvector in the case of an actuator failure

The following algorithm (Yang et al., 2000a) enablesderivation of the optimal state-feedback vector k toincrease the robustness of the system against actuatorfailure:

1. solve (19) with respect to P (mark the result as P ∗),and choose an arbitrary R0, satisfying

R0 ≤(bTP ∗b +R

)−1

; (22)

2. solve

P = Q+ATPA−ATP J0PA (23)

with respect to the stabilizing P and check thecondition

R0 ≤(bTP b+R

)−1

, (24)

where

J0 = b(1− γ2

0

) ((bTP b+R

) (1− γ2

0

)

+R−10 γ2

0

)−1bT , (25)

(γ0 will be defined in due course);

3. if the inequality (22) is satisfied for R0 and P , increaseR0 and proceed to Step 2; otherwise, decrease R0 andproceed to Step 2, checking if Step 4 is satisfied;

4. if the inequality (22) is satisfied for R0 and P , thestabilizing matrix P satisfies Eqn. (23), and there isno positive-definite solution for the pair R0 and P forarbitrary R′

0, where

R0 ≤ R′0 ≤

(bTP ∗b+R

)−1

,

then stop the algorithm; in this case, the state-feedbackvector is given as

kT = −γ−1(1−(

X−1−R0

)((1−γ2

0

)(26)

+ γ20R

−10 X−1

)−1γ20R

−10

)X−1bTPA ,

where X = bTP b+R.

The following notation has been adopted in the abovealgorithm (Yang et al., 2000a):

γ =�+ + �−

2, (27)

γ0 =�+ − �−�+ + �−

. (28)

The matrix P satisfying the equation

P = QATPA

−ATP b(bTP b+R

)−1

bTPA (29)

is called the stabilizing Ricatti solution to the Ricattiequation, and all eigenvalues of the matrix A−b

(bTP b+

R)−1

bTPA are inside the unit circle.In addition, we have (Yang et al., 2000b)

� = (1 + �0) γ , (30)

|�0| ≤ γ0 ≤ 1 , (31)

where

�0 =�− γ

γ.

Page 5: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

Robust LQR and LQI control with actuator failure of a 2DOF unmanned bicycle robot. . . 329

7. Actuator failure vs. active constraintsand undermodeling errors

When the input signal of the actuator saturates, inamplitude-constrained control, it may be treated as aspecial case of actuator failure. In such a situation, it isassumed that γ = α and the constrained control signalbecomes (Horla and Królikowski, 2011)

uFt = sat

(kTx t;α

), (32)

where sat is a function that imposes constraints on thecontrol signal in the span of ±α.

A single actuator failure on the basis of (11) and (32)is presented in Fig. 3. The assumed failure model can beput in the form

uFt = sat (�vt;α) , (33)

as a compilation of the models (32) and (11) (the dashedarea in Fig. 3).

vt

uF

t

α

−α

−αtanϕ− = �−

tanϕ+ = �+

α

Fig. 3. Actuator failure and amplitude constraints.

The dashed cones in Fig. 3 might also depict theuncertainty introduced by using a linearized model insteadof a true nonlinear model of the plant, e.g., due to theexistence of the nonlinear static characteristic. Whenevercontrol signals computed for the linearized case failto result in expected changes in output signals due toundermodeling, one can treat this case as an actuatorfailure and use algorithms cited in this paper.

Figure 4 presents robustness areas for differentvalues of �+, �−, making it possible to visualize the“robustness area” of the control law applied and itsanalogy to the actuator failure case in the controller. Ashas been already mentioned, the more different the valuesof �+ and �−, the larger the cones.

8. Experimental comparison of controlstrategies

8.1. Experiment conditions. Figure 5 presents therobot during the experiment. As can be seen, there is a

vt

uF

t

(a)

vt

uFt

(b)

vt

uFt

(c)

vt

uFt

(d)

Fig. 4. Robust action areas for ρ+ = 1 + δ, ρ− = 1 − δ withδ = 0.2 (a), δ = 0.4 (b), δ = 0.6 (c), δ = 0.8 (d).

small block on the left which supports the constructionevery time right before the trigger. When the triggerappears, it immediately turns on the control system andthe data acquisition one. For the next few seconds, themachine restores its upright position and is trying to reachthe equilibrium point. All important signal values are

Page 6: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

330 A. Owczarkowski and D. Horla

Fig. 5. Real robot during the experiment.

stored in the SD card with a frequency of 200 Hz. Theinitial deflection angle in every experiment is the same(x1,0 = −0.06 rad).

The controllers considered work in discrete-timewith their output fed to zero-order hold and based on thediscretised model of the linearized plant (5), (6).

During the experiment, the followingparameters have been fixed: TS = 0.02 s, Q =diag {1, 1, 0.01, 0.4}, R = 10, 0 ≤ δ ≤ 0.8, ρ+ = 1+δ,ρ− = 1 − δ, α = 2.1A, R0 = 0.1, kI = [0, 0, 0.2, 0]T ,x 0 = [−0.06 rad, 0 rad · s−1, 0 rad, 0 rad · s−1]T . Foreach of 18 values of δ, a series of 5 experiments wascarried out for both LQR and LQI controllers, producingtwo sets of 90 experiments. The results presentedin subsequent sections are mean values from everyexperiment series.

It is to be stressed that Q and R have beenchosen to assure that optimal k computed via a standardLQR procedure provides acceptable dynamics of theclosed-loop system, and guarantees that the control signaldoes not saturate frequently and the mechanics of themachinery are far from resonant-line behavior.

8.2. Performance indices. In order to verify thebehavior of the control system, the following twoperformance indexes were introduced:

Jxi =

N∑t=0

x2i,t (i = 1, . . . , 4) , (34)

Ju =N∑t=0

u2t , (35)

where N denotes the number of samples collected at eachmultiplicity of TS from the sampled-data control system.The experiment time was equal in all cases and set to 7.5 s.The first index is related to the energy carried by xi,t. In

the case of x1, which has to be stabilized at zero (uprightposition, unstable equilibrium point); x2 being the angularvelocity of the robot, x3 meaning the rotation angle of thereaction wheel (partially related to energy consumption ina steady-state) that also represents the gyroscopic drift andits velocity x4, small values of the corresponding indicesrepresent a smooth response to initial conditions.

8.3. Results. In Fig. 6, a series of plots is presented,derived from a set of 90 LQR experiments, whereas inFig. 7 the results for 90 LQI experiments are depicted.As can be seen, the implemented controllers allowstabilization of the nonlinear plant, i.e., an unmannedbicycle robot, in an unstable equilibrium point. It mustbe noticed that for the LQR controller the mean value ofthe control signal after t > 2 s is nonzero, which causesboth energy consumption and forces the reaction wheelto rotate slowly to compensate the gyroscopic drift seenin the plot of x3. The larger the value of δ, the greaterthe uncertainty information passed to the control system,which causes the control signal to change rapidly (thereis a 25 times increase in the mean variance of a set of 5experiments between δ = 0 and δ = 0.8, both for t > 2 s),and corresponds to the action of the controller to conformto the upper boundary (20). On the contrary, in the case ofthe LQI controller, this increase is 4 times only.

For the LQI controller, by integrating x3 and usingthis information in the control law, it is assured that themean value of the control signal in the time horizonas above is virtually zero, which results, for longerobservation, in no angular change in the rotation of thereaction wheel. For a shorter period of time, swayingbetween the integral of x3 and x1 visible. Nevertheless,especially in the case of the gyroscopic drift or its initialoffset, states x1 and x2 drift away from zero in longertime horizons in the LQR case (see Fig. 8), causing x3

to increase also; this is not the case for the LQI strategy,where no drift in x3 can be observed.

Mean performance indices for LQR and LQIcontrollers are presented in Fig. 9. The initial values,i.e. for δ = 0, related to plain LQR or LQI, are givenin Table 1 and reflect the assumption that the linearizedmodel (7), (8) is true. In comparison of these valueswith plots depicted in Fig. 9, it becomes obvious that byincorporating robustness against actuator failure, whichcan be treated as a discrepancy between the expectedbehavior of the robot with respect to its linearized modeland its true response to the control signal, the performanceindices are decreased in a certain span of δ. It is noticeablethat for approximately. δ = 0.1 the decrease in Jx1 isnearly 10% of the value for plain LQR, whereas in thecase of energy consumption it is reduced by over 30%.The same phenomenon has been observed in simulationsof the robot model in a sampled-data control systemwith the same sampling period, as reported by Horla and

Page 7: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

Robust LQR and LQI control with actuator failure of a 2DOF unmanned bicycle robot. . . 331

0 2 4 6−0.1

−0.08

−0.06

−0.04

−0.02

0

0.02

0.04

t [s]

x 1(t)

[rad

]

0 2 4 6−0.2

−0.1

0

0.1

0.2

0.3

0.4

0.5

0.6

t [s]

x 2(t)

[rad

/s]

0 2 4 6−6

−4

−2

0

2

4

t [s]

x 3(t)

[rad

]

0 2 4 6−5

0

5

10

t [s]

x 4(t)

[rad

/s]

0 2 4 6−3

−2

−1

0

1

2

3

t [s]

u(t)

[A]

Fig. 6. Data set derived from a set of 90 LQR experiments.

0 2 4 6−0.1

−0.08

−0.06

−0.04

−0.02

0

0.02

0.04

t [s]

x 1(t)

[rad

]

0 2 4 6−0.2

−0.1

0

0.1

0.2

0.3

0.4

0.5

0.6

t [s]

x 2(t)

[rad

/s]

0 2 4 6−6

−4

−2

0

2

4

t [s]

x 3(t)

[rad

]

0 2 4 6−5

0

5

10

t [s]

x 4(t)

[rad

/s]

0 2 4 6−3

−2

−1

0

1

2

3

t [s]

u(t)

[A]

Fig. 7. Data set derived from a set of 90 LQI experiments.

Page 8: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

332 A. Owczarkowski and D. Horla

0 10 20 30−0.1

−0.08

−0.06

−0.04

−0.02

0

0.02

0.04

t [s]

x 1(t)

[rad

]

0 10 20 30−0.2

−0.1

0

0.1

0.2

0.3

0.4

0.5

0.6

t [s]

x 2(t)

[rad

/s]

0 10 20 30−20

0

20

40

60

80

100

120

t [s]

x 3(t)

[rad

]

0 10 20 30−5

0

5

10

t [s]

x 4(t)

[rad

/s]

0 10 20 30−3

−2

−1

0

1

2

3

t [s]

u(t)

[A]

Fig. 8. Gyroscopic drift in the case of its initial offset and LQR control.

0 0.2 0.4 0.6 0.80.125

0.13

0.135

0.14

0.145

0.15

0.155

Jx

1

δ0 0.2 0.4 0.6 0.8

4.5

5

5.5

6

Jx

2

δ

0 0.2 0.4 0.6 0.81000

2000

3000

4000

5000

6000

Jx

3

δ

0 0.2 0.4 0.6 0.82000

2500

3000

3500

4000

4500

Jx

4

δ0 0.2 0.4 0.6 0.8

4000

5000

6000

7000

8000

9000

10000

Ju

δ0 0.2 0.4 0.6 0.8

−200

−150

−100

−50

0

50

100

δ

k1

k2

1000k3

100k4

LQRLQI

Fig. 9. Mean performance indices for the control laws considered (units have been omitted).

Page 9: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

Robust LQR and LQI control with actuator failure of a 2DOF unmanned bicycle robot. . . 333

Owczarkowski (2015), with a similar increase in otherperformance indices in the case of LQI with respect toLQR control.

From the plot of Ju it can be observed that there isa range of values of δ for which the performance indexis small with respect to its value for δ = 0, whereas forlarge values of δ it increases and causes abrupt changes insignals as reported above, resulting in an increase in otherperformance indices. This is related to the inability of thecontrol law to encompass such a degree of uncertaintywhich is ‘fed’ to the control law. The rapid changesin signals for large δs are a result of an increase incomponents of k, as in the last plot in Fig. 9.

Table 1. Values of performance indices for δ = 0.control law Jx1 Jx2 Jx3 Jx4 Ju

LQR 0.1418 5.4680 3047 2944 5998LQI 0.1341 5.6187 2926 3716 6648

9. Conclusions

By introduction of an actuator failure model into LQRand LQI control laws it is possible to reduce performanceindices, especially energy consumption. This reductionis mainly due to using a linearized robot model in asampled-data controller and by using the notion of theactuator failure. Since it might be treated as a means tomimic modeling imperfections, as well as the presenceof nonlinearities in the control system, by increasingthe robustness of the control system we obtain slowertransients and the bicycle robot stabilizes at an unstableequilibrium point. In addition, by introducing an integralterm to LQR control, the gyroscopic drift is eliminated,causing again smaller energy consumption in a longertime horizon of the experiment.

In further research, it should be verified if a similarapproach enables us to improve LQR controller resultsin the case of a 4DOF robot, enabling us to modify thesteering wheel angular change, as well as the linear speedof the robot.

The presented approach is computationally simpleand requires only necessary calculations of k off-line andstoring them for future use, which makes it applicable toreal-time control systems.

ReferencesBlock, D.J., Åström, K.I. and Spong, M.W. (2007).

The reaction wheel pendulum, Synthesis Lectureson Controls and Mechatronics 1(1): 1–105, DOI:10.2200/S00085ED1V01Y200702CRM001.

Chang, S. and Peng, T. (1972). Adaptive guaranteed costcontrol of systems with uncertain parameters, IEEE Trans-actions on Automatic Control 17(4): 474–483, DOI:10.1109/TAC.1972.1100037.

Drapikowski, P., Goslinski, J. and Owczarkowski, A. (2012).Control and model parameters identification of inertiawheel pendulum, Proceedings of the 9th InternationalConference on Informatics in Control (ICINCO), Rome,Italy, pp. 574–579.

Horla, D. and Królikowski, A. (2011). Discrete-time LQGcontrol with actuator failure, Proceedings of the 8th Inter-national Conference on Informatics in Control, Automa-tion and Robotics, Noordwijkerhout, The Netherlands, (onCD–ROM).

Horla, D. and Owczarkowski, A. (2015). Robust LQRwith actuator failure control strategies for 4DoF modelof unmanned bicycle robot stabilised by inertial wheel,2015 International Conference on Industrial Engineer-ing and Systems Management (IESM), Seville, Spain, pp.998–1003, DOI: 10.1109/IESM.2015.7380276.

Kwakernaak, H. and Sivan, R. (1972). Linear Optimal ControlSystems, Wiley-Interscience, Hoboken, NJ.

Owczarkowski, A., Lis, M. and Kozierski, P. (2014). Trackingcontrol of an inertial wheel pendulum by LQR regulation,Proceedings of the 19th International Conference on Meth-ods and Models in Automation and Robotics, Miedzyz-droje, Poland, (on CD-ROM).

Petersen, I. and McFarlane, D. (1992). Optimizing theguaranteed cost in the control of uncertain systems, in M.Mansour et al. (Eds.), Robustness of Dynamical Systemswith Parameter Uncertainties, Brikhäuser, Boston, MA,pp. 241–250.

Smerpitak, K., Ukakimparn, P., Trisuwananwat, T. andTrakoonkootaworn, S. (2012). An unmanned bicycleversus linear quadratic optimal controls, Proceedings of the12th International Conference on Control, Automation andSystems, JeJu Island, Korea, pp. 1337–1341.

Xie, L. and Soh, Y. (1995). Guaranteed cost control of uncertaindiscrete-time systems, Control Theory and Advanced Tech-nology 10(4): 1235–1251.

Yang, J., Lee, S., Kim, S., Lee, Y. and Kwon, O. (2011).Linear controller design for circular motion of unmannedbicycle, Proceedings of the 11th International Conferenceon Control, Automation and Systems, Gyeonggi-do, Korea,pp. 893–897.

Yang, Y., Wang, J. and Soh, Y. (2000a). Reliable LQG controlwith sensor failures, IEE Proceedings: Control Theory andApplications 147(4): 433–439.

Yang, Y., Yang, G. and Soh, Y. (2000b). Reliable control ofdiscrete-time systems with actuator failures, IEE Proceed-ings: Control Theory and Applications 147(4): 428–432.

Zuo, Z., Ho, D. and Wang, Y. (2010). Fault tolerant controlfor singular systems with actuator saturation and nonlinearperturbation, Automatica 46(3): 569–576.

Page 10: ROBUST LQR AND LQI CONTROL WITH ACTUATOR FAILURE OF A …pldml.icm.edu.pl/pldml/element/bwmeta1.element.bwn... · 2016. 6. 30. · dariusz.horla@put.poznan.pl Essential ingredients

334 A. Owczarkowski and D. Horla

Adam Owczarkowski is finishing his Ph.D.studies at Poznan University of Technology, Fac-ulty of Electrical Engineering, Institute of Con-trol and Information Engineering. He holds anM.Sc. degree in control from the Poznan Univer-sity of Technology. He focuses his work on con-trol strategies for underactuated robots and is alsointerested in linear quadratic control, non-linearcontrol, object modeling, embedded system pro-gramming and machine intelligence.

Dariusz Horla is with the Institute of Controland Information Engineering of the Poznan Uni-versity of Technology. He received his M.Sc. in2002, his Ph.D. in 2005 and his D.Sc. in 2013,all from the Faculty of Electrical Engineering ofthe Poznan University of Technology. His re-search interests include anti-windup compensa-tion, optimal control, linear matrix inequalities,optimization theory, and adaptive control.

Received: 30 March 2015Revised: 8 July 2015Accepted: 9 October 2015