You are on page 1of 12

Automatica. Vol.23, No. 2, pp. 149 160.1987 0005 1098/87$3.00+0.

00
Printed in Great Britain. PergamonJournalsLtd.
© 1987InternationalFederationof AutomaticControl

Generalized Predictive Control Part II.


Extensions and Interpretations*
D. W. CLARKFt, C. MOHTADIt and P. S. TUFFS~

By relating a novel predictive controller to LQ designs stability theorems are deduced,


and extensions to Generalized Predictive Control give model following, state-dead-beat
and pole-placement control objectives.

Key Words--Self-tuning control; predictive control; model-reference control; LQ control; dead-beat


control; nonminimum-phase plant; variable dead-time.

Abstract--The original GMV self-tuner was later extended example, in process control it is generally found
to provide a general framework which included feedforward that energetic control signals are undesirable, a
compensation and user-chosen polynomials with detuned mod-
el-reference, optimal Smith predictor and load-disturbance tail- slowly responding loop being prefered, and plant
oring objectives. This paper adds similar refinements to the models are poorly specified in terms of dead-
GPC algorithm which are illustrated by a set of simulations. time and order as well as their transfer function
The relationship between GPC and LQ designs is investigated
to show the computational advantage of the new approach. The parameters. Emphasis is therefore placed on robust
roles of the output and control horizons are explored for and consistent performance despite variations in
processes with nonminimum-phase, unstable and variable dead- quantities such as dead-time and despite sustained
time models. The robustness of the GPC approach to model
over- and under-parameterization and to fast sampling rates is load-disturbances. High-performance electromech-
demonstrated by further simulations. An appendix derives anical systems tend to have well-understood models
stability results showing that certain choices of control and though often with lightly-damped poles, and the
output horizons in GPC lead to cheap LQ, "mean-lever', state-
dead-beat and pole-placement controllers. control requirement is for fast response, accepting
the fact that the actuation might saturate. It is
1. I N T R O D U C T I O N doubtful whether a single criterion as in (1) can
THE BASIC GPC method developed in Part I is a deal with such a wide range of problems, so to
natural successor to the GMV algorithm of Clarke create an effective general-purpose self-tuner it is
and Gawthrop (1975) in which a cost-function of essential to be able to adapt the basic approach by
the form: the incorporation of "tuning-knobs".
The GMV design was developed into a useful
JoMv = E{(y(t + k) -- w(t))2 + 2uZ(t)[t} (1) self-tuning algorithm by the addition of user-chosen
transfer functions P(q-1), Q(q-1) and an observer
was first defined and minimized. The cost of (1) is polynomial T(q-1). These time-domain perfor-
single-stage and so it is found that effective control mance-oriented "knobs" allow the engineer to tac-
depends on knowledge of the dead-time k of the kle different control problems within the same
plant and for nonminimum-phase plant stability overall scheme. For example, Gawthrop (1977) and
requires a nonzero value of 2. The use of long- Clarke and Gawthrop (1979) showed that model-
range prediction and a multi-stage cost in GPC following, detuned model-following, and optimal
overcomes the problem of stabilizing a nonmini- Smith prediction were interpretations which could
mum-phase plant with unknown or variable dead- be invoked as well as the original control weighting
time. concept of (1). Clarke (1982) and Tufts (1984) give
The relative importance of controller perfor- further examples of the use of these polynomials in
mance criteria varies with the application area. For practice.
This paper introduces similar polynomials to
* Received 2 March 1985; revised 7 July 1985; revised 3 March GPC for specifying a desired closed-loop model
1986; revised 22 September 1986. The original version of this and for tailoring the controlled responses to load
paper was not presented at any IFAC meeting. This paper was
recommended for publication in revised form by Associate disturbances, and the derivation of GPC is
Editor M. Gevers under the direction of Editor P. C. Parks. expanded to include the more general CARIMA
"t Department of Engineering Science, Parks Road, Oxford model. The properties of these extensions are ver-
OX1 3PJ, U.K.
:~Aluminum Company of America, Alcoa Technical Center, ified by simulations, which also show the robustness
Pittsburgh, PA 15069, U.S.A. of the method to a range of practical problems such
149
150 D.W. CLARKEet al.

as model over- and under-parameterization. An Consider the Diophantine identity:


appendix relates GPC with a state-space LQ con-
trol law and derives stability results for two uses _~_
of GPC: large output and control horizons N 2 and Pn = EsAA + q iF: j = 1,2, (3)
Pd Yd ....
N U leading to a cheap LQ controller, and N U = 1
giving a "mean-level" controller for stable plants
Multiplying (2b) by qJEjA and following the same
which need not be minimum-phase. Moreover,
route as in Part I, we obtain:
the relations between state-dead-beat control and
deterministic pole-assignment with GPC are exam-
ined. ~(t + jlt) = GjAu(t + j - 1)
+ F y ( t ) / P d ( q - x) (4)
where Gj = EjB.
2. EXTENSIONS TO GENERALIZED PREDICTIVE
CONTROL
The Diophantine recursion equations developed
2.1. Model-following: P(q- x)
earlier are identical to those involved here but
Generalized Predicitive Control as described in simply the starting point is different:
Part I is based on minimization of a set of predicted
system errors based on available input-output
Pn(O)
data, with some constraints placed on the projected E1 - Pd(O)' F1 = q(Pn - E1A), andA = AAPd.
control signals. It is possible to use an auxiliary
function of the output as in the GMV development
and consider the predicted system errors associated The transfer function P(q- 1) has two distinct inter-
with this pseudo-output and the projected set- pretations depending on the particular application
points. and on the control strategy envisaged. For process
Consider the auxiliary output: control, when dealing with a "simple" plant the
primary design "knobs" are N 2 and N U . If the
t~(t) = P(q-1)y(t) where process output has a large overshoot to set-point
p(q- 1) = Pn(q- 1)/Pd(q- 1). and load changes, P(q-1) can be used to penalize
this overshoot. In high-performance applications
p(q-1) is a transfer-function given by polynomials N U is chosen larger than unity by an amount
Pn and Pd in the backward shift operator q- 1 with depending on the complexity of the plant. P(q- 1)
P(1) set to unity to ensure offset-free control. The can then be interpreted as the "approximate inverse
cost that the controller minimizes is the expectation closed-loop model" and when 2 = 0 and N 2
subject to data available at time t of: = N U >~ k the relationship is exact. For the noise-
free case this implies that the closed-loop response
to changes in w(t) is given by:
tJ = N 1

+ ~ 2(j)[Au(t + j -- 1)] 2 } 1
y(t) ~- -pW(t - k) = M ( q - 1)w(t - k)

where M ( q - 1) is the user-chosen closed-loop model.


where:
The model-following properties are detuned in
cases where N U < N2, but for large N U the change
O(t) is P(q- 1).~t);
in performance is slight without the associated
N1 is the minimum costing horizon;
problem of nonminimum-phase zeros in the esti-
N2 is the maximum costing horizon, and
;~(j) mated plant which destabilize the usual model-
is a control-weighting sequence.
reference controllers. It is possible to achieve model-
The prediction equations given in Part I must following (as in many MRAC approaches) by using
therefore be modified to forecast @(t + j) instead of a prefilter M ( q - 1), giving an intermediate set-point
3~t +j). w'(t) = M(q-1)w(t), and with a minimum-variance
The locally-linearized plant model to be consid- control regulating 3~t) about w'(t). However, the
ered is based on an ARIMA representation of the disturbance rejection properties and the overall
disturbances: robustness are not then affected by the model as it
simply operates on the set-point which is independ-
A(q-1)y(t) = B(q-1)u(t - 1) + C(q-1)~(t)/A (2b) ent of variations within the loop. GPC in effect has
an inverse series model whose output is affected by
and again for simplicity of derivation it is assumed both w(t) and the disturbances and so is a more
that C(q-~) = 1. practical approach.
Generalized predictive control--Part II 151

2.2. Coloured noise C(q-1) and the design recursion equation outlined in Appendix A. Combi-
polynomial T(q - 1) ning (6) and (7) gives:
Most practical processes have more than one
disturbance or noise source acting on them to give j)(t +jlt) = GjAu(t + j - 1) + FjAuY(t - 1) (8)
an effective plant model: + Fyf(t).

B C1 C. The minimization procedure to provide the optimal


y(t) = u(t - k) + ~-A~l(t) + ... + ~--~¢.(t).
control sequence is then as given in Part I.
The choice of T(q -1) follows the procedure
The noise components can be combined into a adopted in GMV designs (Clarke, 1982; Tufts,
C 1984). If a constrained minimum-variance solution
single random sequence-~-A¢(t ) where the noise-
is required C(q -1) must be estimated and T put
colouring polynomial C(q -1) has all of its roots equal to C, but for most practical applications T
within the unit circle provided that at least one can be taken as a fixed first-order polynomial where
noise element has nonzero-mean and is persistently 1/Tis a low-pass filter.
exciting. Note that C(q- 1) is a time-invariant poly-
nomial only if the individual noise variances tr2 3. RELATIONOF GPC WITH STATE-SPACE LQ
remain constant. However, with a typical industrial DESIGN
process this will rarely hold in practice, so successful Any linear controller may be implemented in a
identification of C(q- 1) is unlikely. If the structure state-space framework by the appropriate state
of the variations cannot be estimated on-line, a transformations and if the control scheme mini-
design polynomial T(q- 1) can be used to represent mizes a quadratic cost Riccati iteration may be
prior knowledge about the process noise. employed. Details of such a controller based on a
One interpretation of T(q- 1) is as a fixed observer CARIMA model representation are given elsewhere
for the prediction of future (pseudo-) outputs (Clarke et al., 1985). In order to consider stability
(Astr6m and Wittenmark, 1984). In particular, if and numerical properties, however, inclusion of
T(q-1) = C(q-1) and the stochastic model of (2b) disturbances is not necessary, but the insights based
is valid, then the predictions are asymptotically on well-known state-space controller design can be
optimal (minimum-variance) and the controller will applied to the GPC method. This is explored in
minimize the variance of the output subject to the detail in Appendix B, where in particular it is shown
prespecified constraints on the input and output how various choices of control and output horizons
sequences. The following demonstrates the inclu- lead to cheap L Q, "mean-level', state-dead-beat
sion of T(q- 1) or C(q- i) in the GPC control scheme and pole-placement controllers.
with the full CARIMA model. Consider the plant:
As before, defining a Diophantine identity:
AAy(t) = BAu(t - 1).
T(q-1) = EjAA + q-JFj (5)
Its state-space representation in observable canon-
and proceeding in the usual manner we obtain: ical form may be written as:

T(q-1)P(t +jlt) = GjAu(t + j - I) + Fy(t) x(t + 1) = Ax(t) + bAu(t) + if(t) (9)


y(t) = eTx(t) + W(t).
or:
The matrix A and the vectors b, e are defined in
~(t + j[t) = GiAuf(t + j - 1) + F yY(t) (6) Appendix B. The vector ff is associated with the
set-point (see Clarke et al., 1985). For this model
where "Y" denotes a quantity filtered by 1/T(q-1). the multi-stage cost of G P C becomes:
The constraints and the cost are in terms of
Au(t + j) for j = 0, 1.... rather than Au y and there- J = x(t + N2)Tx(t + N2)
fore the prediction equation must be modified. N2+t- 1
Consider the following: + ~ [x(i)rQx(i) + 2(i)Au(i)2]. (lO)
i=t

G~(q -~) = G'j~q-')T(q-') + q-~F~(q-'). (7)


The control giving minimum cost is therefore:
The coefficients of G' are those of G where the
initial identity of (5) has T = 1. These coefficients Au(t) = kT~(t[t)
together with those of Fj can be found by the k T = (2(0 + bTp(t)b) - ~bTp(t)A
152 [). w. CLARKF et al.

where P(t) is given by the backwards iteration of 4. S I M U L A T I O N EX A MP LES


the following Riccati equation starting from the Simulations were performed to demonstrate the
terminal covariance Q: effect of the design features of G P C using a self-
tuning control package FAUST (Tufts and Clarke,
P*(i) = P(i + 1) - P(i + 1)b().(i) 1985). Two principal types were undertaken: in one
case the plant was constant and the exercise was
+ bTP(i + 1)b) lbTP(i + 1) (11)
intended to show the effect of changing one of the
P(i) = ATP*(i)A + Q
design "knobs" on the transient response, whilst
Q = [1,0,0 ..... 0]T[1,0,0 ..... o]. (12) the second set involved a time-varying plant and
the objective was to show the robustness of the
For the cases where N U < N 2 (i.e. when some adaptive use of GPC. Some simulations used con-
of the control increments in the future are assumed tinuous-time models to illustrate the effect of sam-
to be zero), the value of 2(i) is time-varying, as pling, though in all cases the estimated model
fixing the control signal is equivalent to employing used in the self-tuned version of G P C was in the
a very large penalty on the particular control equivalent discrete-time form.
increment. In this way all combinations of G P C The parameters of the A and B polynomials were
may be implemented in a state-space framework. estimated by a standard UDU version of RLS
Recall, however, that because the states are not (Bierman, 1977) using the incremental model:
accessible a state-observer or state-reconstruction
scheme must be employed. Lain (1980) uses a y:(t) = yY(t - 1) + q(1 - A(q-~))Ay:(t)
transmittance matrix and his approach was + B(q- ~)Auf(t) + D(q 1)AvY(t)+ e,(t)
employed by Clarke et al. (1985) in their LQ self-
tuner. Three points are of relevance. where "f" denotes signals filtered by 1/T(q-~), if
(i) The stability properties of G P C and the used. For simplicity of exposition a fixed forgetting-
deterministic LQ method with a finite horizon of factor was adopted whose value was normally one
predictions are identical. Appendix B examines the (no forgetting), unless otherwise stated. The signal
stability properties for special cases of the G P C v(t) is a measured disturbance signal (feedforward)
settings. which, as with GMV, can readily be added to GPC.
(ii) It is known that numerical properties of LQ The parameter estimates were initialized in the
in the state-space formulation are good though its simulations with 6o equal to one and the rest equal
drawback is in execution time. The G P C approach to zero and with the covariance matrix set to
requires the inversion of (GTG + 21) which can be diag{10}.
done using U D U factorization. Vectors f and w The figures consist of two sets of graphs covering
need never be formed since the multiplication of the behaviour over 400 or 800 samples, one showing
GT(w -- f) can also be implemented recursively at the set-point w(t) together with the plant output
the same time as that of the G and F parameters. }~t), and the other showing the control signal u(t)
In the LQ case the measurement-update equation and possibly a feedforward signal v(t). The scales
(11) can be done using U D U and (12) can be done for each graph are shown on the axes; in all
via a modified weighted G r a m - S c h m i d t algorithm. examples the control was limited to lie in the
For large N U both methods required N U iterations range [ - 100,100]. Load-disturbances were of two
of a U D U algorithm. Assuming the time-updates principal types, one (called "dcu") consisted of steps
of (12) are equivalent to calculating Gs and Fs in in d(t) for the model:
the G P C approach, the burden of computing the
feedback gains will be approximately equal. How-
d(q- l).~(t)= B(q-l)u(t - 1) + d(t)
ever, with G P C the top row of (GVG + 21) ~ is
simply multiplied by the vector GT(w - fl. In the
and the other ("dcy') of steps in d(t) in the model:
LQ case typically three calls to the transmittance
matrix routines are required to obtain state esti-
mates which are avoided by GPC. For NU = 1 A(q 1)y(t)-- B(q 1)u(t- 1) + A(q ~)d(t).
inversion is a scalar calculation whilst for LQ
matrix operations in addition to state estimation The dcy disturbance, equivalent to an additive step
are required. Hence the "one shot" algorithm of on the plant output, is a particularly severe test of
G P C is computationally less demanding. an adaptive algorithm.
(iii) The Riccati equation implementation implic-
itly assumes that there are no rate or amplitude 4.1. The effect of P(q- 1)
limits on the control signal. With G P C it is possible These simulations were designed to demonstrate
to perform a constrained optimization which the uses of P: penalizing overshoot (process control)
includes these limits. and model-following (high-performance role).
Generalized predictive c o n t r o l - - P a r t II 153

IOO% Set-point (W) Output ( Y )

L_A . . . . . L,, /'L


~ ~ n "v'- Vv-- -
P changing

0 °/* I ! !
0 400

I00 *A Control signal and feed-forward

05

Feed-forward signor

O *A I I I I . • , I I
400

FIG. 1. The effectof the P transfer-functionon closed-loopperformance.

The first plant simulated was the third-order hand, was unaffected by the change made in P.
oscillator: For the second example shown in Fig. 2, consider
the double-oscillator plant whose transfer function
(1 + sX1 + s2)Xt) = u(t) + d(t) + v(t), is given by:

d (1 + s2X1 + 1.5s2)~(t)= u(t).


where s is the differential operator ~-, sampled at
1 s intervals. Also, 3 A(q- 1), 3 B(q- 1) and 3 D(q- 1) The model chosen had Pn(q -1) = ( 1 - 0.5q-l) 2,
(feed-forward) parameters were estimated, with an and Pd(q -1) set to 0.2778(1 - 0 . 1 q - 1 ) . A constant
assumed delay of unity. The horizon N2 was chosen control signal of one unit was applied for the first
as 10 samples and NU = 1. The control signal was 10 samples after which it was set to zero for
fixed at 20 units for the first 30 samples while the 70 samples. The estimator was enabled from the
estimator was enabled. Two set-point changes of begining of the run, whereas the controller was
20 units each were applied at intervals of 30 samples. switched in the loop at the 80th sample and the
Unmeasurable step load-disturbances dcu of + 10 set-point was also changed to 20 units at that
units were added at t = 150, 180 and t = 210, 240. instant. NU was set to two at the start of the
Measurable step load-disturbances v(t) of the same simulation and as seen this choice did not give
size were added at t = 90, 120 and t = 280, 310. good model-following characteristics although the
P(q-~) was initially set to unity. As seen in Fig. 1, closed-loop was stable. NU was set to four at the
the set-point response and the load-disturbance downward set-point change to 20 units and at each
rejection although stable, had excessive overshoot, subsequent downward set-point change to 20 units
but once the feed-forward parameters were tuned NU was incremented by two. Clearly the control
the feed-forward disturbance rejection was almost ringing increased with the increase in NU but the
exact. Note that in order to include feed-forward designed model-following closed-loop performance
in the prediction equations a model of the form: remained almost the same for N U > 4 (the number
of plant poles). If desired, the control modes can
A(q- 1)Ay(t) = B(q- l)Au(t - 1) be damped using a nonzero value of 2 to give "fine-
+ D(q-1)Av(t - I) + ~(t) tuning" as in GMV.

is assumed where Av(t + j) = 0 for N2 > j > 0. 4.2. The effect of T(q -1)
For the second half of the simulation P(q-1) A second-order plant with time-delay was simu-
was set to ( 1 - 0.8q-1)/0.2 at t = 190. Both the lated where:
disturbance rejection and the set-point responses
were thereby detuned and the overshoot removed (1 + 15s + 50s2)3~t) = e-2Su(t) + lOd(t)
altogether. Feed-forward rejection, on the other + (1 + 15s + 50s2)dc~t) + (1 + 15s + 50s2)~(t)
AUT 2 3 / 2 - S
154 D . W . CLARKE et al.

60 °/* Set-point (W)

~ A A ^ . , A A ^ . ~
-10%
, O v " v , VV'w . . . . l I
400

iooo/.,,.illControt signor

0* I
I

/
J
_100°/o ] I •
0 400

FIG. 2. (Detuned) model-following using P(q J).

in which ~(t) was an uncorrelated random sequence whether G P C suffers from this problem:
with zero mean and RMS value of two units for
210 < t < 240. A step-disturbance dcy of three units (1 + 10s)3j~t) = u(t).
was added to the output at 150 < t < 180. Another
step-disturbance dcu, exciting all of the modes of Five A and five B parameters were estimated; N1
the plant, of + 10 units was added at 120 < t < 150 was chosen to be 1 and N 2 was initially set to 10
and 270 < t < 300. Two step changes in set-point but doubled at every upward-going step in w(t) to
at the end of a period of regulation were employed 50. A sampling time of 1 s was chosen; note that
to see the effect of disturbances on the servo the settling time of the plant is about 160s. Figure
performance of the controller, and three A and four 5 shows that the initial control was stable but had
B parameters were estimated with an assumed delay a small ringing mode and attained the imposed
of unity. saturation limits. When an output horizon of 20
The initial set-point response and subsequent samples was chosen this mode was removed. At
load-disturbance rejection were good, as seen in N 2 = 40 (the rise-time of the plant) the control was
Fig. 3. The rejection of dey, on the other hand, was much smoother. Increasing the horizon of output
very active initially and inconsistent in the second prediction to the settling time of the plant
change of load. This was due to dynamic parameter (N2 = 160) caused the speed of the closed-loop
changes caused by not estimating parameters asso- under this condition to be almost the same as that
ciated with the noise structure. Subsequent behavi- of the open-loop, verifying the "mean-level" theory
our based on the poor model was not very good of Appendix B. In all cases, then, the responses
as the control was far too active. were smooth despite the rapid sampling.
In the second simulation shown in Fig. 4, T(q - 1)
was chosen to be (1 - 0.8q-1). Note that although 4.4. Over-parameterization
T improved the disturbance rejection of the closed- One of the problems with many adaptive control
loop it had no effect on the set-point response. In schemes is that an exact knowledge of the model
addition, since the parameter estimator was better order is required; of particular interest is the ability
conditioned in the second case, the final set-point to over-parameterize the plant parameter estimator
responses were almost identical to the initial ones. in order to model the plant well in case of dynamic
changes.
4.3. The effect of N2 and the sampling period A first-order plant was simulated in discrete time:
One of the major criticisms of digital controllers
is that most designs only work well if the sampling (1 - 0.9q 1)34t) = u(t - 1).
period is chosen carefully (approximately 1/4 to
1/10 of the settling-time of the plant). A slow plant Estimation was disabled and the parameters were
with three real poles was chosen to investigate fixed a priori to the required values given below.
Generalized predictive c o n t r o l - - P a r t II 155

iO0 *A Output ( Y )
Set-point (WI

!50°/

0O/o I i I i I
0 40O

Control signal

_,oo
ii1
ioo'A

- t
i,-
I Pi
! I I I I I
0 400

FIG. 3. The control of a plant with additive disturbances (without the Tpolynomial).

I 0 0 */.
Output (Y)

Set-point ( W )

\ /k_
50 "/ W-'v =

0 °/o f, ! ! - I i I I I
400

Control signal

0 400

FIG, 4. The control of a plant with additive disturbances (with the Tpolynomial).

Initially a c o m m o n factor of (1 + 2q-1) was set repeated. Figure 6 shows that in all cases the control
between the estimated A and B polynomials, giving: performance of G P C was unaffected by the c o m m o n
factor.
.~(q-1) = 1 + 1.1q -1 -- 1.8q -2
/~(q - 1) = 1 + 2q - 1.
4.5. Under-parameterization
Most industrial processes are nonlinear and
NU was set to one at the set-point change from 0 therefore may only be approximated by high-order
to 20, to two for the change 20-40, four for the linear models. A good choice of sample-rate, on the
change 4 0 - 2 0 and finally NU = 10 for the change other hand, enables the designer to use low-order
20-0. The c o m m o n root was then moved to - 0 . 5 , models for control: slow sampling masks the high-
0.5 and 2 in succession and the transient test order fast dynamics.
156 D . W . CLARKE et al,

I 0 0 °,~
Set-point (W) Output ( Y }

50 */

0 */* | I I I I I I ,I |
800

Control signal
I00

~-]r F L__/---I_
-IO0
I[I,II. ,Y. I I I
800

FIG. 5. The effect of fast sampling and the prediction horizon.

50 *A

0 */,
J Ltpul lI,J I
I
40C

Control signal
2 5 */*

J,. ;
ii 11 II-II
0 */

-25*/. I t I I I ! I
0 400

FIG. 6. The effect of common factors in the estimated parameters.

Consider the fourth-order plant: Fig. 7, offset-free control was achieved. The overall
performance was good despite the wrong parame-
(1 + s)2(1 + 3S)2)~t) = u(t) + d(t). terization; the overshoot could have been reduced
using P(q-1) as shown in the previous sections.
A second-order model was assumed and the plant
4.6. Unknown or variable time-delay
was sampled at 1 s intervals; note that the sampling
The G M V design is sensitive to choice of dead-
process was not masking the slightly faster poles.
time; G P C is however robust provided that 6B is
In this case two A and three B parameters were
chosen to absorb any change in the time-delay.
estimated and the assumed time-delay was unity.
Consider the plant:
N2 was set to 10 and N U was set to one. The set-
point sequence was a square wave with a period of (1 - 1.1q- 1))~t) = - ( 0 . 1 + 0 . 2 q - l)u(t - k)
40 samples. Load-disturbances of 10 and 20 units
were added at the marked times and as shown in where k = 1, 2, 3, 4, 5 at different stages in the trial.
Generalized predictive control--Part II 157

IO0%
Set-point (W) Output { Y )

5OO/

0 "/o
O 4o0

Control signal

0% " IL. _ .. __

I00 / I I I I I I

FIG. 7. The effect of under-parameterization.

The value of k was changed at the downward- applied to a varying dead-time plant which is a
going steps in set-point increasing initially from requirement of GMV designs.
one to five and then decreasing from five back to The simulations show that GPC can cope with
one again. The adaptive controller estimated two the control of complex processes under realistic
A and six B parameters and a scalar forgetting- conditions. As it is relatively insensitive to basic
factor of 0.9 was employed to enable tracking of assumptions (model order, etc.) about the process,
variations in the dead-time. N 2 was set to 10 and GPC can be easily applied in practice without a
N U to one. The performance of GPC shown in prolonged design phase. These features ensure that
Fig. 8 is good; note that the plant was both the method provides an effective approach to the
nonminimum-phase and open-loop unstable with adaptive control of an industrial plant.
variable dead-time, yet stable control was achieved
with the default settings of this algorithm.
REFERENCES
5. CONCLUSIONS /~strrm, K. J. and B. Wittenmark, (1984). Computer Controlled
This paper has shown that GPC can be equipped Systems--Theory and Design. Prentice-Hall, Englewood
Cliffs, NJ.
with the design features of the well-known GMV Bierman, G. J. (1977). F actorization Methods for Discrete System
approach and given a wide range of possible Estimation. Academic Press, New York.
control objectives, which can be interpreted by its Clarke, D. W. (1982). The application of self-tuning control.
Trans. Inst. M.C., 5, 59-69.
relationship with LQ algorithms based on state- Clarke, D. W. and P. J. Gawthrop, (1975). Self-tuningcontroller.
space models. These results are summarized in Proc. IEE, 122, 929-934.
Table 1. It might seem that there are many possible Clarke, D. W. and P. J. Gawthrop, (1979). Self-tuning control.
Proc. IEE, 126, 633-640.
choices of design parameters in GPC, but the Clarke, D. W., P. P. Kanjilal and C. Mohtadi, (1985). A
table shows that many combinations lead to well- generalised LQG approach to self-tuning control. Int. J.
understood control laws. In practice not all this Control, 41, 1509-1544.
Gawthrop, P. J. (1977). Some interpretations of the self-tuning
flexibility would be required and many processes controller. Proc. lEE, 124, 889-894.
can be effectively controlled using default settings. Kwakernaak, H. and R. Sivan, (1972). Linear Optimal Control
Closer inspection of Table 1 shows that a "large" Systems. Wiley, New York.
Lam, K. P. (1980). Implicit and explicit self-tuning controllers.
value of N 2 is generally recommended and that D. Phil Thesis, Oxford University.
N U and P can then be chosen according to the Peterka, V. (1984). Predictor-based self-tuning control. Automa-
control philosophy appropriate for the plant and tica, 20, 39-50.
Tufts, P. S. 0984). Self-tuning control: algorithms and applica-
the computing power available. Hence the "knobs" tions. D. Phil. Thesis, Oxford University.
can be used to tailor an adaptive controller to Tufts, P. S. and D. W. Clarke, (1985). FAUST: a software
precise specifications, which is of great value in the package for self-tuning control. IEE Conf. "'Control 85",
Cambridge.
high-performance role. In particular, the method Wellstead, P. E., D. Prager, and P. Zanker, (1979). Pole
no longer needs to employ control weighting when assignment self-tuning regulator. Proc. IEE, 126, 781-787.
158 D.W. CLARKE et al.

ioo*/o 7 ~ - •

50 *A

O */*
0 800

Control signoL
too

• ' I t I I I , •

-io¢
800

FIG. 8. The control of a variable dead-time plant.

TABLE 1. SPECIAL CASESOF G P C SETTlNGS and, for i = 1 to max(fB, fTJ:

NU N~ N2 P 2 Plant Controller
?o+ 1~i = Yoi + e~bi + gj+ ltl (A.5)

1 1 10 1 0 s,d "Default"
where ?u~ denotes the ith coefficient of the polynomial F~
1 1 ---,oc 1 0 s,d "Mean-level"
associated with q - 1 Note that the coefficients of B or T with
N2 1 />k P 0 mp Exact model-
indices greater than their respective degrees are zero.
following P = I / M
<N 2 1 ~>k P 0,2 "Detuned" model-
following A P P E N D I X B. S O M E STABILITY RESULTS F O R
N2 1 - , oe 1 >0 s,d LQ infinite-stage L I M I T I N G CASES O F G P C
N 2 -- n + 1 1 -,o~ 1 0 s,d Cheap LQ Consider the plant given in shift-operator form by:
n n >~2n - 1 1 0 o,c State-dead-beat
n n />2n - 1 P 0 o,c Pole-assignment
A(q- l)Ay(t) = B(q - l)Au(t -- 1). (B.I)
2 s,d "Detuned" pole-
assignment
A state-space model of this plant can be written as:
s: stabilizable; d: detectable; o: observable; c: controllable; mp:
minimum-phase. x(t + 1) = Ax(t) + hAu(t) (B.2)

y(t) = erx(t) (B.3)

where A is the state transition matrix which we take to be in


observable canonical form, b is the vector of B parameters and
AA polynomials (Clarke et al., 1985). Since disturbances do not
A P P E N D I X A. R E C U R S I O N O F T H E P O L Y N O M I A L affect the stability properties of the controller, only deterministic
G'(q- l) elements are considered in the following section. Defining the
Consider the successive Diophantine identities: augmented polynomial ,~ to be:

Gj = G j T + q-JFj (A.1) ~(q t) = AA = 1 + t]lq I + ~2q-2 q_ ....4_ ti,q-"

Gj+ l = G'j+IT+ q - J - l F j ÷ r (A.2) then the matrices and vectors in the state-space form are:

Note that Gj = EjB, where: A = --al 1 0 ... 0

-a2 0 1 ... 0

Ej(q l ) = e o + e l q l +...+ei_lq-J+l. 0

Subtracting equations (A.I) from (A.2) we obtain:


-- t~ n

q-JejB = q - J g j + l T + q - J ( q - l F j + l - Fj). (A.3)


b = [bo, b l ..... b. 1] T
Hence the update equations become:
where ai = 0 for i > deg(~) and
gj÷ t = 1/to(ejbo + ?O~o) (A.4) b i ~ 0 for i > deg(B).
Generalized predictive control--Part II 159

The cost-function in the state-space formulation can be where 2 i are the eigenvalues and IAil < 1 for all i # ! and 2~ = 1
written as: and ql and ri are right and left eigenvectors associated with the
particular eigenvalue of A.
N2 The right and left eigenvectors associated with the eigenvalue
J = 3-'~ [x(t + i - 1)TQx(t + i - 1) at 1 are given by:

+ 2(t + i -- l)Au(t + i - 1)2] (B.4) qlr=[l,1 +fit, I +a I +'~2,...]


r~ = I-l, 1, 1, 1..... 1].
where Q = [1,0,0 ..... o]T[1,O,O ..... 0]. The set-point has been
omitted for simplicity because the stability properties are inde- Hence, the matrix
pendent of inputs.
The solution is obtained by iterating the equations below.
A = - - , qtr~ as m-~ oo.

Measurement update: Note, however, that the choice N U = 1 implies that there will
P*(i) = P(i + 1) - P(i + l)b(2(i) be N2 - 1 time-updates followed by a single full update at the
+ hrp(i + 1)h)-1hTp(i + 1). (B.5) last iteration, giving:

P(t + 1) = Q + ATQA + A2TQA 2 + AaTQA 3 + ....


Time update:
P(i) = Q + ArP*(i)A (B.6)
In the limit as N2 ~ ~ the matrix P(t + 1) will satisfy:
Au(t) = - ( 2 ( 0 + bTp(t)b) - thTP(t)Ax(t). (B.7)
P(t + l ) / m ~ q l r r Q q , r X a s m -~ o¢,
P is called the "covariance matrix" (from duality with the
i.e. P(t + 1 ) / m --* rlqtr[l,0 ..... 0]TEl,0, ... ,0]qlrtT
estimation equations). N2 iterations are performed backwards
starting from Q, the terminal covariance matrix. Note that since
or:
both the "one-shot" (GPC) method of cost minimization and
the dynamic p r o g r a m m i n g approach of state-space (LQ) succeed
in minimizing the same cost under certain conditions (i.e. linear P(t + 1)/m --* rlr~. (B.9)
plant and no constraints on the control signal), the resulting
control law must be the same because there is only one m i n i m u m This implies that:
and so their stability characteristics must be identical.
Note that fixing the projected control signal in the future is hXP(t + 1)h/m ~ (Zbl) 2
equivalent to employing a large penalty on the appropriate
increment (i.e. 2(0---, oo). This means that the measurement and finally substituting the asymptotic value of P(t + 1) from
update need not be performed for the particular value of i. Three (B.9) into (B.7) gives for 2 = 0:
special cases of G P C are considered below.
Au(t) = -(Ebb)-111, 1, 1..... 1Ix(t). (B.10)
Theorem 1. The closed-loop system is stable if the system (A, b, ¢)
is stabilizable and detectable and if: Recall the formulation is that of the observable canonical form
so that in a deterministic environment the states m a y be written
(i) N 2 --* or3, N U = N 2 and 2(0 > 0 o r
as follows:
(ii) N 2 --* ~ , NU --* oo where NU ~< N 2 - n + 1 and
x.(t) = -~.y(t - 1) + b . _ l A u ( t - 1)
2(0 = O.
x._ 1 = -tin_ ly(t - 1) - ti.y{t - 2)
P r o o f . Part (i) is easily proven from the stability conditions of
+ b._ tAu(t - 2) + b . _ 2 A u ( t - 1)
the state-space LQ controller. The cost of (B.4) tends to the
infinite stage cost and for convergence to the algebraic Riccati
equation (ARE) solution. Q can be positive semi-definite if 2 is
xdt ) = -8ly(t - 1) - ... + b o A u ( t - 1) + ... (B.I1)
positive definite for all terminal covariances (Kwakernaak and
Sivan, 1972). For part OiL note that for the first n - 1 iterations
and
of the Riccati equation only the time-updates (B.6) are necessary
(2(0 -~ ~). Note, moreover, that the terminal covariance Q is
~t) = xdt).
of rank one. Each of the time-updates increases the rank of
the matrix P by one and after n - 1 iterations the matrix
Combining (B.11) with the expression for the control signal
P(N2 - n + 1) is of full rank. It is seen that P(N2 - n + 1) is
(B.10) yields the parametric representation of the controller:
ZA~reerA~--the observability G r a m m i a n which is guaranteed
positive definite for the structure assumed. Then as N U ~ oo
(Y, b 3 A u ( t ) = - y ( t ) - (1 + f i t ) y ( t - 1) ... ( E b , - bo)
the iterations (B.5, B.6) converge to the ARE solution for all
values 2(0 = 2/> 0. Au(t - 1) - (Xbl - bo - bt)Au(t - 2) - ... (B.12)

R e m a r k 1. Part (ii) is a special case of the stabilizing controller or:


of Peterka (1984), using 2(0 = tos where 0 < ~os < oo.
G(q - l)Au(t) = - A ( q - 1)/B(I )y(t)
Remark 2. For 2 --* 0 the G P C laws above are equivalent to the
constrained minimum-variance regulator derived by spectral where G is the solution of the Diophantine identity
factorization for noise models of regression type.
G(q-1)A(q-')A + q-lA(q-l)B(q-t)/B(l) = A(q-l). (B.13)
Theorem 2. For open-loop stable processes the closed-loop is
stable and the control tends to a mean-level law for N U = 1 This can be verified simply by comparing the coefficients of G
and 2(i) = 0 as N~ - , oo. with those derived in (B.12). These are the equations of pole-
assignment placing the closed-loop poles at the open-loop
P r o o f . For simplicity assume that the matrix A has distinct positions. Hence for open-loop stable processes the closed-loop
eigenvalues and can therefore be written as: is stable as N 2 ~ oo, and because the closed-loop poles are
placed in exactly the same locations as those of the open-loop
A = ~.2iqir ~ (B.8) poles, this controller is a mean-level controller.
160 D.W. CLARKE et al.

Remark 1. A mean-level controller provides a step in control Clearly if the matrix G is of full rank then Au(t) is unique.
following a step in the set-point which will drive the plant
output exactly to the set-point and hence provide the same
closed-loop dynamics as of the open-loop. Note that steps in
load-disturbance are, however, rejected since the controller
includes an integrator.

Remark 2. The spectral decomposition of the matrix A shows


that a value of N2 equal to the settling time of the plant is
equivalent for control purposes to N2 - , oo. By assumption 1 the controller minimizing the cost above is
unique because G is full rank.
Remark 3. For cases where there are multiple eigenvalues the
As the system is assumed controllable there exists a unique
spectral factorization (B.8) is not valid and the Jordan canonical
feedback gain k such that ( A - bkr)"x(t)= 0, independent of
form must be employed. However, because all plant eigenvalues x(t). Such a controller is the state-dead-beat controller and is
are assumed to lie inside the unit circle the value of A" as m -+ stable, independent of the pole-zero locations of the system.
oo remains the same as for the distinct eigenvalue case and the Note that the cost incurred by choosing k as the dead-
rest of the argument follows. beat controller is exactly zero (in the deterministic case)--the
Theorem 3. The closed-loop system is equivalent to a stable minimum achievable cost. Recall however, that the controller
state-dead-beat controller if minimizing the cost was shown to be unique. Therefore, the
controller derived from iterations of the Riccati equation or
(1) the system (A, h,e) is observable and controllable and solving the set of simultaneous equations is the dead-beat
controller and hence stable. Note that since the cost incurred is
(2) N 1 = n, N 2 >i 2n - 1, N U = n and 2 = 0, where n is the
zero, increasing the horizon N 2 does not affect the minimum
number of states of the plant.
nor the solution of the optimization problem.
Proof. Initially the case of N 2 = 2n - I is considered. The cost
minimized by the- controller proposed above is Remark 1. Note that the state-dead-beat control is equivalent
N2 + t to placing all of the closed-loop poles at the origin.
d = j~=t x(i)rQ(i)x(i) (B.14)
Remark 2. As seen in the assumption (1), the condition of
where Q ( i ) = 0 for i < t + N 1 and Q ( i ) = e c T for i > / t + N v minimal realization is necessary for the control calculation and
The Kalman control gains can therefore be calculated using the therefore the choice of horizons above must in practice be used
iterations (B.5, B.6, B.7). For the first n - 1 iterations of the with caution. This is the same condition required when solving
Riccati equation only (B.6) is used with Q = cC, hence obtaining the Diophantine identity for a pole-assignment controller. Note
the observability Grammian in Theorem (1). For the next n that 2 may be used to improve the condition of the G matrix.
iterations (B.5, B.6) are used employing Q = 0. Note that these
iterations yield a unique Kalman control gain with
Remark 3. If instead of (B.1) the augmented plant:
P(t + N2) = cc T.
From the predictive point of view, calculation of the control
signal Au(t) is tantamount to solving the set of simultaneous A(q- ')AP(q- 1)y(t) = B(q ')AP(q- l)u(t - I)
equations:

is considered and dead-beat control is performed on ip(t) = P~(t)


g, g, - 1 ... g1 instead of ~(t), the closed-loop poles will be located at the zeros
of P(q- 1). This is because the closed-loop poles of the augmented
plant (w --, ~(t)) are all at the origin and therefore the closed-
g2. 2 g2.-1 ..- g. 1 loop poles of the actual plant are at the zeros of P(q- t). The
predictions fit + j) can be obtained from (6) replacing ¢/(t) for
3~t) and the G-parameters from the algorithm given in Appendix
Au(t+l) = w-J(t+n+l) .
A. The signal PAu(t) and subsequently u(t) are calculated from
the equations given above. This is then equivalent to the
Au(t + n - I) [ w -J~t + 2n - I)] standard pole-placement controller.

You might also like