• Sonuç bulunamadı

Optimal control for a class of partially observed bilinear stochastic systems

N/A
N/A
Protected

Academic year: 2021

Share "Optimal control for a class of partially observed bilinear stochastic systems"

Copied!
2
0
0

Yükleniyor.... (view fulltext now)

Tam metin

(1)

OPTIMAL CONTROL FOR A CLASS O F PARTIALLY OBSERVED BILINEAR STOCHASTIC SYSTEMS

Tayel E. Dabbous

Bilkent University, P.O. Box 8 06572saltepe Ankara, Turke;

Dep. of Elec. & Eletcronic En

ABSTRACT

An alternative formulation is resented for a class of partially observed bilinear stochastic controf problem which is described by three sets of stochastic differential equations: one for the s stem to be controlled one for the observer, and one for the controT pro- cess which is dr!ven by the observation process. With this formu- lation the stochastic control problem is converted to an equivalent deterihnistic identification problem of control gain matrices. Usin standard variation arguments, we obtain the necessary conditions

05

optimality on the basis of which the optimal control parameters can be determined.

I. 1MTR.ODUCTION

The Hamilton-Jacobi-Bellman (HJB) equation arising from the application of Bellman's principle of optimality as well as Ito's for- mula t o controlled stochastic systems has been the major tool for determining optimal control laws (cf. [3,8]). With this approach, one is required t o solve a nonlinear partial differential eaquation (of parabolic type) on the state space Rn. This has, so far, posed a major stumbling block in its application to en ineering problems. It seems almost impossible to avoid solving the 5JB-equation if one is interested in determinin optimal controls for nonlinear, bilinear or even linear problems wit% control constraints.

Recently, Teo, Ahmed and Fisher 111 and Ahmed [2] proposed a new formulation for stochastic control problem for partially ob- served linear systems governed by stochastic differential equations driven by point processes [I] or general martingales [2]. With this formulation the stochastic control problem can be converted into an equivalent (deterministic) identification problem the necessary conditions of which can be obtained by use of standard variation arguments.

In this paper we consider the optimal control roblem for a class of partially observed bilinear stochastic systems Liven by standard Wiener processes. Using similar control structure as that of [I] or [2], we convert the original stochastic control problem into an equiv- alent identification problem for control gain matrices. Further, using standard varitional arguments (cf. [4,5,6]), we derive the necessary conditions of optima& on the basis of which the required optimal control parameters can be determined.

11. PROBLEM STATEMENT Consider the following (bilinear) stochastic system

M W t ) = A(t)z(t)dt t B(t)du(t) t I ( 0 ) = I o , t E I 3 [O,T], u , ( t ) x ( t ) d W $ ( t ) , t=l (1) where A E Rnxn, B E R"", U , E P X " ; 1 5 i 5 M , and W G

{Wi;l

5

i

5 M )

is standard Wiener process with values in RM. The initial state IO is a random variable independent of W. The control process u ( t ) ; t

2

0 will be defined shortly. Along with the system (l), let the observed process be governed by

N d Y ( t ) = Hl(t)Z(t)dt

+

HZ(t)Y(t)dt i- CZi,(t)Y(t)dK(t), Y ( 0 ) = 0, (2) t E I , ,=l where H I E R m x n , Hz E R m x m , Z, E R m x m and V {V,;l

5 a

5 N } is an RN-valued standard Wiener process independent of zo and W . Assuming that all the random vectors and processes described above are defined on some complete probability space ( Q , B , p ) , we wish to design a control system having the structure

du(t) = Kl(t)Y(t)dt f K2(t)dy(t), t E I ,

u(0) = 0, (3)

where the control parameters K P { K l , X Z } are rhosen from some admissible class P C L , ( I , R V X m ) x L,(I,RPXm) so that a given objective functional is minimized. As indicated in [2], one may con- sider several possible objective functionals related to the observed process y .

CH2917-3/90/0000-1416$1.00

@

1990

IEEE

1416

Proceedings of the 291h Conference on Decision and Control Honolulu. Hawaii 9 December 1990

where c(t) E E { y ( t ) } , (.,.) denotes the scalar product in Rm, and Q is any continuous positive semidefinite symmetric matrix-valued function and Yd is the desired output.

In

this paper we shall only deal with the objective functional 51. The rest can be dealt with in a similar manner.

AssumDtions

(Al) all the matrix-valued functions A , B , u i , H l ,

H2,

and

Zi

are

(A2) the control parameter K {A'1,Kz} E P where P is a continuous.

compact convex subset of L , ( I , R r X m ) x L m ( I , Rrxm). 111. FORMULATION O F IDENTIFICATION PROBLEM Consider the following control problem

Find a K E P such that (SP1)

subject t o the constraints (1)-(3).

Here Q is a continuous positive semidefinite symmetric matrix valued function. In this section we shall show how the problem (SP1) can be converted into an equivalent (deterministic) identification problem. First using (3) in ( l ) , we obtain

d z ( t ) = ( A @ )

+

B ( t ) K z ( t ) H l ( t ) ) I ( t ) d t

+

( B ( t ) K i ( t )

+

B(t)Kz(t)H~(t))Y(t)dl

I(0) = IO, t E 1.

Defining

dynamics (2) can jointly be written as

=

(z,y)(, the system (5) together with the observation

M N

d t ( t ) = A(t, K)E(t)dt

+

Ci(d)E(t)dWi(t) t

C

Di(t, K ) t ( t ) d V i ( t ) i=l i = l

E(0) = Eo, t E I , (6)

where

For the solution of the problem (SPl), we shall need the following result.

Lemma 1

Consider the system (6) and suppose E0,W and V are statisti- cally independent. Then for each K-E 'P, the mean $(t) 5 E{E(t)}

and the covariance P ( t ) E{(E(t)-[(t))(E(t)-$(t))'}, t

2

0, satisfy the following system of differential equations

l ( 0 ) = $0 I (PO,O)', and

(2)

M

Remarkl

can be partitioned as

The proof follows from standard computations.

For each K E

P,

the covariance matrix P ( t ) L P ( t , K ) , t

2

0 ,

where

Pii(t)

9 E { ( z ( t )

-

Z ( t ) ) ( z ( t ) - Z(t)Y), P22(t) 3 E{(Y(t) - ixt))(Y(t) -

5(t))'L

PlZ(t) = PZl(t)

=

E { ( z ( t )

-

% ( t ) ) ( ? / ( t ) -

5W').

Using the result of Lemma 1, we can convert the problem (SPl) into equivalent (deterministic) identification problem. Let

A ( t ) 5 P ( t ) t t ( t ) ? ( t ) , t 2 0. (11) Then using (9) and (lo), one can easily verify that A satisfies the following (matrix) differential equation

M

d

- A ( t ) = d(t, K ) A ( t ) t A(t)A'(t, 9) t C C ; ( t ) A ( t ) C : ( t ) dt

i = l

Defining Q =

(:

:)

,

it follows that

(13) Using (13) it is clear that the problem (SPl) can be restated as follows

(DP1) Find a K E P such that

J I ( K ) 3 l t r ( & ( t ) A ( t ) ) d t = min.

subject to the dynamic constraint (12).

We close this section by indicating that one can show, using similar arguments as those of Ahmed [2-61, that the problem (DP1) has a solution.

s

In this section we utilize standard variation arguments to de- rive the necessary conditions of optimality for the problem (DPl). Let K" E

P

be the optimal parameter for the problem (DP1) and let A o ( t ) 5 A ( t , K o ) , t 2 0, be the solution of (12) corresponding to K O . Suppose the parameter set P is convex and let x ( t ) i ( t , K O ; K

-

K O ) , t

2

0, denote the Gateaux differential of A at K O in the direction (K

-

KO). For the derivation of the necessary conditions of optimality we shall need the following result the proof of which follows from standard computations.

LemmaS

Consider the problem (DP1) and suppose that the parameter set

P

is convex. Then for each pair K, KO 6

P,

the Gateaux differential of .TI, at KO in the direction (K

-

KO), exists and is given by

J ; ( K O ; K - K O ) 3 / t r ( Q ( t ) x ( t ) ) d t

L

0. (14) denotes the Gateaux differential of A satisfying

I Here M $ x ( t ) = d(t, K o ) x ( t ) t x(t)d'(t, KO) t C;(t)i(t)C;(t) N i = l t x D i ( t , K%(t)D:(t, KO) t z ( t ) A o ( t ) t h " ( t ) X ( t ) i = l N

t C[Di(t, Ko)Ao(t)Ei(t) t E;(t)A"(t)D;(t, K O ) ] (15) i = l

X(o)

= 0, t E I,

where

3

and

Ei

denote the Gateaux differential of A and

D;.

necessary conditions of optimality. With the help of the above Lemma, we present the following Theorem

3

Consider the problem (DP1) and suppose Lemma 2 hold. Then the optimal parameter KO E p can be determined by the simulta- neous solution of the differential equation

Proof

In order that J1, as defined by (13), attains its minimum

2

K O E p , it is necessary that (14) holds for all K E P, where A denotes the Gateaux differential of A as defined by Lemma 2. The inequality (14) can be further simplified by introducing the adjoint variable I'o which is the solution of the (matrix) differential equation (17). Using (14), (15) and (17), one can easily verify that

l t r ( Q ( t ) x ( t ) ) d t = 2Jtr(r0(t)&t)A0(t) N

t rO(t)D;(t, K o ) A o ( t ) E ; ( t ) ) d t .

i = l

Now the inequality (18) follows from (14) and (19). Remark 2

optimal parameter KO using any of the algorithms proposed in [3 or

71. V. CONCLUSION

aper we have considered the o timal control problem for a class ofpartially observed bilinear stocgastic systems. Assum- ing that the control process is governed by a (stochastic) differential equation, driven by the output process, with unknown arameters, we have converted the original stochastic control probfem into an equivalent identification problem for control parameters. Using stan- dard variational arguments, we derived the corresponding necessary conditions of optimality on the basis of which optimal parameters and hence optimal control can be determined.

REFERENCES

[I]

Teo, K.L., Ahmed, N.U., and Fisher, M.E., "Optimal Feedback Control for Linear Stochastic Systems Driven by Counting Pro- cesses", to appear in J. of Engineering Optimization. [2] Ahmed, N.U., "Computer Aided Design of Optimal Feedback

Controls for Partially Observed Stochastic S stems Driven b a

Martingale Process" 14th IFIP on System bodelling and d p - timization. Leipzig, h l y , 1989.

[3] Ahmed, N.U., "Elements of Finite Dimmsional Systems and Control Theory", Longman Scientific and Technical, Essex, New York. 1988.

[4] Ahmed, N.U.,"Optimization, Identification of system Governed b Evolution E uations on Banach Space" Pitman Research dotes in Math. ?ki Vol. 184, Longman SciLntific & Technical Essex, England, C'dpublished with John Wiley & Sons, Ned York, 1988.

[5] Ahmed, N.U, "Identification of Linear Operators in Differential Equations on Banach Space", in Operator Methods for Opti- mal Control Problems (ed. S.J. Lee), Lect. Notes in Pure and Applied Math., Vol. 108, P. 1-29, Marcel Dekker, 1988. [6] Ahmed, N.U.,"Identification of Operators in Systems Goverened

by Evolution Equations on Banach Space", in Control of Partial Differential Equations (ed. A. Berudez), Lect. Notes in Control and Information Sciences, Vol. 114, P. 73-83, 1989.

[7] ?'eo, K.L. and Wu, Z.S., "Computational Methods for Optimiz- ing Distributed Systems", Appendix V, Academic Press, 1984. 181 WO%, H.W. and Ahmed, N.U., "Optimal Precession Control of Randomly Perturbed S in Stabilized Satellite", IEEE naris. on Aerospace & Electr. lystems, Vol. AES-13, No.2, p. 172- 178, 1977.

Based on the above necessary conditions one can compute t h e ,

In this

Referanslar

Benzer Belgeler

But now that power has largely passed into the hands of the people at large through democratic forms of government, the danger is that the majority denies liberty to

Compared to the jagged partitioning (JP) model, the proposed hypergraph par- titioning (HP) model results in both better load balancing in the rendering phase and less total volume

When Said (1979) asserts “the Orient is Orientalized”, the East and the Eastern subject is produced within this discourse, because it is the West that have the competence in

Our approach was to approximate system dynamics around the limit-cycle as a linear time-periodic (LTP) sys- tem, enabling us to address the input–output system iden- tification

The patriarchal berâts should be considered as documents that not only secured the rights of the patriarchs vis-à-vis the Ottoman state or Ottoman officers, but also as

Thus, limiting the scope of research to those names who witnessed the Great War – and commented on it- without diverging from the Greenian version of British Idealist

In this work, we research the exact and numerical solutions of such a FOCP that system dynamics are defined by a space–time fractional differential equation in terms of the Caputo and

Bu sonuçlar, Aşar’ın (2006) “Avrupa Birliğine Tam Üyelik Sürecinde Avrupa Birliği Eğitim ve Gençlik Programlarının İncelenmesi ve Türk Eğitim Sisteminin