The separation principle is one of the fundamental principles of stochastic control theory, which states that the problems of optimal control and state estimation can be decoupled under certain conditions. In its most basic formulation it deals with a linear stochastic system
with a state process x {\displaystyle x} , an output process y {\displaystyle y} and a control u {\displaystyle u} , where w {\displaystyle w} is a vector-valued Wiener process, x ( 0 ) {\displaystyle x(0)} is a zero-mean Gaussian random vector independent of w {\displaystyle w} , y ( 0 ) = 0 {\displaystyle y(0)=0} , and A {\displaystyle A} , B 1 {\displaystyle B_{1}} , B 2 {\displaystyle B_{2}} , C {\displaystyle C} , D {\displaystyle D} are matrix-valued functions which generally are taken to be continuous of bounded variation. Moreover, D D ′ {\displaystyle DD'} is nonsingular on some interval [ 0 , T ] {\displaystyle [0,T]} . The problem is to design an output feedback law π : y ↦ u {\displaystyle \pi :\,y\mapsto u} which maps the observed process y {\displaystyle y} to the control input u {\displaystyle u} in a nonanticipatory manner so as to minimize the functional
where E {\displaystyle \mathbb {E} } denotes expected value, prime ( ′ {\displaystyle '} ) denotes transpose. and Q {\displaystyle Q} and R {\displaystyle R} are continuous matrix functions of bounded variation, Q ( t ) {\displaystyle Q(t)} is positive semi-definite and R ( t ) {\displaystyle R(t)} is positive definite for all t {\displaystyle t} . Under suitable conditions, which need to be properly stated, the optimal policy π {\displaystyle \pi } can be chosen in the form
where x ^ ( t ) {\displaystyle {\hat {x}}(t)} is the linear least-squares estimate of the state vector x ( t ) {\displaystyle x(t)} obtained from the Kalman filter
where K {\displaystyle K} is the gain of the optimal linear-quadratic regulator obtained by taking B 2 = D = 0 {\displaystyle B_{2}=D=0} and x ( 0 ) {\displaystyle x(0)} deterministic, and where L {\displaystyle L} is the Kalman gain. There is also a non-Gaussian version of this problem (to be discussed below) where the Wiener process w {\displaystyle w} is replaced by a more general square-integrable martingale with possible jumps. In this case, the Kalman filter needs to be replaced by a nonlinear filter providing an estimate of the (strict sense) conditional mean
where
is the filtration generated by the output process; i.e., the family of increasing sigma fields representing the data as it is produced.
In the early literature on the separation principle it was common to allow as admissible controls u {\displaystyle u} all processes that are adapted to the filtration { Y t , 0 ≤ t ≤ T } {\displaystyle \{{\cal {Y}}_{t},\,0\leq t\leq T\}} . This is equivalent to allowing all non-anticipatory Borel functions as feedback laws, which raises the question of existence of a unique solution to the equations of the feedback loop. Moreover, one needs to exclude the possibility that a nonlinear controller extracts more information from the data than what is possible with a linear control law.