The previous section derived an EM algorithm for the case of deterministic
channel coefficients, using an impulse function for
. In this section
we analyze a similar situation, where no information about
is available,
but here we use a uniform distribution for
. Although impulse and uniform
distributions for
do not seem similar, they are similar in this
application because the location of the impulse (mean of
) is unknown in
the previous section.
If
is uniform over some finite interval, then computation of
and
could be performed numerically using
from (12).
But if we let the domain of
be infinite, then we can compute
and
analytically. In this case
is improper,
since it would have to be zero for the area under it to be 1. Nevertheless,
assuming that
is uniform and of infinite extent is useful, since it is
practically equivalent to letting
be uniform over some range of several
standard deviations for which the exponential term in (12) is significant.
Outside of some finite region, the exponential term in (12) is practically zero.
So we propose that letting
in (12) is reasonable,
which reduces (12) to:
To put this into a more useful form,
expand the norm-squared term from (24):
Now, if we let
,
and if
has full rank so that
,
then (25) reduces to:
If does not have full rank then some simplification of (25) into a form
similar to (26) may be performed, but that is not pursued here.
In the case of data values which do not contain 0's, e.g. (-1,1) data values,
will always
have full rank if aj = 0 for
,
due to the 0's in the first L-1 rows which make the columns of
linearly independent.
Therefore, for the case of uniformly distributed channel coefficients, the
EM algorithm is almost the same as for deterministic channel coefficients,
except that the Viterbi algorithm incremental cost function from (18),
which includes , is used instead of (23), and
is given by (28).
To initialize , we can initialize
and
, then use the Viterbi algorithm to
estimate
, then set
to this estimate of
.
The initial estimate of
depends on the set of transmitted sequence values and initial channel
conditions. For example, in the case of (-1,1) transmitted data values,
with aj = 0 for
,
diag
may be used, based on just the diagonal elements of
.