Communication requires at least three components.
A
transmitter sends a signal over a communications
channel to a
receiver
that collects the signal for further use.
The signal consists of a series of symbols,
which convey some average amount of information, R,
measured in bits per symbol [Shannon, 1948].
We follow Shannon and other early workers
[Shannon, 1948,Brillouin, 1951,Tribus & McIrvine, 1971,Rothstein, 1951]
and take this to be
the uncertainty of the receiver before receiving
symbols minus the uncertainty after reception:
To find the maximum information from equation (43),
the symbols appearing at the receiver must be equally likely,
(so that
)
and every symbol must be exactly identified (no uncertainty left
after reception,
Hafter = 0).
Under these circumstances
the information is
.
If there is any noise (
),
or the symbols are not equally likely
(
Hbefore < Hequal)
then this simple formula must not be used
and
R < Rmaximum.
If the symbols are sent at a rate of Wssymbols per second, then the channel carries WsR bits per second.
Shannon defined the ``channel capacity'' of a communications system
and showed that it is:
Shannon proved a remarkable theorem about the channel capacity.
One part of the theorem says that
we cannot send information at a rate
faster than the channel capacity.
If we try to do this (i.e., WsR > C),
a quantity of noise will be received
that limits the rate to C.
The other part of the theorem is surprising:
if we transmit at
any rate less than or equal to the channel capacity (
),
then the transmission is possible
with as low an error rate as we may desire.
There is a price to be paid to get a low error rate: we must carefully encode the signal before transmission and then carefully decode it afterward. Although both steps require a delay, the overall transmission rate can approach C. Unfortunately the derivation of (45) and the proof of the theorem do not tell us how to make codes which allow transmission at rates close to C. Nevertheless, the formula is useful for understanding and designing communication systems, and methods have been found for creating ``good'' codes [Gilbert, 1966,Hamming, 1986,Sourlas, 1989].