Channel Capacity:
Consider a DMC having an input alphabet X= {xo, x1, ... , xq-1} and an output alphabet Y= {yo, x1, ... , xQ-1}. Let us denote the set of channel transition probabilities by pji=P(yi/xj). The average mutual information provided by the output Y about the input X is given by

The maximum value of the average mutual information, I(X; Y), over the set of input symbol probabilities P(x) is a quantity that depends only on the channel transition probabilities P(y/x ). This quantity is called the Capacity of the Channel (C).

The units of channel capacity is bits per channel i.e. logarithm based on 2.
Statlearner
Statlearner