Definition
These are common discrete probability distributions that appear frequently in randomized algorithms and probability.
Bernoulli distribution
A random variable has a Bernoulli distribution with parameter , written
if it has only two outcomes:
- success with probability
- failure with probability
Usually,
So the PMF is
The expected value is
and the variance is
An indicator variable is Bernoulli-distributed.
Binomial distribution
A random variable has a Binomial distribution with parameters , written
if counts the number of successes in independent Bernoulli trials with success probability .
Then
Reason:
- choose which trials are successful: possibilities
- each such pattern has probability
The expected value is
and the variance is
You can think of a binomial variable as a sum of independent Bernoulli variables:
Poisson distribution
A random variable has a Poisson distribution with parameter , written
if
Its expected value and variance are both
Balls and bins view
Throw balls independently and uniformly at random into bins, and let be the number of balls in one fixed bin. Then
so
As , this binomial distribution converges to
More generally, if
then for fixed and large ,
So the Poisson distribution is a good model for the number of rare, independent events in a fixed time or space interval.
Geometric distribution
A random variable has a Geometric distribution with parameter , written
if is the number of independent Bernoulli trials until the first success.
Here the support is
To have the first success on trial , we need:
- failures first
- then one success
So
The expected value is
and the variance is
Important
The geometric distribution is memoryless:
So after a long wait without success, the remaining waiting time has the same distribution as at the start.
Negative binomial distribution
A random variable has a Negative Binomial distribution with parameters if is the number of independent Bernoulli trials until the -th success.
This generalizes the geometric distribution, which is the special case .
The support is
To have the -th success on trial :
- the -th trial must be a success
- among the first trials, exactly must be successes
Therefore
The expected value is
and the variance is
Relationship
- Bernoulli: one success/failure experiment
- Binomial: number of successes in Bernoulli trials
- Poisson: approximation for rare-event counts
- Geometric: waiting time until the first success
- Negative Binomial: waiting time until the -th success