Definition

These are common discrete probability distributions that appear frequently in randomized algorithms and probability.

Bernoulli distribution

A random variable has a Bernoulli distribution with parameter , written

if it has only two outcomes:

  • success with probability
  • failure with probability

Usually,

So the PMF is

The expected value is

and the variance is

An indicator variable is Bernoulli-distributed.

Binomial distribution

A random variable has a Binomial distribution with parameters , written

if counts the number of successes in independent Bernoulli trials with success probability .

Then

Reason:

  • choose which trials are successful: possibilities
  • each such pattern has probability

The expected value is

and the variance is

You can think of a binomial variable as a sum of independent Bernoulli variables:

Poisson distribution

A random variable has a Poisson distribution with parameter , written

if

Its expected value and variance are both

Balls and bins view

Throw balls independently and uniformly at random into bins, and let be the number of balls in one fixed bin. Then

so

As , this binomial distribution converges to

More generally, if

then for fixed and large ,

So the Poisson distribution is a good model for the number of rare, independent events in a fixed time or space interval.

Geometric distribution

A random variable has a Geometric distribution with parameter , written

if is the number of independent Bernoulli trials until the first success.

Here the support is

To have the first success on trial , we need:

  • failures first
  • then one success

So

The expected value is

and the variance is

Important

The geometric distribution is memoryless:

So after a long wait without success, the remaining waiting time has the same distribution as at the start.

Negative binomial distribution

A random variable has a Negative Binomial distribution with parameters if is the number of independent Bernoulli trials until the -th success.

This generalizes the geometric distribution, which is the special case .

The support is

To have the -th success on trial :

  • the -th trial must be a success
  • among the first trials, exactly must be successes

Therefore

The expected value is

and the variance is

Relationship

  • Bernoulli: one success/failure experiment
  • Binomial: number of successes in Bernoulli trials
  • Poisson: approximation for rare-event counts
  • Geometric: waiting time until the first success
  • Negative Binomial: waiting time until the -th success