where means that the element s belongs to the set S 随机变量X将S样本空间中的抽象的概念用数字表现出来。
Probability mass function (pmf)
Let X be a discrete random variable and be the values that X can take on. The pmf p(x) is a function that satisfies the following properties: 1. 2. 3. , for any event A. pmf指概率质量函数。在概率论中,概率质量函数是离散随机变量在各特定取值上的概率 例如,投掷一枚硬币,令:那么,其PMF为
Cumulative distribution function (cdf)
We call the function defined bythe cumulative distribution function and abbreviate it as cdf
Let be the pmf of the random variable , and be the values that can take on. Then for any
For any ,
is a nondecreasing function, i.e. for .
同样是上面的硬币问题,其cdf为
Properties of cdf
The cdf is right continuous, i.e. for any where is the right limit of at
For any ,where is the left limit of at
For ,
Mean and variance
Definition of mean
If is a discrete random variable having a probability mass function , then the mean, or the expectation, of , denoted by , is defined byiIf the values that can take on are , thenIn words, the mean of is a weighted average of the possible values that can take on, each value being weighted by the probability that assumes it.
Expectation of a function of a random variable
If X is a discrete random variable that takes on the values , with the pmf p(x), then for any real-valued function Remarkis a actually a new random variable. However, to compute the mean of , i.e. , it is not necessary to find the distribution of once the distribution of is given.
Definition of variance
The variance of a random variable , denoted by , is defined as the mean of the function of with , i.e. In the practice, it is more convenient to compute the variance vir the following equivalent formula
The variance is always nonnegative
The square root of , i.e. , is called the standard deviation of
Properties of mean and variance
Let be a random variable, , and be constants. Consider as a linear function of . Then
The formula can be proved using the above properties
Let be random variables. The mean of a sum of random variables equals the sum of the mean of each random variable, i.e.Moreover, let be constant. Then
A Bernoulli experiment is a random experiment with two outcomes, modeled with the sample space
Let be a random variable associated with a Bernoulli experiment with for some . The pmf of can be written as
We say that has a Bernoulli distribution, and
Binomial distribution
A Bernoulli experiment is performed times independently, and let the random variable be the number of times when the outcome is 1 in the trials
The sample space of is
The pmf of is
X is said to have a binomial distribution, which is denoted by the symbol . The constants and are called the parameters of the binomial distribution
A Bernoulli distribution is just a binomial distribution with parameters
If a random variable has a binomial distribution with parameters , then The mean and variance can be computed in two ways
Direct computation with series:
For , let be the outcome of the i-th trial. Then has a Bernoulli distribution () and Therefore,
Geometric distribution
Motivation: we are interested in the number of independent trials needed in order to get the first success. The probability of success in each trial is constantly
The sample space
A random variable is said to have a geometric distribution if the pmf of is defined bywhere