MTH-102复习2
随机变量上半部分,包括基础内容和离散随机变量
Lec5
Random variables
Definition
Given a random experiment with a sample space S, a function X that assigns one and only one real number X(s) = x to each element s in S is called a random variable. The space of X is the set of real numbers
wheremeans that the element s belongs to the set S
随机变量X将S样本空间中的抽象的概念用数字表现出来。Probability mass function (pmf)
Let X be a discrete random variable andbe the values that X can take on. The pmf p(x) is a function that satisfies the following properties:
1.
2.
3., for any event A.
pmf指概率质量函数。在概率论中,概率质量函数是离散随机变量在各特定取值上的概率
例如,投掷一枚硬币,令:那么,其PMF为 Cumulative distribution function (cdf)
- We call the function defined by
the cumulative distribution function and abbreviate it as cdf - Let
be the pmf of the random variable , and be the values that can take on. Then for any - For any
, is a nondecreasing function, i.e. for .- 同样是上面的硬币问题,其cdf为
Properties of cdf
- The cdf
is right continuous, i.e. for any where is the right limit of at - For any
, where is the left limit of at - For
,
- The cdf
- We call the function defined by
Mean and variance
Definition of mean
If is a discrete random variable having a probability mass function , then the mean, or the expectation, of , denoted by , is defined by iIf the values that can take on are , then In words, the mean of is a weighted average of the possible values that can take on, each value being weighted by the probability that assumes it.Expectation of a function of a random variable
If X is a discrete random variable that takes on the values , with the pmf p(x), then for any real-valued function Remark is a actually a new random variable. However, to compute the mean of , i.e. , it is not necessary to find the distribution of once the distribution of is given.Definition of variance
The variance of a random variable , denoted by , is defined as the mean of the function of with , i.e. In the practice, it is more convenient to compute the variance vir the following equivalent formula- The variance is always nonnegative
- The square root of
, i.e. , is called the standard deviation of
Properties of mean and variance
Let be a random variable, , and be constants. Consider as a linear function of . Then can be proved using the above properties- Let
be random variables. The mean of a sum of random variables equals the sum of the mean of each random variable, i.e. Moreover, let be constant. Then
Lec6
离散随机变量(Discrete random)指的是一些值取得不连续的随机变量,上述所有的定义和概念都建立在随机变量是离散随机变量的情况下。
下面会介绍一些简单的离散随机变量
Bernoulli distribution
- A Bernoulli experiment is a random experiment with two outcomes, modeled with the sample space
- Let
be a random variable associated with a Bernoulli experiment with for some . The pmf of can be written as - We say that
has a Bernoulli distribution, and
- A Bernoulli experiment is a random experiment with two outcomes, modeled with the sample space
Binomial distribution
- A Bernoulli experiment is performed
times independently, and let the random variable be the number of times when the outcome is 1 in the trials - The sample space of
is - The pmf of
is - X is said to have a binomial distribution, which is denoted by the symbol
. The constants and are called the parameters of the binomial distribution - A Bernoulli distribution is just a binomial distribution with parameters
- If a random variable
has a binomial distribution with parameters , then The mean and variance can be computed in two ways- Direct computation with series:
- For
, let be the outcome of the i-th trial. Then has a Bernoulli distribution ( ) and Therefore,
- Direct computation with series:
- A Bernoulli experiment is performed
Geometric distribution
- Motivation: we are interested in the number of independent trials needed in order to get the first success. The probability of success in each trial is constantly
- The sample space
- A random variable
is said to have a geometric distribution if the pmf of is defined by where - The cdf of a geometric random variable
is - The mean of
is - The variance of
is
- Motivation: we are interested in the number of independent trials needed in order to get the first success. The probability of success in each trial is constantly
Poisson distribution
- The pmf
where - The mean of
is - The variance of
is 通常为数据的均值- 泊松分布是由二项分布推导得来。但二项分布的p很小时,两者比较接近
- The pmf