7-1 random variables and discrete probability distributions chapter 7

92
7-1 Random Variables and Discrete probability Distributions Chapter 7

Post on 20-Dec-2015

242 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-1

Random Variables and Discrete probability Distributions

Random Variables and Discrete probability Distributions

Chapter 7

Page 2: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-22007會計資訊系統計學 ( 一 )上課投影片

Random Variables A random variable(隨機變數) is a function or

rule that assigns a number to each outcome of an experiment.

Alternatively, the value of a random variable is a numerical event.

A random variable reflects the aspect of a random experiment that is of interest for us.

Instead of talking about the coin flipping event as {heads, tails} think of it as

“the number of heads when flipping a coin” {1, 0} (numerical events)

Page 3: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-32007會計資訊系統計學 ( 一 )上課投影片

Two Types of Random Variables There are two types of random variables:

Discrete random variable Continuous random variable.

Discrete Random Variable(間斷、離散型隨機變數)– one that takes on a countable number of values– E.g. values on the roll of dice: 2, 3, 4, …, 12

Continuous Random Variable (連續型隨機變數)– one whose values are not discrete, not countable– E.g. time (30.1 minutes? 30.10000001 minutes?)

Analogy: Integers are Discrete, while Real Numbers are Continuous

Page 4: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-42007會計資訊系統計學 ( 一 )上課投影片

A random variable is discrete if it can assume a countable(可數) number of values.

A random variable is continuous if it can assume an uncountable (不可數) number of values.

0 11/21/41/16

Continuous random variableAfter the first value is definedthe second value, and any valuethereafter are known.

Therefore, the number of values is countable

After the first value is defined,any number can be the next one

Discrete random variable

Therefore, the number of values is uncountable

0 1 2 3 ...

Two Types of Random Variables

Page 5: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-52007會計資訊系統計學 ( 一 )上課投影片

Probability Distributions A probability distribution(機率分配) is a

table, formula, or graph that describes the values of a random variable and the probability associated with these values.

Since we’re describing a random variable (which can be discrete or continuous) we have two types of probability distributions:– Discrete Probability Distribution, (this chapter) and– Continuous Probability Distribution (Chapter 8)

Page 6: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-62007會計資訊系統計學 ( 一 )上課投影片

Probability Notation An upper-case letter will represent the name of the

random variable, usually X.

Its lower-case counterpart will represent the value of the random variable.

The probability that the random variable X will equal x is: P(X = x)

or more simply P(x)

Page 7: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-72007會計資訊系統計學 ( 一 )上課投影片

A table, formula, or graph that lists all possible values a discrete random variable can assume, together with associated probabilities, is called a discrete probability distribution.

To calculate the probability that the random variable X assumes the value x, P(X = x), add the probabilities of all the simple events for which X

is equal to x, or use probability calculation tools (tree diagram), apply probability definitions

Discrete Probability Distributions

Page 8: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-82007會計資訊系統計學 ( 一 )上課投影片

Requirements for a Discrete Distribution The probabilities of the values of a discrete random

variable may be derived by means of probability tools such as tree diagrams or by applying one of the definitions of probability, so long as these two conditions apply:

1)x(P 2.

x all for 1)x(P0 1.

x all

Page 9: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-92007會計資訊系統計學 ( 一 )上課投影片

In practice, probability distributions can be estimated from relative frequencies.

Example 7.1 A survey reveals the following frequencies (1,000s) for the

number of color TVs per household.1,218 ÷ 101,501 = 0.012

e.g. P(X=4) = P(4) = 7,714÷101,501 = 0.076 = 7.6%

Discrete Probability Distributions

Page 10: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-102007會計資訊系統計學 ( 一 )上課投影片

Example 7.1 The probability distribution can be used to calculate

the probability of different events Calculate the probability of the following events: P(The number of color TVs is 3) = P(X=3) =.191 P(The number of color TVs is two or more)= P(X2) = P(X=2)+P(X=3)+P(X=4)+P(X=5) =.374 +.191 +.076 +.028 = .669

Page 11: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-112007會計資訊系統計學 ( 一 )上課投影片

Example 7.1 E.g. what is the probability there is at least one television but

no more than three in any given household? P(The number of color TVs is at least one but no more than

three ) = P(≤X≤ = P(X=1)+P(X=2)+P(X=3) =.319 +.374 +.191 = .884

Page 12: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-122007會計資訊系統計學 ( 一 )上課投影片

Developing a Probability Distribution

Probability calculation techniques can be used to develop probability distributions.

ExampleFind the probability distribution of the random variable describing the number of heads that face-up when a coin is flipped twice.

Sample space x ProbabilityHH 2 1/4HT 1 1/4TH 1 1/4TT 0 1/4

x p(x)0 1/41 1/22 1/4

Page 13: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-132007會計資訊系統計學 ( 一 )上課投影片

Example 7.2 A mutual fund sales person knows that there is 20%

chance of closing a sale on each call she makes. What is the probability distribution of the number of

sales if she plans to call three customers?

Let S denote success, i.e. closing a sale P(S)=.20 Thus SC is not closing a sale, and P(SC)=.80 Let X denote the number of sales , X=0,1,2,3.

Page 14: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-142007會計資訊系統計學 ( 一 )上課投影片

Solution Use probability rules and trees Define event S = {A sale is made}.

P(S)=.2

P(SC)=.8P(S)=.2

P(S)=.2

P(S)=.2

P(S)=.2

P(SC)=.8

P(SC)=.8

P(SC)=.8

P(SC)=.8

S S S

S S SC

S SC S

S SC SC

SC S S

SC S SC

SC SC S

SC SC SC

P(S)=.2

P(SC)=.8

P(SC)=.8

P(S)=.2

X P(x)3 .23 = .0084 3(.032)=.0965 3(.128)=.3840 .83 = .512

(.2)(.2)(.8)= .032

Example 7.2

Sales Call 1

Sales Call 2

Sales Call 3

Page 15: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-152007會計資訊系統計學 ( 一 )上課投影片

Additional Example The number of cars a dealer is selling daily were recorded in the

last 100 days. This data was summarized in the table below. Estimate the probability distribution, and determine the probability

of selling more than 2 cars a day.

Daily sales Frequency0 51 152 353 254 20

100

Developing Probability Distribution and Finding the Probability of an Event

Page 16: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-162007會計資訊系統計學 ( 一 )上課投影片

0 1 2 3 4

Solution From the table of frequencies we can calculate the

relative frequencies, which becomes our estimated probability distribution

Daily sales Relative Frequency0 5/100=.051 15/100=.152 35/100=.353 25/100=.254 20/100=.20

1.00

.05

.15

.35.25

.20

X

P(X>2) = P(X=3) + P(X=4) =.25 + .20 = .45

Developing Probability Distribution and Finding the Probability of an Event

Page 17: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-172007會計資訊系統計學 ( 一 )上課投影片

Population/Probability Distribution The discrete probability distribution represents a

population(母體)• Example 7.1 the population of number of TVs per household• Example 7.2 the population of sales call outcomes

Since we have populations, we can describe them by computing various parameters(參數) .

E.g. the population mean and population variance.

Page 18: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-182007會計資訊系統計學 ( 一 )上課投影片

Population Mean (Expected Value) The population mean(母體平均數) is the

weighted average(加權平均) of all of its values. The weights are the probabilities.

This parameter is also called the expected value(期望值) of X and is represented by E(X) or µ.

Recall:

x all

)x(xP)X(E

N

XN

1ii

Page 19: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-192007會計資訊系統計學 ( 一 )上課投影片

Population Variance The population variance(母體變異數) is calculated

similarly. It is the weighted average of the squared deviations from the mean.

Recall:

The standard deviation(標準差) is the same as before: 2

2

x all

2

x all

22 )x(Px)x(P)x()X(V

N

)X(N

1i

2i

2

Page 20: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-202007會計資訊系統計學 ( 一 )上課投影片

Example 7.3 Find the mean, variance, and standard deviation for the

population of the number of color televisions per household (from Example 7.1)

= 2.084

)028(.5)076(.4)191(.3)374(.2)319(.1)012(.0

)5(P5)1(P1)0(P0)x(xP)X(Ex all

Page 21: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-212007會計資訊系統計學 ( 一 )上課投影片

Example 7.3 Find the mean, variance, and standard deviation for the

population of the number of color televisions per household (from Example 7.1)

= 1.107

)028(.)084.25()319(.)084.21()012(.)084.20(

)x(P)x()X(V

222

x all

22

Page 22: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-222007會計資訊系統計學 ( 一 )上課投影片

Example 7.3 Find the mean, variance, and standard deviation for the

population of the number of color televisions per household (from Example 7.1)

052.1107.12

Page 23: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-232007會計資訊系統計學 ( 一 )上課投影片

Laws of Expected Value E(c) = c E(X + c) = E(X) + c E(cX) = cE(X)

Laws of Variance V(c) = 0 V(X + c) = V(X) V(cX) = c2V(X)

Laws of Expected Value and Variance

where X is a random variable c is a constant

Page 24: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-242007會計資訊系統計學 ( 一 )上課投影片

Laws of Expected Value 1. E(c) = c

The expected value of a constant (c) is just the value of the constant.

2. E(X + c) = E(X) + c

3. E(cX) = cE(X) We can “pull” a constant out of the expected value

expression (either as part of a sum with a random variable X or as a coefficient of random variable X).

Page 25: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-252007會計資訊系統計學 ( 一 )上課投影片

Example 7.4 The monthly sales at a computer store have a mean of

$25,000 and a standard deviation of $4,000. Profits are 30% of the sales less fixed costs of $6,000.

Find the mean monthly profit.

Describe the problem statement in algebraic terms: sales have a mean of $25,000 E(Sales) = 25,000 profits are calculated by

Profit = .30(Sales) – 6,000

Page 26: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-262007會計資訊系統計學 ( 一 )上課投影片

Example 7.4 The monthly sales at a computer store have a mean of

$25,000 and a standard deviation of $4,000. Profits are 30% of the sales less fixed costs of $6,000.

Find the mean monthly profit.

E(Profit) =E[.30(Sales) – 6,000] =E[.30(Sales)] – 6,000[by rule #2. E(X+c)=E(X)+c] =.30E(Sales) – 6,000 [by rule #3. E(cX)=cE(X)] =.30(25,000) – 6,000 = 1,500

Thus, the mean monthly profit is $1,500

Page 27: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-272007會計資訊系統計學 ( 一 )上課投影片

Laws of Variance 1. V(c) = 0

The variance of a constant (c) is zero.

2. V(X + c) = V(X) The variance of a random variable and a constant is just the

variance of the random variable.

3. V(cX) = c2V(X) The variance of a random variable and a constant

coefficient is the coefficient squared times the variance of the random variable.

Page 28: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-282007會計資訊系統計學 ( 一 )上課投影片

Example 7.4 The monthly sales at a computer store have a mean of

$25,000 and a standard deviation of $4,000. Profits are 30% of the sales less fixed costs of $6,000.

Find the standard deviation of monthly profit.

Describe the problem statement in algebraic terms: sales have a standard deviation of $4,000

V(Sales) = 4,0002 = 16,000,000 (remember the relationship between standard deviation and

variance ) profits are calculated by… Profit = .30(Sales) – 6,000

2

Page 29: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-292007會計資訊系統計學 ( 一 )上課投影片

Example 7.4The monthly sales at a computer store have a mean of $25,000 and a standard deviation of $4,000. Profits are 30% of the sales less fixed costs of $6,000.

Find the standard deviation of monthly profit.

The variance of profit = V(Profit)=V[.30(Sales) – 6,000]=V[.30(Sales)] [by rule #2. V(X+c)=V(X)]=(.30)2V(Sales) [by rule #3. V(cX)=c2V(X)]=(.30)2(16,000,000) = 1,440,000

Again, standard deviation is the square root of variance,so standard deviation of Profit = (1,440,000)1/2 = $1,200

Page 30: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-302007會計資訊系統計學 ( 一 )上課投影片

Example 7.4 (summary) The monthly sales at a computer store have a mean of

$25,000 and a standard deviation of $4,000. Profits are 30% of the sales less fixed costs of $6,000. Find the mean and standard deviation of monthly

profits.

The mean monthly profit is $1,500

The standard deviation of monthly profit is $1,200

Page 31: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-312007會計資訊系統計學 ( 一 )上課投影片

V(X + c) = V(X)

V(cX) = c2V(X)

E(cX) = cE(X)

E(X + c) = E(X) + c

Laws of Expected Value and Variance Solution

V(Profit) = V(.30(Sales) – 6,000] = V[(.30)(Sales)] = (.30)2V(Sales) = 1,440,000

Profit = .30(Sales) – 6,000

E(Profit) = E[.30(Sales) – 6,000] = E[.30(Sales)] – 6,000 = .30E(Sales) – 6,000 = .(30)(25,000) – 6,000 = 1,500

= [1,440,000]1/2 = 1,200

Page 32: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-322007會計資訊系統計學 ( 一 )上課投影片

Additional Example The total number of cars to be sold next week

is described by the following probability distribution

Determine the expected value and standard deviation of X, the number of cars sold.

x 0 1 2 3 4p(x) .05 .15 .35 .25 .20

Laws of Expected Value and Variance

Page 33: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-332007會計資訊系統計學 ( 一 )上課投影片

40.2

)20.0(4)25.0(3)35.0(2)15.0(1)05.0(0

)x(xp)X(E4

0x

x 0 1 2 3 4p(x) .05 .15 .35 .25 .20

Laws of Expected Value and Variance

Solution

11.124.1

24.1)20(.)4.24()25(.)4.23(

)35(.)4.22()15(.)4.21()05(.)4.20(

)x(p)4.2x()X(V

22

222

4

0x

22

Page 34: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-342007會計資訊系統計學 ( 一 )上課投影片

Additional example - continued With the probability distribution of cars sold per week,

assume a salesman earns a fixed weekly wages of $150 plus $200 commission for each car sold.

What is his expected wages and the variance of the wages for the week?

Solution The weekly wages is Y = 200X + 150 E(Y) = E(200X+150) = 200E(X)+150= 200(2.4)+150=630$. V(Y) = V(200X+150) = 2002V(X) = 2002(1.24) = 49,600$2

Laws of Expected Value and Variance

Page 35: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-352007會計資訊系統計學 ( 一 )上課投影片

Up to now, we have looked at univariate distributions, i.e. probability distributions in one variable.

As you might guess, bivariate distributions(二元分配) are probabilities of combinations of two variables.

The bivariate (or joint) distribution is used when the relationship between two random variables is studied.

A joint probability distribution of X and Y is a table or formula that lists the joint probabilities for all pairs of values x and y, and is denoted P(x,y).

The probability that X assumes the value x, and Y assumes the value y is denoted

7.2 Bivariate Distributions

)yYxX(P)y,x(P

Page 36: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-362007會計資訊系統計學 ( 一 )上課投影片

Discrete Bivariate Distribution As you might expect, the requirements for a bivariate

distribution are similar to a univariate distribution, with only minor changes to the notation:

1)y,x(P 2.

y)(x, pairs all for 1)y,x(P0 1.

y allx all

Page 37: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-372007會計資訊系統計學 ( 一 )上課投影片

Example 7.5 Xavier and Yvette are real estate agents; let’s use X

and Y to denote the number of houses each sells in a month. The following joint probabilities are based on past sales performance:

We interpret these joint probabilities as before.E.g the probability that Xavier sells 0 houses and Yvette sells 1 house in the month is P(0, 1) = .21

Page 38: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-382007會計資訊系統計學 ( 一 )上課投影片

p(x,y)

Bivariate Distributions

X

YX=0 X=2X=1

y=1

y=2

y=0

0.42

0.120.21

0.070.06

0.02

0.06

0.03

0.01

Example 7.5 – continued

Page 39: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-392007會計資訊系統計學 ( 一 )上課投影片

Marginal Probabilities

Example 7.5 – continued Sum across rows and down columns

XY 0 1 2 p(y)0 .12 .42 .06 .601 .21 .06 .03 .302 .07 .02 .01 .10p(x) .40 .50 .10 1.00

p(0,0)p(0,1)p(0,2)

The marginal probability P(X=0)

P(Y=1),The marginal probability.

Page 40: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-402007會計資訊系統計學 ( 一 )上課投影片

Marginal Probabilities As before, we can calculate the marginal

probabilities by summing across rows and down columns to determine the probabilities of X and Y individually:

E.g the probability that Xavier sells 1 house = P(X=1) =0.40

Page 41: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-412007會計資訊系統計學 ( 一 )上課投影片

Describing the Bivariate Distribution The joint distribution can be described by the mean,

variance, and standard deviation of each variable. This is done using the marginal distributions.

same formulae as for univariate distributions

Page 42: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-422007會計資訊系統計學 ( 一 )上課投影片

Describing the Bivariate Distribution

To describe the relationship between the two variables we compute the covariance and the coefficient of correlation Covariance(共變數)

Coefficient of Correlation(相關係數)

To describe the relationship between the two variables we compute the covariance and the coefficient of correlation Covariance(共變數)

Coefficient of Correlation(相關係數)

YXy allx all

y allYX

x all

)y,x(xyP

)y,x(P)y)(x()Y,X(COV

YXXY

)Y,X(COV

Page 43: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-432007會計資訊系統計學 ( 一 )上課投影片

Example 7.6 Compute the covariance and the coefficient of

correlation between the numbers of houses sold by Xavier and Yvette.

There is a weak, negative relationship between the two variables.

15.0

)01)(.5.2)(7.2(

)42)(.5.0)(7.1()12)(.5.0)(7.0(

)y,x(P)y)(x()Y,X(COVy all

YXx all

35.0)67)(.64(.

15.0)Y,X(COV

YXXY

Page 44: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-442007會計資訊系統計學 ( 一 )上課投影片

1.30.

03.

)1Y(P

)1Y2X(P)1Y|2X(P

2.30.

06.

)1Y(P

)1Y1X(P)1Y|1X(P

7.30.

21.

)1Y(P

)1Y0X(P)1Y|0X(P

Example 7.5 - continued

The sum is equal to 1.0

Conditional Probability (Optional)

)yY(P

)yYxX(P)yYxX(P

Page 45: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-452007會計資訊系統計學 ( 一 )上課投影片

Two random variables are said to be independent(獨立) when

Example 7.5 - continued Since P(X=0|Y=1)=.7 P(X=0)=.4, The variables X and Y

are not independent.

Conditions for Independence (optional)

y)x)P(YP(Xy)YxP(X or

y)P(Y)xXyP(Y or

)xX(P)yYxX(P

Page 46: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-462007會計資訊系統計學 ( 一 )上課投影片

Sum of Two Variables The bivariate distribution allows us to develop the

probability distribution of any combination of the two variables, of particular interest is the sum of two variables.

to answer questions like “what is the probability that two houses are sold”?

P(X+Y=2) = P(0,2) + P(1,1) + P(2,0) = .07 + .06 + .06 = .19 If we consider our example of Xavier and Yvette selling

houses, we can create a probability distribution.

Page 47: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-472007會計資訊系統計學 ( 一 )上課投影片

P(X+Y=0) = P(X=0 Y=0) = .12P(X+Y=1) = P(X=0 Y=1)+ P(X=1 Y=0) =.21 + .42

= .63P(X+Y=2) = P(X=0 Y=2)+ P(X=1 Y=1)+ P(X=2 Y=0)

= .07 + .06 + .06 = .19

The probabilities P(X+Y)=3 and P(X+Y) =4 are calculated the same way. The distribution follows

XY 0 1 2 p(y)0 .12 .42 .06 .601 .21 .06 .03 .302 .07 .02 .01 .10p(x) .40 .50 .10 1.00

The Probability Distribution of X+Y

Page 48: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-482007會計資訊系統計學 ( 一 )上課投影片

Sum of Two Variables The distribution of X+Y

Likewise, we can compute the expected value, variance, and standard deviation of X+Y in the usual way.

E(X + Y) = 0(.12) + 1(.63) + 2(.19) + 3(.05) + 4(.01) = 1.2

V(X + Y) = (0 – 1.2)2(.12) + … + (4 – 1.2)2(.01) = .56

75.56.)YX(VYX

x + y 0 1 2 3 4p(x+y) .12 .63 .19 .05 .01

Page 49: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-492007會計資訊系統計學 ( 一 )上課投影片

The Expected Value and Variance of X+Y We can derive laws of expected value and variance

for the sum of two variables as follows:

E(X + Y) = E(X) + E(Y) V(X + Y) = V(X) + V(Y) + 2COV(X, Y) If X and Y are independent, COV(X, Y) = 0 and thus

V(X + Y) = V(X) + V(Y)

Generally, for constant a and b, E(aX + bY) = aE(X) + bE(Y) V(aX + bY) = a2V(X) + b2V(Y) + 2abCOV(X, Y)

Page 50: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-502007會計資訊系統計學 ( 一 )上課投影片

7.4 Binomial Distribution

The binomial distribution (二項分配) is the probability distribution that results from doing a “binomial experiment(二項實驗)” . Binomial experiments have the following properties:

Fixed number of trials, represented as n. Each trial has two possible outcomes, a “success” and a

“failure”. P(success)=p (and thus: P(failure)=1–p), for all trials. The trials are independent, which means that the outcome

of one trial does not affect the outcomes of any other trials.

Page 51: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-512007會計資訊系統計學 ( 一 )上課投影片

Success and Failure …are just labels for a binomial experiment, there is no value

judgment implied.

For example a coin flip will result in either heads or tails. If we define “heads” as success then necessarily “tails” is considered a failure (inasmuch as we attempting to have the coin lands heads up).

Other binomial experiment notions: A coin flipped results in heads or tails An election candidate wins or loses An employee is male or female A car uses 87octane gasoline, or another gasoline.

Page 52: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-522007會計資訊系統計學 ( 一 )上課投影片

Binomial Random Variable The random variable of a binomial experiment is defined as

the number of successes in the n trials, and is called the binomial random variable(二項隨機變數) .

E.g. flip a fair coin 10 times.1) Fixed number of trials n=102) Each trial has two possible outcomes {heads (success), tails

(failure)} 3) P(success)= 0.50; P(failure)=1–0.50 = 0.50 4) The trials are independent (i.e. the outcome of heads on the first

flip will have no impact on subsequent coin flips).

Hence flipping a coin ten times is a binomial experiment since all conditions were met.

Page 53: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-532007會計資訊系統計學 ( 一 )上課投影片

Binomial Random Variable The binomial random variable counts the number of

successes in n trials of the binomial experiment. It can take on values from 0, 1, 2, …, n. Thus, its a discrete random variable.

Page 54: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-542007會計資訊系統計學 ( 一 )上課投影片

S1

S2

S2

F2

F1

F2

S3

S3

S3

S3

F3

F3

F3

F3

P(SSS)=p3

P(SSF)=p2(1-p)

P(SFS)=p(1-p)p

P(SFF)=p(1-p)2

P(FSS)=(1-p)p2

P(FSF)=(1-p)p(1-p)P(FFS)=(1-p)2p

P(FFF)=(1-p)3

P(S 1)=p

P(S2|S1)

P(F1 )=1-p

P(F2 |S

1)

P(S2|F1)

P(F2 |F

1)

P(S2)=p

P(F2)=1-p

P(S2)=p

P(F2 )=1-p

P(S3|S2,S1)

P(F3|S2,S1)

P(S3|F2,S1)

P(F3|F2,S1)

P(S3|S2,F1)

P(S3|F2,F1)

P(F3|F2,F1)

P(F3|S2,F1)

P(S3)=p

P(S3)=p

P(S3)=p

P(S3)=p

P(F3)=1-p

P(F3)=1-p

P(F3)=1-p

P(F3)=1-p

Since the outcome of each trial is independent of the previous outcomes, we can replace the conditional probabilities with the marginal probabilities.

Since the outcome of each trial is independent of the previous outcomes, we can replace the conditional probabilities with the marginal probabilities.

P(S2|S1)

Developing the Binomial Probability Distribution (n = 3)

Page 55: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-552007會計資訊系統計學 ( 一 )上課投影片

P(SSS)=p3

P(SSF)=p2(1-p)

P(SFS)=p(1-p)p

P(SFF)=p(1-p)2

P(FSS)=(1-p)p2

P(FSF)=(1-p)p(1-p)P(FFS)=(1-p)2p

P(FFF)=(1-p)3

Let X be the number of successes in three trials. Then,

X = 3

X =2

X = 1

X = 0

P(X = 3) = p3

P(X = 2) = 3p2(1-p)

P(X = 1) = 3p(1-p)2

P(X = 0) = (1- p)3

This multiplier is calculated in the following formula

SSS

SS

S S

SS

Developing the Binomial Probability Distribution (n = 3)

Page 56: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-562007會計資訊系統計學 ( 一 )上課投影片

Calculating the Binomial Probability

In general, The binomial probability is calculated by:

n)1n(21!n

)!xn(!x

!nC where

n,,1,0x

)p1(pC)x(p)xX(P

nx

xnxnx

Page 57: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-572007會計資訊系統計學 ( 一 )上課投影片

Example 7.9 & 7.10 Pat Statsdud is registered in a statistics course and intends

to rely on luck to pass the next quiz. The quiz consists on 10 multiple choice questions with 5

possible choices for each question, only one of which is the correct answer.

Pat will guess the answer to each question Find the following probabilities

• Pat gets no answer correct• Pat gets two answer correct• Pat fails the quiz

Calculating the Binomial Probability

Page 58: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-582007會計資訊系統計學 ( 一 )上課投影片

Pat Statsdud Pat Statsdud is a student taking a statistics course. Pat’s exam

strategy is to rely on luck for the next quiz. The quiz consists of 10 multiple-choice questions. Each question has five possible answers, only one of which is correct. Pat plans to guess the answer to each question.

Is this a binomial experiment? Check the conditions: There is a fixed finite number of trials (n=10). An answer can be either correct or incorrect (two possible

outcomes). The probability of a correct answer (P(success)=.20) does not

change from question to question. Each answer is independent of the others.

Page 59: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-592007會計資訊系統計學 ( 一 )上課投影片

Pat Statsdud n=10, and P(success) = .20

What is the probability that Pat gets no answers correct?

I.e. # success, x, = 0; hence we want to know P(x=0)

Pat has about an 11% chance of getting no answers correct using the guessing strategy.

1074.)8)(.1(1

)2.1()2(.)!010(!0

!10)0(P

10

0100

Page 60: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-602007會計資訊系統計學 ( 一 )上課投影片

Pat Statsdud n=10, and P(success) = .20

What is the probability that Pat gets two answers correct?

I.e. # success, x, = 2; hence we want to know P(x=2)

Pat has about a 30% chance of getting exactly two answers correct using the guessing strategy.

3020.)1678)(.04(.45

)2.1()2(.)!210(!2

!10)2(P 2102

Page 61: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-612007會計資訊系統計學 ( 一 )上課投影片

Cumulative Probability Thus far, we have been using the binomial probability

distribution to find probabilities for individual values of x. To answer the question:

“Find the probability that Pat fails the quiz”

If a grade on the quiz is less than 50% (i.e. 5 questions out of 10), that’s considered a failed quiz.

Thus, we want to know what is: P(X ≤ 4) to answer

requires a cumulative probability(累積機率) , that is, P(X ≤ x)

Page 62: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-622007會計資訊系統計學 ( 一 )上課投影片

Pat Statsdud P(X ≤ 4) = P(0) + P(1) + P(2) + P(3) + P(4)

We already know P(0) = .1074 and P(2) = .3020. Using the binomial formula to calculate the others:

P(1) = .2684 , P(3) = .2013, and P(4) = .0881

We have P(X ≤ 4) = .1074 + .2684 + … + .0881 = .9672

Thus, its about 97% probable that Pat will fail the test using the luck strategy and guessing at answers.

Page 63: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-632007會計資訊系統計學 ( 一 )上課投影片

Mean and Variance of Binomial Variable

As you might expect, statisticians have developed general formulas for the mean, variance, and standard deviation of a binomial random variable. They are:

)p1(np

)p1(np)X(V

np)X(E2

Page 64: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-642007會計資訊系統計學 ( 一 )上課投影片

Example 7.11 If all the students in Pat’s class intend to guess

the answers to the quiz, what is the mean and the standard deviation of the quiz mark?

Solution

Mean and Variance of Binomial Variable

26.16.1)p1(np

6.1)8)(.2(.10)p1(np)X(V

2)2(.10np)X(E2

Page 65: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-652007會計資訊系統計學 ( 一 )上課投影片

Binomial Table Calculating binomial probabilities by hand is tedious

and error prone. There is an easier way. Refer to Table 1 in Appendix B. For the Pat Statsdud example, n=10, so the first important step is to get the correct table!

Page 66: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-662007會計資訊系統計學 ( 一 )上課投影片

Binomial Table The probabilities listed in the tables are cumulative, i.e. P(X ≤ k) – k is the row index; the columns of the

table are organized by P(success) = p

Page 67: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-672007會計資訊系統計學 ( 一 )上課投影片

Binomial Table What is the probability that Pat fails the quiz?i.e. what is P(X ≤ 4), given P(success) = .20 and n=10 ?

P(X ≤ 4) = .967

Page 68: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-682007會計資訊系統計學 ( 一 )上課投影片

Binomial Table The binomial table gives cumulative probabilities for P(X ≤ k), but as we’ve seen in the last example,

P(X = k) = P(X ≤ k) – P(X ≤ [k–1])

Likewise, for probabilities given as P(X ≥ k), we have: P(X ≥ k) = 1 – P(X ≤ [k–1])

Page 69: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-692007會計資訊系統計學 ( 一 )上課投影片

Binomial Table What is the probability that Pat gets no answers correct?i.e. what is P(X = 0), given P(success) = .20 and n=10 ?

P(X = 0) = P(X ≤ 0) = .107

Page 70: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-702007會計資訊系統計學 ( 一 )上課投影片

Binomial Table What is the probability that Pat gets two answers correct?i.e. what is P(X = 2), given P(success) = .20 and n=10 ?

P(X = 2) = P(X≤2) – P(X≤1) = .678 – .376 = .302remember, the table shows cumulative probabilities.

Page 71: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-712007會計資訊系統計學 ( 一 )上課投影片

=BINOMDIST() Excel Function There is a binomial distribution function in Excel that can also be used

to calculate these probabilities. For example: What is the probability that Pat gets two answers correct?

# successes

# trials

P(success)

cumulative(i.e. P(X≤x)?)

P(X=2)=.3020

Page 72: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-722007會計資訊系統計學 ( 一 )上課投影片

=BINOMDIST() Excel Function There is a binomial distribution function in Excel that can also be used

to calculate these probabilities. For example: What is the probability that Pat fails the quiz?

# successes

# trials

P(success)

cumulative(i.e. P(X≤x)?)

P(X≤4)=.9672

Page 73: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-732007會計資訊系統計學 ( 一 )上課投影片

Binomial Distribution

Additional Example Records show that 30% of the customers in a shoe store

make their payments using a credit card. This morning 20 customers purchased shoes. Use Table 1 of Appendix B to answer some questions

stated in the next slide. This is a binomial experiment with n=20 and p=.30. Let X be the number of customers who use a credit

card.

Page 74: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-742007會計資訊系統計學 ( 一 )上課投影片

pk .01……….. .300..

11

What is the probability that at least 12 customers used a credit card? (n=20 and p=.30)

.995

P(At least 12 used credit card) = P(X≥12)=1 - P(X≤11)= 1-.995 = .005

Binomial Distribution

Page 75: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-752007會計資訊系統計學 ( 一 )上課投影片

pk .01……….. .30

What is the probability that at least 3 but not more than 6 customers used a credit card? (n=20 and p=.30)

.608

P(3 ≤ X ≤ 6)=P(X=3 or 4 or 5 or 6)=P(X ≤ 6) - P(X ≤ 2)=.608 - .035 = .573

02.6

.035

Binomial Distribution

Page 76: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-762007會計資訊系統計學 ( 一 )上課投影片

What is the expected number of customers who used a credit card?

E(X) = np = 20(.30) = 6

Find the probability that exactly 14 customers did not use a credit card.

Let Y be the number of customers who did not use a credit card.P(Y=14) = P(X=6) = P(X6) - P(x5) = .608 - .416 = .192

Find the probability that at least 9 customers did not use a credit card. P(Y9) = P(X11) = .995

Binomial Distribution

Page 77: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-772007會計資訊系統計學 ( 一 )上課投影片

The Poisson distribution, named for Simeon Poisson, is a discrete probability distribution

It refers to the number of events (a.k.a. successes) within a specific time period or a specific region of space.

7.5 Poisson Distribution

Page 78: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-782007會計資訊系統計學 ( 一 )上課投影片

The number of errors a typist makes per page. The number of flaws in a bolt of cloth. The number of telephone calls received by a

switchboard in an hour. The number of customers entering a service station

per day. The number of cars arriving at a service station in a

week. The number of accidents in 1 day on a particular

stretch of highway. (The interval is defined by both time, 1 day, and space, the particular stretch of highway.)

Examples of Poisson Distribution

Page 79: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-792007會計資訊系統計學 ( 一 )上課投影片

Like a binomial experiment, a Poisson experiment has four defining characteristic properties: The number of successes (events) that occur in any

interval is independent of the number of successes that occur in any other interval.

The probability of a success in an interval is• the same for all intervals of the same size,• proportional to the size of the interval.

The probability that two or more successes will occur in an interval approaches 0 as the interval becomes smaller.

Properties of the Poisson Experiment

Page 80: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-802007會計資訊系統計學 ( 一 )上課投影片

The Poisson random variable is the number of successes that occur in a period of time or an interval of space in a Poisson experiment.

E.g. On average, 96 trucks arrive at a border crossing every hour.

E.g. The number of typographic errors in a new textbook edition averages 1.5 per 100 pages.

Poisson Distribution

successes

time period

successes (?!) interval

Page 81: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-812007會計資訊系統計學 ( 一 )上課投影片

The Poisson Random Variable indicates the number of successes that occur during a given time interval or in a specific region in a Poisson experiment

Probability Distribution of the Poisson Random Variable.

The Poisson Variable and Distribution

V(X)E(X)

base logarithm natural is e and

interval the in successesof number mean the is μ where

0,1,2,x for !x

e)x(p)xX(P

x

Page 82: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-822007會計資訊系統計學 ( 一 )上課投影片

As mentioned on the Poisson experiment slide:

The probability of a success is proportional to the size of the interval

Thus, knowing an error rate of 1.5 typos per 100 pages, we can determine a mean value for a 400 page book as:

=1.5(4) = 6 typos / 400 pages.

Poisson Distribution

Page 83: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-832007會計資訊系統計學 ( 一 )上課投影片

Poisson Distributions (Graphs)

3678.e!01e

)0(p)0X(P 101

3678.e!11e

)1(p)1X(P 111

1839.2

e!21e

)2(p)2X(P121

0613.6

e!31e

)3(p)3X(P131

0

0.1

0.2

0.3

0.4

1 2 3 4 5 6 7 8 9 10 11 0 1 2 3 4 5

Page 84: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-842007會計資訊系統計學 ( 一 )上課投影片

Poisson Distributions (Graphs)

0

0.05

0.1

0.15

0.2

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

0

0.05

0.1

0.15

0.2

1 2 3 4 5 6 7 8 9 10 11

00.050.1

0.150.2

0.250.3

1 2 3 4 5 6 7 8 9 10 11

Poisson probability distribution with =2

Poisson probability distribution with =5

Poisson probability distribution with =7

0 1 2 3 4 5 6 7

0 1 2 3 4 5 6 7 8 9 10

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Page 85: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-852007會計資訊系統計學 ( 一 )上課投影片

Poisson Distribution

Example 7.12 The number of Typographical errors in new editions of

textbooks is Poisson distributed with a mean of 1.5 per 100 pages.

100 pages of a new book are randomly selected. What is the probability that there are no typos? That is, what is P(X=0) given that =1.5?

Solution.2231

!0

5.1e

!x

e)0X(P

0-1.5x

“ There is about a 22% chance of finding zero errors”

Page 86: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-862007會計資訊系統計學 ( 一 )上課投影片

Example 7.13 For a 400 page book, what is the probability that there

are no typos?

Solution

“ there is a very small chance there are no typos”

.002479!0

6e

!x

e)0X(P

0-6x

Important!A mean of 1.5 typos per100 pages, is equivalent to 6 typosper 400 pages.

Page 87: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-872007會計資訊系統計學 ( 一 )上課投影片

For a 400 page book, what is the probability that there are five or less typos?

This is rather tedious to solve manually. A better alternative is to refer to Table 2 in Appendix B.

k=5, μ =6, and P(X ≤ k) = .446

Example 7.13

“ there is about a 45% chance there are 5 or less typos”

.446!5

6e

!1

6e

!0

6e

)5(P)1(P)0(P)5X(P561606-

Page 88: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-882007會計資訊系統計學 ( 一 )上課投影片

Example 7.13 Excel is an even better alternative:

Page 89: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-892007會計資訊系統計學 ( 一 )上課投影片

Additional Example Cars arrive at a tollbooth at a rate of 360 cars per hour. What is the probability that only two cars will arrive during

a specified one-minute period? (Use the formula) Solution

The probability distribution of arriving cars for any one-minute period is Poisson with = 360/60 = 6 cars per minute. Let X denote the number of arrivals during a one-minute period.

0446.!2

6e)2X(P

26

Poisson Distribution

Page 90: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-902007會計資訊系統計學 ( 一 )上課投影片

What is the probability that only two cars will arrive during a specified one-minute period? (Use Table 2, Appendix B.)

P(X = 2) = P(X2) - P(X1) = .062 - .017 = .045

K 0.1 …………… 3 …………. 6 …….0 0.905 0.05 0.0021 0.995 0.199 0.0172 1 0.423 0.062

selectedafor)kX(PprobabiltytheprovidestableThe

Poisson Distribution - Using a Table

Page 91: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-912007會計資訊系統計學 ( 一 )上課投影片

What is the probability that at least four cars will arrive during a one-minute period? (Use table 2 , Appendix B)

P(X4) = 1 - P(X3) = 1 - .151 = .849

Poisson Distribution - Using a Table

Page 92: 7-1 Random Variables and Discrete probability Distributions Chapter 7

7-922007會計資訊系統計學 ( 一 )上課投影片

Example A warehouse has a policy of examining 50 sunglasses

from each incoming lot, and accepting the lot only if there are no more than two defective pairs.

What is the probability of a lot being accepted if, in fact, 2% of the sunglasses are defective?

Solution This is a binomial experiment with n = 50, p = .02. Tables for n = 50 are not available; p<.05; thus, a Poisson approximation is appropriate [ =

(50)(.02) =1] P(Xpoisson2) = .920 (true binomial probability = .922)

Poisson Approximation of the Binomial (Optional)