unit8partii????? 4.0 june19probability-proportional-to-size:pps uses a monetary unit as a sampling...

1

1

การประยกตใชวชาสถตในการตรวจสอบ

Statistics and Sampling

19 มถนายน 2554

Manit Panichakul, CIA, CISA, CISM

2

Unit 8: Statistics and Sampling8.1 Probability and Distributions8.2 Statistics8.3 Hypothesis Testing8.4 Sampling Fundamentals8.5 Attribute Sampling8.6 Variables Sampling8.7 Probability-Proportional-to-Size (PPS) Sampling

8.8 Statistical Quality Control8.9 Summary

2

3

Audit Sampling and ProbabilityResource limitations require IA to apply sampling techniquesStatistical methods permit a quantitative assessment of the accuracy and reliability of the sample resultsIA can evaluate their hypothesis and reduce uncertainty

4

ProbabilityProbability is the numerical measure of the likelihood that an event will occurValue is between 0 and 1Sum of the probabilities ofall mutually exclusive and collectively exhaustive events is 1Normal Distribution: 99% lies with

+/-3 SD (Mean=0, Variance=1)Small sample size (n<30) given SD is known

T-distribution provide reasonable estimate of the population mean

Certain

Impossible

.5

1

0

3

5

•In statistics, each observation needs to be mutually exclusive in order for them to be properly differentiated and organized into separate categories, (such as maleand female).•A property of a set of categories such that an individual or object is included in only one category. •Essentially, something being mutually exclusive serves to prevent it from being counted more than once, being true or false.•Two mutually exclusive events cannot both occur. Mutually exclusive events(joint prob): Pr(A ∩ B) = 0. For example, the result "1" and "2" from the roll of a die are mutually exclusive - it cannot be a 1 and a 2.

6

ขอแตกตางระหวางประชากรและตวอยาง

เปนสวนยอยของประชากร มความจาเปนเพอลดคาใชจายจากการศกษาประชากร >>>>>Statistic

คอ ทกหนวยในเรองทสนใจศกษา

ในงาน ควรจะแยกประชากรในแตละเรอง ถงแมจะเปนองคกรเดยวกนเพอความชดเจนของผลทได>>>>Parameter

ตวอยาง (Sample)ประชากร (Population)

Nonparametric, or distribution-free (rank order is known – qualitative data).

Central Limit Theorem: regardless of distribution, shape of the sampling distribution of the mean approaches the Normal Distribution as the sample sizeis increased.

4

7

•Precision: interval about an estimate of a population parameter•Statistical sampling allows quantitative assessment

(precision and reliability)•Sampling risk (Attribute sampling: test of controls):

•Risk of assessing control too low (type II error)•Risk of assessing control too high (type I error)

•Sampling risk (Variable sampling: substantive test/dollar limit test)•Risk of incorrect acceptance (type II error)•Risk of incorrect rejection (type I error)

•Sampling size: population size, acceptable risk, variability in the population and the acceptable misstatement or deviation rateProbability-Proportional-to-Size:PPS uses a monetary unit as a sampling unit to systematically selects every nth unit•Statistical control charts are graphic aids for monitoring of any process subject to acceptable or unacceptable variations

8

Expected ValueApplying basic probability conceptsMultiplying the prob of each outcome by its pay-offRepresents the long-term average pay-off (mean) for repeated trials

Type of probObjective prob: logic or actual experience

Pr: 1/6 for dice roll, ½ for coin toss, 1/54 for card drawSubjective prob: estimates based on judgment and past experience of the likelihood of future event

Joint prob: 60% of students are male Pr(m)=6/10, if 1/6 male students have a B average, Pr(B/m)=1/6, prob that any given std (m or f) selected as random. Joint prob (both m and B grade) Pr(m/B)= Pr(m)xPr(b/m)=6/10x1/6=1/10

Ex Pr that either one or both of two events will occur= Sum of their separate Pr>>Two fair coins are thrown, Pr at least one will come up heads is Pr(#1 H)

+Pr(#2 H) - Pr(#1 H&#2 H)= (.5)+(.5)-(.5x.5)= .75

5

9

Expected Value

Ex from earlier ex, 1/3 students have B average: Pr(B average) is 1/3), Pr(B/m)=P(B)xPr(male)= 1/3x6/10=2/10 Q;Pr (male or B average) = Pr(male) + P(B average) – Pr(B ∩ male)

= 6/10 + 1/3 – 2/10 = .7333Two coins One Die#1 #2 Pr Outcome Chances DecimalH H .25 1 Dot 1/6 0.166667H T .25 2 Dot 1/6 0.166667T H .25 3 Dot 1/6 0.166667T T .25 4 Dot 1/6 0.166667

1.0(Certainty) 5 Dot 1/6 0.1666676 Dot 1/6 0.166667

1.00 (Certainty)Uniform distribution: Pr = 1/N

10

Random Variable

Random variableOutcomes of an experiment expressed numericallye.g.: Toss a die twice; Count the number of times the number 4 appears (0, 1 or 2 times)

6

11

Discrete Random Variable

Discrete random variableObtained by counting (1, 2, 3, etc.)Usually a finite number of different valuese.g.: Toss a coin five times; Count the number of tails (0, 1, 2, 3, 4, or 5 times)

12

Summary Measures

Example of expected value (the mean):Toss two coins, count the number of

tails, compute expected value

( )( )( ) ( )( ) ( )( ) 0 2.5 1 .5 2 .25 1

j jj

X P Xμ =

= + + =

∑

continued

7

13

Summary Measures

VarianceWeight average squared deviation about the mean Example of variance:

Toss two coins, count number of tails, compute variance

(continued)

( ) ( ) ( )222j jE X X P Xσ μ μ⎡ ⎤= − = −⎣ ⎦ ∑

( ) ( )( ) ( ) ( ) ( ) ( ) ( )

22

2 2 2 0 1 .25 1 1 .5 2 1 .25 .5

j jX P Xσ μ= −

= − + − + − =

∑

14

Important Discrete Probability Distributions

Discrete Probability Distributions

Binomial Hypergeometric Poisson

8

15

Binomial Probability Distribution

“n” Identical trialse.g.: 15 tosses of a coin; 10 light bulbs taken from a warehouse

Two mutually exclusive outcomes on each trial (success or failure)

e.g.: Heads or tails in each toss of a coin; defective or not defective light bulb

Trials are independentThe outcome of one trial does not affect the outcome of the other

การแจกแจงแบบทวนาม

16

Binomial Probability Distribution

Constant probability for each triale.g.: Probability of getting a tail is the same each time a coin is tossed

Two sampling methodsInfinite population without replacementFinite population with replacement

>>>apply for Stat Quality Control

(continued)

9

17

ให X เปนจานวนครงทเกดเหตการณทสนใจจากการทดลอง n ครงทอสระกน เรยก X วาตวแปรสมทวนาม และเรยกการแจกแจงความนาจะเปนของ X วา การแจกแจงแบบทวนาม ซงสามารถพจารณาหารปแบบการแจกแจงความนาจะเปนได ดงนนยาม จากการทดลองทวนามซงมการทดลองอยางอสระกน n ครง แตละครงของการ ทดลองมโอกาสทจะเกดเหตการณทสนใจเปน p และโอกาสทจะเกดเหตการณทไมสนใจเปน q ให X เปนตวแปรสมทวนามแลว f(x) เปนฟงกชนการแจกแจงความนาจะเปนของตวแปรสมทวนาม เขยนไดดงน P(X) = nCxp x q n-x

การใชสตรในการคานวณการแจกแจงทวนามจะยงยากเมอ n มขนาดใหญขน ดงนน การแจกแจงแบบปกตสามารถนามาใชประมาณคาความนาจะเปนแบบทวนามได (np>5)

18

10

19

Binomial formulan! x pr(1-p)n-r

r! (n - r)!

Dining room table to be balance for male and female. The tables have only 6, 10 or 16 seatsPopulation are male 50% and female 50%>>Pr(3m,3f) = 6! x(.5)2x(1-.5)6-3=.3125

3!(6-3)!

20

Poisson Distribution

Poisson processDiscrete events in an “interval”The probability of success isindependent from interval to intervale.g.: The number of customers arriving in 15 minutesWhen large sample size& is small (<7)>>Approaches the Binomial, = np

PX xx

x( |!= λ

λλe-

11

21

การแจกแจงแบบปวสซอง (Poisson Distribution)การทดลองทเกดขนในชวงเวลาหนง หรอขอบเขตหนงห x แทนตวแปรของเหตการณทสนใจในชวงเวลาหนงหรอในขอบเขตหนง โดยทคาของตวแปร x มจานวนไมจากด เชนx แทนจานวนลกคาทเขามาซอของในหางสรรพสนคา ในชวงเวลา 12.00 – 15.00 น.x แทนจานวนรอยตาหนบนผา 1 ผนจะไดวา x อาจจะมคาเปน 0, 1, 2, 3,…………จะเหนไดวา คา x ตางๆ ทกลาวมา จะมจานวนครงทไมแนนอน จงใชการแจกแจงแบบปวสซองเขามาชวย แกปญหาในการหาคาความนาจะเปนดงกลาว

22

ฟงกชนความนาจะเปนแบบปวสซอง

โดยท x = 0 , 1 , 2 , …………….e = คาคงท 2.71828.

= คาเฉลยของเหตการณหรอการทดลองทสนใจ= mean/variance

Truck, average 2 trucks are involved in an accident each month= 2 >The Pr(0 craches) = 1e(-2) =e(-2) =.135

0! Pr(4 craches) = (4) x e(- ) = 2 (4) x e(-2) = .09

4! 4!

12

23

Hypergeometric Distribution

“n” Trials in a sample taken from a finite population of size NSample taken without replacementTrials are dependentConcerned with finding the probability of “X”successes in the sample where there are “A”successes in the population

24

Continuous Probability Distributions

Continuous random variableValues from interval of numbersAbsence of gaps

Continuous probability distributionDistribution of continuous random variable

Most important continuous probability distribution

The normal distribution

13

25

The Normal Distribution

“Bell shaped”SymmetricalMean, median and mode are equalInterquartile rangeequals 1.33 σRandom variablehas infinite range

Mean Median Mode

X

f(X)

μ

26

α and 1 – α

34% 34%

95.5%68%

1 SD = 68%1.64SD = 90%1.96 SD= 95%2 SD = 95.5%2.57 SD= 99%

14

27

Many Normal Distributions

By varying the parameters σ and μ, we obtain different normal distributions

There are an infinite number of normal distributions

Kurtosis

28

Which Table to Use?

An infinite number of normal distributions means an infinite number of tables to look up!

15

29

The t-distribution (Student’s distribution)

Usually fewer than 30, with unknown population varianceFor large sample sizes (n>30), is almost identical to the standard normal distributionFor small sample sizes (n<30), only for sample standard deviation is known, can provide a reasonable estimate for test of the population mean if population is normally distributedUseful because large sample sizes are often too expensive

30

Chi-square distribution

Test of Fit between actual data and theoretical distribution (Goodness of Fit) based on comparison of sample var and population varEx Canning machine exhibited a long-term standard deviation of .4 onces (σ = .4)A new machine is tested, only 15 cans are examined with results: standard deviation (s)=.311>> χ2 = (n-1)s2/ σ2 =(15−1).3112/.42 =8.463 compare with table error.05, 14 df=23.68

Conclusion: critical value is much higher than sample statistic then the hypothesis cannot be rejected.

16

31

สถต (Statistics) คออะไร

สถต คอตวเลขตางๆ ทแสดงใหทราบถงปรากฎการณอยางใดอยางหนง หรอ ขอเทจจรงตางๆ ตลอดจนความสมพนธระหวางตวเลขทรวบรวมมา เชน กรมบญชกลาง ป 2552 มพนกงาน …..คน เขาใหม ….คน ยายเขา….คน ยายออก…..คน มอตราวาง …..อตรา อตราสวนพนกงานชายตอพนกงานหญงคดเปน ....ตอ ….วชาสถต เปนวธการทางคณตศาสตร หรอ กระบวนการของการรวบรวม การบรรยาย การจดระเบยบ การวเคราะห การนาเสนอ และการตความหมาย ของขอมลทเปนตวเลข สถต จงเปนเครองมอในการวดผล และการวจย

32

ขนตอนสถต การรวบรวมขอมล (Data Collection): เกบขอมลขอเทจจรงทเปนตวเลขหรอสามารถแปลงคาใหเปนตวเลข

การวเคราะหขอมล (Data Analysis): แจกแจง จดหมวดหม คาณวนคาสถต เพอใหความหมายการนาเสนอขอมล (Data Presentation): จดใหอยในรปทเหมาะสม เชน ตาราง กราฟ หรอ รปภาพการตความขอมล (Data Interpretation): ตความเปนภาษา ความหมายทผอนเขาใจ เพอนาไปสขอสรป

17

33

ประเภทของวชาสถต ตามลกษณะของขอมลไดเปนสองประเภทคอ

3.1 สถตเชงอนมาน(Inferential Statistics) หมายถง สถตทใชจด

กระทากบขอมลทไดมาเพยงบางสวนของขอมลทงหมด >Draw conclusions about a population base on a sample

3.2 สถตเชงบรรยาย(Descriptive Statistics) หมายถง สถตทใชจด

กระทากบขอมลทไดมาเฉพาะเรองใดเรองหนง

>Ways to summarize large amount of raw data

34

Descriptive StatisticsMeasures of central tendency

Mean, median, mode, geometric mean, midrange

Measure of variationRange, Interquartile range, variance and standard deviation, coefficient of variation

ShapeSymmetric, skewed, using box-and-whisker plots

•Coefficient of correlation•Pitfalls in numerical descriptive measures and ethical considerations

18

35

Summary Measures

Central Tendency

MeanMedian

Mode

Quartile

Summary Measures

Variation

Variance

Standard Deviation

Coefficient of Variation

Range

36

Measures of Central Tendency

Central Tendency

Average Median Mode

1

1

n

ii

N

ii

XX

n

X

Nμ

=

=

=

=

∑

∑

19

37

Mean (Arithmetic Mean)

Mean (arithmetic mean) of data valuesSample mean

Population mean

1 1 2

n

ii n

XX X XX

n n= + + +

= =∑ L

1 1 2

N

ii N

XX X X

N Nμ = + + +

= =∑ L

Sample Size

Population Size

38

Mean (Arithmetic Mean)

The most common measure of central tendencyAffected by extreme values (outliers)

(continued)

0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14

Mean = 5 Mean = 6

20

39

Median

Robust measure of central tendencyNot affected by extreme values

In an ordered array, the median is the “middle”number

If n or N is odd, the median is the middle numberIf n or N is even, the median is the average of the two middle numbers

0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 12 14

Median = 5 Median = 5

40

Mode

A measure of central tendencyValue that occurs most oftenNot affected by extreme valuesUsed for either numerical or categorical dataThere may be no modeThere may be several modes

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14

Mode = 9

0 1 2 3 4 5 6

No Mode

21

41

Measures of Dispersion/Variation

Variation

Variance Standard Deviation Coefficient of Variation

PopulationVariance

Sample Variance

PopulationStandardDeviation

Sample Standard Deviation

Range

Interquartile Range

42

Range

Measure of variationDifference between the largest and the smallest observations:

Ignores the way in which data are distributed

Largest SmallestRange X X= −

7 8 9 10 11 12

Range = 12 - 7 = 5

7 8 9 10 11 12

Range = 12 - 7 = 5

22

43

Measure of variationAlso known as midspread

Spread in the middle 50%Difference between the first and third quartiles

Not affected by extreme values3 1Interquartile Range 17.5 12.5 5Q Q= − = − =

Interquartile Range

Data in Ordered Array: 11 12 13 16 16 17 17 18 21

44

( )2

2 1

N

ii

X

N

μσ =

−=

∑

Important measure of variationShows variation about the mean

Sample variance:

Population variance:

( )2

2 1

1

n

ii

X XS

n=

−=

−

∑

Variance

23

45

Standard Deviation

Most important measure of variationShows variation about the meanHas the same units as the original data

Sample standard deviation:

Population standard deviation:

( )2

1

1

n

ii

X XS

n=

−=

−

∑

( )2

1

N

ii

X

N

μσ =

−=

∑

46

Comparing Standard Deviations

Mean = 15.5s = 3.33811 12 13 14 15 16 17 18 19 20 21

11 12 13 14 15 16 17 18 19 20 21

Data B

Data A

Mean = 15.5s = .9258

11 12 13 14 15 16 17 18 19 20 21

Mean = 15.5s = 4.57

Data C

คะแนนมาตรฐาน (Z-Score): ผลตางของคะแนนดบกบ คาเฉลยเปนกเทาของ SD (normal distribution)

24

47

Coefficient of Variationสปส. แหงความแปรผน

Measures relative variation

Always in percentage (%)

Shows variation relative to mean

Is used to compare two or more sets of data measured in different units

100%SCVX

⎛ ⎞= ⎜ ⎟⎝ ⎠

48

Shape of a Distribution

Describes how data is distributedMeasures of shape

Symmetric or skewed

Mean = Median =ModeMean < Median < Mode Mode < Median < MeanRight-SkewedLeft-Skewed Symmetric

25

49

Exploratory Data Analysis

Box-and-whisker plotGraphical display of data using 5-number summary

Median( )

4 6 8 10 12

XlargestXsmallest 1Q 3Q2Q

50

Distribution Shape and Box-and-Whisker Plot

Right-SkewedLeft-Skewed Symmetric

1Q 1Q 1Q2Q 2Q 2Q3Q 3Q3Q

26

51

ขนตอนการใชสถตในงาน IA1. ระบวตถประสงค (State the Objectives of the Audit Test)

• ยอมรบใหเกด error ในประชากรไมเกน 5%

2. ระบลกษณะทตองการวด (Define the Attributes of Interest)

• วดในรปตวเงนมลคา (Variable Sampling/Dollar Limit Sampling)

• วดในรปเปอรเซนตของ error ทเกด (Attribute Sampling)

3. ระบประชากร (Define the Population)

• ใบเสรจรบเงนของป 2553 บญชลกหน

• จานวนสนคาในคลง คาใชจายในการดแลรกษา เครองจกร

52

ขนตอนการใชสถตในงาน IA4. กาหนดแผนการเลอกตวอยาง (Determine the Sampling Plan)

5. ระบหนวยตวอยาง (Define the Sampling Unit)

1 sampling unit = ใบเสรจ 1 ใบ

1 sampling unit = ใบเสรจ 1 เลม

6. กาหนดขนาดตวอยาง (Determine the Sample Size)

ขนาดตวอยางจะขนอยกบ

• การกระจายของขอมล อตราการเกด error สงสดทยอมรบได

• ระดบความเชอมน (ความเชอถอได : reliability)

• อตราความผดพลาดทยอมใหเกด

27

53

ขนตอนการใชสถตในงาน IA

7. เลอกตวอยาง (Select the Sample)

เลอกตวอยางตามแผนทเลอกไวจนครบจานวนตวอยางทกาหนด

8. ทาการตรวจสอบ (Performing of Audit)

• ตรวจสอบเอกสารตางๆ

• ตรวจนบจานวนสนคา

• ฯลฯ

54

การทดสอบสมมตฐานทางสถต (Statistical Testing Hypothesis)

สมมตฐาน คอ ความเชอของบคคลใดบคคลหนงหรอกลมบคคลหรอ สงทคาดวาจะเกด ซงอาจจะเปนจรงหรอไมกได เชน เชอวา

• เปอรเซนตความผดพลาดไมเกน 5%

• เปอรเซนตของสนคามปญหา ไมเกน 5%

• ประสทธภาพการทางานฝายผลตเพมขน

28

55

การทดสอบสมมตฐานทางสถต (Statistical Testing Hypothesis)

สมมตฐานทางสถต จะตองประกอบดวย• สมมตฐานวาง (Null Hypothesis) ใชสญลกษณ Ho

>>>>always contains “=“, “≤“ “≥“ sign• สมมตฐานแยง (Alternative Hypothesis) ใชสญลกษณ

H1หรอ Ha

H0หรอ H1 ตองอยในทศทางตรงขามเสมอHo : เปอรเซนตของสนคาทมปญหาเทากบ 5%H1 : เปอรเซนตของสนคาทมปญหาไมเทากบ 5%

56

ความเสยงทเกดจากผลการทดสอบสมมตฐาน

ความเสยงหรอความผดพลาดของผลการทดสอบม 2 ประเภท คอ

1. ความผดพลาดประเภทท I (Type I Error : α)

ความเสยงของผผลต

P (Type I Error) = α= P (reject H0/ H0 True)

หมายความวา ในประชากรม error ไมเกน 5% แตจากขอมลตวอยางทาใหเราสรปวา ในประชากรม error เกน 5% จงอาจจะ หาสาเหตโดยตรวจสอบเพม หรอ ตรวจสอบ 100% จะทาใหเสยคาใชจายเพม

29

57

ความเสยงทเกดจากผลการทดสอบสมมตฐาน

2. ความผดพลาดประเภทท II (Type II Error : )

ความเสยงของผบรโภค

P (Type II Error) =

= P(Accept H0 / H0 False)

หมายความวาในความเปนจรงประชากรม error มากกวา 5%

แตจากขอมลตวอยางทาใหเราสรปวาประชากรม error ไมเกน 5%

ในกรณนเราจะไมตรวจสอบเพม ซงจะกอใหเกดความเสยหายอยางมาก

58

ความผดพลาดในการทดสอบสมมตฐานทางสถต

1. ความผดพลาดประเภทท 1 (Type I error) เปนความผดพลาดเนองจากการปฏเสธ Ho เมอ H0 เปนจรง ใชสญลกษณ

=P ( ปฏเสธ H0 / H0 จรง )

เรยก วาระดบนยสาคญ (Level of Significance)2.ความผดพลาดประเภทท II (Type II error) เปนความผดพลาดเนองจากการ

ยอมรบ H0 เมอ H1 จรง(H0 ไมจรง)=P ( ยอมรบ H0 / H0 ไมจรง )

การลด จะทาให เพมขนการลด จะทาให เพมขนถาตองการลดทงคา และ ควรเพมขนาดตวอยาง

αα

α

αβ

ββ

α

β

30

59

Confidence level: 1-α

i.e., Confidence level: 1-α = 95% Level of significance of the test: α =5%

Power of the test: probability of correctly reject a false null hypothesis: 1-β

60

วธการเลอกตวอยาง

แบงเปน 2 วธใหญๆ คอ

1. การเลอกตวอยางโดยทราบความนาจะเปน

(Probability Sampling or Statistical Sampling)

2. การเลอกตวอยางทไมทราบความนาจะเปน

(Non-Probability Sampling or Non-Statistical Sampling)

31

61

สงทจาเปนของการเลอกตวอยางทใชความนาจะเปนการเลอกตวอยางทใชความนาจะเปนจาเปนตองประกอบดวย

กรอบตวอยาง (Sampling Frame) หมายถง รายชอทกหนวยในประชากร พรอมทอยทสามารถตดตอได ในเรอง IA จะหมายถง เลขทเอกสารตางๆ

ตองทราบขนาดประชากร N/ตองทราบขนาดตวอยาง n

สามารถคานวณโอกาสทแตละหนวยในประชากรจะถกเลอกเปนตวอยาง

ทาใหสามารถคานวณความคลาดเคลอนจากการใชตวอยาง(sampling risk)

62

เทคนคการเลอกตวอยางททราบความนาจะเปน

1. การเลอกตวอยางสมแบบงาย (Simple Random Sampling)

2. การเลอกตวอยางแบบเปนระบบ (Systematic Sampling)

3. การเลอกตวอยางแบบแบงชนภม (Stratified Sampling)

4. การเลอกตวอยางแบบกลม (Cluster Sampling)

5. การเลอกตวอยางแบบหลายขนตอน (Multi-stage Sampling)

6. การเลอกตวอยางแบบตามลาดบ (Sequential Sampling)

ฯลฯ

32

63

1. การเลอกตวอยางสมแบบงาย : SRS(Simple Random Sampling)วธนจะกาหนดใหแตละหนวยในประชากรมโอกาสถกเลอกเทาๆกน ในแตละครงของการเลอก

วธการเลอก

1. การจบสลาก

2. การใชตารางเลขสม

3. การใชโปรแกรมสาเรจรป เชน SPSS SAS หรอ เขยนโปรแกรมเอง

64

2. การเลอกตวอยางแบบมระบบ(Systematic Sampling)

หลกเกณฑ

1. ใหหมายเลขแกแตละหนวยในประชากร คอ หมายเลข 1-N

2. เลอกหนวยตวอยางหนวยแรก โดยทให k = N/n แลวเลอกเลขสมทมคาตงแต 1 ถง k เชน ไดเลข r หมายถงหนวยท r เปนหนวยตวอยางหนวยแรก

3. เลอกหนวยถดไปใหหางจากหนวยทเพงถกเลอกเปนชวง เทาๆ กน นนคอ หนวยท i จะหางจากหนวยท i + 1 และ หนวยท i - 1 เปนระยะเทาๆกน จนไดครบ n หนวย

33

65

ตวอยางการเลอกตวอยางแบบเปนระบบ

ถา N = 2,500 (เลขทเอกสาร 652 - 3,151) n = 125k = N/n = 2,500/125 = 20

สมเลขสมทมคาระหวาง 1 - 20 เชน ไดเลข 9ตวอยางท 1 : เอกสารเลขท 651 + 9 = 660ตวอยางท 2 : เอกสารเลขท 660 + 20 = 680ตวอยางท 3 : เอกสารเลขท 680 + 20 = 700

... … …ตวอยางท 125 : เอกสารเลขท 3,140

66

ขอดและขอเสยของ Systematic Sampling

อาจเกด bias ขน เนองจากเมอเลอกหนวยแรกแลว หนวยตอไปจะถกเลอกโดยอตโนมต

งาย และไดตวอยางกระจายอยทวประชากร

ขอเสยขอด

34

67

การแกขอเสยของ Systematic Samplingจะใชวธทเรยกวา Multiple Start โดยการคณระยะหางโดยคาคงทเพอใหไดระยะหางใหม เชน ใหคาคงท = 4

ระยะหางเดม = 20 ระยะหางใหม = 20 x 4 = 80

เลอกเลขสมเพอหาตวอยางแรก โดยเลอกเลขสมมา 4 ตว มคาตงแต 0 - 79 เชนไดเลข 0 , 21 , 46 , 64ตวอยางท 1 : เอกสารหมายเลข 652 + 0 = 652ตวอยางท 2 : เอกสารหมายเลข 652 + 21 = 673ตวอยางท 3 : เอกสารหมายเลข 652 + 46 = 698ตวอยางท 4 : เอกสารหมายเลข 652 + 64 = 716ตวอยางท 5 : เอกสารหมายเลข 652 + 80 = 732ตวอยางท 6 : เอกสารหมายเลข 673 + 80 = 753

68

การเลอกตวอยางแบบแบงชนภม (Stratified Sampling)• เปนการแบงประชากรออกเปนกลม โดยจะเรยกแตละกลมวา ชน

ภม (Stratum) ซงจะใหหนวยตางๆ ทอยในชนภมเดยวกนม

ลกษณะทสนใจศกษาเหมอนกนหรอใกลเคยงกนมากทสด แต

หนวยทอยตางชนภมกนจะมลกษณะทสนใจศกษาแตกตางกน

มากทสด แตละหนวยจะตองอยในชนภมใดชนภมหนงเพยงชน

ภมเดยวเทานน

• การเลอกตวอยางจากแตละชนภมจะเปนอสระกน

• ขนาดตวอยางทงหมดจะเปนผลรวมของขนาดตวอยางจากแตละ

ชนภม และตองเลอกตวอยางจากทกชนภม

35

69

t t

tttt t

1

1

hhn nn n

70

ปญหาเกยวกบการสรางชนภม

• ควรใชตวแปรใดเปนตวแปรแบงชนภม

• ตวแปรทใชแบงชนภมควรเปนตวแปรเพยงตวเดยวหรอมากกวาห นงตว

• จะสรางหลกเกณฑในการกาหนดชวงของคาตวแปรทกาหนดชนภมอยางไร

• ควรมจานวนชนภมกชนภม

36

71

การเลอกตวอยางแบบแบงชนภมในงาน IA

เปาหมายตองการแบงให

งานสวนทม error มาก ( > 20%) อยดวยกน เพอทจะสมตวอยางมาตรวจสอบมาก หรอ ตรวจสอบ 100%

งานสวนทม error ปานกลาง (5 - 20%) อยดวยกน เพอทจะสมตวอยางมาตรวจสอบปานกลาง

งานสวนทม error นอย ( < 5%) อยดวยกน เพอทจะสมตวอยางมาตรวจสอบ นอย

ซงปจจยทใชในการแบงชนภมอาจพจารณาจากขอมลในอดตทเคย ตรวจสอบมาแลว หรอจากประสบการณ

72

การแบงชนภมอาจแบงโดยพจารณา ดงน ใชมลคาของรายการตางๆ เชน มลคาของสนคาคงคลง อาจแบงเปน

ชนภมท 1 สนคาทมมลคามากกวา 100,000 บาท

ชนภมท 2 สนคาทมมลคาระหวาง 50,000 ถง 100,000 บาท

ชนภมท 3 สนคาทมมลคานอยกวา 50,000 บาท

อาจกาหนดใหขนาดตวอยางจากชนภมท 1 มากกวาจากชนภมท 2 และ 3

หรอแบงตามยอดสนเชอ ตามประเภทหลกทรพย ตามระยะเวลาหน ฯลฯ

หรอสายการบน อาจแบงโดยเสนทางการบน ยโรป อเมรกา เอเชย ฯลฯ

37

73

ไดครบทกลกษณะ

สามารถประมาณคาของแตละชนภมได

ในแตละชนภมสามารถเลอกใชแผนการเลอกตวอยางทเหมาะสม

การกาหนดขนาดตวอยางของชนภม

1. ใหแตละชนภมมขนาดตวอยางเทากน

โดยท L = จานวนชนภม

2. ใหขนาดตวอยางของชนภมเปนสดสวนกบขนาดของชนภม

3. ใหขนาดตวอยางของชนภมเปนสดสวนกบการกระจายของขอมลใน ชนภม

ขอดของการเลอกตวอยางแบบแบงชนภม

74

Probability – Proportional - to - Size Sampling (PPS or Dollar-Unit Sampling or Monetary-Unit Sampling (MUS))เปนการใชหลกเกณฑของการเลอกตวอยางแบบแบงชนภม

เชน ประชากร คอ บญชลกหนคางจายทมมลคารวม 10 ลานบาท โดยมจานวนบญช 2000 บญช ในทนจะถอวา ประชากรม 10 ลานหนวย (1 หนวย คอ 1 บาท)

รายการ มลคา (บาท)

มลคาสะสม Dollar-Unit Dollar Select

1 50,000 50,000 1 - 50,000 -

2 100,000 150,000 50,001 - 150,000 100,000

3 80,000 230,000 150,001 - 230,000 -

4 200,000 430,000 230,001 - 430,000 300,000

5 300,000 730,000 430,001 - 730,000 500,000 700,000

38

75

4. การเลอกตวอยางแบบกลม(Cluster Sampling)

• หมายถง การเลอกตวอยางทหนวยตวอยางไมใชหนวยทใหขอมลโดยตรง

เพยงระดบเดยว หรออาจกลาวไดวาจะแบงประชากรออกเปนกลม โดยให

ในแตละกลมมลกษณะทแตกตางกน

• เชน การศกษาเรองรายได หรอ เรองความคดเหนในเรองตางๆ และ

ประชากร คอ กทม. อาจแบง กทม. ออกเปนเขต เนองจากในแตละเขตม

ทกระดบของรายได และมความคดเหนทแตกตางกน หรอในการ

ตรวจสอบ อาจแบงกลมตามประเภทสนทรพย อาชพของผก หรอตาม

มลคา

76

n t1 5h 1 n

n t1 5h 1 n

n t1 5h 1 n

n t1 5h 1 n

39

77

การเลอกตวอยางแบบกลม (Cluster Sampling)

• ไมจาเปนตองเลอกตวอยางจากทกกลม

• เลอกตวอยางเพยงบางกลมกพอเพยง เพราะมครบทกลกษณะ

• ในงาน IA เรยกวา “Block Sampling”

• 1 กลม หมายถง 1 ชวงเวลา เชน 1 เดอน หรอ 3 เดอน โดยตรวจทกรายการเกดขนในชวงเวลาทถกเลอก ใน 1 ป ม 12 กลม อาจเลอกมาเพยง 3 กลม

• 1 กลม หมายถง 1 แฟม หรอ 1 ชน หรอ 1 หนา

78

การเลอกตวอยางกลมขนเดยว(One-Stage Cluster Sampling)

เปนการเกบขอมลจากทกหนวยในกลมทถกเลอกเปนตวอยาง

การเลอกกลมยอย อาจใหแตละแตละกลมมโอกาสถกเลอกเทาๆ กน

เชน เมอเลอกเดอนมนาคม และเดอนสงหาคม จะไดทกรายการทเกดในสองเดอนขางตนตกเปนตวอยาง

40

79

การเลอกตวอยางกลมแบบสองขน(Two-Stage Cluster Sampling)

เปนการเกบรวบรวมขอมล 2 ขนตอน

ขนท 1 : เลอกหนวยใหญ

ขนท 2 : เลอกหนวยยอยจากหนวยใหญทถกเลอก เชน

ประชากร : ครวเรอนในประเทศไทย

ขนท 1 : เลอกภาค 3 ภาค

ขนท 2 : เลอกจงหวดจากแตละภาคทถกเลอก แลวสอบถามจากทกครวเรอนในจงหวดทถกเลอก

80

เหนอ อสาน ใต ตอ. ตต.กลาง

อยธยา

กทม.

กาญจนบร ตรง กระบ สพรรณ

การเลอกตวอยางกลมแบบสองขน(Two-Stage Cluster Sampling)

41

81

ตวอยางการเลอกหลายขนตอนในงาน IA

2 ขนตอน เชน

• สม เดอน และวนภายในเดอนทถกสม

• สมตรวจสนคาคงคลงตามราคา และตามชนดของสนคาทมราคาอยในชวงเดยวกน

• ฯลฯ

82

การเลอกตวอยางแบบตามลาดบ >>one of attribute sampling (Sequential or Stop-or-Go Sampling)

เปนการเลอกตวอยางทไมกาหนดขนาดทแนนอนไวลวงหนา

มกจะใชในงานการตรวจสอบคณภาพของสนคา

ในงาน IA มกจะใชเมอคาดวาเปอรเซนตการเกด error ตา

การตดสนใจจะพจารณาจากผลการตรวจตวอยาง เชน สมตวอยางมา กลมละ 10 หนวย และตงเกณฑไววา ในการตรวจกลมแรก

ถาตรวจพบวามปญหาไมเกน 1 หนวย จะหยดการสมตวอยาง แลวสรปวา ทงประชากรไมมปญหา

ถาตรวจพบวามปญหามากกวา 1 หนวย แตไมเกน 4 หนวย จะตองสมกลมตอไป

แตถาตรวจพบวามปญหาเกน 4 หนวย จะหยดการสมตวอยาง และสรปวา ทงประชากรมปญหา

42

83

สม 1 กลม (10 หนวย) มาตรวจสอบ

สรปวาไมมปญหา

สมตวอยางกลมถดไป ( 10 หนวย)

สมตวอยางกลมถดไป ( 10 หนวย)

สรปวามปญหามปญหาD1 หนวย

สรปวาไมมปญหา สรปวามปญหามปญหาD2 หนวย

1 1D <= 1 5D >

621 ≤+ DD 621 >+ DD

84

เทคนคการเลอกตวอยางทไมทราบความนาจะเปน(Non Statistical Sampling)

1. การเลอกตวอยางตามความสะดวก(Convenient Sampling or Haphazard selection)

2. การเลอกตวอยางโดยใชวจารณญาณ(Judgement Sampling)

3. การเลอกตวอยางโดยใชโควตา(Quota Sampling)

4. การเลอกตวอยางโดยบงเอญ(Accident Sampling)

5. Block Selection

43

85

Type of Statistical Sampling Plan

เนองจากงาน IA จะตองตรวจทงสวนทเกยวกบตวเงนและสวนทไมเกยวกบตวเงน จงแบงเปน 3 ประเภท ดงน

Audit Approach Sampling Plan

• Tests of controls • Attribute Sampling

• Substantive tests • PPS Sampling • Variable Sampling

• Fraud Detection • Discovery Sampling

86

Tests of Controls (Qualitative Characteristics)

An auditor performs tests of control and access whether they were functioning properly during the period being audited.

Objective: Drawing conclusions about the effectiveness of control procedure.

Sampling Plan:

1. Fixed Sample-size Sampling Plan

2. Sequential Sampling Plan

44

87

Factors that Impact Sample Size(Attribute Sampling)

1. Population Size

2. Risk of Accessing Control Too Low (beta)

3. Tolerable Deviation Rate

4. Expected Population Deviation Rate

88

Sampling RiskReliability Risk

= Confidence Level

= P (Correct decision)

= P (Making right conclusions about the operating effectiveness of a control procedure)

1. Risk of incorrect acceptance (beta) : Audit Effectiveness = Risk of Assessing Control Risk too low.

2. Risk of incorrect rejection (alpha) : Audit Efficiency

= Risk of Assessing Control Risk too high

45

89

Tolerable Deviation Rate(Occurrence Rate)

The auditor must specify the maximum number of deviations from the prescribed control procedures that can be tolerated without affecting acceptance of the procedures as effective.

90

Expected Population Deviation Rate

Expected Population Deviation Rate is the anticipated error.

Depend on:

prior year’s audit

pilot sample selected

initial understanding phase of the internal control review evaluation.

46

91

Discovery Sampling (Attribute sampling)

Two conditions for Discovery Sampling :1. When the auditor’s best judgment of the population

occurrence rate is zero or near zero percent.2. When the auditor is looking for a very critical

characteristics that, if discovered, might be indicative of more widespread irregularities or serious errors in the financial statements.

92

Summary of Attribute Sampling

Attribute Model Application

Characteristic Fixed Sample size

Sequential Discovery

1. Typical sample size

Medium Low High

2. Types of testing Tests of Controls

Tests of Controls

Special study and substantive tests

3. Expected control deviations

Low Zero or very low

Zero or very low

47

93

Factors that Impact Sample Size(Attribute Sampling)

DirectExpected Population Deviation Rate

InverseTolerance Deviation Rate

InverseRisk of Assessing Control Risk too low

DirectPopulation Size

Impact on Sample Size

Factors

94

Substantive Testing

Objective: To evaluate the fair presentation of dollar amounts that appear in the financial statements.

Sampling techniques:

1. PPS Sampling – Probability-proportional-to-size or DUS (Dollar-unit sampling)>>using Poisson distribution to approximate the Binomial distribution

2. Variables Sampling

48

95

Variables Sampling

Objective: To estimate a total dollar amount of transactions or balances.There are 3 alternative approaches:

1. Mean per Unit Estimate>>precision is determined using mean and SD of the sample

2. Difference Estimation>>precision is calculated using the mean and SD of the differences

3. Ratio Estimation

96

1. Calculate sample mean

2. Calculate the estimated Value:

EV = N.

Mean per Unit Estimation

X

X

49

97

N = ขนาดประชากรn = ขนาดตวอยาง σ2 = คาแปรปรวนของประชากร

E = ความคลาดเคลอนในการประมาณคา μ ดวยนนคอ E = ⏐ −μ⏐

Z = คาปกตมาตรฐานทไดจากตารางการแจกแจงแบบปกตมาตรฐาน ซงขนอยกบระดบความเชอมน ทกาหนด

= (x - μ)/ σ or ( - μ)/ σ /root n

xx

x

98

1. A graphic aids for monitoring the status of any process subject to acceptable or unacceptable variations during repeated operationse.g., unit cost, direct labor hours used

2. UCL – Upper Control Limit3. LCL – Lower Control Limit4. P Chart- based on attribute (acceptable/nonacceptable)5. C Chart- attributs control chart shows defect per item6. R Chart – shows range of dispersion from the mean (size or weight)7. X-bar Chart – shows sample mean for variable from the average

range

Statistical control chart

50

99

1. Most accounting distributions tend to be:A) Symmetrically skewedB) Positively skewedC) Negatively skewedD) Not skewed

100

2. A Major distinction between statistical and judgmental sampling is that:

A) Judgmental sampling allows the quantification of sampling errors.B) Statistical sampling virtually eliminates sampling riskC) Statistical sampling provides an objective means of determining sample sizeD) Judgmental sampling results in smaller sample sizes.

51

101

3. A Simple random sampling requires that:A) The population is unbiased.B) Every item in the population has an equal chance of being selected.C) The distribution of original data is approximately normal.D) The expected error rate is low (Less than 5%)

102

4. A form of attribute sampling that is intended to reduce sample size is:

A) Stop-or-go samplingB) Difference estimationC) Discovery samplingD) Ratio estimation

52

103

5. A Type 1, or alpha error, is:A) An incorrect acceptance of a valueB) An incorrect rejection of a valueC) An assessment of control risk that is too lowD) More serious than a Type II error.

104

6. The measure of variability of a statistical sample that serves as an estimate of the population variability is the:

A) Basic precisionB) RangeC) Standard deviationD) Confidence interval

unit8partii????? 4.0 june19probability-proportional-to-size:pps uses a monetary unit as a sampling...

Documents