overview of basic and advanced statistical · pdf fileoverview of basic and advanced...

OVERVIEW OF BASIC AND

ADVANCED STATISTICAL

METHODS Professor Dr Noor Azina Ismail

Department of Applied Statistics

Faculty of Economics and Administration

University of Malaya

nazina@um.edu.my

Types of Data

• Why important?

• The type of data defines:

• The summary measures used

• Mean, Standard deviation for continuous data

• Proportions for discrete data

• Statistics used for analysis:

• Examples:

• T-test for normally distributed continuous

• Wilcoxon Rank Sum for non-normally distributed continuous

Types of Data

• Discrete Data-limited number of choices

• Binary: two choices (yes/no)

• Dead or alive

• Disease-free or not

• Categorical: more than two choices, not ordered

• Race

• Ordinal: more than two choices, ordered

• Stages of a cancer

• Likert scale for response

• E.G. strongly agree, agree, neither agree or disagree, etc.

• Continuous data – interval or ratio

Descriptive Statistics

• Characterize data set

• Graphical presentation

• Histograms

• Frequency distribution

• Box and whiskers plot

• Numerical

• Measures of central tendency of data • Mean

• Median

• Mode

• Measures of variability of data • Standard Deviation

• Interquartile range

Histogram Continuous Data

No segmentation of data into groups

Frequency Distribution

Segmentation of data into groups

Discrete or continuous data

Box and Whisker Plots

Useful for presenting comparative data graphically

Goals and statistical approaches:

To investigate data for a single quantitative variable

• Compute descriptive statistics

• Make any of the following graphs: stem & leaf, histogram, dot plot, box plot

• Form a confidence interval using a one-sample t

• Alternative: Wilcoxon signed rank procedure, or the Sign procedure

To investigate data for a single categorical variable

• Make a table of frequencies and percentages

• Construct a bar chart or pie chart

• Make a confidence interval or do a test for a proportion of interest

To compare two or more groups on the basis of

quantitative variable/To investigate the relationship

between one categorical and one quantitative variable

• Compute descriptive statistics on the quantitative variable for each group

• Construct side-by-side box plots or side-by-side dot plots

• For 2 groups, carry out a 2-sample t-test or construct the related confidence interval

• For more than two-groups, carry out an ANOVA

• Alternatives: Mann-Whitney procedure (Wilcoxon rank sum) for two groups; Kruskal Wallis for more than two groups.

To compare two or more groups on the basis of a categorical variable/ To

investigate the relationship between two categorical variables

• Make a two-way table (cross-tabulation) of the frequencies

and percentages

• Illustrate these frequencies with a clustered bar chart

• Perform the chi-squared test

To investigate the relationship between two quantitative

variables

• Compute the correlation coefficient and/or linear regression

equation

• Make a scatter plot of the data

• Run the correlation test, or regression slope test

• Alternative: Spearman’s correlation

• Note: Please refer to the summary of inferential methods

Inferential Methods GOAL METHOD

Test the mean of one population One-sample t-test

Estimate the mean of one population Confidence interval based on the one-sample t

Test a proportion for one population Test for a proportion

Estimate a proportion for one population Confidence interval for a proportion

Compare the means of two populations Two-sample t-test

Estimate the difference between the means of two

populations

Confidence interval based on the two-sample t

Compare the means of several populations One-Way ANOVA & multiple comparisons

Investigate the association between two categorical

variables

Chi-Square test

Investigate the relationship between two quantitative

variables

Correlation/Regression

Statistical Tests

• Parametric tests • Continuous data normally distributed

• Non-parametric tests • Continuous data not normally distributed

• Categorical or Ordinal data

• Most non-parametric tests are based on ranks or other non- value related methods

Regression

• Based on fitting a line to data • Provides a regression coefficient, which is the slope of the line

• Y = ax + b

• Use to predict a dependent variable’s value based on the value of an independent variable.

• Very helpful- In analysis of height and weight, for a known height, one can predict weight.

• Much more useful than correlation • Allows prediction of values of Y rather than just whether there is a relationship between two

variable.

Why Multivariate Analysis

• In published papers, the multivariable models are more

powerful than univariable models

• Theoretical reasons:

• Real-world is multidimensional and multicausal

• ie multiple IVs (predictors) and DVs (outcomes)

• Statistical reasons

• Examine large data sets in a single analysis

Multivariate Data Analysis

• Analysis of dependence

• Attempts to explain or predict the dependent variable(s) on the basis of two or more

independent variables

• The goal can either be:

• specifying a relationship between one dependent variable and several independent variables

• Forecasting the dependent variable on the basis of numerous independent variables

• Examples: multiple regression analysis, multiple discriminant analysis, multivariate analysis

of variance and canonical correlation analysis

• Analysis of Interdependence

• The goal is to give meaning to a set of variables or seek to group things together.

• No one variable or variable subset is to be predicted from the others or explained by them

• Look at relationships among variables, objects or cases

• Examples: factor analysis, cluster analysis and multidimensional scaling

Analysis of dependence

Dependent Variable Multivariate Technique

Numerical Multiple Linear Regression

Nominal - binary Binary Logistic Regression

Nominal – more than two

overview of basic and advanced statistical · pdf fileoverview of basic and advanced...

Documents

chapter 5 basic statistical analysis

advanced writing skills - · pdf filemaster basic writing...

part ii. statistical nlp advanced artificial intelligence...

upgrade manual for latent gold choice 5.0: basic, advanced,...

introduction to percolation : basic concept and something...

anintroductionto statisticalsignalprocessing - stanford...

basic advanced pro -...

advanced lighting control systems: bench testing ·...

(statistical process control) · seven basic statistical...

advanced computer graphics · advanced computer graphics...

ÖzgeÇmİ ve bİlİmsel eserlerİ - bezmialem ·...

- tips form basic to advanced - drbl -...

git: basic to advanced

an introduction to statistical signal...

basic life support advanced life...

azƏrbaycanin demoqrafİk gÖstƏrİcİlƏrİ demographic...

no.1 korea solidworks reseller -...

ee565 advanced image processing copyright xin li 20081...

robotics manualedell'operatore … · 2019-02-26 ·...

メタアナリシス meta-analysis basic assumptions...