Dr. Ji Son

Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Slide Duration:

Table of Contents

Section 1: Introduction

Descriptive Statistics vs. Inferential Statistics

25m 31s

Intro

0:00

Roadmap

0:10

Roadmap

0:11

Statistics

0:35

Statistics

0:36

Let's Think About High School Science

1:12

Measurement and Find Patterns (Mathematical Formula)

1:13

Statistics = Math of Distributions

4:58

Distributions

4:59

Problematic… but also GREAT

5:58

Statistics

7:33

How is It Different from Other Specializations in Mathematics?

7:34

Statistics is Fundamental in Natural and Social Sciences

7:53

Two Skills of Statistics

8:20

Description (Exploration)

8:21

Inference

9:13

Descriptive Statistics vs. Inferential Statistics: Apply to Distributions

9:58

Descriptive Statistics

9:59

Inferential Statistics

11:05

Populations vs. Samples

12:19

Populations vs. Samples: Is it the Truth?

12:20

Populations vs. Samples: Pros & Cons

13:36

Populations vs. Samples: Descriptive Values

16:12

Putting Together Descriptive/Inferential Stats & Populations/Samples

17:10

Putting Together Descriptive/Inferential Stats & Populations/Samples

17:11

Example 1: Descriptive Statistics vs. Inferential Statistics

19:09

Example 2: Descriptive Statistics vs. Inferential Statistics

20:47

Example 3: Sample, Parameter, Population, and Statistic

21:40

Example 4: Sample, Parameter, Population, and Statistic

23:28

Section 2: About Samples: Cases, Variables, Measurements

About Samples: Cases, Variables, Measurements

32m 14s

Intro

0:00

Data

0:09

Data, Cases, Variables, and Values

0:10

Rows, Columns, and Cells

2:03

Example: Aircrafts

3:52

How Do We Get Data?

5:38

Research: Question and Hypothesis

5:39

Research Design

7:11

Measurement

7:29

Research Analysis

8:33

Research Conclusion

9:30

Types of Variables

10:03

Discrete Variables

10:04

Continuous Variables

12:07

Types of Measurements

14:17

Types of Measurements

14:18

Types of Measurements (Scales)

17:22

Nominal

17:23

Ordinal

19:11

Interval

21:33

Ratio

24:24

Example 1: Cases, Variables, Measurements

25:20

Example 2: Which Scale of Measurement is Used?

26:55

Example 3: What Kind of a Scale of Measurement is This?

27:26

Example 4: Discrete vs. Continuous Variables.

30:31

Section 3: Visualizing Distributions

Introduction to Excel

8m 9s

Intro

0:00

Before Visualizing Distribution

0:10

Excel

0:11

Excel: Organization

0:45

Workbook

0:46

Column x Rows

1:50

Tools: Menu Bar, Standard Toolbar, and Formula Bar

3:00

Excel + Data

6:07

Exce and Data

6:08

Frequency Distributions in Excel

39m 10s

Intro

0:00

Roadmap

0:08

Data in Excel and Frequency Distributions

0:09

Raw Data to Frequency Tables

0:42

Raw Data to Frequency Tables

0:43

Frequency Tables: Using Formulas and Pivot Tables

1:28

Example 1: Number of Births

7:17

Example 2: Age Distribution

20:41

Example 3: Height Distribution

27:45

Example 4: Height Distribution of Males

32:19

Frequency Distributions and Features

25m 29s

Intro

0:00

Roadmap

0:10

Data in Excel, Frequency Distributions, and Features of Frequency Distributions

0:11

Example #1

1:35

Uniform

1:36

Example #2

2:58

Unimodal, Skewed Right, and Asymmetric

2:59

Example #3

6:29

Bimodal

6:30

Example #4a

8:29

Symmetric, Unimodal, and Normal

8:30

Point of Inflection and Standard Deviation

11:13

Example #4b

12:43

Normal Distribution

12:44

Summary

13:56

Uniform, Skewed, Bimodal, and Normal

13:57

Sketch Problem 1: Driver's License

17:34

Sketch Problem 2: Life Expectancy

20:01

Sketch Problem 3: Telephone Numbers

22:01

Sketch Problem 4: Length of Time Used to Complete a Final Exam

23:43

Dotplots and Histograms in Excel

42m 42s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Previously

1:02

Data, Frequency Table, and visualization

1:03

Dotplots

1:22

Dotplots Excel Example

1:23

Dotplots: Pros and Cons

7:22

Pros and Cons of Dotplots

7:23

Dotplots Excel Example Cont.

9:07

Histograms

12:47

Histograms Overview

12:48

Example of Histograms

15:29

Histograms: Pros and Cons

31:39

Pros

31:40

Cons

32:31

Frequency vs. Relative Frequency

32:53

Frequency

32:54

Relative Frequency

33:36

Example 1: Dotplots vs. Histograms

34:36

Example 2: Age of Pennies Dotplot

36:21

Example 3: Histogram of Mammal Speeds

38:27

Example 4: Histogram of Life Expectancy

40:30

Stemplots

12m 23s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

What Sets Stemplots Apart?

0:46

Data Sets, Dotplots, Histograms, and Stemplots

0:47

Example 1: What Do Stemplots Look Like?

1:58

Example 2: Back-to-Back Stemplots

5:00

Example 3: Quiz Grade Stemplot

7:46

Example 4: Quiz Grade & Afterschool Tutoring Stemplot

9:56

Bar Graphs

22m 49s

Intro

0:00

Roadmap

0:05

Roadmap

0:08

Review of Frequency Distributions

0:44

Y-axis and X-axis

0:45

Types of Frequency Visualizations Covered so Far

2:16

Introduction to Bar Graphs

4:07

Example 1: Bar Graph

5:32

Example 1: Bar Graph

5:33

Do Shapes, Center, and Spread of Distributions Apply to Bar Graphs?

11:07

Do Shapes, Center, and Spread of Distributions Apply to Bar Graphs?

11:08

Example 2: Create a Frequency Visualization for Gender

14:02

Example 3: Cases, Variables, and Frequency Visualization

16:34

Example 4: What Kind of Graphs are Shown Below?

19:29

Section 4: Summarizing Distributions

Central Tendency: Mean, Median, Mode

38m 50s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

Central Tendency 1

0:56

Way to Summarize a Distribution of Scores

0:57

Mode

1:32

Median

2:02

Mean

2:36

Central Tendency 2

3:47

Mode

3:48

Median

4:20

Mean

5:25

Summation Symbol

6:11

Summation Symbol

6:12

Population vs. Sample

10:46

Population vs. Sample

10:47

Excel Examples

15:08

Finding Mode, Median, and Mean in Excel

15:09

Median vs. Mean

21:45

Effect of Outliers

21:46

Relationship Between Parameter and Statistic

22:44

Type of Measurements

24:00

Which Distributions to Use With

24:55

Example 1: Mean

25:30

Example 2: Using Summation Symbol

29:50

Example 3: Average Calorie Count

32:50

Example 4: Creating an Example Set

35:46

Variability

42m 40s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Variability (or Spread)

0:45

Variability (or Spread)

0:46

Things to Think About

5:45

Things to Think About

5:46

Range, Quartiles and Interquartile Range

6:37

Range

6:38

Interquartile Range

8:42

Interquartile Range Example

10:58

Interquartile Range Example

10:59

Variance and Standard Deviation

12:27

Deviations

12:28

Sum of Squares

14:35

Variance

16:55

Standard Deviation

17:44

Sum of Squares (SS)

18:34

Sum of Squares (SS)

18:35

Population vs. Sample SD

22:00

Population vs. Sample SD

22:01

Population vs. Sample

23:20

Mean

23:21

23:51

Example 1: Find the Mean and Standard Deviation of the Variable Friends in the Excel File

27:21

Example 2: Find the Mean and Standard Deviation of the Tagged Photos in the Excel File

35:25

Example 3: Sum of Squares

38:58

Example 4: Standard Deviation

41:48

Five Number Summary & Boxplots

57m 15s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Summarizing Distributions

0:37

Shape, Center, and Spread

0:38

5 Number Summary

1:14

Boxplot: Visualizing 5 Number Summary

3:37

Boxplot: Visualizing 5 Number Summary

3:38

Boxplots on Excel

9:01

Using 'Stocks' and Using Stacked Columns

9:02

Boxplots on Excel Example

10:14

When are Boxplots Useful?

32:14

Pros

32:15

Cons

32:59

How to Determine Outlier Status

33:24

Rule of Thumb: Upper Limit

33:25

Rule of Thumb: Lower Limit

34:16

Signal Outliers in an Excel Data File Using Conditional Formatting

34:52

Modified Boxplot

48:38

Modified Boxplot

48:39

Example 1: Percentage Values & Lower and Upper Whisker

49:10

Example 2: Boxplot

50:10

Example 3: Estimating IQR From Boxplot

53:46

Example 4: Boxplot and Missing Whisker

54:35

Shape: Calculating Skewness & Kurtosis

41m 51s

Intro

0:00

Roadmap

0:16

Roadmap

0:17

Skewness Concept

1:09

Skewness Concept

1:10

Calculating Skewness

3:26

Calculating Skewness

3:27

Interpreting Skewness

7:36

Interpreting Skewness

7:37

Excel Example

8:49

Kurtosis Concept

20:29

Kurtosis Concept

20:30

Calculating Kurtosis

24:17

Calculating Kurtosis

24:18

Interpreting Kurtosis

29:01

Leptokurtic

29:35

Mesokurtic

30:10

Platykurtic

31:06

Excel Example

32:04

Example 1: Shape of Distribution

38:28

Example 2: Shape of Distribution

39:29

Example 3: Shape of Distribution

40:14

Example 4: Kurtosis

41:10

Normal Distribution

34m 33s

Intro

0:00

Roadmap

0:13

Roadmap

0:14

What is a Normal Distribution

0:44

The Normal Distribution As a Theoretical Model

0:45

Possible Range of Probabilities

3:05

Possible Range of Probabilities

3:06

What is a Normal Distribution

5:07

Can Be Described By

5:08

Properties

5:49

'Same' Shape: Illusion of Different Shape!

7:35

'Same' Shape: Illusion of Different Shape!

7:36

Types of Problems

13:45

Example: Distribution of SAT Scores

13:46

Shape Analogy

19:48

Shape Analogy

19:49

Example 1: The Standard Normal Distribution and Z-Scores

22:34

Example 2: The Standard Normal Distribution and Z-Scores

25:54

Example 3: Sketching and Normal Distribution

28:55

Example 4: Sketching and Normal Distribution

32:32

Standard Normal Distributions & Z-Scores

41m 44s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

A Family of Distributions

0:28

Infinite Set of Distributions

0:29

Transforming Normal Distributions to 'Standard' Normal Distribution

1:04

Normal Distribution vs. Standard Normal Distribution

2:58

Normal Distribution vs. Standard Normal Distribution

2:59

Z-Score, Raw Score, Mean, & SD

4:08

Z-Score, Raw Score, Mean, & SD

4:09

Weird Z-Scores

9:40

Weird Z-Scores

9:41

Excel

16:45

For Normal Distributions

16:46

For Standard Normal Distributions

19:11

Excel Example

20:24

Types of Problems

25:18

Percentage Problem: P(x)

25:19

Raw Score and Z-Score Problems

26:28

Standard Deviation Problems

27:01

Shape Analogy

27:44

Shape Analogy

27:45

Example 1: Deaths Due to Heart Disease vs. Deaths Due to Cancer

28:24

Example 2: Heights of Male College Students

33:15

Example 3: Mean and Standard Deviation

37:14

Example 4: Finding Percentage of Values in a Standard Normal Distribution

37:49

Normal Distribution: PDF vs. CDF

55m 44s

Intro

0:00

Roadmap

0:15

Roadmap

0:16

Frequency vs. Cumulative Frequency

0:56

Frequency vs. Cumulative Frequency

0:57

Frequency vs. Cumulative Frequency

4:32

Frequency vs. Cumulative Frequency Cont.

4:33

Calculus in Brief

6:21

Derivative-Integral Continuum

6:22

PDF

10:08

PDF for Standard Normal Distribution

10:09

PDF for Normal Distribution

14:32

Integral of PDF = CDF

21:27

Integral of PDF = CDF

21:28

Example 1: Cumulative Frequency Graph

23:31

Example 2: Mean, Standard Deviation, and Probability

24:43

Example 3: Mean and Standard Deviation

35:50

Example 4: Age of Cars

49:32

Section 5: Linear Regression

Scatterplots

47m 19s

Intro

0:00

Roadmap

0:04

Roadmap

0:05

Previous Visualizations

0:30

Frequency Distributions

0:31

Compare & Contrast

2:26

Frequency Distributions Vs. Scatterplots

2:27

Summary Values

4:53

Shape

4:54

Center & Trend

6:41

Spread & Strength

8:22

Univariate & Bivariate

10:25

Example Scatterplot

10:48

Shape, Trend, and Strength

10:49

Positive and Negative Association

14:05

Positive and Negative Association

14:06

Linearity, Strength, and Consistency

18:30

Linearity

18:31

Strength

19:14

Consistency

20:40

Summarizing a Scatterplot

22:58

Summarizing a Scatterplot

22:59

Example 1: Gapminder.org, Income x Life Expectancy

26:32

Example 2: Gapminder.org, Income x Infant Mortality

36:12

Example 3: Trend and Strength of Variables

40:14

Example 4: Trend, Strength and Shape for Scatterplots

43:27

Regression

32m 2s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Linear Equations

0:34

Linear Equations: y = mx + b

0:35

Rough Line

5:16

Rough Line

5:17

Regression - A 'Center' Line

7:41

Reasons for Summarizing with a Regression Line

7:42

Predictor and Response Variable

10:04

Goal of Regression

12:29

Goal of Regression

12:30

Prediction

14:50

Example: Servings of Mile Per Year Shown By Age

14:51

Intrapolation

17:06

Extrapolation

17:58

Error in Prediction

20:34

Prediction Error

20:35

Residual

21:40

Example 1: Residual

23:34

Example 2: Large and Negative Residual

26:30

Example 3: Positive Residual

28:13

Example 4: Interpret Regression Line & Extrapolate

29:40

Least Squares Regression

56m 36s

Intro

0:00

Roadmap

0:13

Roadmap

0:14

Best Fit

0:47

Best Fit

0:48

Sum of Squared Errors (SSE)

1:50

Sum of Squared Errors (SSE)

1:51

Why Squared?

3:38

Why Squared?

3:39

Quantitative Properties of Regression Line

4:51

Quantitative Properties of Regression Line

4:52

So How do we Find Such a Line?

6:49

SSEs of Different Line Equations & Lowest SSE

6:50

Carl Gauss' Method

8:01

How Do We Find Slope (b1)

11:00

How Do We Find Slope (b1)

11:01

Hoe Do We Find Intercept

15:11

Hoe Do We Find Intercept

15:12

Example 1: Which of These Equations Fit the Above Data Best?

17:18

Example 2: Find the Regression Line for These Data Points and Interpret It

26:31

Example 3: Summarize the Scatterplot and Find the Regression Line.

34:31

Example 4: Examine the Mean of Residuals

43:52

Correlation

43m 58s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Summarizing a Scatterplot Quantitatively

0:47

Shape

0:48

Trend

1:11

Strength: Correlation ®

1:45

Correlation Coefficient ( r )

2:30

Correlation Coefficient ( r )

2:31

Trees vs. Forest

11:59

Trees vs. Forest

12:00

Calculating r

15:07

Average Product of z-scores for x and y

15:08

Relationship between Correlation and Slope

21:10

Relationship between Correlation and Slope

21:11

Example 1: Find the Correlation between Grams of Fat and Cost

24:11

Example 2: Relationship between r and b1

30:24

Example 3: Find the Regression Line

33:35

Example 4: Find the Correlation Coefficient for this Set of Data

37:37

Correlation: r vs. r-squared

52m 52s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

R-squared

0:44

What is the Meaning of It? Why Squared?

0:45

Parsing Sum of Squared (Parsing Variability)

2:25

SST = SSR + SSE

2:26

What is SST and SSE?

7:46

What is SST and SSE?

7:47

r-squared

18:33

Coefficient of Determination

18:34

If the Correlation is Strong…

20:25

If the Correlation is Strong…

20:26

If the Correlation is Weak…

22:36

If the Correlation is Weak…

22:37

Example 1: Find r-squared for this Set of Data

23:56

Example 2: What Does it Mean that the Simple Linear Regression is a 'Model' of Variance?

33:54

Example 3: Why Does r-squared Only Range from 0 to 1

37:29

Example 4: Find the r-squared for This Set of Data

39:55

Transformations of Data

27m 8s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Why Transform?

0:26

Why Transform?

0:27

Shape-preserving vs. Shape-changing Transformations

5:14

Shape-preserving = Linear Transformations

5:15

Shape-changing Transformations = Non-linear Transformations

6:20

Common Shape-Preserving Transformations

7:08

Common Shape-Preserving Transformations

7:09

Common Shape-Changing Transformations

8:59

Powers

9:00

Logarithms

9:39

Change Just One Variable? Both?

10:38

Log-log Transformations

10:39

Log Transformations

14:38

Example 1: Create, Graph, and Transform the Data Set

15:19

Example 2: Create, Graph, and Transform the Data Set

20:08

Example 3: What Kind of Model would You Choose for this Data?

22:44

Example 4: Transformation of Data

25:46

Section 6: Collecting Data in an Experiment

Sampling & Bias

54m 44s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Descriptive vs. Inferential Statistics

1:04

Descriptive Statistics: Data Exploration

1:05

Example

2:03

To tackle Generalization…

4:31

Generalization

4:32

Sampling

6:06

'Good' Sample

6:40

Defining Samples and Populations

8:55

Population

8:56

Sample

11:16

Why Use Sampling?

13:09

Why Use Sampling?

13:10

Goal of Sampling: Avoiding Bias

15:04

What is Bias?

15:05

Where does Bias Come from: Sampling Bias

17:53

Where does Bias Come from: Response Bias

18:27

Sampling Bias: Bias from Bas Sampling Methods

19:34

Size Bias

19:35

Voluntary Response Bias

21:13

Convenience Sample

22:22

Judgment Sample

23:58

Inadequate Sample Frame

25:40

Response Bias: Bias from 'Bad' Data Collection Methods

28:00

Nonresponse Bias

29:31

Questionnaire Bias

31:10

Incorrect Response or Measurement Bias

37:32

Example 1: What Kind of Biases?

40:29

Example 2: What Biases Might Arise?

44:46

Example 3: What Kind of Biases?

48:34

Example 4: What Kind of Biases?

51:43

Sampling Methods

14m 25s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Biased vs. Unbiased Sampling Methods

0:32

Biased Sampling

0:33

Unbiased Sampling

1:13

Probability Sampling Methods

2:31

Simple Random

2:54

Stratified Random Sampling

4:06

Cluster Sampling

5:24

Two-staged Sampling

6:22

Systematic Sampling

7:25

Example 1: Which Type(s) of Sampling was this?

8:33

Example 2: Describe How to Take a Two-Stage Sample from this Book

10:16

Example 3: Sampling Methods

11:58

Example 4: Cluster Sample Plan

12:48

Research Design

53m 54s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Descriptive vs. Inferential Statistics

0:51

Descriptive Statistics: Data Exploration

0:52

Inferential Statistics

1:02

Variables and Relationships

1:44

Variables

1:45

Relationships

2:49

Not Every Type of Study is an Experiment…

4:16

Category I - Descriptive Study

4:54

Category II - Correlational Study

5:50

Category III - Experimental, Quasi-experimental, Non-experimental

6:33

Category III

7:42

Experimental, Quasi-experimental, and Non-experimental

7:43

Why CAN'T the Other Strategies Determine Causation?

10:18

Third-variable Problem

10:19

Directionality Problem

15:49

What Makes Experiments Special?

17:54

Manipulation

17:55

Control (and Comparison)

21:58

Methods of Control

26:38

Holding Constant

26:39

Matching

29:11

Random Assignment

31:48

Experiment Terminology

34:09

'true' Experiment vs. Study

34:10

Independent Variable (IV)

35:16

Dependent Variable (DV)

35:45

Factors

36:07

Treatment Conditions

36:23

Levels

37:43

Confounds or Extraneous Variables

38:04

Blind

38:38

Blind Experiments

38:39

Double-blind Experiments

39:29

How Categories Relate to Statistics

41:35

Category I - Descriptive Study

41:36

Category II - Correlational Study

42:05

Category III - Experimental, Quasi-experimental, Non-experimental

42:43

Example 1: Research Design

43:50

Example 2: Research Design

47:37

Example 3: Research Design

50:12

Example 4: Research Design

52:00

Between and Within Treatment Variability

41m 31s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Experimental Designs

0:51

Experimental Designs: Manipulation & Control

0:52

Two Types of Variability

2:09

Between Treatment Variability

2:10

Within Treatment Variability

3:31

Updated Goal of Experimental Design

5:47

Updated Goal of Experimental Design

5:48

Example: Drugs and Driving

6:56

Example: Drugs and Driving

6:57

Different Types of Random Assignment

11:27

All Experiments

11:28

Completely Random Design

12:02

Randomized Block Design

13:19

Randomized Block Design

15:48

Matched Pairs Design

15:49

Repeated Measures Design

19:47

Between-subject Variable vs. Within-subject Variable

22:43

Completely Randomized Design

22:44

Repeated Measures Design

25:03

Example 1: Design a Completely Random, Matched Pair, and Repeated Measures Experiment

26:16

Example 2: Block Design

31:41

Example 3: Completely Randomized Designs

35:11

Example 4: Completely Random, Matched Pairs, or Repeated Measures Experiments?

39:01

Section 7: Review of Probability Axioms

Sample Spaces

37m 52s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

Why is Probability Involved in Statistics

0:48

Probability

0:49

Can People Tell the Difference between Cheap and Gourmet Coffee?

2:08

Taste Test with Coffee Drinkers

3:37

If No One can Actually Taste the Difference

3:38

If Everyone can Actually Taste the Difference

5:36

Creating a Probability Model

7:09

Creating a Probability Model

7:10

D'Alembert vs. Necker

9:41

D'Alembert vs. Necker

9:42

Problem with D'Alembert's Model

13:29

Problem with D'Alembert's Model

13:30

Covering Entire Sample Space

15:08

Fundamental Principle of Counting

15:09

Where Do Probabilities Come From?

22:54

Observed Data, Symmetry, and Subjective Estimates

22:55

Checking whether Model Matches Real World

24:27

Law of Large Numbers

24:28

Example 1: Law of Large Numbers

27:46

Example 2: Possible Outcomes

30:43

Example 3: Brands of Coffee and Taste

33:25

Example 4: How Many Different Treatments are there?

35:33

Addition Rule for Disjoint Events

20m 29s

Intro

0:00

Roadmap

0:08

Roadmap

0:09

Disjoint Events

0:41

Disjoint Events

0:42

Meaning of 'or'

2:39

In Regular Life

2:40

In Math/Statistics/Computer Science

3:10

Addition Rule for Disjoin Events

3:55

If A and B are Disjoint: P (A and B)

3:56

If A and B are Disjoint: P (A or B)

5:15

General Addition Rule

5:41

General Addition Rule

5:42

Generalized Addition Rule

8:31

If A and B are not Disjoint: P (A or B)

8:32

Example 1: Which of These are Mutually Exclusive?

10:50

Example 2: What is the Probability that You will Have a Combination of One Heads and Two Tails?

12:57

Example 3: Engagement Party

15:17

Example 4: Home Owner's Insurance

18:30

Conditional Probability

57m 19s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

'or' vs. 'and' vs. Conditional Probability

1:07

'or' vs. 'and' vs. Conditional Probability

1:08

'and' vs. Conditional Probability

5:57

P (M or L)

5:58

P (M and L)

8:41

P (M|L)

11:04

P (L|M)

12:24

Tree Diagram

15:02

Tree Diagram

15:03

Defining Conditional Probability

22:42

Defining Conditional Probability

22:43

Common Contexts for Conditional Probability

30:56

Medical Testing: Positive Predictive Value

30:57

Medical Testing: Sensitivity

33:03

Statistical Tests

34:27

Example 1: Drug and Disease

36:41

Example 2: Marbles and Conditional Probability

40:04

Example 3: Cards and Conditional Probability

45:59

Example 4: Votes and Conditional Probability

50:21

Independent Events

24m 27s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Independent Events & Conditional Probability

0:26

Non-independent Events

0:27

Independent Events

2:00

Non-independent and Independent Events

3:08

Non-independent and Independent Events

3:09

Defining Independent Events

5:52

Defining Independent Events

5:53

Multiplication Rule

7:29

Previously…

7:30

But with Independent Evens

8:53

Example 1: Which of These Pairs of Events are Independent?

11:12

Example 2: Health Insurance and Probability

15:12

Example 3: Independent Events

17:42

Example 4: Independent Events

20:03

Section 8: Probability Distributions

Introduction to Probability Distributions

56m 45s

Intro

0:00

Roadmap

0:08

Roadmap

0:09

Sampling vs. Probability

0:57

Sampling

0:58

Missing

1:30

What is Missing?

3:06

Insight: Probability Distributions

5:26

Insight: Probability Distributions

5:27

What is a Probability Distribution?

7:29

From Sample Spaces to Probability Distributions

8:44

Sample Space

8:45

Probability Distribution of the Sum of Two Die

11:16

The Random Variable

17:43

The Random Variable

17:44

Expected Value

21:52

Expected Value

21:53

Example 1: Probability Distributions

28:45

Example 2: Probability Distributions

35:30

Example 3: Probability Distributions

43:37

Example 4: Probability Distributions

47:20

Expected Value & Variance of Probability Distributions

53m 41s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Discrete vs. Continuous Random Variables

1:04

Discrete vs. Continuous Random Variables

1:05

Mean and Variance Review

4:44

Mean: Sample, Population, and Probability Distribution

4:45

Variance: Sample, Population, and Probability Distribution

9:12

Example Situation

14:10

Example Situation

14:11

Some Special Cases…

16:13

Some Special Cases…

16:14

Linear Transformations

19:22

Linear Transformations

19:23

What Happens to Mean and Variance of the Probability Distribution?

20:12

n Independent Values of X

25:38

n Independent Values of X

25:39

Compare These Two Situations

30:56

Compare These Two Situations

30:57

Two Random Variables, X and Y

32:02

Two Random Variables, X and Y

32:03

Example 1: Expected Value & Variance of Probability Distributions

35:35

Example 2: Expected Values & Standard Deviation

44:17

Example 3: Expected Winnings and Standard Deviation

48:18

Binomial Distribution

55m 15s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Discrete Probability Distributions

1:42

Discrete Probability Distributions

1:43

Binomial Distribution

2:36

Binomial Distribution

2:37

Multiplicative Rule Review

6:54

Multiplicative Rule Review

6:55

How Many Outcomes with k 'Successes'

10:23

Adults and Bachelor's Degree: Manual List of Outcomes

10:24

P (X=k)

19:37

Putting Together # of Outcomes with the Multiplicative Rule

19:38

Expected Value and Standard Deviation in a Binomial Distribution

25:22

Expected Value and Standard Deviation in a Binomial Distribution

25:23

Example 1: Coin Toss

33:42

Example 2: College Graduates

38:03

Example 3: Types of Blood and Probability

45:39

Example 4: Expected Number and Standard Deviation

51:11

Section 9: Sampling Distributions of Statistics

Introduction to Sampling Distributions

48m 17s

Intro

0:00

Roadmap

0:08

Roadmap

0:09

Probability Distributions vs. Sampling Distributions

0:55

Probability Distributions vs. Sampling Distributions

0:56

Same Logic

3:55

Logic of Probability Distribution

3:56

Example: Rolling Two Die

6:56

Simulating Samples

9:53

To Come Up with Probability Distributions

9:54

In Sampling Distributions

11:12

Connecting Sampling and Research Methods with Sampling Distributions

12:11

Connecting Sampling and Research Methods with Sampling Distributions

12:12

Simulating a Sampling Distribution

14:14

Experimental Design: Regular Sleep vs. Less Sleep

14:15

Logic of Sampling Distributions

23:08

Logic of Sampling Distributions

23:09

General Method of Simulating Sampling Distributions

25:38

General Method of Simulating Sampling Distributions

25:39

Questions that Remain

28:45

Questions that Remain

28:46

Example 1: Mean and Standard Error of Sampling Distribution

30:57

Example 2: What is the Best Way to Describe Sampling Distributions?

37:12

Example 3: Matching Sampling Distributions

38:21

Example 4: Mean and Standard Error of Sampling Distribution

41:51

Sampling Distribution of the Mean

1h 8m 48s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Special Case of General Method for Simulating a Sampling Distribution

1:53

Special Case of General Method for Simulating a Sampling Distribution

1:54

Computer Simulation

3:43

Using Simulations to See Principles behind Shape of SDoM

15:50

Using Simulations to See Principles behind Shape of SDoM

15:51

Conditions

17:38

Using Simulations to See Principles behind Center (Mean) of SDoM

20:15

Using Simulations to See Principles behind Center (Mean) of SDoM

20:16

Conditions: Does n Matter?

21:31

Conditions: Does Number of Simulation Matter?

24:37

Using Simulations to See Principles behind Standard Deviation of SDoM

27:13

Using Simulations to See Principles behind Standard Deviation of SDoM

27:14

Conditions: Does n Matter?

34:45

Conditions: Does Number of Simulation Matter?

36:24

Central Limit Theorem

37:13

SHAPE

38:08

CENTER

39:34

SPREAD

39:52

Comparing Population, Sample, and SDoM

43:10

Comparing Population, Sample, and SDoM

43:11

Answering the 'Questions that Remain'

48:24

What Happens When We Don't Know What the Population Looks Like?

48:25

Can We Have Sampling Distributions for Summary Statistics Other than the Mean?

49:42

How Do We Know whether a Sample is Sufficiently Unlikely?

53:36

Do We Always Have to Simulate a Large Number of Samples in Order to get a Sampling Distribution?

54:40

Example 1: Mean Batting Average

55:25

Example 2: Mean Sampling Distribution and Standard Error

59:07

Example 3: Sampling Distribution of the Mean

1:01:04

Sampling Distribution of Sample Proportions

54m 37s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Intro to Sampling Distribution of Sample Proportions (SDoSP)

0:51

Categorical Data (Examples)

0:52

Wish to Estimate Proportion of Population from Sample…

2:00

Notation

3:34

Population Proportion and Sample Proportion Notations

3:35

What's the Difference?

9:19

SDoM vs. SDoSP: Type of Data

9:20

SDoM vs. SDoSP: Shape

11:24

SDoM vs. SDoSP: Center

12:30

SDoM vs. SDoSP: Spread

15:34

Binomial Distribution vs. Sampling Distribution of Sample Proportions

19:14

Binomial Distribution vs. SDoSP: Type of Data

19:17

Binomial Distribution vs. SDoSP: Shape

21:07

Binomial Distribution vs. SDoSP: Center

21:43

Binomial Distribution vs. SDoSP: Spread

24:08

Example 1: Sampling Distribution of Sample Proportions

26:07

Example 2: Sampling Distribution of Sample Proportions

37:58

Example 3: Sampling Distribution of Sample Proportions

44:42

Example 4: Sampling Distribution of Sample Proportions

45:57

Section 10: Inferential Statistics

Introduction to Confidence Intervals

42m 53s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Inferential Statistics

0:50

Inferential Statistics

0:51

Two Problems with This Picture…

3:20

Two Problems with This Picture…

3:21

Solution: Confidence Intervals (CI)

4:59

Solution: Hypotheiss Testing (HT)

5:49

Which Parameters are Known?

6:45

Which Parameters are Known?

6:46

Confidence Interval - Goal

7:56

When We Don't Know m but know s

7:57

When We Don't Know

18:27

When We Don't Know m nor s

18:28

Example 1: Confidence Intervals

26:18

Example 2: Confidence Intervals

29:46

Example 3: Confidence Intervals

32:18

Example 4: Confidence Intervals

38:31

t Distributions

1h 2m 6s

Intro

0:00

Roadmap

0:04

Roadmap

0:05

When to Use z vs. t?

1:07

When to Use z vs. t?

1:08

What is z and t?

3:02

z-score and t-score: Commonality

3:03

z-score and t-score: Formulas

3:34

z-score and t-score: Difference

5:22

Why not z? (Why t?)

7:24

Why not z? (Why t?)

7:25

But Don't Worry!

15:13

Gossett and t-distributions

15:14

Rules of t Distributions

17:05

t-distributions are More Normal as n Gets Bigger

17:06

t-distributions are a Family of Distributions

18:55

Degrees of Freedom (df)

20:02

Degrees of Freedom (df)

20:03

t Family of Distributions

24:07

t Family of Distributions : df = 2 , 4, and 60

24:08

df = 60

29:16

df = 2

29:59

How to Find It?

31:01

'Student's t-distribution' or 't-distribution'

31:02

Excel Example

33:06

Example 1: Which Distribution Do You Use? Z or t?

45:26

Example 2: Friends on Facebook

47:41

Example 3: t Distributions

52:15

Example 4: t Distributions , confidence interval, and mean

55:59

Introduction to Hypothesis Testing

1h 6m 33s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Issues to Overcome in Inferential Statistics

1:35

Issues to Overcome in Inferential Statistics

1:36

What Happens When We Don't Know What the Population Looks Like?

2:57

How Do We Know whether a sample is Sufficiently Unlikely

3:43

Hypothesizing a Population

6:44

Hypothesizing a Population

6:45

Null Hypothesis

8:07

Alternative Hypothesis

8:56

Hypotheses

11:58

Hypotheses

11:59

Errors in Hypothesis Testing

14:22

Errors in Hypothesis Testing

14:23

Steps of Hypothesis Testing

21:15

Steps of Hypothesis Testing

21:16

Single Sample HT ( When Sigma Available)

26:08

Example: Average Facebook Friends

26:09

Step1

27:08

Step 2

27:58

Step 3

28:17

Step 4

32:18

Single Sample HT (When Sigma Not Available)

36:33

Example: Average Facebook Friends

36:34

Step1: Hypothesis Testing

36:58

Step 2: Significance Level

37:25

Step 3: Decision Stage

37:40

Step 4: Sample

41:36

Sigma and p-value

45:04

Sigma and p-value

45:05

On tailed vs. Two Tailed Hypotheses

45:51

Example 1: Hypothesis Testing

48:37

Example 2: Heights of Women in the US

57:43

Example 3: Select the Best Way to Complete This Sentence

1:03:23

Confidence Intervals for the Difference of Two Independent Means

55m 14s

Intro

0:00

Roadmap

0:14

Roadmap

0:15

One Mean vs. Two Means

1:17

One Mean vs. Two Means

1:18

Notation

2:41

A Sample! A Set!

2:42

Mean of X, Mean of Y, and Difference of Two Means

3:56

SE of X

4:34

SE of Y

6:28

Sampling Distribution of the Difference between Two Means (SDoD)

7:48

Sampling Distribution of the Difference between Two Means (SDoD)

7:49

Rules of the SDoD (similar to CLT!)

15:00

Mean for the SDoD Null Hypothesis

15:01

Standard Error

17:39

When can We Construct a CI for the Difference between Two Means?

21:28

Three Conditions

21:29

Finding CI

23:56

One Mean CI

23:57

Two Means CI

25:45

Finding t

29:16

Finding t

29:17

Interpreting CI

30:25

Interpreting CI

30:26

Better Estimate of s (s pool)

34:15

Better Estimate of s (s pool)

34:16

Example 1: Confidence Intervals

42:32

Example 2: SE of the Difference

52:36

Hypothesis Testing for the Difference of Two Independent Means

50m

Intro

0:00

Roadmap

0:06

Roadmap

0:07

The Goal of Hypothesis Testing

0:56

One Sample and Two Samples

0:57

Sampling Distribution of the Difference between Two Means (SDoD)

3:42

Sampling Distribution of the Difference between Two Means (SDoD)

3:43

Rules of the SDoD (Similar to CLT!)

6:46

Shape

6:47

Mean for the Null Hypothesis

7:26

Standard Error for Independent Samples (When Variance is Homogenous)

8:18

Standard Error for Independent Samples (When Variance is not Homogenous)

9:25

Same Conditions for HT as for CI

10:08

Three Conditions

10:09

Steps of Hypothesis Testing

11:04

Steps of Hypothesis Testing

11:05

Formulas that Go with Steps of Hypothesis Testing

13:21

Step 1

13:25

Step 2

14:18

Step 3

15:00

Step 4

16:57

Example 1: Hypothesis Testing for the Difference of Two Independent Means

18:47

Example 2: Hypothesis Testing for the Difference of Two Independent Means

33:55

Example 3: Hypothesis Testing for the Difference of Two Independent Means

44:22

Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

1h 14m 11s

Intro

0:00

Roadmap

0:09

Roadmap

0:10

The Goal of Hypothesis Testing

1:27

One Sample and Two Samples

1:28

Independent Samples vs. Paired Samples

3:16

Independent Samples vs. Paired Samples

3:17

Which is Which?

5:20

Independent SAMPLES vs. Independent VARIABLES

7:43

independent SAMPLES vs. Independent VARIABLES

7:44

T-tests Always…

10:48

T-tests Always…

10:49

Notation for Paired Samples

12:59

Notation for Paired Samples

13:00

Steps of Hypothesis Testing for Paired Samples

16:13

Steps of Hypothesis Testing for Paired Samples

16:14

Rules of the SDoD (Adding on Paired Samples)

18:03

Shape

18:04

Mean for the Null Hypothesis

18:31

Standard Error for Independent Samples (When Variance is Homogenous)

19:25

Standard Error for Paired Samples

20:39

Formulas that go with Steps of Hypothesis Testing

22:59

Formulas that go with Steps of Hypothesis Testing

23:00

Confidence Intervals for Paired Samples

30:32

Confidence Intervals for Paired Samples

30:33

Example 1: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

32:28

Example 2: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

44:02

Example 3: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

52:23

Type I and Type II Errors

31m 27s

Intro

0:00

Roadmap

0:18

Roadmap

0:19

Errors and Relationship to HT and the Sample Statistic?

1:11

Errors and Relationship to HT and the Sample Statistic?

1:12

Instead of a Box…Distributions!

7:00

One Sample t-test: Friends on Facebook

7:01

Two Sample t-test: Friends on Facebook

13:46

Usually, Lots of Overlap between Null and Alternative Distributions

16:59

Overlap between Null and Alternative Distributions

17:00

How Distributions and 'Box' Fit Together

22:45

How Distributions and 'Box' Fit Together

22:46

Example 1: Types of Errors

25:54

Example 2: Types of Errors

27:30

Example 3: What is the Danger of the Type I Error?

29:38

Effect Size & Power

44m 41s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Distance between Distributions: Sample t

0:49

Distance between Distributions: Sample t

0:50

Problem with Distance in Terms of Standard Error

2:56

Problem with Distance in Terms of Standard Error

2:57

Test Statistic (t) vs. Effect Size (d or g)

4:38

Test Statistic (t) vs. Effect Size (d or g)

4:39

Rules of Effect Size

6:09

Rules of Effect Size

6:10

Why Do We Need Effect Size?

8:21

Tells You the Practical Significance

8:22

HT can be Deceiving…

10:25

Important Note

10:42

What is Power?

11:20

What is Power?

11:21

Why Do We Need Power?

14:19

Conditional Probability and Power

14:20

Power is:

16:27

Can We Calculate Power?

19:00

Can We Calculate Power?

19:01

How Does Alpha Affect Power?

20:36

How Does Alpha Affect Power?

20:37

How Does Effect Size Affect Power?

25:38

How Does Effect Size Affect Power?

25:39

How Does Variability and Sample Size Affect Power?

27:56

How Does Variability and Sample Size Affect Power?

27:57

How Do We Increase Power?

32:47

Increasing Power

32:48

Example 1: Effect Size & Power

35:40

Example 2: Effect Size & Power

37:38

Example 3: Effect Size & Power

40:55

Section 11: Analysis of Variance

F-distributions

24m 46s

Intro

0:00

Roadmap

0:04

Roadmap

0:05

Z- & T-statistic and Their Distribution

0:34

Z- & T-statistic and Their Distribution

0:35

F-statistic

4:55

The F Ration ( the Variance Ratio)

4:56

F-distribution

12:29

F-distribution

12:30

s and p-value

15:00

s and p-value

15:01

Example 1: Why Does F-distribution Stop At 0 But Go On Until Infinity?

18:33

Example 2: F-distributions

19:29

Example 3: F-distributions and Heights

21:29

ANOVA with Independent Samples

1h 9m 25s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

The Limitations of t-tests

1:12

The Limitations of t-tests

1:13

Two Major Limitations of Many t-tests

3:26

Two Major Limitations of Many t-tests

3:27

Ronald Fisher's Solution… F-test! New Null Hypothesis

4:43

Ronald Fisher's Solution… F-test! New Null Hypothesis (Omnibus Test - One Test to Rule Them All!)

4:44

Analysis of Variance (ANoVA) Notation

7:47

Analysis of Variance (ANoVA) Notation

7:48

Partitioning (Analyzing) Variance

9:58

Total Variance

9:59

Within-group Variation

14:00

Between-group Variation

16:22

Time out: Review Variance & SS

17:05

Time out: Review Variance & SS

17:06

F-statistic

19:22

The F Ratio (the Variance Ratio)

19:23

S²bet = SSbet / dfbet

22:13

What is This?

22:14

How Many Means?

23:20

So What is the dfbet?

23:38

So What is SSbet?

24:15

S²w = SSw / dfw

26:05

What is This?

26:06

How Many Means?

27:20

So What is the dfw?

27:36

So What is SSw?

28:18

Chart of Independent Samples ANOVA

29:25

Chart of Independent Samples ANOVA

29:26

Example 1: Who Uploads More Photos: Unknown Ethnicity, Latino, Asian, Black, or White Facebook Users?

35:52

Hypotheses

35:53

Significance Level

39:40

Decision Stage

40:05

Calculate Samples' Statistic and p-Value

44:10

Reject or Fail to Reject H0

55:54

Example 2: ANOVA with Independent Samples

58:21

Repeated Measures ANOVA

1h 15m 13s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

The Limitations of t-tests

0:36

Who Uploads more Pictures and Which Photo-Type is Most Frequently Used on Facebook?

0:37

ANOVA (F-test) to the Rescue!

5:49

Omnibus Hypothesis

5:50

Analyze Variance

7:27

Independent Samples vs. Repeated Measures

9:12

Same Start

9:13

Independent Samples ANOVA

10:43

Repeated Measures ANOVA

12:00

Independent Samples ANOVA

16:00

Same Start: All the Variance Around Grand Mean

16:01

Independent Samples

16:23

Repeated Measures ANOVA

18:18

Same Start: All the Variance Around Grand Mean

18:19

Repeated Measures

18:33

Repeated Measures F-statistic

21:22

The F Ratio (The Variance Ratio)

21:23

S²bet = SSbet / dfbet

23:07

What is This?

23:08

How Many Means?

23:39

So What is the dfbet?

23:54

So What is SSbet?

24:32

S² resid = SS resid / df resid

25:46

What is This?

25:47

So What is SS resid?

26:44

So What is the df resid?

27:36

SS subj and df subj

28:11

What is This?

28:12

How Many Subject Means?

29:43

So What is df subj?

30:01

So What is SS subj?

30:09

SS total and df total

31:42

What is This?

31:43

What is the Total Number of Data Points?

32:02

So What is df total?

32:34

so What is SS total?

32:47

Chart of Repeated Measures ANOVA

33:19

Chart of Repeated Measures ANOVA: F and Between-samples Variability

33:20

Chart of Repeated Measures ANOVA: Total Variability, Within-subject (case) Variability, Residual Variability

35:50

Example 1: Which is More Prevalent on Facebook: Tagged, Uploaded, Mobile, or Profile Photos?

40:25

Hypotheses

40:26

Significance Level

41:46

Decision Stage

42:09

Calculate Samples' Statistic and p-Value

46:18

Reject or Fail to Reject H0

57:55

Example 2: Repeated Measures ANOVA

58:57

Example 3: What's the Problem with a Bunch of Tiny t-tests?

1:13:59

Section 12: Chi-square Test

Chi-Square Goodness-of-Fit Test

58m 23s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Where Does the Chi-Square Test Belong?

0:50

Where Does the Chi-Square Test Belong?

0:51

A New Twist on HT: Goodness-of-Fit

7:23

HT in General

7:24

Goodness-of-Fit HT

8:26

Hypotheses about Proportions

12:17

Null Hypothesis

12:18

Alternative Hypothesis

13:23

Example

14:38

Chi-Square Statistic

17:52

Chi-Square Statistic

17:53

Chi-Square Distributions

24:31

Chi-Square Distributions

24:32

Conditions for Chi-Square

28:58

Condition 1

28:59

Condition 2

30:20

Condition 3

30:32

Condition 4

31:47

Example 1: Chi-Square Goodness-of-Fit Test

32:23

Example 2: Chi-Square Goodness-of-Fit Test

44:34

Example 3: Which of These Statements Describe Properties of the Chi-Square Goodness-of-Fit Test?

56:06

Chi-Square Test of Homogeneity

51m 36s

Intro

0:00

Roadmap

0:09

Roadmap

0:10

Goodness-of-Fit vs. Homogeneity

1:13

Goodness-of-Fit HT

1:14

Homogeneity

2:00

Analogy

2:38

Hypotheses About Proportions

5:00

Null Hypothesis

5:01

Alternative Hypothesis

6:11

Example

6:33

Chi-Square Statistic

10:12

Same as Goodness-of-Fit Test

10:13

Set Up Data

12:28

Setting Up Data Example

12:29

Expected Frequency

16:53

Expected Frequency

16:54

Chi-Square Distributions & df

19:26

Chi-Square Distributions & df

19:27

Conditions for Test of Homogeneity

20:54

Condition 1

20:55

Condition 2

21:39

Condition 3

22:05

Condition 4

22:23

Example 1: Chi-Square Test of Homogeneity

22:52

Example 2: Chi-Square Test of Homogeneity

32:10

Section 13: Overview of Statistics

Overview of Statistics

18m 11s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

The Statistical Tests (HT) We've Covered

0:28

The Statistical Tests (HT) We've Covered

0:29

Organizing the Tests We've Covered…

1:08

One Sample: Continuous DV and Categorical DV

1:09

Two Samples: Continuous DV and Categorical DV

5:41

More Than Two Samples: Continuous DV and Categorical DV

8:21

The Following Data: OK Cupid

10:10

The Following Data: OK Cupid

10:11

Example 1: Weird-MySpace-Angle Profile Photo

10:38

Example 2: Geniuses

12:30

Example 3: Promiscuous iPhone Users

13:37

Example 4: Women, Aging, and Messaging

16:07

This is a quick preview of the lesson. For full access, please Log In or Sign up.
For more information, please see full course syllabus of Statistics

Statistics Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Name: Statistics: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means
Brand: Educator.com
Price: 35 USD
Availability: InStock

Section 10: Inferential Statistics: Lecture 6 | 1:14:11 min

Lecture Description

Next Lecture

Previous Lecture

Discussion
Answer Engine
Download Lecture Slides
Table of Contents
Transcription
Related Books

Lecture Comments (2)

0 answers

Post by Samuel Tindell on January 30, 2019

when calculating the SE for the independent samples why didn't you square the Sx and Sy. Instead it seems like it's just sqrt (Sx/Nx) + (Sy/Ny) which in the example yielded 2.26. I thought it was sqrt(Sx^2/Nx)+ (Sy^2/Ny) which would be 17.8 in this example but I admit seems very wrong which is why I'm confused.

0 answers

Post by Brijesh Bolar on August 22, 2012

I really like the way you simplified hypothesis testing..

Answer EngineGet answers to any question!Ask any question related to Statistics

Working on the solution...

Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Lecture Slides are screen-captured images of important points in the lecture. Students can download and print out these lecture slide images to do practice problems as well as take notes while watching the lecture.

Intro

Roadmap

Roadmap

The Goal of Hypothesis Testing

One Sample and Two Samples

Independent Samples vs. Paired Samples

Independent SAMPLES vs. Independent VARIABLES

independent SAMPLES vs. Independent VARIABLES

T-tests Always…

T-tests Always…

Notation for Paired Samples

Notation for Paired Samples

Steps of Hypothesis Testing for Paired Samples

Steps of Hypothesis Testing for Paired Samples

Rules of the SDoD (Adding on Paired Samples)

Formulas that go with Steps of Hypothesis Testing

Formulas that go with Steps of Hypothesis Testing

Confidence Intervals for Paired Samples

Confidence Intervals for Paired Samples

Example 1: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Example 2: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Example 3: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Intro 0:00
Roadmap 0:09

Roadmap

The Goal of Hypothesis Testing 1:27

One Sample and Two Samples

Independent Samples vs. Paired Samples 3:16

Independent Samples vs. Paired Samples
Which is Which?

Independent SAMPLES vs. Independent VARIABLES 7:43

independent SAMPLES vs. Independent VARIABLES

T-tests Always… 10:48

T-tests Always…

Notation for Paired Samples 12:59

Notation for Paired Samples

Steps of Hypothesis Testing for Paired Samples 16:13

Steps of Hypothesis Testing for Paired Samples

Rules of the SDoD (Adding on Paired Samples) 18:03

Shape
Mean for the Null Hypothesis
Standard Error for Independent Samples (When Variance is Homogenous)
Standard Error for Paired Samples

Formulas that go with Steps of Hypothesis Testing 22:59

Formulas that go with Steps of Hypothesis Testing

Confidence Intervals for Paired Samples 30:32

Confidence Intervals for Paired Samples

Example 1: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means 32:28
Example 2: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means 44:02
Example 3: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means 52:23

General Statistics Online Course

Section 1: Introduction
	Descriptive Statistics vs. Inferential Statistics	25:31
Section 2: About Samples: Cases, Variables, Measurements
	About Samples: Cases, Variables, Measurements	32:14
Section 3: Visualizing Distributions
	Introduction to Excel	8:09
	Frequency Distributions in Excel	39:10
	Frequency Distributions and Features	25:29
	Dotplots and Histograms in Excel	42:42
	Stemplots	12:23
	Bar Graphs	22:49
Section 4: Summarizing Distributions
	Central Tendency: Mean, Median, Mode	38:50
	Variability	42:40
	Five Number Summary & Boxplots	57:15
	Shape: Calculating Skewness & Kurtosis	41:51
	Normal Distribution	34:33
	Standard Normal Distributions & Z-Scores	41:44
	Normal Distribution: PDF vs. CDF	55:44
Section 5: Linear Regression
	Scatterplots	47:19
	Regression	32:02
	Least Squares Regression	56:36
	Correlation	43:58
	Correlation: r vs. r-squared	52:52
	Transformations of Data	27:08
Section 6: Collecting Data in an Experiment
	Sampling & Bias	54:44
	Sampling Methods	14:25
	Research Design	53:54
	Between and Within Treatment Variability	41:31
Section 7: Review of Probability Axioms
	Sample Spaces	37:52
	Addition Rule for Disjoint Events	20:29
	Conditional Probability	57:19
	Independent Events	24:27
Section 8: Probability Distributions
	Introduction to Probability Distributions	56:45
	Expected Value & Variance of Probability Distributions	53:41
	Binomial Distribution	55:15
Section 9: Sampling Distributions of Statistics
	Introduction to Sampling Distributions	48:17
	Sampling Distribution of the Mean	1:08:48
	Sampling Distribution of Sample Proportions	54:37
Section 10: Inferential Statistics
	Introduction to Confidence Intervals	42:53
	t Distributions	1:02:06
	Introduction to Hypothesis Testing	1:06:33
	Confidence Intervals for the Difference of Two Independent Means	55:14
	Hypothesis Testing for the Difference of Two Independent Means	50:00
	Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means	1:14:11
	Type I and Type II Errors	31:27
	Effect Size & Power	44:41
Section 11: Analysis of Variance
	F-distributions	24:46
	ANOVA with Independent Samples	1:09:25
	Repeated Measures ANOVA	1:15:13
Section 12: Chi-square Test
	Chi-Square Goodness-of-Fit Test	58:23
	Chi-Square Test of Homogeneity	51:36
Section 13: Overview of Statistics
	Overview of Statistics	18:11

Transcription: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Hi and welcome to www.educator.com.0000

We are going to talk about confidence interval and hypothesis testing for the difference of two paired means.0002

We have been talking about independent samples so far, one example, two independent samples.0008

We are going to talk about paired samples.0017

We are going to look at the difference between independent samples and paired samples.0020

We are also going to try and clarify the difference between independent sample0025

and independent variables because paired samples still use independent variables.0029

We are going to talk about two types of t-tests.0035

One that we covered or also called hypothesis testing and one that we covered so far with independent samples.0039

The new one that was cover with paired samples.0046

We are going to introduce some notation for paired samples, go through the steps of hypothesis testing0050

for paired samples and adjust or add on to the rules of SDOD that we already have looked at.0058

Finally we are going to go over the formulas that go with the steps of hypothesis testing for independent as well as paired samples.0069

We are going to briefly cover confidence interval for paired samples.0081

Here is the goal of hypothesis testing.0085

Remember, with one sample our goal was to reject the null when we get a sample0091

that significantly different from the hypothesized population.0098

When we talk about two-tailed hypotheses we are really saying the0102

hypothesized population might be significantly higher or significantly lower.0107

Either way, we do not care.0114

The sample is too low or too high, it is too extreme in some way.0116

If that is the case, we reject the null.0123

In two samples, what we do is we reject the null when we get samples that0125

are significantly different from each other in some way.0132

Either one is significantly lower than the other or the other is significantly lower than the one.0135

It does not matter.0141

Our null hypothesis becomes this idea that x - y either = 0 because they are the same0142

and the alternative is that it does not equal 0 because they are different from each other.0152

If they are the same that is considered the null hypotheses and when they are different that considered the alternative hypotheses.0159

Remember another way you could write this is by adding y to each side and then you get x=y.0168

X = y they are the same.0174

In that way you know that you are covering the entire space of all the differences and the end of the day0176

we can figure out whether they are the same or we do not think that the they are the same.0186

Let us talk about independent samples versus paired samples because from here on out,0195

we are totally going to be dealing with paired samples.0203

It would help to know what those are.0205

Independent samples, the scores are derived separately from each other.0208

For instance they came from separate people, separate schools, separate dishes.0212

The samples are independent from each other.0219

My getting of the sample had nothing to do with my getting of this other sample.0222

In dependent, another word for paired, in dependent or paired samples the scores are linked in some way.0227

For instance, they are linked by the same person so my score on the math test and my score on the english test are linked because they both come from me.0236

Maybe we are one married couple, we ask one spouse how many children would you like to have0248

and you ask the other spouse how many children would you like to have?0258

In that way, although they come from different people these scores are linked because they come from the same married couple.0262

Another thing might be a pre and post tests of the class.0269

Maybe a statistics class might do a pre and post test.0276

Maybe 10 different statistics classes from all over the United States picked to do a pre and post test.0279

Those tests are linked because the same class did the first test and the second test.0287

10 different classes did the pairs.0295

It is not just a hodgepodge of pretests scores and a hodgepodge of posttest scores, it is more like a neat line0298

where the pretests scores for this guy, but for this class is lined up with the pretests scores for that class.0309

They are all lined up next to each other.0317

We know these definitions, let us see if we can pick them out.0319

Which of these is which?0327

The test scores from Professor x’s class versus test scores from professor y class.0329

Will these be independent samples because they just come from different classes?0336

They are not each score is not linked in any particular way.0341

River samples from 8 feet deep versus 16 feet deep.0346

This also does not really seem like paired samples unless they went through0350

some procedure to make sure it is the same spot in the river.0355

That is probably an independent sample.0360

Male heights versus female heights, they just a jumble of heights over here and a jumble of heights over here.0364

They are not like match to each other.0370

They are independent samples.0372

Left hand span versus right hand span will in this case basically these two spans came from the same person.0375

It is not a hodgepodge like left hand right hand from person 1, left hand right hand for person2 or person 3.0384

I would say this is a paired sample.0392

Productive vocabulary of two-year-old infant often raised by bilingual parents versus monolingual parents.0395

It is a bunch of scores here and a bunch of scores here.0402

They are not lined up in any way.0406

I would say independent.0408

Productive vocabulary of identical twins, twin 1, twin 2.0410

Here we see paired samples.0417

Scores on an eye gaze by autistic individual and age matched controls.0420

Autistic individuals often have trouble with eye gaze and in order to know that you0427

would have to match them with people who are the same age who are not autistic.0432

Here we have autistic individual lined up with somebody who is their same age is not autistic.0438

They are these nice even pairs and each pair has eye gaze scores.0445

I would say these are paired samples.0452

Hopefully that give you a better idea of some examples of paired samples.0457

What about independence samples versus independent variables?0462

What you will also see is IV.0469

In multi sample statistics like 2, 3, 4 samples we are often trying to find some0471

predictive relationship between the IV and the DV.0477

The independent variable and the dependent variable.0481

Usually this is often called the test or the score.0484

The independent variable is seen as the predictor and the dependent variable0488

is the thing that is been predicted the outcome.0495

We might be interested in the IV of parent language and you might have two levels of bilingual and monolingual.0498

You might be interested in how that impacts the DV of children’s vocabulary.0519

Here we have these two groups, bilingual and monolingual.0534

We have these scorers from children and these are independent samples because0542

although we have two groups these scores are not linked to each other in any particular way.0550

They are just a hodgepodge of scores here and a hodgepodge of scores here.0556

On the other hand, if our IV is something like age of twin.0560

We have slightly older like a couple of minutes or seconds, and younger.0572

We want to know is that has an impact on vocabulary.0582

We will have a bunch of scorers for older twins versus younger twins, but these scores are not just in a jumble.0593

They are linked to each other because these are twins.0611

They are identical.0615

This is the picture you could draw and the IV tells you how you determine these groups. The paired parts tells you whether these groups scores are linked to some scores0617

in the other group for some reason or another.0640

Here they are linked but here they are not linked.0642

In all t tests, we are calling them hypothesis testing.0646

We are going to have other hypothesis tests but so far we are using t test.0657

T tests always have some sort of categorical IV so that you can create different groups0662

and in t-tests it is always technically two groups, two means, paired means.0668

The DV is always continuous.0674

The reason that the dependent variable or the scores always continuous is because you need to calculate means in order to do a t test.0678

We are comparing means too and looking at standard error and you can compute mean0687

and standard error for categorical variables.0694

If you have a categorical variables such as you know, yes or no, you cannot quite compute a mean for that.0697

Or if you have a categorical variable like red or yellow, you cannot compute a standard error for that.0707

If you did have a categorical DV and a categorical IV, you would use what it is called the logistic test.0713

We are actually not going to cover that.0721

That does not usually get covered in descriptive and inferential statistics. T0723

Usually you have to graduate level work or higher level statistics courses.0727

There are two types of t test given all of this.0735

Remember all t tests have this.0740

These are all t tests.0742

Both of these t tests are going to use categorical IV and continuous DV.0743

The first kind of t test is what we have been talking about so far, independent samples t tests.0750

The second type is what we are going to cover today called paired or dependent samples.0762

Both of these have categorical IV and continuous DV.0769

Let us have some notations for paired samples.0778

Just like before, with two sample independent sample t test, for one example,0784

you might call it x so that its individual members are x sub 1, x sub 2, x sub 3.0792

Remember each sample is a set of numbers.0797

It is not just one number but a set of numbers.0800

Second sample, you might call y.0803

I did not have to pick x and y though.0807

I could pick other letters.0809

Y could just mean another sample.0810

You could have picked w or p or n.0816

We usually try to reserve n, t, f, d, k for other things in statistics, but it is mostly by culture more than we have to do it by rules.0820

Here is the third thing you need to know for paired samples.0837

With paired samples remember x sub 1 and y sub 1 are somehow linked to each other.0842

They either come from the same person or the same married couple or0848

they are a set of twins or it is an autistic person and age matched control.0853

All these reasons why these are linked to each other in some way.0859

And because of that you can actually subtract these scores from each other and get a set of different scores.0865

That is what we call d.0872

D is x sub 1 – y sub 1.0874

What is the difference between these two scores?0877

What is the difference between these two scores and what is the difference between these two scores?0882

These are paired differences.0888

Let us think about this.0891

If the mean of x is denoted as x bar and the mean of y is denoted as y bar, what do you think the mean of d might be?0894

I guess d bar and that is what it is.0902

If you got the mean of this entire set that would be d bar.0907

Once you have d bar, you could imagine having a sampling distribution made of d bars.0912

It is not x bars anymore, sampling distribution of the mean is the sampling distribution of the mean of a whole bunch of differences.0924

That is a new idea here.0942

Imagine getting a sample of d, calculating the mean d bar and placing it somewhere here.0945

You will get a sampling distribution of d bars.0959

That is what we are going to learn about next.0964

These are means of a bunch of linked differences.0966

When we go through the steps of hypothesis testing for paired samples it is going0971

to be very similar to hypothesis testing for independent samples with just a few tweaks.0979

First you need to stay to hypothesis and often our null hypothesis is that the two groups of scores, the two samples x and y are the same.0985

Usually that is the null hypothesis.0997

You put the significance level, how weird does our sample has to be for us to reject that null hypothesis.1004

We set a decision stage and we draw here the SDOD d bar.1013

We identify the critical limits and rejection regions and we find the critical test statistic.1020

From here on out I am going to assume that you are almost never going to1027

be given the actual standard deviation of the population.1033

From here on out I am usually going to be using t instead of z.1038

Then we use the actual sample differences and SDOD in order to compute the mean differences.1041

We are not dealing with just the means, we are dealing with mean differences, test statistics, and p value.1053

We compare the sample to the population and we decide whether to reject the null or not.1061

Things are very similar so far.1069

It is going to make us figure out what SDOD is all about.1073

The rules of SDOD we are now adding on to sampling distribution of1083

the differences between means that we talked about before you.1093

We are going to add onto that.1100

The SDOM for x and y are normal then the SDOD is normal too.1103

That is the same here.1109

The mean for the null hypotheses now looks like this.1111

Remember the SDOD with the bar, the mean here is no longer called the mu sub x bar - y bar because it is actually x bar - y bar.1116

A whole bunch of them and then you find the mean of them.1132

That is called d bar.1136

That is the new notation for the differences of paired samples.1137

Here the mu of d bar for the null hypotheses equal 0.1147

Remember for independent samples = that for mu sub x bar - y bar that = 0.1153

It is very similar.1162

For standard error for independent samples when various is not homogenous, which is largely the case,1164

what we would use is s sub x bar - y bar.1174

Instead here for paired samples, we would use s sub d bar.1182

Here what we would do is take the square root of the variance of1188

the standard error from x and the standard error variance of y bar and add that together.1194

If you wanted to write that out more fully, that would be s sub x² the variance of x / n sub x + variance of y / n sub y.1207

That is what you would do if life was easy and you have independent samples.1228

That is what we know so far.1238

What about for paired samples?1240

For paired samples you have to think about the world differently.1242

You have to think first we are getting a whole bunch of differences then we are finding the standard error of those differences.1245

Here is that we are going to do.1253

Here we would find standard error of those differences by looking at1256

the standard deviation of the differences ÷ how many differences we have.1263

This is a little crazy, but when I show you it, it will be much more easy to understand.1272

I think a lot of people have trouble understanding what is difference between this and this?1281

I cannot keep track all these differences.1287

We have to draw SDOD.1291

You have to remember it is made up of a whole bunch of d bars.1302

He is made up of a whole bunch of these.1312

You have to imagine pulling out samples, finding the differences,1314

averaging those differences together, then plotting it here.1324

Each single sample it has a standard deviation made up of differences.1328

Once you plot a whole bunch of these d bars on here, this is going to have a standard deviation and that is called standard error.1337

Here we have mu sub d bar and this standard error is standard error sub d bar.1347

Standard deviations of d bar whereas this is just for one sample.1359

This guy is for entire sampling distribution.1367

Let us talk about the different formulas that go with the steps of hypothesis testing.1378

Hopefully we can drive home the difference between SDOD from before and SDOD now, we will call it SDOD bar.1385

For independent samples, first we had to write down a null hypothesis and alternative hypothesis.1398

Often a null hypothesis was that the mu sub x bar - y bar = 0 or mu sub x bar - y bar does not equal 0 as the alternative.1408

In paired samples our hypothesis looks very similar except now we are not dealing with x bar - y bars but we are dealing with difference bars.1421

The average of differences.1438

The mean differences.1440

This is the differences of means.1442

This is mean of differences.1448

We will get into the other one.1453

Mu sub d bar does not =0.1457

This so far it seems like okay.1463

Here difference of means and d bar is the mean of a whole bunch of differences.1467

We get a whole bunch of differences first, then we find the mean of it.1484

Here we find the means first and we find the difference between the means.1489

This part is actually the same.1495

It is alpha =.05 usually two tailed.1500

Step 2, we got that.1510

Significant level, we get it.1515

Step 3 is where we draw the SDOD here.1517

Here we draw the SDOD bar.1521

Thankfully you could draw it in similar ways, but conceptually they are talking about different things.1530

Here how we got it was we pulled a bunch of x.1538

We got the mean then we pulled a bunch of y then we got the mean and subtracted those means and plotted that here.1543

We did that millions and millions of time with a whole bunch of that.1550

We got the entire sampling distribution of differences of means.1554

Here what we did was we pull the sample of x and y.1560

We got a bunch of the differences and then we average those differences and then we plot it back.1568

Here this is the sampling distribution of the mean of differences.1579

Where the mean go in the order is really important.1591

Here we get mu sub x bar - y bar, but here we get mu sub d bar.1599

In order to find the degrees of freedom for the differences here what we did was1607

we found the degrees of freedom for x and add it to it the degrees of freedom for y.1615

We are going to do something else in order to find the degrees of freedom for1620

the difference we are going to count how many differences we average together and subtract 1.1626

This is how many n sub d – 1.1637

Finally we need to know the standard error of the sucker.1644

The standard error of differences here we called it s sub x bar - y bar and that1650

was the standard error of x, the variance of x bar + the variance of y bar.1659

The variance of these two things added together then take the square root.1670

This refers to this distribution with the spread of this distribution.1676

This difference here is actually going to be called s sub d bar and that is1688

going to be standard deviation of your sample of differences ÷ √n of those differences.1696

Last thing, I am leaving off step 5 because step 5 is explanatory.1707

Step 4, now we have to find the sample t.1719

Our sample is really two independent samples.1723

We have a sample of x and a sample of y.1732

Because of that we need to find the difference between those two means.1734

We find the mean of this group first, the mean of this group and we subtract.1741

We find the means first then we subtract - the mu sub x bar - y bar.1747

I want you to contrast this with this new sample t.1756

Here we get a bunch of x and y, we have two samples.1761

We find the differences first then we average.1766

Here we find the average first and find a different.1773

Here we find the differences then we find the average.1776

That is going to be d bar.1782

D bar – mu sub d bar.1784

This is getting a little bit cramped.1790

We divide all of that by the standard error of the difference and you could substitute that in.1796

Divide all that by the standard error of the differences.1803

You see how here it really matters when you take the differences.1811

Here you find the differences first and then you just deal with the differences.1820

Here, you have to keep finding means first then you find the differences between those means.1824

Let us talk about the confidence interval for these paired samples.1830

The confidence intervals are going to be very similar to the confidence intervals that you saw before with independent samples.1841

I am just covering it very briefly.1849

Let us think about independent samples.1851

In this case, the confidence interval was just going to be the difference of means and + or - t × the standard error.1854

You need to put in the appropriate standard error and use the appropriate degrees of freedom as well.1877

In confidence intervals for paired samples it is going to look very similar except instead of having the differences of means1884

you are going to put in the mean difference d bar + or - t × the standard error.1897

Remember standard error here is going to mean s sub x bar - y bar.1906

The standard error here is going to be s sub d bar.1914

In order to find degrees of freedom you have to take the degrees of freedom for x and add that to the degrees of freedom for y.1918

In order to find degrees of freedom you have to find the degrees of freedom for d1928

your sample of differences and that equals how many differences you have -1.1935

Let us talk about examples.1945

There is a download available for you and says this data set includes the highway1953

and city gas mileage for random sample of 8 cars.1958

Assume gas mileage is normally distributed.1962

It says that because we could see your sample is quite small so we do not have1965

a reason to assume that normal distribution of the SDOM.1970

Construct and interpret the confidence interval and also conduct an appropriate t test to check your confidence interval interpretation.1974

Here I have my example and going to example 1.1984

Here we have 8 models of cars, their highway miles per gallon, as well as their city miles per gallon.1989

You can see that there is a reason to consider these things as linked.2004

They are linked because they come from the same model car.2010

Let us construct the confidence interval.2013

Remember in confidence interval what we are going to do is use our sample in order to predict something about our population.2018

Here we will use our sample differences to say something about the real difference between these two populations.2028

Here is the big step of difference when you work with paired samples.2036

You have to first find the paired differences so the set of d.2042

That is going to be one of these will take highway - the city.2048

That x1 – y1, x2 – y2, x sub 3 – y sub 3.2054

Here are all our differences and we can now find the average differences.2062

We can find the standard deviation of these differences and all the stuff.2067

Let us find confidence interval and this helps me to say what I need is my d bar + or - t × the standard error.2071

In order to find my t but in order to do that I need to find my degrees of freedom.2090

My degrees of freedom is just going to be the degrees of freedom of the d.2098

How many differences I have -1.2107

That is count how many differences they should have the same number of differences as cars -1 =7.2110

Once I have that, I could find my t.2121

I also need to find d bar.2126

Let us find t.2130

I need to find t and t inverse and I probably am going to assume a 95% confidence interval.2134

My two tailed probability is .05 and my degrees of freedom is down here and so that will be 2.36.2146

Those are my outer boundaries and let us also find d bar, the average.2157

I almost have everything I need.2165

I just need standard error.2172

Standard error here is going to be s sub d ÷ the square root of how many differences I have.2174

That is going to be the standard deviation of my differences ÷ the square root of 8 because I have 8 differences.2187

Once I have that, then I can find the confidence interval.2206

The upper boundary will be the d bar + t × standard error and the lower boundary is the same thing, except that this - t × standard error.2209

My upper boundary is that 10.6.2244

My lower boundary is that 7.6.2249

To interpret my confidence interval I would say the real difference between highway miles per gallon2253

and city miles per gallon I have 95% confidence that the real difference in the population is between 10.6 and 7.6.2264

Notice that 0 is not included in here in this confidence interval.2274

It would be 0 if highway and city miles per gallon could be equal to each other by chance.2280

There is less than 5% chance of them being equal to each other.2288

Because of that, I would guess that we would also reject the null because it does not include 0.2295

Let us do hypothesis testing to see if we do really reject the null because it does not include 02304

I would predict that we would reject the null.2312

Let us go straight into hypothesis testing here.2314

First things first.2317

The step 1, the null hypothesis this should be that the mu of d bar.2320

Here let us do hypothesis testing.2332

The first step is mu sub d bar is equal to 0.2344

Highway and city gas mileage are the same but the alternative is that one of them is different from the other.2356

That they are different from each other in some way.2366

It is significantly stand out.2369

This difference stands out.2371

That would be that mu sub d bar does not equal 0.2373

Step 2, my significance level, the false alarm rate is very low .05 and two tailed.2378

Let us set our decision stage.2392

I need to draw an SDOD bar and here I put my mu as 0 because the mu sub d bar will be 0.2397

Let us also find the standard error here.2418

The standard error here is going to be s sub d bar and that is really the standard deviation of the d / √n sub d.2421

That I could compute here.2434

Actually, we already computed that because we have the standard deviation of the d bars / the square root of how many d I have.2439

That is .64.2449

What is my degrees of freedom?2455

That is 7 because that is how many differences I have -1.2458

Based on that I can find my t and my t is going to be + or - 2.36.2466

Let us deal with our sample.2476

When we talk about the sample t, what we really mean is what the x bar of our sample differences that would be d bar.2483

I would just put x bar sub d because it is a simpler way of doing it.2502

- the mu which is 0 / the standard error which is .64.2505

I could just put this here so I can skip directly to step 4 and I will compute my sample t.2512

I should say this is my critical t so that I do not get confused.2527

My sample t is going to be d bar - mu / standard error.2533

That is d bar - mu which is 0 ÷ standard error = 14.3.2546

I can also find the p value and I'm guessing my p value is probably be tiny.2564

Here 14.3 is really small.2573

My p value is going to be t dist because I want my probability.2577

I put in my t, my degrees of freedom which is 7, and I have a two-tailed hypotheses.2586

That is going to be 2 × 10^-6.2593

Imagine .000002 given this tiny p value much smaller than .05 we should say at step 5 reject the null.2610

We had predicted that we would reject the null because the CI, the confidence interval did include 0.2630

Good job confidence interval and hypothesis testing working together.2636

Example 2, see the download again, this data set shows the average salary earned by first-year college graduates.2641

Graduated at the bottom or top 15% of their class for random sample of 10 colleges ranked in the top 100 public colleges in the US.2650

Is there a significant difference in earnings that is unlikely to have occurred by chance alone?2661

We want to know is there a difference between these top 15% folks and the bottom 15% folks.2667

They are linked to having graduated from the same college.2674

We would not necessarily want to compare people from the top 15% of one college that might be really good to one2678

to the bottom percentage of people from a college that might be not as great.2687

We would really want from the same college does not matter if you are in the top 15 or bottom 15%.2693

If you go to example 2, you will see these randomly selected colleges and the earnings in dollars per year, salary per year for the bottom 15%, as well as the top 15%.2699

Because it is a paired sample what we want to do is start off with d or set up d.2718

What is the difference between bottom and top?2724

We are going to get probably a whole bunch of negative numbers assuming that top earners earn more than bottom.2729

Indeed we do, we have a bunch of negative numbers.2738

If you wanted to turn these negatives into positives, you just have to remember2740

which one you decided as x and which one you decided to be y.2745

I will call this one x and I will call this one y.2750

It will help me remember which one I subtracted from which.2759

I am going to reverse all of these and it is just going to give me the positive versions of this.2764

Here is my d.2771

Let us go ahead and start with hypothesis testing.2773

This part I will do by hand.2777

Step 1, the null hypothesis says something that the top 15% folks and the bottom 15% folks are the same.2783

Their difference is going to be 0.2796

The mu sub d bar should be 0 but the alternative is that they are different.2800

We are neutral as to how they are different.2807

We do not know whether one earns more than the other.2811

Whether they are top earns more than bottom or bottom earns more than the top.2813

We can use our common sense to predict that the top ranking folks might earn more, but right now we are neutral.2818

Step 2, is our alpha level .05 or significance level.2827

Let us say two details.2834

Step 3, drawing the SDOD, the mean differences and here we will put 0.2837

And let us figure out the standard error.2850

The standard error here would be s sub d bar and that would be the standard deviation of d / √(n ) sub d.2857

We also want to figure out the degrees of freedom so that is going to be n sub b -1 and we also want to find out the t.2871

These are all things you can do in Excel.2881

Step 3, standard error is going to be s sub d bar and that will be s sub d ÷ √n sub d.2884

That will be the standard deviation of our sample of d ÷ the square root of how many there are and there is 10.2903

Here is our standard error.2923

What is our degrees of freedom?2926

That is going to be 10-1 =9.2930

What is our critical t?2935

We know it is a critical t because we are still in step 3 the decision stage.2939

We are just setting up our boundaries.2944

That is going to be t inverse because we already know the probability .05 two-tailed,2947

degrees of freedom being 9 and we get 2.26.2953

It is + or -2.26 those are our boundaries of t.2959

Step 4, this will say what is our sample t?2966

And that is going to be our d bar – mu / standard error/2973

I will write step 4 here and so I need to find t which is d bar – mu/ standard error.2981

I need to find the bar for sure and standard error.2994

My d bar is the average of all my differences and that is about $12,000 - $13,000 a year.3000

That is just right after college.3016

I need to find the d bar - 0 ÷ the standard error to give me my sample t.3018

That is the difference between sample t and critical t.3033

8.05 is actually the average of the differences.3041

The top 15% are on average earning $13,000 more than the bottom 15%.3056

The sample t gives us how far that differences from 0 in terms of standard error.3065

We know that is way more extreme than 2.26.3075

Let us find the p value.3080

We put it in t dist because we want to know the probability.3083

Put in our t, degrees of freedom, and we have a two-tailed hypotheses.3087

That would be 2 × 10^-5.3094

Our p value = 2 × 10^-5 which is a very tiny, tiny number, much smaller than the alpha.3100

We would reject the null hypotheses.3113

Is there a significant difference in earnings that is unlikely to have occurred by chance alone?3118

There is always going to be a difference in earnings between these two groups of people, the top 15 and the bottom 15%.3125

Is this difference greater than would be expected by chance?3131

Yes it is because we are rejecting the model that they are equal to each other.3135

Example 3, in fitting hearing aids to individuals, researchers wanted to examine whether3141

there was a difference between hearing words in silence or in the presence of background noise.3151

Two equally difficult wordless are randomly presented to each person.3156

One less than silence and the other with white noise in a random order for each person.3160

This means that some people get silence than noise, other people get noise and silence.3166

Are the hearing aid equally effective in silence or with background noise?3171

First conduct the t test assuming that these are independent samples then conduct the t test assuming that these are paired samples.3178

Which is more powerful?3185

The independent sample t-test or paired samples t test?3188

We need to figure out what it means by more powerful.3192

I need some scratch paper here because the problem was so long I am just going to divide the space in half.3196

This top part I am going to use for assuming independent samples.3205

They are not actually independent samples, but I want you to see the difference between doing them as independent sample and doing them as paired samples doing this hypothesis testing as paired samples.3211

Step 1, the hypothesis, the null hypothesis is that if I get these sample3224

and they are independent this difference of means on average is going to be 0.3231

The mu sub x bar - y bar is going to = 0.3240

The alternative hypothesis is that the mu sub x bar - y bar does not equal to 0.3244

Here I am going to put alpha =.05, two-tailed.3252

I am going to draw myself an SDOD.3261

Just to let you know it is the differences of means.3268

Here we know that this is going to be 0 and we probably should find out the standard error.3273

The standard error of this difference of means is going to be the square roots of the variance of x bar + the variance of y bar.3283

I am going to write this out to be s sub x²/ n sub x + s sub y² /n sub y.3303

The variance of x and x bar, the variance of x /n, the variance of y/n.3315

We will probably need to find the degrees of freedom and that is going to be n sub x – 1 + n sub y -1.3321

Finally we will probably need to know the critical t but I will put that up here.3340

Let us look at this data, go to example 3.3345

Click on example 3 and take a look at this data.3353

Let us assume independent samples.3356

Here we are going to assume that this silence is just one group of scores and3359

this background noise is another group of scores and they are not paired.3368

They are actually paired.3372

This belongs to subject one, these 2 belongs to subject 3, this belongs to subject 5.3374

Here is the list order, it is A, B.3380

We get A list first then list B and here is the noise order.3384

They get it silent first then noisy.3388

This guy gets noisy first then silent.3390

All these orders are randomly assigned and the noise orders are randomly assigned as well.3393

For this exercise, we are going to assume we do not have any of this stuff.3406

We are going to assume this is gone and that this just a bunch of scores from one group of subjects3412

that listen to a list of words in silence and another group of subjects that listen to list of words in background noise.3418

We do the independent samples t test and we start with step 3.3427

We know we need to find the standard error, which is going to be the square root of the variance of x ÷ n(x) + the variance of y / n sub y.3433

All that added together and a square root.3473

We need to find the variance of x.3476

We need to find n sub x.3479

We also need to find the variance of y and n sub y before we can find standard error.3481

Variance is pretty easy.3488

We will just call silence x and the count of this is 24.3491

The count for y is going to be the same, but what is the variance of y?3504

The variance of y slightly different.3513

In order to find this guy, the standard error, we are going to put in square root of the variance of x ÷ 24 + the variance of y ÷ 24.3521

We get a standard error of 2.26 and standard error gives it just in terms of number of words accurately heard.3547

We also need to find the degrees of freedom.3564

In order to find degrees of freedom, we need the degrees of freedom for x + degrees of freedom for y.3567

The degrees of freedom for x is just going to be 24 - 1 and the degrees of freedom for y is also going to be 24 – 1.3574

The new degrees of freedom is 23 + 23 = 46.3586

Once we have that we can find our critical t.3593

Our critical t, we know that alpha is .05 so we are going to put in t in and3606

put in our two-tailed probability and the degrees of freedom 46.3615

We get a critical t of + or -2.01.3620

Our critical t is + or -2.01.3625

I will just leave that stuff on the Excel file.3631

Given all this now let us deal with the sample.3636

When we find the sample t what we are doing is finding the difference in means and then find the difference3641

between that difference and our expected difference 0 and divide all of that by standard error to find how many standard errors away we are.3653

Here I will put step 4, sample t.3666

In order to find sample t we need to find x bar - y bar - mu and all of that ÷ standard error.3672

Thankfully, we have a bunch of those things available to us quite easily.3697

We have x bar, we can get y bar, we can get standard error.3701

Let us find x bar, the average number of words heard accurately in silence and that is about 33 words.3708

The average number of words heard correctly with background noise, and that is 29 words.3723

Is the difference of about 4 words big enough to be statistically different?3732

We would take this - this and we know mu = 0 so I am going to ignore that / standard error found up here.3741

That would give us 1.75.3754

1.75 that is more extreme than + or -2.01.3758

1.75 we will actually say do not reject.3765

We should find the p value too.3769

This p value should be greater than .05.3773

We will put in t dist then our sample t, degrees of freedom which is 46 and we want a two tailed and we get .09.3777

.09 is greater than .05.3790

Step 5, fail to reject.3797

Now that we have all that, we want to know is it more sensitive?3809

Can we detect the statistical difference better if we used paired examples?3821

Let us start.3829

Here we would say p =.09 and 5 is failed to reject.3831

It is not outside of our rejection zone, it is inside our fail to reject zone.3846

Let us talk about the null hypotheses here.3855

What we are going to do is find the differences first then the mean of those differences.3860

We are saying if they are indeed not that different from each other that mean different should be 0.3866

The alternative is that the mean difference is not equal to 0.3871

Once again alpha = .05 two tailed and now we will draw our SDOD bar which means3877

it is a standard sampling distribution of means mean made of differences.3892

Here we want to put 0.3909

We probably also want to figure out standard error somewhere along the line,3919

which is going to be s sub d bar which is s sub d ÷ √n sub d.3924

We probably also want to find the degrees of freedom, which is going to be n sub d -1.3933

We probably also want to find the critical t.3941

Let us find out that.3945

Here I will start my paired samples section.3949

I will also start with step 3.3955

Let me move all of these over here.3957

Let us start here with step 33965

Let us find standard error and that is going to be s sub d not d bar ÷ √n sub d.3970

We can find s sub d very easily and we could also find n sub d.3986

First we need to create a column of d.3994

I will find the standard deviation of the d but I realized that I do not have any d.4002

The d look something like this silence - background noise.4008

This is how many more words, they are able to hear accurately in silence and background noise.4020

Here we see that some people hear a lot of words better in silence.4026

Some people here words better with a little bit of background noise.4032

Some people are exactly the same.4035

We could find a standard deviation of all these differences.4037

We could also find the mean of them4045

The n of them will be the same as 24 because there are 24 people that came from.4053

There is 24 differences.4062

We could find out standard error.4065

Standard deviation of d ÷ √24.4070

That is standard error, notice that is quite different from finding a standard error of independent samples.4076

Let us find degrees of freedom for d and that is going to be n sub d -1 and that is 24 -1.4086

Our critical t should be t inverse .05 two tailed=23 and we get 2.07.4101

So far it seems that our standard for how extreme it has to be is more far out.4127

That makes sense because the degrees of freedom is smaller than 46.4134

+ or -2.07.4139

Let us talk about our sample.4152

In order to find our sample t, we want to find the average of difference subtract from the hypothesized mu4154

and divide all of that by standard error to find out how many standard errors away our sample mean difference is.4169

We also want to find p value.4179

Here is step 4, our sample t would be d bar - mu ÷ standard error.4181

What is d bar and how would we find it?4196

Just use the average function and the average of our d like this d bar.4205

We can do d bar -0 / standard error= 2.97.4211

That is more extreme than 2.06.4226

Let us figure out why.4233

We might look at standard error, the standard error is much smaller and the steps are smaller.4235

How many steps we need to take to get all the way out to this d bar?4251

There is more of them than these the bigger steps.4257

These are almost twice as big.4261

These bigger steps, there is few of them that you need.4263

That is what the sample t I get is how many of these standard errors,4267

how many of these steps does it take to get all the way out to d bar or x bar – y bar?4273

We need almost 3 steps out.4279

What is our p value?4282

Our p value should be less than .05 that is going to be t dist.4287

Here is our t value I will put in our degrees of freedom and two tailed and its .007.4293

That certainly less than .05.4303

Step 5, here we reject whereas here we fail to reject.4306

Since there is this difference and we detected it with this one but not with this one,4313

we would say that this is the more sensitive test given that there is something to detect out there.4321

This is the difference if it does exist.4328

This one is a little coarser, there is a couple of reasons for that.4331

One of the reasons is because the standard error are usually larger than the standard error of differences.4336

Another issue is that x bar - y bar, the difference here if we look at x bar - y bar this difference is roughly around the same.4343

This difference is the same as this difference.4362

It is not that bad but it is that you are dividing by a smaller standard error here then you are here.4366

Here, the standard error is quite large.4373

The steps are quite large.4375

Here, the standard errors are small.4376

The steps are quite small.4379

It is because you are taking out some of the variation caused by having some people4380

just being able to hear a lot of words accurately all the time with noise.4385

Some people are very good at hearing anyway.4392

They might have over a low number of scores but with d bar you do not care about those individual differences.4397

You end up accounting for those by subtracting them out.4405

Here this is a more sensitive test.4409

Here we get p=.006 and we reject.4412

Which test is more sensitive?4425

Which test is able to detect the difference, if there is a difference?4431

Paired samples.4435

That principles are little more complicated to collect that data but it is worth it because it is a more sensitive test.4436

Thanks for using www.educator.com.4448

Related Books

Statistics by Witte, 10th Edition

Authors: Robert S. Witte, John S . Witte

ISBN: 1118450531

Publisher: Wiley

Year: 2013

This book provides a clear and methodical approach to essential statistical procedures. It clearly explains the basic concepts and procedures of descriptive and inferential statistical analysis. This book features a new emphasis on expressions involving sums of squares and degrees of freedom as well as a stronger stress on the importance of variability.

Related Books

Name	Description	Link
BookRenter.com	BookRenter.com is simply the most reliable online textbook rental service.	Visit BookRenter.com
PhysicsForums.com Homework Help	Physics Forums is a scientific community for students looking for math & science help.	Visit PhysicsForums.com Homework Help

Statistics Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Share this knowledge with your friends!

Copy & Paste this embed code into your website’s HTML

Discussion

Answer Engine

Download Lecture Slides

Table of Contents

Transcription

Related Books

Answer EngineGet answers to any question!Ask any question related to Statistics

Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

General Statistics Online Course

Transcription: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Related Books

Related Books

Start Learning Now

Membership Overview

Statistics Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Share this knowledge with your friends!

Copy & Paste this embed code into your website’s HTML

Discussion

Answer Engine

Download Lecture Slides

Table of Contents

Transcription

Related Books

Answer EngineGet answers to any question!Ask any question related to Statistics

Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

General Statistics Online Course

Transcription: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

Related Books

Related Books

Available 24/7. Unlimited Access to Our Entire Library.

Searchable Lessons

Get Answers & Community Support

Downloadable Lecture Notes

Study Guides, Worksheets and Extra Example Lessons

Start Learning Now

Membership Overview