Dr. Ji Son

Hypothesis Testing for the Difference of Two Independent Means

Slide Duration:

Table of Contents

Section 1: Introduction

Descriptive Statistics vs. Inferential Statistics

25m 31s

Intro

0:00

Roadmap

0:10

Roadmap

0:11

Statistics

0:35

Statistics

0:36

Let's Think About High School Science

1:12

Measurement and Find Patterns (Mathematical Formula)

1:13

Statistics = Math of Distributions

4:58

Distributions

4:59

Problematic… but also GREAT

5:58

Statistics

7:33

How is It Different from Other Specializations in Mathematics?

7:34

Statistics is Fundamental in Natural and Social Sciences

7:53

Two Skills of Statistics

8:20

Description (Exploration)

8:21

Inference

9:13

Descriptive Statistics vs. Inferential Statistics: Apply to Distributions

9:58

Descriptive Statistics

9:59

Inferential Statistics

11:05

Populations vs. Samples

12:19

Populations vs. Samples: Is it the Truth?

12:20

Populations vs. Samples: Pros & Cons

13:36

Populations vs. Samples: Descriptive Values

16:12

Putting Together Descriptive/Inferential Stats & Populations/Samples

17:10

Putting Together Descriptive/Inferential Stats & Populations/Samples

17:11

Example 1: Descriptive Statistics vs. Inferential Statistics

19:09

Example 2: Descriptive Statistics vs. Inferential Statistics

20:47

Example 3: Sample, Parameter, Population, and Statistic

21:40

Example 4: Sample, Parameter, Population, and Statistic

23:28

Section 2: About Samples: Cases, Variables, Measurements

About Samples: Cases, Variables, Measurements

32m 14s

Intro

0:00

Data

0:09

Data, Cases, Variables, and Values

0:10

Rows, Columns, and Cells

2:03

Example: Aircrafts

3:52

How Do We Get Data?

5:38

Research: Question and Hypothesis

5:39

Research Design

7:11

Measurement

7:29

Research Analysis

8:33

Research Conclusion

9:30

Types of Variables

10:03

Discrete Variables

10:04

Continuous Variables

12:07

Types of Measurements

14:17

Types of Measurements

14:18

Types of Measurements (Scales)

17:22

Nominal

17:23

Ordinal

19:11

Interval

21:33

Ratio

24:24

Example 1: Cases, Variables, Measurements

25:20

Example 2: Which Scale of Measurement is Used?

26:55

Example 3: What Kind of a Scale of Measurement is This?

27:26

Example 4: Discrete vs. Continuous Variables.

30:31

Section 3: Visualizing Distributions

Introduction to Excel

8m 9s

Intro

0:00

Before Visualizing Distribution

0:10

Excel

0:11

Excel: Organization

0:45

Workbook

0:46

Column x Rows

1:50

Tools: Menu Bar, Standard Toolbar, and Formula Bar

3:00

Excel + Data

6:07

Exce and Data

6:08

Frequency Distributions in Excel

39m 10s

Intro

0:00

Roadmap

0:08

Data in Excel and Frequency Distributions

0:09

Raw Data to Frequency Tables

0:42

Raw Data to Frequency Tables

0:43

Frequency Tables: Using Formulas and Pivot Tables

1:28

Example 1: Number of Births

7:17

Example 2: Age Distribution

20:41

Example 3: Height Distribution

27:45

Example 4: Height Distribution of Males

32:19

Frequency Distributions and Features

25m 29s

Intro

0:00

Roadmap

0:10

Data in Excel, Frequency Distributions, and Features of Frequency Distributions

0:11

Example #1

1:35

Uniform

1:36

Example #2

2:58

Unimodal, Skewed Right, and Asymmetric

2:59

Example #3

6:29

Bimodal

6:30

Example #4a

8:29

Symmetric, Unimodal, and Normal

8:30

Point of Inflection and Standard Deviation

11:13

Example #4b

12:43

Normal Distribution

12:44

Summary

13:56

Uniform, Skewed, Bimodal, and Normal

13:57

Sketch Problem 1: Driver's License

17:34

Sketch Problem 2: Life Expectancy

20:01

Sketch Problem 3: Telephone Numbers

22:01

Sketch Problem 4: Length of Time Used to Complete a Final Exam

23:43

Dotplots and Histograms in Excel

42m 42s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Previously

1:02

Data, Frequency Table, and visualization

1:03

Dotplots

1:22

Dotplots Excel Example

1:23

Dotplots: Pros and Cons

7:22

Pros and Cons of Dotplots

7:23

Dotplots Excel Example Cont.

9:07

Histograms

12:47

Histograms Overview

12:48

Example of Histograms

15:29

Histograms: Pros and Cons

31:39

Pros

31:40

Cons

32:31

Frequency vs. Relative Frequency

32:53

Frequency

32:54

Relative Frequency

33:36

Example 1: Dotplots vs. Histograms

34:36

Example 2: Age of Pennies Dotplot

36:21

Example 3: Histogram of Mammal Speeds

38:27

Example 4: Histogram of Life Expectancy

40:30

Stemplots

12m 23s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

What Sets Stemplots Apart?

0:46

Data Sets, Dotplots, Histograms, and Stemplots

0:47

Example 1: What Do Stemplots Look Like?

1:58

Example 2: Back-to-Back Stemplots

5:00

Example 3: Quiz Grade Stemplot

7:46

Example 4: Quiz Grade & Afterschool Tutoring Stemplot

9:56

Bar Graphs

22m 49s

Intro

0:00

Roadmap

0:05

Roadmap

0:08

Review of Frequency Distributions

0:44

Y-axis and X-axis

0:45

Types of Frequency Visualizations Covered so Far

2:16

Introduction to Bar Graphs

4:07

Example 1: Bar Graph

5:32

Example 1: Bar Graph

5:33

Do Shapes, Center, and Spread of Distributions Apply to Bar Graphs?

11:07

Do Shapes, Center, and Spread of Distributions Apply to Bar Graphs?

11:08

Example 2: Create a Frequency Visualization for Gender

14:02

Example 3: Cases, Variables, and Frequency Visualization

16:34

Example 4: What Kind of Graphs are Shown Below?

19:29

Section 4: Summarizing Distributions

Central Tendency: Mean, Median, Mode

38m 50s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

Central Tendency 1

0:56

Way to Summarize a Distribution of Scores

0:57

Mode

1:32

Median

2:02

Mean

2:36

Central Tendency 2

3:47

Mode

3:48

Median

4:20

Mean

5:25

Summation Symbol

6:11

Summation Symbol

6:12

Population vs. Sample

10:46

Population vs. Sample

10:47

Excel Examples

15:08

Finding Mode, Median, and Mean in Excel

15:09

Median vs. Mean

21:45

Effect of Outliers

21:46

Relationship Between Parameter and Statistic

22:44

Type of Measurements

24:00

Which Distributions to Use With

24:55

Example 1: Mean

25:30

Example 2: Using Summation Symbol

29:50

Example 3: Average Calorie Count

32:50

Example 4: Creating an Example Set

35:46

Variability

42m 40s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Variability (or Spread)

0:45

Variability (or Spread)

0:46

Things to Think About

5:45

Things to Think About

5:46

Range, Quartiles and Interquartile Range

6:37

Range

6:38

Interquartile Range

8:42

Interquartile Range Example

10:58

Interquartile Range Example

10:59

Variance and Standard Deviation

12:27

Deviations

12:28

Sum of Squares

14:35

Variance

16:55

Standard Deviation

17:44

Sum of Squares (SS)

18:34

Sum of Squares (SS)

18:35

Population vs. Sample SD

22:00

Population vs. Sample SD

22:01

Population vs. Sample

23:20

Mean

23:21

23:51

Example 1: Find the Mean and Standard Deviation of the Variable Friends in the Excel File

27:21

Example 2: Find the Mean and Standard Deviation of the Tagged Photos in the Excel File

35:25

Example 3: Sum of Squares

38:58

Example 4: Standard Deviation

41:48

Five Number Summary & Boxplots

57m 15s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Summarizing Distributions

0:37

Shape, Center, and Spread

0:38

5 Number Summary

1:14

Boxplot: Visualizing 5 Number Summary

3:37

Boxplot: Visualizing 5 Number Summary

3:38

Boxplots on Excel

9:01

Using 'Stocks' and Using Stacked Columns

9:02

Boxplots on Excel Example

10:14

When are Boxplots Useful?

32:14

Pros

32:15

Cons

32:59

How to Determine Outlier Status

33:24

Rule of Thumb: Upper Limit

33:25

Rule of Thumb: Lower Limit

34:16

Signal Outliers in an Excel Data File Using Conditional Formatting

34:52

Modified Boxplot

48:38

Modified Boxplot

48:39

Example 1: Percentage Values & Lower and Upper Whisker

49:10

Example 2: Boxplot

50:10

Example 3: Estimating IQR From Boxplot

53:46

Example 4: Boxplot and Missing Whisker

54:35

Shape: Calculating Skewness & Kurtosis

41m 51s

Intro

0:00

Roadmap

0:16

Roadmap

0:17

Skewness Concept

1:09

Skewness Concept

1:10

Calculating Skewness

3:26

Calculating Skewness

3:27

Interpreting Skewness

7:36

Interpreting Skewness

7:37

Excel Example

8:49

Kurtosis Concept

20:29

Kurtosis Concept

20:30

Calculating Kurtosis

24:17

Calculating Kurtosis

24:18

Interpreting Kurtosis

29:01

Leptokurtic

29:35

Mesokurtic

30:10

Platykurtic

31:06

Excel Example

32:04

Example 1: Shape of Distribution

38:28

Example 2: Shape of Distribution

39:29

Example 3: Shape of Distribution

40:14

Example 4: Kurtosis

41:10

Normal Distribution

34m 33s

Intro

0:00

Roadmap

0:13

Roadmap

0:14

What is a Normal Distribution

0:44

The Normal Distribution As a Theoretical Model

0:45

Possible Range of Probabilities

3:05

Possible Range of Probabilities

3:06

What is a Normal Distribution

5:07

Can Be Described By

5:08

Properties

5:49

'Same' Shape: Illusion of Different Shape!

7:35

'Same' Shape: Illusion of Different Shape!

7:36

Types of Problems

13:45

Example: Distribution of SAT Scores

13:46

Shape Analogy

19:48

Shape Analogy

19:49

Example 1: The Standard Normal Distribution and Z-Scores

22:34

Example 2: The Standard Normal Distribution and Z-Scores

25:54

Example 3: Sketching and Normal Distribution

28:55

Example 4: Sketching and Normal Distribution

32:32

Standard Normal Distributions & Z-Scores

41m 44s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

A Family of Distributions

0:28

Infinite Set of Distributions

0:29

Transforming Normal Distributions to 'Standard' Normal Distribution

1:04

Normal Distribution vs. Standard Normal Distribution

2:58

Normal Distribution vs. Standard Normal Distribution

2:59

Z-Score, Raw Score, Mean, & SD

4:08

Z-Score, Raw Score, Mean, & SD

4:09

Weird Z-Scores

9:40

Weird Z-Scores

9:41

Excel

16:45

For Normal Distributions

16:46

For Standard Normal Distributions

19:11

Excel Example

20:24

Types of Problems

25:18

Percentage Problem: P(x)

25:19

Raw Score and Z-Score Problems

26:28

Standard Deviation Problems

27:01

Shape Analogy

27:44

Shape Analogy

27:45

Example 1: Deaths Due to Heart Disease vs. Deaths Due to Cancer

28:24

Example 2: Heights of Male College Students

33:15

Example 3: Mean and Standard Deviation

37:14

Example 4: Finding Percentage of Values in a Standard Normal Distribution

37:49

Normal Distribution: PDF vs. CDF

55m 44s

Intro

0:00

Roadmap

0:15

Roadmap

0:16

Frequency vs. Cumulative Frequency

0:56

Frequency vs. Cumulative Frequency

0:57

Frequency vs. Cumulative Frequency

4:32

Frequency vs. Cumulative Frequency Cont.

4:33

Calculus in Brief

6:21

Derivative-Integral Continuum

6:22

PDF

10:08

PDF for Standard Normal Distribution

10:09

PDF for Normal Distribution

14:32

Integral of PDF = CDF

21:27

Integral of PDF = CDF

21:28

Example 1: Cumulative Frequency Graph

23:31

Example 2: Mean, Standard Deviation, and Probability

24:43

Example 3: Mean and Standard Deviation

35:50

Example 4: Age of Cars

49:32

Section 5: Linear Regression

Scatterplots

47m 19s

Intro

0:00

Roadmap

0:04

Roadmap

0:05

Previous Visualizations

0:30

Frequency Distributions

0:31

Compare & Contrast

2:26

Frequency Distributions Vs. Scatterplots

2:27

Summary Values

4:53

Shape

4:54

Center & Trend

6:41

Spread & Strength

8:22

Univariate & Bivariate

10:25

Example Scatterplot

10:48

Shape, Trend, and Strength

10:49

Positive and Negative Association

14:05

Positive and Negative Association

14:06

Linearity, Strength, and Consistency

18:30

Linearity

18:31

Strength

19:14

Consistency

20:40

Summarizing a Scatterplot

22:58

Summarizing a Scatterplot

22:59

Example 1: Gapminder.org, Income x Life Expectancy

26:32

Example 2: Gapminder.org, Income x Infant Mortality

36:12

Example 3: Trend and Strength of Variables

40:14

Example 4: Trend, Strength and Shape for Scatterplots

43:27

Regression

32m 2s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Linear Equations

0:34

Linear Equations: y = mx + b

0:35

Rough Line

5:16

Rough Line

5:17

Regression - A 'Center' Line

7:41

Reasons for Summarizing with a Regression Line

7:42

Predictor and Response Variable

10:04

Goal of Regression

12:29

Goal of Regression

12:30

Prediction

14:50

Example: Servings of Mile Per Year Shown By Age

14:51

Intrapolation

17:06

Extrapolation

17:58

Error in Prediction

20:34

Prediction Error

20:35

Residual

21:40

Example 1: Residual

23:34

Example 2: Large and Negative Residual

26:30

Example 3: Positive Residual

28:13

Example 4: Interpret Regression Line & Extrapolate

29:40

Least Squares Regression

56m 36s

Intro

0:00

Roadmap

0:13

Roadmap

0:14

Best Fit

0:47

Best Fit

0:48

Sum of Squared Errors (SSE)

1:50

Sum of Squared Errors (SSE)

1:51

Why Squared?

3:38

Why Squared?

3:39

Quantitative Properties of Regression Line

4:51

Quantitative Properties of Regression Line

4:52

So How do we Find Such a Line?

6:49

SSEs of Different Line Equations & Lowest SSE

6:50

Carl Gauss' Method

8:01

How Do We Find Slope (b1)

11:00

How Do We Find Slope (b1)

11:01

Hoe Do We Find Intercept

15:11

Hoe Do We Find Intercept

15:12

Example 1: Which of These Equations Fit the Above Data Best?

17:18

Example 2: Find the Regression Line for These Data Points and Interpret It

26:31

Example 3: Summarize the Scatterplot and Find the Regression Line.

34:31

Example 4: Examine the Mean of Residuals

43:52

Correlation

43m 58s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Summarizing a Scatterplot Quantitatively

0:47

Shape

0:48

Trend

1:11

Strength: Correlation ®

1:45

Correlation Coefficient ( r )

2:30

Correlation Coefficient ( r )

2:31

Trees vs. Forest

11:59

Trees vs. Forest

12:00

Calculating r

15:07

Average Product of z-scores for x and y

15:08

Relationship between Correlation and Slope

21:10

Relationship between Correlation and Slope

21:11

Example 1: Find the Correlation between Grams of Fat and Cost

24:11

Example 2: Relationship between r and b1

30:24

Example 3: Find the Regression Line

33:35

Example 4: Find the Correlation Coefficient for this Set of Data

37:37

Correlation: r vs. r-squared

52m 52s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

R-squared

0:44

What is the Meaning of It? Why Squared?

0:45

Parsing Sum of Squared (Parsing Variability)

2:25

SST = SSR + SSE

2:26

What is SST and SSE?

7:46

What is SST and SSE?

7:47

r-squared

18:33

Coefficient of Determination

18:34

If the Correlation is Strong…

20:25

If the Correlation is Strong…

20:26

If the Correlation is Weak…

22:36

If the Correlation is Weak…

22:37

Example 1: Find r-squared for this Set of Data

23:56

Example 2: What Does it Mean that the Simple Linear Regression is a 'Model' of Variance?

33:54

Example 3: Why Does r-squared Only Range from 0 to 1

37:29

Example 4: Find the r-squared for This Set of Data

39:55

Transformations of Data

27m 8s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Why Transform?

0:26

Why Transform?

0:27

Shape-preserving vs. Shape-changing Transformations

5:14

Shape-preserving = Linear Transformations

5:15

Shape-changing Transformations = Non-linear Transformations

6:20

Common Shape-Preserving Transformations

7:08

Common Shape-Preserving Transformations

7:09

Common Shape-Changing Transformations

8:59

Powers

9:00

Logarithms

9:39

Change Just One Variable? Both?

10:38

Log-log Transformations

10:39

Log Transformations

14:38

Example 1: Create, Graph, and Transform the Data Set

15:19

Example 2: Create, Graph, and Transform the Data Set

20:08

Example 3: What Kind of Model would You Choose for this Data?

22:44

Example 4: Transformation of Data

25:46

Section 6: Collecting Data in an Experiment

Sampling & Bias

54m 44s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Descriptive vs. Inferential Statistics

1:04

Descriptive Statistics: Data Exploration

1:05

Example

2:03

To tackle Generalization…

4:31

Generalization

4:32

Sampling

6:06

'Good' Sample

6:40

Defining Samples and Populations

8:55

Population

8:56

Sample

11:16

Why Use Sampling?

13:09

Why Use Sampling?

13:10

Goal of Sampling: Avoiding Bias

15:04

What is Bias?

15:05

Where does Bias Come from: Sampling Bias

17:53

Where does Bias Come from: Response Bias

18:27

Sampling Bias: Bias from Bas Sampling Methods

19:34

Size Bias

19:35

Voluntary Response Bias

21:13

Convenience Sample

22:22

Judgment Sample

23:58

Inadequate Sample Frame

25:40

Response Bias: Bias from 'Bad' Data Collection Methods

28:00

Nonresponse Bias

29:31

Questionnaire Bias

31:10

Incorrect Response or Measurement Bias

37:32

Example 1: What Kind of Biases?

40:29

Example 2: What Biases Might Arise?

44:46

Example 3: What Kind of Biases?

48:34

Example 4: What Kind of Biases?

51:43

Sampling Methods

14m 25s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Biased vs. Unbiased Sampling Methods

0:32

Biased Sampling

0:33

Unbiased Sampling

1:13

Probability Sampling Methods

2:31

Simple Random

2:54

Stratified Random Sampling

4:06

Cluster Sampling

5:24

Two-staged Sampling

6:22

Systematic Sampling

7:25

Example 1: Which Type(s) of Sampling was this?

8:33

Example 2: Describe How to Take a Two-Stage Sample from this Book

10:16

Example 3: Sampling Methods

11:58

Example 4: Cluster Sample Plan

12:48

Research Design

53m 54s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Descriptive vs. Inferential Statistics

0:51

Descriptive Statistics: Data Exploration

0:52

Inferential Statistics

1:02

Variables and Relationships

1:44

Variables

1:45

Relationships

2:49

Not Every Type of Study is an Experiment…

4:16

Category I - Descriptive Study

4:54

Category II - Correlational Study

5:50

Category III - Experimental, Quasi-experimental, Non-experimental

6:33

Category III

7:42

Experimental, Quasi-experimental, and Non-experimental

7:43

Why CAN'T the Other Strategies Determine Causation?

10:18

Third-variable Problem

10:19

Directionality Problem

15:49

What Makes Experiments Special?

17:54

Manipulation

17:55

Control (and Comparison)

21:58

Methods of Control

26:38

Holding Constant

26:39

Matching

29:11

Random Assignment

31:48

Experiment Terminology

34:09

'true' Experiment vs. Study

34:10

Independent Variable (IV)

35:16

Dependent Variable (DV)

35:45

Factors

36:07

Treatment Conditions

36:23

Levels

37:43

Confounds or Extraneous Variables

38:04

Blind

38:38

Blind Experiments

38:39

Double-blind Experiments

39:29

How Categories Relate to Statistics

41:35

Category I - Descriptive Study

41:36

Category II - Correlational Study

42:05

Category III - Experimental, Quasi-experimental, Non-experimental

42:43

Example 1: Research Design

43:50

Example 2: Research Design

47:37

Example 3: Research Design

50:12

Example 4: Research Design

52:00

Between and Within Treatment Variability

41m 31s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Experimental Designs

0:51

Experimental Designs: Manipulation & Control

0:52

Two Types of Variability

2:09

Between Treatment Variability

2:10

Within Treatment Variability

3:31

Updated Goal of Experimental Design

5:47

Updated Goal of Experimental Design

5:48

Example: Drugs and Driving

6:56

Example: Drugs and Driving

6:57

Different Types of Random Assignment

11:27

All Experiments

11:28

Completely Random Design

12:02

Randomized Block Design

13:19

Randomized Block Design

15:48

Matched Pairs Design

15:49

Repeated Measures Design

19:47

Between-subject Variable vs. Within-subject Variable

22:43

Completely Randomized Design

22:44

Repeated Measures Design

25:03

Example 1: Design a Completely Random, Matched Pair, and Repeated Measures Experiment

26:16

Example 2: Block Design

31:41

Example 3: Completely Randomized Designs

35:11

Example 4: Completely Random, Matched Pairs, or Repeated Measures Experiments?

39:01

Section 7: Review of Probability Axioms

Sample Spaces

37m 52s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

Why is Probability Involved in Statistics

0:48

Probability

0:49

Can People Tell the Difference between Cheap and Gourmet Coffee?

2:08

Taste Test with Coffee Drinkers

3:37

If No One can Actually Taste the Difference

3:38

If Everyone can Actually Taste the Difference

5:36

Creating a Probability Model

7:09

Creating a Probability Model

7:10

D'Alembert vs. Necker

9:41

D'Alembert vs. Necker

9:42

Problem with D'Alembert's Model

13:29

Problem with D'Alembert's Model

13:30

Covering Entire Sample Space

15:08

Fundamental Principle of Counting

15:09

Where Do Probabilities Come From?

22:54

Observed Data, Symmetry, and Subjective Estimates

22:55

Checking whether Model Matches Real World

24:27

Law of Large Numbers

24:28

Example 1: Law of Large Numbers

27:46

Example 2: Possible Outcomes

30:43

Example 3: Brands of Coffee and Taste

33:25

Example 4: How Many Different Treatments are there?

35:33

Addition Rule for Disjoint Events

20m 29s

Intro

0:00

Roadmap

0:08

Roadmap

0:09

Disjoint Events

0:41

Disjoint Events

0:42

Meaning of 'or'

2:39

In Regular Life

2:40

In Math/Statistics/Computer Science

3:10

Addition Rule for Disjoin Events

3:55

If A and B are Disjoint: P (A and B)

3:56

If A and B are Disjoint: P (A or B)

5:15

General Addition Rule

5:41

General Addition Rule

5:42

Generalized Addition Rule

8:31

If A and B are not Disjoint: P (A or B)

8:32

Example 1: Which of These are Mutually Exclusive?

10:50

Example 2: What is the Probability that You will Have a Combination of One Heads and Two Tails?

12:57

Example 3: Engagement Party

15:17

Example 4: Home Owner's Insurance

18:30

Conditional Probability

57m 19s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

'or' vs. 'and' vs. Conditional Probability

1:07

'or' vs. 'and' vs. Conditional Probability

1:08

'and' vs. Conditional Probability

5:57

P (M or L)

5:58

P (M and L)

8:41

P (M|L)

11:04

P (L|M)

12:24

Tree Diagram

15:02

Tree Diagram

15:03

Defining Conditional Probability

22:42

Defining Conditional Probability

22:43

Common Contexts for Conditional Probability

30:56

Medical Testing: Positive Predictive Value

30:57

Medical Testing: Sensitivity

33:03

Statistical Tests

34:27

Example 1: Drug and Disease

36:41

Example 2: Marbles and Conditional Probability

40:04

Example 3: Cards and Conditional Probability

45:59

Example 4: Votes and Conditional Probability

50:21

Independent Events

24m 27s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Independent Events & Conditional Probability

0:26

Non-independent Events

0:27

Independent Events

2:00

Non-independent and Independent Events

3:08

Non-independent and Independent Events

3:09

Defining Independent Events

5:52

Defining Independent Events

5:53

Multiplication Rule

7:29

Previously…

7:30

But with Independent Evens

8:53

Example 1: Which of These Pairs of Events are Independent?

11:12

Example 2: Health Insurance and Probability

15:12

Example 3: Independent Events

17:42

Example 4: Independent Events

20:03

Section 8: Probability Distributions

Introduction to Probability Distributions

56m 45s

Intro

0:00

Roadmap

0:08

Roadmap

0:09

Sampling vs. Probability

0:57

Sampling

0:58

Missing

1:30

What is Missing?

3:06

Insight: Probability Distributions

5:26

Insight: Probability Distributions

5:27

What is a Probability Distribution?

7:29

From Sample Spaces to Probability Distributions

8:44

Sample Space

8:45

Probability Distribution of the Sum of Two Die

11:16

The Random Variable

17:43

The Random Variable

17:44

Expected Value

21:52

Expected Value

21:53

Example 1: Probability Distributions

28:45

Example 2: Probability Distributions

35:30

Example 3: Probability Distributions

43:37

Example 4: Probability Distributions

47:20

Expected Value & Variance of Probability Distributions

53m 41s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Discrete vs. Continuous Random Variables

1:04

Discrete vs. Continuous Random Variables

1:05

Mean and Variance Review

4:44

Mean: Sample, Population, and Probability Distribution

4:45

Variance: Sample, Population, and Probability Distribution

9:12

Example Situation

14:10

Example Situation

14:11

Some Special Cases…

16:13

Some Special Cases…

16:14

Linear Transformations

19:22

Linear Transformations

19:23

What Happens to Mean and Variance of the Probability Distribution?

20:12

n Independent Values of X

25:38

n Independent Values of X

25:39

Compare These Two Situations

30:56

Compare These Two Situations

30:57

Two Random Variables, X and Y

32:02

Two Random Variables, X and Y

32:03

Example 1: Expected Value & Variance of Probability Distributions

35:35

Example 2: Expected Values & Standard Deviation

44:17

Example 3: Expected Winnings and Standard Deviation

48:18

Binomial Distribution

55m 15s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Discrete Probability Distributions

1:42

Discrete Probability Distributions

1:43

Binomial Distribution

2:36

Binomial Distribution

2:37

Multiplicative Rule Review

6:54

Multiplicative Rule Review

6:55

How Many Outcomes with k 'Successes'

10:23

Adults and Bachelor's Degree: Manual List of Outcomes

10:24

P (X=k)

19:37

Putting Together # of Outcomes with the Multiplicative Rule

19:38

Expected Value and Standard Deviation in a Binomial Distribution

25:22

Expected Value and Standard Deviation in a Binomial Distribution

25:23

Example 1: Coin Toss

33:42

Example 2: College Graduates

38:03

Example 3: Types of Blood and Probability

45:39

Example 4: Expected Number and Standard Deviation

51:11

Section 9: Sampling Distributions of Statistics

Introduction to Sampling Distributions

48m 17s

Intro

0:00

Roadmap

0:08

Roadmap

0:09

Probability Distributions vs. Sampling Distributions

0:55

Probability Distributions vs. Sampling Distributions

0:56

Same Logic

3:55

Logic of Probability Distribution

3:56

Example: Rolling Two Die

6:56

Simulating Samples

9:53

To Come Up with Probability Distributions

9:54

In Sampling Distributions

11:12

Connecting Sampling and Research Methods with Sampling Distributions

12:11

Connecting Sampling and Research Methods with Sampling Distributions

12:12

Simulating a Sampling Distribution

14:14

Experimental Design: Regular Sleep vs. Less Sleep

14:15

Logic of Sampling Distributions

23:08

Logic of Sampling Distributions

23:09

General Method of Simulating Sampling Distributions

25:38

General Method of Simulating Sampling Distributions

25:39

Questions that Remain

28:45

Questions that Remain

28:46

Example 1: Mean and Standard Error of Sampling Distribution

30:57

Example 2: What is the Best Way to Describe Sampling Distributions?

37:12

Example 3: Matching Sampling Distributions

38:21

Example 4: Mean and Standard Error of Sampling Distribution

41:51

Sampling Distribution of the Mean

1h 8m 48s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Special Case of General Method for Simulating a Sampling Distribution

1:53

Special Case of General Method for Simulating a Sampling Distribution

1:54

Computer Simulation

3:43

Using Simulations to See Principles behind Shape of SDoM

15:50

Using Simulations to See Principles behind Shape of SDoM

15:51

Conditions

17:38

Using Simulations to See Principles behind Center (Mean) of SDoM

20:15

Using Simulations to See Principles behind Center (Mean) of SDoM

20:16

Conditions: Does n Matter?

21:31

Conditions: Does Number of Simulation Matter?

24:37

Using Simulations to See Principles behind Standard Deviation of SDoM

27:13

Using Simulations to See Principles behind Standard Deviation of SDoM

27:14

Conditions: Does n Matter?

34:45

Conditions: Does Number of Simulation Matter?

36:24

Central Limit Theorem

37:13

SHAPE

38:08

CENTER

39:34

SPREAD

39:52

Comparing Population, Sample, and SDoM

43:10

Comparing Population, Sample, and SDoM

43:11

Answering the 'Questions that Remain'

48:24

What Happens When We Don't Know What the Population Looks Like?

48:25

Can We Have Sampling Distributions for Summary Statistics Other than the Mean?

49:42

How Do We Know whether a Sample is Sufficiently Unlikely?

53:36

Do We Always Have to Simulate a Large Number of Samples in Order to get a Sampling Distribution?

54:40

Example 1: Mean Batting Average

55:25

Example 2: Mean Sampling Distribution and Standard Error

59:07

Example 3: Sampling Distribution of the Mean

1:01:04

Sampling Distribution of Sample Proportions

54m 37s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Intro to Sampling Distribution of Sample Proportions (SDoSP)

0:51

Categorical Data (Examples)

0:52

Wish to Estimate Proportion of Population from Sample…

2:00

Notation

3:34

Population Proportion and Sample Proportion Notations

3:35

What's the Difference?

9:19

SDoM vs. SDoSP: Type of Data

9:20

SDoM vs. SDoSP: Shape

11:24

SDoM vs. SDoSP: Center

12:30

SDoM vs. SDoSP: Spread

15:34

Binomial Distribution vs. Sampling Distribution of Sample Proportions

19:14

Binomial Distribution vs. SDoSP: Type of Data

19:17

Binomial Distribution vs. SDoSP: Shape

21:07

Binomial Distribution vs. SDoSP: Center

21:43

Binomial Distribution vs. SDoSP: Spread

24:08

Example 1: Sampling Distribution of Sample Proportions

26:07

Example 2: Sampling Distribution of Sample Proportions

37:58

Example 3: Sampling Distribution of Sample Proportions

44:42

Example 4: Sampling Distribution of Sample Proportions

45:57

Section 10: Inferential Statistics

Introduction to Confidence Intervals

42m 53s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Inferential Statistics

0:50

Inferential Statistics

0:51

Two Problems with This Picture…

3:20

Two Problems with This Picture…

3:21

Solution: Confidence Intervals (CI)

4:59

Solution: Hypotheiss Testing (HT)

5:49

Which Parameters are Known?

6:45

Which Parameters are Known?

6:46

Confidence Interval - Goal

7:56

When We Don't Know m but know s

7:57

When We Don't Know

18:27

When We Don't Know m nor s

18:28

Example 1: Confidence Intervals

26:18

Example 2: Confidence Intervals

29:46

Example 3: Confidence Intervals

32:18

Example 4: Confidence Intervals

38:31

t Distributions

1h 2m 6s

Intro

0:00

Roadmap

0:04

Roadmap

0:05

When to Use z vs. t?

1:07

When to Use z vs. t?

1:08

What is z and t?

3:02

z-score and t-score: Commonality

3:03

z-score and t-score: Formulas

3:34

z-score and t-score: Difference

5:22

Why not z? (Why t?)

7:24

Why not z? (Why t?)

7:25

But Don't Worry!

15:13

Gossett and t-distributions

15:14

Rules of t Distributions

17:05

t-distributions are More Normal as n Gets Bigger

17:06

t-distributions are a Family of Distributions

18:55

Degrees of Freedom (df)

20:02

Degrees of Freedom (df)

20:03

t Family of Distributions

24:07

t Family of Distributions : df = 2 , 4, and 60

24:08

df = 60

29:16

df = 2

29:59

How to Find It?

31:01

'Student's t-distribution' or 't-distribution'

31:02

Excel Example

33:06

Example 1: Which Distribution Do You Use? Z or t?

45:26

Example 2: Friends on Facebook

47:41

Example 3: t Distributions

52:15

Example 4: t Distributions , confidence interval, and mean

55:59

Introduction to Hypothesis Testing

1h 6m 33s

Intro

0:00

Roadmap

0:06

Roadmap

0:07

Issues to Overcome in Inferential Statistics

1:35

Issues to Overcome in Inferential Statistics

1:36

What Happens When We Don't Know What the Population Looks Like?

2:57

How Do We Know whether a sample is Sufficiently Unlikely

3:43

Hypothesizing a Population

6:44

Hypothesizing a Population

6:45

Null Hypothesis

8:07

Alternative Hypothesis

8:56

Hypotheses

11:58

Hypotheses

11:59

Errors in Hypothesis Testing

14:22

Errors in Hypothesis Testing

14:23

Steps of Hypothesis Testing

21:15

Steps of Hypothesis Testing

21:16

Single Sample HT ( When Sigma Available)

26:08

Example: Average Facebook Friends

26:09

Step1

27:08

Step 2

27:58

Step 3

28:17

Step 4

32:18

Single Sample HT (When Sigma Not Available)

36:33

Example: Average Facebook Friends

36:34

Step1: Hypothesis Testing

36:58

Step 2: Significance Level

37:25

Step 3: Decision Stage

37:40

Step 4: Sample

41:36

Sigma and p-value

45:04

Sigma and p-value

45:05

On tailed vs. Two Tailed Hypotheses

45:51

Example 1: Hypothesis Testing

48:37

Example 2: Heights of Women in the US

57:43

Example 3: Select the Best Way to Complete This Sentence

1:03:23

Confidence Intervals for the Difference of Two Independent Means

55m 14s

Intro

0:00

Roadmap

0:14

Roadmap

0:15

One Mean vs. Two Means

1:17

One Mean vs. Two Means

1:18

Notation

2:41

A Sample! A Set!

2:42

Mean of X, Mean of Y, and Difference of Two Means

3:56

SE of X

4:34

SE of Y

6:28

Sampling Distribution of the Difference between Two Means (SDoD)

7:48

Sampling Distribution of the Difference between Two Means (SDoD)

7:49

Rules of the SDoD (similar to CLT!)

15:00

Mean for the SDoD Null Hypothesis

15:01

Standard Error

17:39

When can We Construct a CI for the Difference between Two Means?

21:28

Three Conditions

21:29

Finding CI

23:56

One Mean CI

23:57

Two Means CI

25:45

Finding t

29:16

Finding t

29:17

Interpreting CI

30:25

Interpreting CI

30:26

Better Estimate of s (s pool)

34:15

Better Estimate of s (s pool)

34:16

Example 1: Confidence Intervals

42:32

Example 2: SE of the Difference

52:36

Hypothesis Testing for the Difference of Two Independent Means

50m

Intro

0:00

Roadmap

0:06

Roadmap

0:07

The Goal of Hypothesis Testing

0:56

One Sample and Two Samples

0:57

Sampling Distribution of the Difference between Two Means (SDoD)

3:42

Sampling Distribution of the Difference between Two Means (SDoD)

3:43

Rules of the SDoD (Similar to CLT!)

6:46

Shape

6:47

Mean for the Null Hypothesis

7:26

Standard Error for Independent Samples (When Variance is Homogenous)

8:18

Standard Error for Independent Samples (When Variance is not Homogenous)

9:25

Same Conditions for HT as for CI

10:08

Three Conditions

10:09

Steps of Hypothesis Testing

11:04

Steps of Hypothesis Testing

11:05

Formulas that Go with Steps of Hypothesis Testing

13:21

Step 1

13:25

Step 2

14:18

Step 3

15:00

Step 4

16:57

Example 1: Hypothesis Testing for the Difference of Two Independent Means

18:47

Example 2: Hypothesis Testing for the Difference of Two Independent Means

33:55

Example 3: Hypothesis Testing for the Difference of Two Independent Means

44:22

Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

1h 14m 11s

Intro

0:00

Roadmap

0:09

Roadmap

0:10

The Goal of Hypothesis Testing

1:27

One Sample and Two Samples

1:28

Independent Samples vs. Paired Samples

3:16

Independent Samples vs. Paired Samples

3:17

Which is Which?

5:20

Independent SAMPLES vs. Independent VARIABLES

7:43

independent SAMPLES vs. Independent VARIABLES

7:44

T-tests Always…

10:48

T-tests Always…

10:49

Notation for Paired Samples

12:59

Notation for Paired Samples

13:00

Steps of Hypothesis Testing for Paired Samples

16:13

Steps of Hypothesis Testing for Paired Samples

16:14

Rules of the SDoD (Adding on Paired Samples)

18:03

Shape

18:04

Mean for the Null Hypothesis

18:31

Standard Error for Independent Samples (When Variance is Homogenous)

19:25

Standard Error for Paired Samples

20:39

Formulas that go with Steps of Hypothesis Testing

22:59

Formulas that go with Steps of Hypothesis Testing

23:00

Confidence Intervals for Paired Samples

30:32

Confidence Intervals for Paired Samples

30:33

Example 1: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

32:28

Example 2: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

44:02

Example 3: Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means

52:23

Type I and Type II Errors

31m 27s

Intro

0:00

Roadmap

0:18

Roadmap

0:19

Errors and Relationship to HT and the Sample Statistic?

1:11

Errors and Relationship to HT and the Sample Statistic?

1:12

Instead of a Box…Distributions!

7:00

One Sample t-test: Friends on Facebook

7:01

Two Sample t-test: Friends on Facebook

13:46

Usually, Lots of Overlap between Null and Alternative Distributions

16:59

Overlap between Null and Alternative Distributions

17:00

How Distributions and 'Box' Fit Together

22:45

How Distributions and 'Box' Fit Together

22:46

Example 1: Types of Errors

25:54

Example 2: Types of Errors

27:30

Example 3: What is the Danger of the Type I Error?

29:38

Effect Size & Power

44m 41s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Distance between Distributions: Sample t

0:49

Distance between Distributions: Sample t

0:50

Problem with Distance in Terms of Standard Error

2:56

Problem with Distance in Terms of Standard Error

2:57

Test Statistic (t) vs. Effect Size (d or g)

4:38

Test Statistic (t) vs. Effect Size (d or g)

4:39

Rules of Effect Size

6:09

Rules of Effect Size

6:10

Why Do We Need Effect Size?

8:21

Tells You the Practical Significance

8:22

HT can be Deceiving…

10:25

Important Note

10:42

What is Power?

11:20

What is Power?

11:21

Why Do We Need Power?

14:19

Conditional Probability and Power

14:20

Power is:

16:27

Can We Calculate Power?

19:00

Can We Calculate Power?

19:01

How Does Alpha Affect Power?

20:36

How Does Alpha Affect Power?

20:37

How Does Effect Size Affect Power?

25:38

How Does Effect Size Affect Power?

25:39

How Does Variability and Sample Size Affect Power?

27:56

How Does Variability and Sample Size Affect Power?

27:57

How Do We Increase Power?

32:47

Increasing Power

32:48

Example 1: Effect Size & Power

35:40

Example 2: Effect Size & Power

37:38

Example 3: Effect Size & Power

40:55

Section 11: Analysis of Variance

F-distributions

24m 46s

Intro

0:00

Roadmap

0:04

Roadmap

0:05

Z- & T-statistic and Their Distribution

0:34

Z- & T-statistic and Their Distribution

0:35

F-statistic

4:55

The F Ration ( the Variance Ratio)

4:56

F-distribution

12:29

F-distribution

12:30

s and p-value

15:00

s and p-value

15:01

Example 1: Why Does F-distribution Stop At 0 But Go On Until Infinity?

18:33

Example 2: F-distributions

19:29

Example 3: F-distributions and Heights

21:29

ANOVA with Independent Samples

1h 9m 25s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

The Limitations of t-tests

1:12

The Limitations of t-tests

1:13

Two Major Limitations of Many t-tests

3:26

Two Major Limitations of Many t-tests

3:27

Ronald Fisher's Solution… F-test! New Null Hypothesis

4:43

Ronald Fisher's Solution… F-test! New Null Hypothesis (Omnibus Test - One Test to Rule Them All!)

4:44

Analysis of Variance (ANoVA) Notation

7:47

Analysis of Variance (ANoVA) Notation

7:48

Partitioning (Analyzing) Variance

9:58

Total Variance

9:59

Within-group Variation

14:00

Between-group Variation

16:22

Time out: Review Variance & SS

17:05

Time out: Review Variance & SS

17:06

F-statistic

19:22

The F Ratio (the Variance Ratio)

19:23

S²bet = SSbet / dfbet

22:13

What is This?

22:14

How Many Means?

23:20

So What is the dfbet?

23:38

So What is SSbet?

24:15

S²w = SSw / dfw

26:05

What is This?

26:06

How Many Means?

27:20

So What is the dfw?

27:36

So What is SSw?

28:18

Chart of Independent Samples ANOVA

29:25

Chart of Independent Samples ANOVA

29:26

Example 1: Who Uploads More Photos: Unknown Ethnicity, Latino, Asian, Black, or White Facebook Users?

35:52

Hypotheses

35:53

Significance Level

39:40

Decision Stage

40:05

Calculate Samples' Statistic and p-Value

44:10

Reject or Fail to Reject H0

55:54

Example 2: ANOVA with Independent Samples

58:21

Repeated Measures ANOVA

1h 15m 13s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

The Limitations of t-tests

0:36

Who Uploads more Pictures and Which Photo-Type is Most Frequently Used on Facebook?

0:37

ANOVA (F-test) to the Rescue!

5:49

Omnibus Hypothesis

5:50

Analyze Variance

7:27

Independent Samples vs. Repeated Measures

9:12

Same Start

9:13

Independent Samples ANOVA

10:43

Repeated Measures ANOVA

12:00

Independent Samples ANOVA

16:00

Same Start: All the Variance Around Grand Mean

16:01

Independent Samples

16:23

Repeated Measures ANOVA

18:18

Same Start: All the Variance Around Grand Mean

18:19

Repeated Measures

18:33

Repeated Measures F-statistic

21:22

The F Ratio (The Variance Ratio)

21:23

S²bet = SSbet / dfbet

23:07

What is This?

23:08

How Many Means?

23:39

So What is the dfbet?

23:54

So What is SSbet?

24:32

S² resid = SS resid / df resid

25:46

What is This?

25:47

So What is SS resid?

26:44

So What is the df resid?

27:36

SS subj and df subj

28:11

What is This?

28:12

How Many Subject Means?

29:43

So What is df subj?

30:01

So What is SS subj?

30:09

SS total and df total

31:42

What is This?

31:43

What is the Total Number of Data Points?

32:02

So What is df total?

32:34

so What is SS total?

32:47

Chart of Repeated Measures ANOVA

33:19

Chart of Repeated Measures ANOVA: F and Between-samples Variability

33:20

Chart of Repeated Measures ANOVA: Total Variability, Within-subject (case) Variability, Residual Variability

35:50

Example 1: Which is More Prevalent on Facebook: Tagged, Uploaded, Mobile, or Profile Photos?

40:25

Hypotheses

40:26

Significance Level

41:46

Decision Stage

42:09

Calculate Samples' Statistic and p-Value

46:18

Reject or Fail to Reject H0

57:55

Example 2: Repeated Measures ANOVA

58:57

Example 3: What's the Problem with a Bunch of Tiny t-tests?

1:13:59

Section 12: Chi-square Test

Chi-Square Goodness-of-Fit Test

58m 23s

Intro

0:00

Roadmap

0:05

Roadmap

0:06

Where Does the Chi-Square Test Belong?

0:50

Where Does the Chi-Square Test Belong?

0:51

A New Twist on HT: Goodness-of-Fit

7:23

HT in General

7:24

Goodness-of-Fit HT

8:26

Hypotheses about Proportions

12:17

Null Hypothesis

12:18

Alternative Hypothesis

13:23

Example

14:38

Chi-Square Statistic

17:52

Chi-Square Statistic

17:53

Chi-Square Distributions

24:31

Chi-Square Distributions

24:32

Conditions for Chi-Square

28:58

Condition 1

28:59

Condition 2

30:20

Condition 3

30:32

Condition 4

31:47

Example 1: Chi-Square Goodness-of-Fit Test

32:23

Example 2: Chi-Square Goodness-of-Fit Test

44:34

Example 3: Which of These Statements Describe Properties of the Chi-Square Goodness-of-Fit Test?

56:06

Chi-Square Test of Homogeneity

51m 36s

Intro

0:00

Roadmap

0:09

Roadmap

0:10

Goodness-of-Fit vs. Homogeneity

1:13

Goodness-of-Fit HT

1:14

Homogeneity

2:00

Analogy

2:38

Hypotheses About Proportions

5:00

Null Hypothesis

5:01

Alternative Hypothesis

6:11

Example

6:33

Chi-Square Statistic

10:12

Same as Goodness-of-Fit Test

10:13

Set Up Data

12:28

Setting Up Data Example

12:29

Expected Frequency

16:53

Expected Frequency

16:54

Chi-Square Distributions & df

19:26

Chi-Square Distributions & df

19:27

Conditions for Test of Homogeneity

20:54

Condition 1

20:55

Condition 2

21:39

Condition 3

22:05

Condition 4

22:23

Example 1: Chi-Square Test of Homogeneity

22:52

Example 2: Chi-Square Test of Homogeneity

32:10

Section 13: Overview of Statistics

Overview of Statistics

18m 11s

Intro

0:00

Roadmap

0:07

Roadmap

0:08

The Statistical Tests (HT) We've Covered

0:28

The Statistical Tests (HT) We've Covered

0:29

Organizing the Tests We've Covered…

1:08

One Sample: Continuous DV and Categorical DV

1:09

Two Samples: Continuous DV and Categorical DV

5:41

More Than Two Samples: Continuous DV and Categorical DV

8:21

The Following Data: OK Cupid

10:10

The Following Data: OK Cupid

10:11

Example 1: Weird-MySpace-Angle Profile Photo

10:38

Example 2: Geniuses

12:30

Example 3: Promiscuous iPhone Users

13:37

Example 4: Women, Aging, and Messaging

16:07

This is a quick preview of the lesson. For full access, please Log In or Sign up.
For more information, please see full course syllabus of Statistics

Statistics Hypothesis Testing for the Difference of Two Independent Means

Name: Statistics: Hypothesis Testing for the Difference of Two Independent Means
Brand: Educator.com
Price: 35 USD
Availability: InStock

Section 10: Inferential Statistics: Lecture 5 | 50:00 min

Lecture Description

Next Lecture

Previous Lecture

Discussion
Answer Engine
Download Lecture Slides
Table of Contents
Transcription
Related Books

Please login to ask a question and view discussion.

Start Learning Now

Our free lessons will get you started (Adobe Flash^® required).
Get immediate access to our entire library.

Membership Overview

Unlimited access to our entire library of courses.
Search and jump to exactly what you want to learn.
*Ask questions and get answers from the community and our teachers!
Practice questions with step-by-step solutions.
Download lesson files for programming and software training practice.
Track your course viewing progress.
Download lecture slides for taking notes.
Learn at your own pace... anytime, anywhere!

Answer EngineGet answers to any question!Ask any question related to Statistics

Working on the solution...

Hypothesis Testing for the Difference of Two Independent Means

Lecture Slides are screen-captured images of important points in the lecture. Students can download and print out these lecture slide images to do practice problems as well as take notes while watching the lecture.

Intro 0:00
Roadmap 0:06

Roadmap

The Goal of Hypothesis Testing 0:56

One Sample and Two Samples

Sampling Distribution of the Difference between Two Means (SDoD) 3:42

Sampling Distribution of the Difference between Two Means (SDoD)

Rules of the SDoD (Similar to CLT!) 6:46

Shape
Mean for the Null Hypothesis
Standard Error for Independent Samples (When Variance is Homogenous)
Standard Error for Independent Samples (When Variance is not Homogenous)

Same Conditions for HT as for CI 10:08

Three Conditions

Steps of Hypothesis Testing 11:04

Steps of Hypothesis Testing

Formulas that Go with Steps of Hypothesis Testing 13:21

Step 1
Step 2
Step 3
Step 4

Example 1: Hypothesis Testing for the Difference of Two Independent Means 18:47
Example 2: Hypothesis Testing for the Difference of Two Independent Means 33:55
Example 3: Hypothesis Testing for the Difference of Two Independent Means 44:22

General Statistics Online Course

Section 1: Introduction
	Descriptive Statistics vs. Inferential Statistics	25:31
Section 2: About Samples: Cases, Variables, Measurements
	About Samples: Cases, Variables, Measurements	32:14
Section 3: Visualizing Distributions
	Introduction to Excel	8:09
	Frequency Distributions in Excel	39:10
	Frequency Distributions and Features	25:29
	Dotplots and Histograms in Excel	42:42
	Stemplots	12:23
	Bar Graphs	22:49
Section 4: Summarizing Distributions
	Central Tendency: Mean, Median, Mode	38:50
	Variability	42:40
	Five Number Summary & Boxplots	57:15
	Shape: Calculating Skewness & Kurtosis	41:51
	Normal Distribution	34:33
	Standard Normal Distributions & Z-Scores	41:44
	Normal Distribution: PDF vs. CDF	55:44
Section 5: Linear Regression
	Scatterplots	47:19
	Regression	32:02
	Least Squares Regression	56:36
	Correlation	43:58
	Correlation: r vs. r-squared	52:52
	Transformations of Data	27:08
Section 6: Collecting Data in an Experiment
	Sampling & Bias	54:44
	Sampling Methods	14:25
	Research Design	53:54
	Between and Within Treatment Variability	41:31
Section 7: Review of Probability Axioms
	Sample Spaces	37:52
	Addition Rule for Disjoint Events	20:29
	Conditional Probability	57:19
	Independent Events	24:27
Section 8: Probability Distributions
	Introduction to Probability Distributions	56:45
	Expected Value & Variance of Probability Distributions	53:41
	Binomial Distribution	55:15
Section 9: Sampling Distributions of Statistics
	Introduction to Sampling Distributions	48:17
	Sampling Distribution of the Mean	1:08:48
	Sampling Distribution of Sample Proportions	54:37
Section 10: Inferential Statistics
	Introduction to Confidence Intervals	42:53
	t Distributions	1:02:06
	Introduction to Hypothesis Testing	1:06:33
	Confidence Intervals for the Difference of Two Independent Means	55:14
	Hypothesis Testing for the Difference of Two Independent Means	50:00
	Confidence Intervals & Hypothesis Testing for the Difference of Two Paired Means	1:14:11
	Type I and Type II Errors	31:27
	Effect Size & Power	44:41
Section 11: Analysis of Variance
	F-distributions	24:46
	ANOVA with Independent Samples	1:09:25
	Repeated Measures ANOVA	1:15:13
Section 12: Chi-square Test
	Chi-Square Goodness-of-Fit Test	58:23
	Chi-Square Test of Homogeneity	51:36
Section 13: Overview of Statistics
	Overview of Statistics	18:11

Transcription: Hypothesis Testing for the Difference of Two Independent Means

Hi and welcome to www.educator.com.0000

We are going to be talking about hypothesis testing for the difference between two independent means.0001

We are going to go over the goal of hypothesis testing in general.0005

We have only looked at it for one means so far, but we are going to look at0012

how it changes just very suddenly when we talk about two means.0015

We are going to re-talk about the sampling distribution of the difference between two means.0019

You have just watched the confidence interval for two means, then you do not need to watch this one.0025

You do not need to watch that section.0032

We are going to talk about the same conditions for doing hypothesis testing as first confidence interval.0034

They need to meet three conditions before you could do either of these two.0043

When we talk about the modified steps of hypothesis testing for two means and the formulas that go with those steps.0047

Let us talk about the goal of hypothesis testing.0055

In one sample what we wanted to do was reject the null if0060

we got a sample that was significantly different from the hypothesized mu.0065

For instance, significantly lower or significantly higher.0073

A significant does not mean important like it does in our modern use of the word.0076

It actually means does it standout?0083

Is it weird enough?0086

Does it stand out from the hypothesized mu?0088

In those cases we reject the null.0091

Our goal is to reject the null.0095

We can only say whether something is sufficiently weird w cannot say whether it is sufficiently similar.0097

Experiment is actually a success if they reject the null.0106

If they do not reject the null it is considered a null experiment or what we think of as uninformative which is not actually true.0110

That is how traditionally is that.0118

This is the case where we only have one sample and we have a hypothesized population.0123

Here we have two samples and in order to reject the null we need to get samples that are significantly different from each other.0130

They stand out from each other so x is different from y, y is different from x.0144

That is what we are really looking for.0151

Once again, just like the one sample, we cannot say whether they are sufficiently similar,0154

but we can say whether they are sufficiently different.0159

It is okay if x is significantly lower than y or significantly higher.0163

We do not really care.0170

We just care about significantly different.0171

If you do not care about which direction these are called two-tailed hypotheses.0173

Let us think if x and y are different from each other then x - y should not be 0.0179

But if x and y are exactly the same, x = y then x – y =0.0189

Because you can think about this as x – x because x – y.0196

If you want to think about it algebraically even if you add y to each side you would get perfectly x= y.0201

If x and y were the same, we should expect their difference to be 0.0211

Let us just review very briefly the sampling distribution of the difference between two means.0218

This is the case where we do not know what the population is like,0228

but because of the CLT we actually end up knowing quite a bit about the SDOM.0233

This is x the population of x and population of y.0242

This is the SDOM of x bar, so the whole bunch of x bars and this is the SDOM for y which is a whole bunch of y bars.0247

We know some things about these guys and we also know we can figure out the standard error from the sample.0258

What is nice about this if we do not need to know anything about the population.0280

All we have to do is know the standard deviation of the sample which we could easily calculate0284

in order to estimate the standard error of these two populations.0288

Once we have that now we can start talking about the SDOD (the sampling distribution of the difference between means).0294

What we want to do is instead of finding mu sub x or mu sub y, we want to know mu sub x bar – y bar.0306

Here you have to think of pulling out one sample from here and one sample from here getting the difference and plotting it.0322

If these guys are normal, we can assume this one to be normal.0332

Not only that but we can figure out the standard error of this guy as well just0336

from knowing these because the standard error is going to be square roots of s sub x².0342

The variance of s/n sub x + variance of y/ n sub y.0357

These are all things that we have.0366

We do not need anything special.0368

We do not need sigma or anything like that.0370

We just need samples in order to calculate this.0372

If these two distributions or if these two distributions, the population distribution,0374

if we have a reason to suspect that these have homogeneous variance.0384

If their variances are the same then instead of s sub s² and s sub y²,0389

we can actually use spull² but we would not be doing that in this lesson, but you can.0395

Remember the rules of the SDOD are very similar to the CLT and if the SDOM for x is normal0405

and SDOM for y is normal then SDOD is normal too.0415

There is two ways that this could be true.0419

The first way is if populations are normal.0421

If population of x and y are normal then we could assume SDOM for x and y are normal.0428

Or are your other possibility is if n is large enough.0435

We want to talk about the mean for the null hypothesis.0443

The null hypotheses is saying that the population of x and population of y,0450

the difference between them is going to be 0 because they are similar.0457

The null hypotheses is saying both are similar, which means that the means of0461

the sampling distribution of the means, the SDOM means is going to be similar.0467

Which means that is strap in and will give us 0.0474

The null hypothesis says the mean of these differences of means it is going to be 0.0478

That is the null hypotheses and that is really saying that the SDOM for x and SDOM for the y are very similar.0486

Let us talk about standard error for independent samples.0497

Remember, we are still talking just about independent samples.0502

When variance is homogenous that is only used as Spull idea.0506

That means that x sub x bar - y bar is going to be equal to and pretend you are0511

writing just the regular idea where you are dividing by n sub x and n sub y.0521

Instead of using the variance from x and the variance from y, we are going to use that pulled variance idea.0529

That is going to be s pulled.0536

Some people think why do we just put that on top and put n sub x and n sub y at the bottom?0547

That will be algebraically wrong because remember, these are the denominators we would have0554

to have common denominators in order for us to put these together and we do not have common denominators yet.0559

What about in the case where variance is not homogenous and this is the vast majority of time and when in doubt,0565

when you do not know anything about the variance of the population go with this one.0576

It is just a safer option.0582

This is going to mean that this standard error is represented by the variance of x /n + variance of y /n.0584

Add these together and square the whole thing.0602

Just to recap, same conditions must be met in order to do hypothesis testing0605

for two means as the conditions for doing a confidence interval for two means.0616

It is that the two samples were randomly and independently selected from two different populations,0622

it is reasonable to assume that both populations that the sample come from are0632

normally distributed or the sample sizes are sufficiently large.0636

This was to ensure the normality of the SDOM.0641

Also in the case of the sample surveys, the population size should be at least 10 times larger than the sample size for each sample.0643

That is just assume so that we could assume replacement because probability actions change when you do not assume replacement.0651

Let us go in the steps of the hypothesis testing.0663

These are the same steps as you did when you have one mean, except now that we are subtly changing a few things.0669

I'm going to highlight those changes as we go through this.0677

First we need to state our hypotheses and remember now instead of having just the hypotheses that0679

the mean of the population equals this, what we are saying is that the mean of x,0686

population of x and the mean of the population of y those are the same.0696

Mu sub x - y will be 0.0701

You can also write it as mu sub x = mu sub y.0707

The alternative is that they are different from each other in some way.0712

Then we pick a significance level.0718

How different do these two populations have to be for us to say they are different?0721

We set a decision stage, but instead of drawing the SDOM now we draw the SDOD.0726

Because now we are looking at the differences between these to means.0734

We identify critical limits and rejection regions.0739

We also find the critical test statistic, the boundaries.0743

In order to do this we have to find the degrees of freedom for the difference.0747

We cannot just use the degrees of freedom for 1, degrees of freedom for the other but we actually add them together.0753

And then use the samples and the SDOD to compute the mean difference.0759

We are not just computing mean, but we are computing mean difference test statistics, as well as the p value.0764

And then we compare the sample to the hypothesized population.0773

We either reject the null or not.0779

We reject the null if our test statistic and p value lie in those zones of rejection.0781

It is like these are the weirdo zone.0792

This is all we know that our sample is really different from this population.0794

Let us talk about the different formulas that go along with these steps.0799

Remember the first step is going to be, what is the hypothesis, the null hypotheses, as well as the alternative.0806

This is not really a formula, but it is helpful to remember that this is what we really mean versus x bar – y bar does not equal 0.0817

This is often what is going to be the case and you can rewrite this as mu sub x bar – mu sub y bar sometimes,0836

but there are some mathematical ideas that you have to learn before you can write that.0846

I will leave that aside for now.0857

Second thing is significance level.0859

Here there are no formulas but you should know that when we say alpha= .05 we are talking about that false alarm rate.0862

This is the rate of rejecting the null when the null is actually true.0873

This is a very low rate of false alarms.0877

When we say alpha = .05 it is not that we calculated it but it is just that0881

by convention science tends to say this is the reasonable level of significance.0887

Sometimes people are more conservative than 1.0 or 1.001.0895

Number 3, we need to set that decision stage.0900

It is helpful to draw the SDOD and it is helpful to have our hypothesized population here.0905

Mu sub x bay – y bar = 0.0924

We assume that this point is 0.0930

One thing you probably also want to know about the SDOD is the formula for standard error.0932

The formula for standard error of the SDOD we written this a lot of times,0941

is the variance of x / n sub x + the variance of y / n sub y.0951

Another thing, you probably want to know is that we need to find these critical t.0959

We need to find the t values here and in order to find that you will need to know0965

the degrees of freedom for the difference and it is pretty easy.0973

It is the degrees of freedom for x + the degrees of freedom for y.0979

To find this, it is n sub x -1.0983

To find that it is n sub y -1.0988

We could write this as n sub x -1 + n sub y -1.0990

You could write it like that and then I think that is all you need to know for the decision stage.1002

Step 4, if you have to compute the samples mean difference you need to calculate its test statistic as well as its p value.1011

Remember we are going to be using t from here on out because obviously we are using s instead of sigma.1039

Let us talk about how to come to the sample t.1046

Let me write this as sample t.1050

The sample t is really the distance between where our sample differences versus the hypothesized difference.1058

We do not want it just in terms of that raw distance, we want in terms of the standard error.1069

It is going to be whatever our x bar - y bar is the actual sample difference -0.1075

That is our hypothesized population divided by the standard error s sub x bar – y bar.1085

That will give you how many standard errors away our actual mean difference is from 0.1097

Once you have this t value and you have the degrees of freedom,1104

then you can find the p value and then you could reject or accept the null hypotheses.1113

Reject or do not reject, that is really the technical idea there.1121

Let us go onto some examples.1126

The Cheesy Cheesy cookies company wanted to know whether they should have a coarse or fine texture in their cheesy cookies.1131

They assembled a series of taste testing panels that tasted either the coarse1140

or fine textured cookies and gave it a palatability score.1143

The higher score the better.1153

Is there a statistical difference in the mean palatability score between the two texture levels?1154

If you download the examples below and you look under the example 1, you should see a data set that looks like this.1162

This is the palatability score and this is the texture.1174

I believe that 0 = coarse and 1= fine, just so that we can make some sort of recommendation at the end.1177

Here we go, we have these different sets of scores, so this is the score that1200

one panel came up with and that panel tasted coarse textured cheesy cookies.1209

This panel also tasted coarse and that is the score it gave it.1214

Let us go up to fine.1221

They tasted fine texture and they give it that score.1223

They also tasted fine and they give it that score.1227

You could go and see what the different scores are and what texture they had.1231

First, let us think about what our x and y?1240

What are our two independent samples?1245

The two independent samples here seem to come from the two different textures.1247

One group of scores they all tasted coarse texture cheesy cookies.1251

The other group of scores tasted fine textured cheesy cookies.1260

It might be helpful to us to sort this data by texture.1264

I am going to take this and I am going to ask.1270

It would work if I move score over.1281

What I am going to do is just hit sort.1291

Here these are all our coarse cheesy cookie, the palatability scores and here are my fine cheesy cookie palatability scores.1296

Let us think about how we want to approach this problem.1311

First thing we want to do is create some sort of hypothesize population.1315

Our hypothesize population is really going to say that the coarse and1322

fine textured cheesy cookies there is really no difference between them.1327

They are the same.1330

The mu sub x bar - y bar should equal 0.1332

The alternative is that they are different from each other in some way.1337

We do not know which one taste better.1346

Let us just be neutral and say we do not know whether the coarse cheesy cookies1352

are better than the fine or to fine cheesy cookies are better than the coarse.1358

We want to know whether these palatability scores are different or the same.1364

Let us set a significance level for how different they have to be.1370

Our significance level could be alpha= .05.1377

Finally let us set a decision stage.1386

Here I am going to draw SDOD, can we assume normality?1390

Well, they are different and let us look here.1398

We have 8 scores and 8 scores, the n is low.1405

Technically, we might not be able to do hypothesis testing.1416

Let us say for some reason that your teacher wants you doing anyway.1424

But one of the things that should come up when you see low n like this is that you should question1430

whether hypothesis testing is the right way to go because it may not reflect the conditions1436

that we need to have set before we can assume all the stuff.1446

Just for the problem solving and practice here, let us go with that.1449

But if you want it to be smaller you can tell your instructor the conditions are meet for hypothesis testing.1454

Here we set our little lower n rejection and why do we just go ahead and put in our mu here.1466

It is going to be 0 and it will be helpful to find out that t values out here.1478

Let us go ahead and do that.1483

What are our critical t?1486

Critical t or the boundaries.1491

In order to find the critical t, we are going to have to find the degrees of freedom, DF of differences.1494

N sub x we will call x coarses.1503

X will be coarse cheesy cookies and y will be fine.1512

You can use c and f if you want to.1521

This is going to be 8 and this is also 8.1524

The degrees of freedom for each of these is 7 so this is going to be 14.1528

That is a pretty low degrees of freedom.1534

That is all we can assume normality here.1537

Let us find the critical t.1540

In order to find that we would use t inverse because we have the two tailed probability .05 and we have the degrees of freedom.1545

This gives us a positive version.1562

The negative version would just be the negative of that number because they are perfectly symmetrical.1565

2.14 the critical t is + or -2.14.1573

Now that we have that, then we could go ahead and look at the actual samples themselves.1581

Step 4, is we need to find the samples mean difference.1589

We need to find x bar – y bar, but we also need to find this mean differences t.1598

The t sub x bar - y bar.1606

We need to find that as well as the p value.1610

Let us go ahead and do that.1613

We just started from step 3 and step 4 is really the mean difference and that is just the average of these guys - the average of these guys.1618

That is their average difference.1656

This is saying that the coarse scores tend to be on average lower than1662

the fine scores because we do course score – fine score.1668

We get a negative number.1671

The coarse score number must have been small.1672

Actually before we go on, it might be helpful to find the standard error of this situation.1677

In order to find the standard error of the difference we need to find1690

the square roots of the variance of x ÷ n sub x + the variance of y ÷ n sub y.1699

This is going to be our standard error that we need.1717

In order to find that it would be helpful to find each of these pieces by themselves.1724

I guess we could find the whole thing, the variance of x ÷ n sub x and the variance of y ÷ n sub y.1731

I will put each of these on different lines like we can do all of it together.1750

We could just add them all up here.1754

Let us find that.1757

The variance, thankfully Excel has all these functions.1763

Let us check and make sure that this variance will give us n-1.1771

The variance of x ÷ 8 and the variance of all my fine cheesy cookie values ÷ 8.1778

We have these two variances and when we divide by n sub x we are getting the variance of the SDOM.1799

If we add those together then get the square root, then we get the standard error of the difference.1811

The square root of these two guys added together and that is 11.16.1820

Here I will just add this information so the standard error of the difference =11.16.1830

In order to find this t, we need to have this difference between the means -0 / the standard error of the difference.1851

We can easily do that now.1866

Here in order to find the sample t we could put the mean difference -0.1871

If you want to keep it technical you do not need that -0 / the standard error of the difference.1891

Our sample t says the difference is not at 0 it is actually way down here.1901

It is not significantly different.1914

Well, one thing we could do is just operate here and compare this number to this number.1917

This sub boundary here is -2.14.1923

-4.73 is like out here so we definitely know it is way significant.1928

It is way standing out from the expected mean but we can also find the p value.1935

Now remember in Excel one of the things it needs a positive t value.1944

If you have a negative t value you have to turn it into a positive one, but it is okay because it is perfectly symmetrical.1951

The degrees of freedom that we are talking about are going to be this1959

new combined degrees of freedom because we are always talking that the SDOM now.1963

This is the degrees of freedom for this SDOD and that is 14 and it is a two-tailed hypothesis.1969

Our p value is .0003.1976

I will not write the last up here but we can just talk about it.1981

The last step would be we reject or do not reject the null.1991

Well, we reject the null here because our t value is much lower than our significance level.1997

Our t value, our sample t is more extreme than our critical t.2003

Here what we would say is that there is a statistical difference between the two texture levels.2010

One that is very unlikely to be attributed to by chance, because that is what this t values.2018

If it was by chance it would have .03% probability.2026

It is pretty low.2033

Example 2, scientists have found certain tree resins that are deadly to termites.2035

To test the protective power of resin protecting the tree, a lab prepared 16 dishes with 25 termites in each.2042

Each dish was randomly assigned to be treated with 5 mg or 10 mg of resin.2050

At the end of 15 days, the number of surviving termites was counted.2055

Assume that termites survival tends to be normally distributed with both dosage levels.2060

Is there a statistical significant difference in the mean number of survival for those two doses?2066

Now here I think it is worth than just discussing what will be our x and y.2072

Our x might be the 5 mg population and our y might be the 10 mg population.2077

The n sub x some people might think there are 25 termites but actually there is 25 termites in each of 10 Peachtree dishes.2087

There are 8 Peachtree dishes that have been randomly treated with 5 mg and 8 have been treated with 10 mg.2099

This is 8 and 8.2109

When I say 8, we mean the dishes of treatment and the termites are not the subject they are the cases that we are interested in.2113

The termites are the test.2124

You can get 25 termites surviving or you could get 0 surviving.2128

How many termites survived?2134

That is our dependent variable.2135

Okay, let us see.2137

Well one thing we could do is start off with our hypotheses.2142

Our null hypotheses is that these two dosage levels are roughly the same.2146

We might say something like the mu sub x bar - y bar which is equal 0 are the same.2153

The alternative is that they are not the same.2161

Maybe that one is more powerful than the other.2166

We do not know which one.2169

We could easily set our significance level to be .05.2173

Let us talk about the actual set up, the decision stage.2179

In the decision stage, let us see what we have here.2184

We have set up this .05 level rejection and we could just go ahead and this is the x bar - y bar, but what would be that t?2195

The nice thing about this being 0 is that the t distribution as well as the x bar – y bar start off the same.2213

They are not going to have the same numbers out here.2226

Okay, so that is why we do have to put them on different lines.2229

They are still talking about different things.2233

Let us talk about the t values.2235

Before we do, it might be helpful to figure out the new degrees of freedom.2240

The degrees of freedom of differences will be 7 + 7 =14.2247

Here we can do hypothesis testing just jump in right away because given2255

the termite survival tends to be normally distributed within these two dosage rates.2261

If you go to example 2, you will actually see the data here.2267

Here we see dosage and here is the 5 mg, as well as the 10 mg.2284

Here are the survival counts.2293

How many termites survived?2294

Notice that there is no survival count over 25.2296

25 is the maximum you can have, but even the highest gives me 16.2299

What if the survival count cannot go below 0 because we cannot have negative termite surviving.2304

Here we have the survival count.2311

Let us see what we have here.2317

Can we figure out what the critical t is.2323

Can we figure out what the critical t is?2329

I think we can.2335

Let us see.2336

You can use the book but I am going to use Excel to find the critical t.2338

I am going to write for myself step 4.2344

I know the two-tailed probability that I need .05 and I know my degrees of freedom is 14.2347

I see that the critical t is the same as before and because we use2362

the same two tailed probability and the same degrees of freedom of differences.2367

Here we know that it is -2.14, as well as positive 2.14.2372

What we can do is now from here go on to looking at our actual sample.2384

This is actually step 3, it is a part of our decision stage.2394

Step 4, is now actually talking about the sample.2406

It will help to find the sample mean difference, so that is going to be the average of one of these x - the average y.2410

We want to know is this is difference going to be significantly different from 0?2431

We cannot just look at the raw scores because we need to figure out how many standard errors away we are.2436

How shall we find the standard error for the difference?2443

That is equal to the square root of the variance of x/ n sub x + variance of y/ n sub y.2448

Let us find the variance of x/ n sub x over and variance of y/ n sub y.2458

Let us find the variance of x/8 and the variance of y /8.2468

We see that the variance for y is a lot different than the variance for x.2486

That is helpful for us to just look at briefly right now just because this will probably give us an idea2493

that the variance of samples are so different we probably do not have a good reason to pull these two together.2500

We do not have a good reason to assume that the populations are similar.2507

When in doubt go with non homogenous variances.2511

Just assume that they are different.2518

Once we have that then we can find the square root of adding these two standard errors together and we get 2.5.2520

Once we have all of that then we can find the samples mean difference t.2535

And that would be the samples mean difference -0 divided by the standard error of the SDOD.2548

What would that be?2572

That would be this guy and I am going to leave that subtract 0 part divided by the standard error and we get to 2.15.2575

We are close but it is still more extreme than 2.14.2586

It does not have to be extreme and the -n could be either extreme in the negative n or extreme in the positive n.2595

This is extreme in the positive n.2603

It is just right outside our borders.2607

Let us find the p value.2609

In order to find that p value we use t distribution because we have the t value that2611

we want the degrees of freedom and we wanted to be a two-tailed p value.2620

It is going to add up this little chunk and this little chunk together and that can be .049.2625

We will just skip step 4, our p value =.0449 that is right just a hair underneath our alpha.05.2635

We would probably reject the null.2653

Example 3, 2 months before smoking ban in bars, a random sample of bar employees were assessed on respiratory health.2657

Two months after the ban, another random sample of employees were assessed.2672

Researchers saw a statistically significant increase in the mean scores of health.2678

P= .049 we had an example of that two tailed.2684

Which of the following is the best interpretation for this result?2689

The probability is only .049 that the mean score for all of our employees increased from before to after the ban.2693

Is that what this means?2706

For me it helps to draw that SDOD and it is saying the null hypotheses would be2708

the same like before and after are the same.2715

What they actually found is that there is some extreme value.2720

There is the increase in mean scores.2727

There is a positive difference from after – before.2735

There is the increased.2742

It is somewhere up here, that increase tells us that.2745

P= .04.2749

We can actually draw this carefully, it is just right above that cut off.2753

There is only .049 probability that the mean score for all bar employees increase.2760

That is not what this means.2775

It is not saying that there is only a small chance that it increase.2778

It is actually saying there is a pretty good chance that it is not the same.2783

There is a pretty small chance that it is the same.2787

This one we can just rule out.2792

Another possibility is that the mean score for all bar employees increased by more than 4.9%.2796

Does this p value actually talk about the raw score on respiratory health?2805

It does not talk about that score at all, it is the probability of finding such a difference.2814

It does not have anything to do with actual scores.2821

What about this one?2825

An observed difference in the sample means as large or larger than the sample is unlikely to occur2828

if the mean score for all bar employees before and after the ban were the same.2835

This actually have something we can use.2839

This is about considering that the means score for before and after are the same.2842

That is important because that is what the SDOM actually represents.2851

That is what this p value is actually talking something about this idea that when we get the sample,2854

we consider that they were just the same.2865

This is saying an observed difference in sample means as large or larger than a sample is very unlikely to occur.2867

It is likely to occur with .049% if the mean score for all bar employees the true score is actually the same.2876

This is a pretty good contender because the SDOD is talking about how .049 means very unlikely.2889

This I would leave as a definite contender.2900

Maybe there is a better answer.2902

There is a 4.9% chance that the mean score of all bar employees after the ban is actually lower than before the ban.2905

There is a small chance of the opposite hypotheses picture that is probably not the case.2915

It depends on what the null hypothesis was.2925

The null hypothesis and a two mean hypotheses test is usually the same not the one is less than the other.2934

We do not usually do that.2953

Maybe there is a way and that could be true.2954

It is probably not true if we did hypothesis testing at all.2958

Only 4.9% of the bar employees had their score drop but the other 95% had their scores increase.2961

This would be a correct interpretation if we are not talking about the SDOD.2971

If this was not a reflection of the population then maybe that would be true.2977

This is not talking about population, it is talking about the SDOD.2982

This is a wrong interpretation.2987

The correct answer is c.2990

That is our last example for hypotheses testing with two independent means.2992

Thank you for joining us on www.educator.com.2998

Related Books

Statistics by Witte, 10th Edition

Authors: Robert S. Witte, John S . Witte

ISBN: 1118450531

Publisher: Wiley

Year: 2013

This book provides a clear and methodical approach to essential statistical procedures. It clearly explains the basic concepts and procedures of descriptive and inferential statistical analysis. This book features a new emphasis on expressions involving sums of squares and degrees of freedom as well as a stronger stress on the importance of variability.

Related Books

Name	Description	Link
BookRenter.com	BookRenter.com is simply the most reliable online textbook rental service.	Visit BookRenter.com
PhysicsForums.com Homework Help	Physics Forums is a scientific community for students looking for math & science help.	Visit PhysicsForums.com Homework Help

Statistics Hypothesis Testing for the Difference of Two Independent Means

Share this knowledge with your friends!

Copy & Paste this embed code into your website’s HTML

Discussion

Answer Engine

Download Lecture Slides

Table of Contents

Transcription

Related Books

Start Learning Now

Membership Overview

Answer EngineGet answers to any question!Ask any question related to Statistics

Hypothesis Testing for the Difference of Two Independent Means

General Statistics Online Course

Transcription: Hypothesis Testing for the Difference of Two Independent Means

Related Books

Related Books

Start Learning Now

Membership Overview

Statistics Hypothesis Testing for the Difference of Two Independent Means

Share this knowledge with your friends!

Copy & Paste this embed code into your website’s HTML

Discussion

Answer Engine

Download Lecture Slides

Table of Contents

Transcription

Related Books

Start Learning Now

Membership Overview

Answer EngineGet answers to any question!Ask any question related to Statistics

Hypothesis Testing for the Difference of Two Independent Means

General Statistics Online Course

Transcription: Hypothesis Testing for the Difference of Two Independent Means

Related Books

Related Books

Available 24/7. Unlimited Access to Our Entire Library.

Searchable Lessons

Get Answers & Community Support

Downloadable Lecture Notes

Study Guides, Worksheets and Extra Example Lessons

Start Learning Now

Membership Overview