Type I and II Errors

Category:Hypothesis Testing

title: False Alarm, Missed Detection (Type I and Type II error)
author: Bernard Szlachta (NobleProg Ltd)

Prerequisites

Introduction to Hypothesis Testing, Statistical Significance

Questions。

What is Type I and Type II errors?
How to interpret significant and non-significant differences?
Why the null hypothesis should not be rejected when the effect is not significant?

Simplification

Let us assume that null hypothesis is always about something being not different
E.g.
- new application is as popular as old one (there is not difference in popularity)
- new hardware is as fast as old one (there is no difference in speed)
- drug doesn't cure the disease (makes no difference to patient health)

Overview。

State of the world	There is no difference	There is a difference
Null hypothesis (no difference)	Rejected (there is a difference)	Not Rejected (there is no difference)
Error	Type I False Alarm False Positive	Type II Missed Detection False Negative

Example 1。

Patient have a disease
There is a problem

State of the world	Patient doesn't have cancer	Patient have cancer
What we tell the patient	has cancer (H0 rejected)	no cancer (H0 not rejected)
Error	Type I False Alarm False Positive	Type II Missed Detection False Negative

Example 2。

New hardware is different than old one
Customer is more satisfied with new application than old one
There is a increase in people buying our product after running Ad-words campaign

State of the world	No difference between new and old (no difference)	There is a between old and new(difference)
Null hypothesis (no difference)	Rejected (difference)	Not Rejected (no difference)
Error	Type I False detection	Type II Missed Detection

Supplier vs Customer

Type I error。

Type I error is a rejection of a true null hypothesis
False Positive or False Alarm can be used instead in business world
Rejecting the null hypothesis is not an all-or-nothing decision
The Type I error rate is affected by the α level: the lower the α level the lower the Type I error rate

Probability of Type I error。

It might seem that α is the probability of a Type I error
However, this is not correct
Instead, α is the probability of a Type I error given that the null hypothesis is true

You can only make a Type I error if the null hypothesis is true.

Type II error。

{\mbox{beta}}=\mathbb {P} {\big (}{\mbox{not detecting a difference}}{\big |}{\mbox{there is a difference}}{\big )}

Type II error is failing to reject a false null hypothesis
Unlike a Type I error, a Type II error is not really an error
When a statistical test is not significant, it means that the data do not provide strong evidence that the null hypothesis is false

Lack of significance does not support the conclusion that the null hypothesis is true

Therefore, a researcher would not make the mistake of incorrectly concluding that the null hypothesis is true when a statistical test was not significant
Instead, the researcher would consider the test inconclusive

Contrast this with a Type I error in which the researcher erroneously concludes that the null hypothesis is false when, in fact, it is true.

A Type II error can only occur if the null hypothesis is false

If the null hypothesis is false, then the probability of a Type II error is called β
The probability of correctly rejecting a false null hypothesis equals 1- β and is called Power
Power is the probability of being able to find a difference if it really exists

Errors and Decision Making。

Increasing the significance level, increases the changes of False Alarm and decrease the changes of Miss Detection
To decrease changes of both errors, the sample size has be increased

What is more serious?。

False Alarm is usually more serious, as the test is convulsive
Missed Detection usually is less harmful e.g.
- if patient failed to detected a disease, they can repeat the test
- on the other hand, if the therapy was implemented to a misdiagnosed problem, the consequence can be worse
When selecting alpha level which related to probability of False Alarm, it is important to keep that in mind what is more harmful:
- If the administering treatment even for a healthy patient is consider cheap and not harmful, but not detecting the disease of sick patient would be dangerous setting significance level (alpha) high (e.g. 5%) is right thing to do

Supplier v Customer。

We cannot calculate the probability of Type II error without knowing the true state of the world
Reduction in probability of Type I error will increase the chance of Type II error

Google v Hard Drive provider

Hard-drives are provided to the cloud by Company X (supplier) to Google
H0 is the all hard drives have specification as provided by the supplier
In other words, the hard drives Google buys, was drawn from the population of the hard drives complying with the spec
Google tests a sample of the hard drives and test whether they are compliant using Hypothesis Testing
If Type I error occurs, it is to the suppliers detriment, since hard drives are fine, but will be rejected by Google (after testing)
If Type II error occurs, Google accepts drives which are not up to the standard
The sample size can be increased, but that will increase the cost of testing as well
There is a trade-off between the reduction of errors and the cost of sampling

Supplier v Customer Solution。

Google and Company X (supplier) can set up alpha and beta to the level where the probability of both errors is the same
This can be achieved by changing the sample size (sometimes decreasing it), and changing significance level (alpha)

Questions

Template:Statistics Links

Type I and II Errors

Contents

Questions。

Simplification

Overview。

Example 1。

Example 2。

Supplier vs Customer

Type I error。

Probability of Type I error。

Type II error。

Errors and Decision Making。

What is more serious?。

Supplier v Customer。

Supplier v Customer Solution。

Questions

Navigation menu

Personal tools

Namespaces

Variants

Views

Search

Opportunities

Navigation

Tools

	Type I (False Alarm)
	Type II (Missed Detection)

	Type I (False Alarm)
	Type II (Missed Detection)

	Supplier
	Client