Type I and II Errors

From Training Material
Jump to navigation Jump to search
title
False Alarm, Missed Detection (Type I and Type II error)
author
Bernard Szlachta (NobleProg Ltd)

Prerequisites

Questions。

  • What is Type I and Type II errors?
  • How to interpret significant and non-significant differences?
  • Why the null hypothesis should not be rejected when the effect is not significant?

Simplification

  • Let us assume that null hypothesis is always about something being not different
  • E.g.
    • new application is as popular as old one (there is not difference in popularity)
    • new hardware is as fast as old one (there is no difference in speed)
    • drug doesn't cure the disease (makes no difference to patient health)

Overview。

State of the world There is no difference There is a difference
Null hypothesis (no difference) Rejected
(there is a difference)
Not Rejected
(there is no difference)
Error Type I
False Alarm
False Positive
Type II
Missed Detection
False Negative

Example 1。

  1. Patient have a disease
  2. There is a problem
State of the world Patient doesn't have cancer Patient have cancer
What we tell the patient has cancer
(H0 rejected)
no cancer
(H0 not rejected)
Error Type I
False Alarm
False Positive
Type II
Missed Detection
False Negative

Example 2。

  1. New hardware is different than old one
  2. Customer is more satisfied with new application than old one
  3. There is a increase in people buying our product after running Ad-words campaign
State of the world No difference between new and old (no difference) There is a between old and new(difference)
Null hypothesis (no difference) Rejected (difference) Not Rejected (no difference)
Error Type I
False detection
Type II
Missed Detection

Supplier vs Customer

Type I error。

  • Type I error is a rejection of a true null hypothesis
  • False Positive or False Alarm can be used instead in business world
  • Rejecting the null hypothesis is not an all-or-nothing decision
  • The Type I error rate is affected by the α level: the lower the α level the lower the Type I error rate

Probability of Type I error。

  • It might seem that α is the probability of a Type I error
  • However, this is not correct
  • Instead, α is the probability of a Type I error given that the null hypothesis is true
You can only make a Type I error if the null hypothesis is true.

Type II error。

  • Type II error is failing to reject a false null hypothesis
  • Unlike a Type I error, a Type II error is not really an error
  • When a statistical test is not significant, it means that the data do not provide strong evidence that the null hypothesis is false
Lack of significance does not support the conclusion that the null hypothesis is true
  • Therefore, a researcher would not make the mistake of incorrectly concluding that the null hypothesis is true when a statistical test was not significant
  • Instead, the researcher would consider the test inconclusive
  • Contrast this with a Type I error in which the researcher erroneously concludes that the null hypothesis is false when, in fact, it is true.
A Type II error can only occur if the null hypothesis is false
  • If the null hypothesis is false, then the probability of a Type II error is called β
  • The probability of correctly rejecting a false null hypothesis equals 1- β and is called Power
  • Power is the probability of being able to find a difference if it really exists

Errors and Decision Making。

  • Increasing the significance level, increases the changes of False Alarm and decrease the changes of Miss Detection
  • To decrease changes of both errors, the sample size has be increased

What is more serious?。

  • False Alarm is usually more serious, as the test is convulsive
  • Missed Detection usually is less harmful e.g.
    • if patient failed to detected a disease, they can repeat the test
    • on the other hand, if the therapy was implemented to a misdiagnosed problem, the consequence can be worse
  • When selecting alpha level which related to probability of False Alarm, it is important to keep that in mind what is more harmful:
    • If the administering treatment even for a healthy patient is consider cheap and not harmful, but not detecting the disease of sick patient would be dangerous setting significance level (alpha) high (e.g. 5%) is right thing to do

Supplier v Customer。

  • We cannot calculate the probability of Type II error without knowing the true state of the world
  • Reduction in probability of Type I error will increase the chance of Type II error

Google v Hard Drive provider

  • Hard-drives are provided to the cloud by Company X (supplier) to Google
  • H0 is the all hard drives have specification as provided by the supplier
  • In other words, the hard drives Google buys, was drawn from the population of the hard drives complying with the spec
  • Google tests a sample of the hard drives and test whether they are compliant using Hypothesis Testing
  • If Type I error occurs, it is to the suppliers detriment, since hard drives are fine, but will be rejected by Google (after testing)
  • If Type II error occurs, Google accepts drives which are not up to the standard
  • The sample size can be increased, but that will increase the cost of testing as well
  • There is a trade-off between the reduction of errors and the cost of sampling

Supplier v Customer Solution。

  • Google and Company X (supplier) can set up alpha and beta to the level where the probability of both errors is the same
  • This can be achieved by changing the sample size (sometimes decreasing it), and changing significance level (alpha)

Questions

1 It has been shown many times that on a certain test, women perform better than men. However, the probability value for the data from your sample was .12, so you were unable to reject the null hypothesis that women and men perform alike. What type of error did you make?

Type I (False Alarm)
Type II (Missed Detection)

Answer >>

In this example, there is really a difference in the population between men and women, but you did not find a significant difference in your sample. Failing to reject a false null hypothesis is a Type II error.


2 BaiDu rather than Google is preferred by Chinese people as a search engine. However, the probability value for the data from your sample was 0.6, so you were unable to reject the null hypothesis that BaiDu and Google have similar preference. What type of error did you make?

Type I (False Alarm)
Type II (Missed Detection)

Answer >>

In this example, there is really a difference in the population between men and women, but you did not find a significant difference in your sample. Failing to reject a false null hypothesis is a Type II error.


3 As the alpha level gets lower, which error rate also gets lower?

Type I
Type II

Answer >>

The Type I error rate is affected by the alpha level; the lower the alpha level is, the lower the Type I error rate gets. Alpha is the probability of a Type I error given that the null hypothesis is true.


4 Beta is the probability of which kind of error?

Type I
Type II

Answer >>

The probability of a Type II error is called beta. The probability of correctly rejecting a false null hypothesis equals 1- beta and is called power.


5 If the null hypothesis is false, you cannot make which kind of error?

Type I
Type II

Answer >>

A Type I error occurs when a significance test results in the rejection of a TRUE null hypothesis.


6 Failing to detect hardware problem can lead to multi-million dollars loss, whereas False Alarm would not be that serious. What should be the alpha level (significance level) to prevent the loss?

5%
1%

Answer >>

Significance level increases probability of detecting difference when there is no difference (e.g. the difference observed where due pure chance)


7 Who will benefit from the high probability of Type I Error?

Supplier
Client

Answer >>

High probability of Type I Error (False Alarm), will make the rejection of high quality product more likely, therefore benefiting Client


Template:Statistics Links