<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-GB">
	<id>https://training-course-material.com/index.php?action=history&amp;feed=atom&amp;title=Variability</id>
	<title>Variability - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://training-course-material.com/index.php?action=history&amp;feed=atom&amp;title=Variability"/>
	<link rel="alternate" type="text/html" href="https://training-course-material.com/index.php?title=Variability&amp;action=history"/>
	<updated>2026-05-13T10:33:00Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.45.1</generator>
	<entry>
		<id>https://training-course-material.com/index.php?title=Variability&amp;diff=16867&amp;oldid=prev</id>
		<title>Bernard Szlachta: /* Population Variance */</title>
		<link rel="alternate" type="text/html" href="https://training-course-material.com/index.php?title=Variability&amp;diff=16867&amp;oldid=prev"/>
		<updated>2014-05-27T08:35:37Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;Population Variance&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{Cat|Summarizing Distributions| 03}}&lt;br /&gt;
&lt;br /&gt;
=What is Variability=&lt;br /&gt;
Variability refers to &lt;br /&gt;
*how much the numbers in a distribution differ from each other. &lt;br /&gt;
*how &amp;quot;spread out&amp;quot; a group of scores is. &lt;br /&gt;
The terms variability, spread, and dispersion are synonyms, and refer to how spread out a distribution is. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Quiz Example&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Quiz1&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
[[File:Variability-definition1.jpg|300px]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Quiz2&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
[[File:Variability-definition.jpg|300px]]&lt;br /&gt;
&lt;br /&gt;
* The graphs above represent the scores on two quizzes. &lt;br /&gt;
* The mean score for each quiz is 7.0. &lt;br /&gt;
* Despite the equality of means, you can see that the distributions are quite different. &lt;br /&gt;
* Specifically, the scores on Quiz 1 are more densely packed and those on Quiz 2 are more spread out. &lt;br /&gt;
* The differences among students were much greater on Quiz 2 than on Quiz 1.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Measures of Variability=&lt;br /&gt;
There are four frequently used measures of variability: the range, interquartile range, variance, and standard deviation.&lt;br /&gt;
&lt;br /&gt;
==Range==&lt;br /&gt;
* The range is the simplest measure of variability to calculate, and one you have probably encountered many times in your life. &lt;br /&gt;
* The range is simply the highest score minus the lowest score. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Example 1&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
* What is the range of the following group of numbers: 10, 2, 5, 6, 7, 3, 4? &lt;br /&gt;
* The highest number is 10, and the lowest number is 2, so 10 - 2 = 8. &lt;br /&gt;
* The range is 8. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Example 2&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
* Here’s a dataset with 10 numbers: 99, 45, 23, 67, 45, 91, 82, 78, 62, 51. &lt;br /&gt;
* The highest number is 99 and the lowest number is 23, so 99 - 23 equals 76; the range is 76. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Quiz Example&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
* Vonsider the two quizzes shown in the graphs above. &lt;br /&gt;
* On Quiz 1, the lowest score is 5 and the highest score is  9; the range is 4. &lt;br /&gt;
* On Quiz 2, the lowest score is 4 and the highest score is 10; the range is 6.&lt;br /&gt;
* The range on Quiz 2 was larger&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Interquartile Range==&lt;br /&gt;
* The interquartile range (IQR) is the range of the middle 50% of the scores in a distribution. &lt;br /&gt;
* It is computed as follows:&lt;br /&gt;
 IQR = 75th percentile - 25th percentile&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Quiz Example&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
* For Quiz 1, the 75th percentile is 8 and the 25th percentile is 6; The interquartile range is 2. &lt;br /&gt;
* For Quiz 2, which has greater spread, the 75th percentile is 9, the 25th percentile is 5, and the interquartile range is 4. &lt;br /&gt;
* In box plots, the 75th percentile was called the upper hinge and the 25th percentile was called the lower hinge. &lt;br /&gt;
* Using this terminology, the interquartile range is referred to as the H-spread.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===semi-interquartile range===&lt;br /&gt;
* The semi-interquartile range is defined simply as the interquartile range divided by 2. &lt;br /&gt;
* If a distribution is symmetric, the median plus or minus the semi-interquartile range contains half the scores in the distribution.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Variance==&lt;br /&gt;
* Variability can also be defined in terms of how close the scores in the distribution are to the middle of the distribution. &lt;br /&gt;
* Using the mean as the measure of the middle of the distribution, the variance is defined as the average squared difference of the scores from the mean. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Population Variance===&lt;br /&gt;
The formula for the variance is:&lt;br /&gt;
 [[File:Pop_var.gif]]&lt;br /&gt;
 where σ2 is the variance, μ is the mean, and N is the number of numbers&lt;br /&gt;
&lt;br /&gt;
:[[File:ClipCapIt-140527-093540.PNG]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Quiz Example&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
The data from Quiz 1 are shown in Table below. &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot; style=&amp;quot;text-align: center;&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Scores&lt;br /&gt;
! Deviation from Mean&lt;br /&gt;
! Squared Deviation&lt;br /&gt;
|-&lt;br /&gt;
| 9&lt;br /&gt;
| 2&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| 9&lt;br /&gt;
| 2&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| 9&lt;br /&gt;
| 2&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| 8&lt;br /&gt;
| 1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 8&lt;br /&gt;
| 1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 8&lt;br /&gt;
| 1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 8&lt;br /&gt;
| 1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 7&lt;br /&gt;
| 0&lt;br /&gt;
| 0&lt;br /&gt;
|-&lt;br /&gt;
| 7&lt;br /&gt;
| 0&lt;br /&gt;
| 0&lt;br /&gt;
|-&lt;br /&gt;
| 7&lt;br /&gt;
| 0&lt;br /&gt;
| 0&lt;br /&gt;
|-&lt;br /&gt;
| 7&lt;br /&gt;
| 0&lt;br /&gt;
| 0&lt;br /&gt;
|-&lt;br /&gt;
| 7&lt;br /&gt;
| 0&lt;br /&gt;
| 0&lt;br /&gt;
|-&lt;br /&gt;
| 6&lt;br /&gt;
| -1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 6&lt;br /&gt;
| -1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 6&lt;br /&gt;
| -1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 6&lt;br /&gt;
| -1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 6&lt;br /&gt;
| -1&lt;br /&gt;
| 1&lt;br /&gt;
|-&lt;br /&gt;
| 5&lt;br /&gt;
| -2&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| 5&lt;br /&gt;
| -2&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;3&amp;quot; | Means&lt;br /&gt;
|-&lt;br /&gt;
| 7&lt;br /&gt;
| 0&lt;br /&gt;
| 1.5&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
* The mean score is 7.0.&lt;br /&gt;
* Therefore, the column &amp;quot;Deviation from Mean&amp;quot; contains the score minus 7. &lt;br /&gt;
* The column &amp;quot;Squared Deviation&amp;quot; is simply the previous column squared.&lt;br /&gt;
&lt;br /&gt;
* The mean deviation from the mean is 0. This will always be the case. &lt;br /&gt;
* The mean of the squared deviations is 1.5. &lt;br /&gt;
* Therefore, the variance is 1.5. &lt;br /&gt;
* Analogous calculations with Quiz 2 show that its variance is 6.7. &lt;br /&gt;
* Using the formula, for Quiz 1, μ = 7 and N = 20.&lt;br /&gt;
&lt;br /&gt;
===Sample Variance===&lt;br /&gt;
* If the variance in a sample is used to estimate the variance in a population, then the formula for Population Variance underestimates the variance and the following formula should be used is below.&lt;br /&gt;
 [[File:Sample var.gif]]&lt;br /&gt;
 where s2 is the estimate of the variance and M is the sample mean. &lt;br /&gt;
* Note that M is the mean of a sample taken from a population with a mean of μ. &lt;br /&gt;
* Since, in practice, the variance is usually computed in a sample, this formula is most often used.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Example&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
* Assume the scores 1, 2, 4, and 5 were sampled from a larger population. &lt;br /&gt;
* To estimate the variance in the population you would compute s2 as follows:&lt;br /&gt;
  M = (1 + 2 + 4 + 5)/4 = 12/4 = 3.&lt;br /&gt;
  s&amp;lt;sup&amp;gt;2&amp;lt;/sup&amp;gt; = [(1-3)&amp;lt;sup&amp;gt;2&amp;lt;/sup&amp;gt; + (2-3)&amp;lt;sup&amp;gt;2&amp;lt;/sup&amp;gt; + (4-3)&amp;lt;sup&amp;gt;2&amp;lt;/sup&amp;gt; + (5-3)&amp;lt;sup&amp;gt;2&amp;lt;/sup&amp;gt;]/(4-1)&lt;br /&gt;
     = (4 + 1 + 1 + 4)/3 = 10/3 = 3.333&lt;br /&gt;
&lt;br /&gt;
===Alternate Formulas ===&lt;br /&gt;
* There are alternate formulas that can be easier to use if you are doing your calculations with a hand calculator.&lt;br /&gt;
* These formulas are subject to rounding error if your values are very large and/or you have an extremely large number of observations.&lt;br /&gt;
[[File:Comp varp.gif]] [[File:Comp var.gif]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
For the example above,&lt;br /&gt;
&lt;br /&gt;
[[File:Formula5.jpg]]&lt;br /&gt;
&lt;br /&gt;
==Standard Deviation==&lt;br /&gt;
* The standard deviation is the square root of the variance. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Quiz Example&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
The standard deviations of the two quiz distributions 1.225 and 2.588. &lt;br /&gt;
&lt;br /&gt;
===Standard Deviation and Normal Distribution===&lt;br /&gt;
* The standard deviation is an especially useful measure of variability when the distribution is normal or approximately normal because the proportion of the distribution within a given number of standard deviations from the mean can be calculated. &lt;br /&gt;
&lt;br /&gt;
* 68% of the distribution is within one standard deviation of the mean and approximately 95% of the distribution is within two standard deviations of the mean. &lt;br /&gt;
* Therefore, if you had a normal distribution with a mean of 50 and a standard deviation of 10, then 68% of the distribution would be between 50 - 10 = 40 and 50 +10 =60. &lt;br /&gt;
&lt;br /&gt;
* Similarly, about 95% of the distribution would be between 50 - 2 x 10 = 30 and 50 + 2 x 10 = 70. &lt;br /&gt;
* The symbol for the population standard deviation is σ; the symbol for an estimate computed in a sample is s. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Example&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
[[File:Std.PNG]]&lt;br /&gt;
&lt;br /&gt;
Figure above shows two normal distributions. &lt;br /&gt;
* The red distribution has a mean of 40 and a standard deviation of 5&lt;br /&gt;
* The blue distribution has a mean of 60 and a standard deviation of 10&lt;br /&gt;
* For the red distribution, 68% of the distribution is between 35 and 45&lt;br /&gt;
* for the blue distribution, 68% is between 50 and 70.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Quiz==&lt;br /&gt;
&amp;lt;quiz display=simple &amp;gt;&lt;br /&gt;
{ What is the range of 2, 4, 6, and 8?  &lt;br /&gt;
&lt;br /&gt;
|type=&amp;quot;{}&amp;quot;}&lt;br /&gt;
{ 6 }&lt;br /&gt;
&lt;br /&gt;
{&lt;br /&gt;
{{Show Answer|&lt;br /&gt;
6&lt;br /&gt;
&lt;br /&gt;
8 - 2 is 6&lt;br /&gt;
}}&lt;br /&gt;
}&lt;br /&gt;
&lt;br /&gt;
{Would the variance of 10, 12, 17, 20, 25, 27, 42, and 45 be larger if the numbers represented a population or a sample?&lt;br /&gt;
&lt;br /&gt;
|type=&amp;quot;()&amp;quot;}&lt;br /&gt;
-Population&lt;br /&gt;
+Sample&lt;br /&gt;
&lt;br /&gt;
{&lt;br /&gt;
{{Show Answer|&lt;br /&gt;
Sample&lt;br /&gt;
&lt;br /&gt;
The variance would be larger if these numbers represented a sample because you would divide by N-1 (instead of just N).&lt;br /&gt;
}}&lt;br /&gt;
}&lt;br /&gt;
&lt;br /&gt;
{What is the standard deviation of this sample?&lt;br /&gt;
 Y&lt;br /&gt;
  8&lt;br /&gt;
 15&lt;br /&gt;
 20&lt;br /&gt;
 12&lt;br /&gt;
 13&lt;br /&gt;
 11&lt;br /&gt;
 13&lt;br /&gt;
 15&lt;br /&gt;
&lt;br /&gt;
|type=&amp;quot;{}&amp;quot;}&lt;br /&gt;
{ 3.5026 }&lt;br /&gt;
&lt;br /&gt;
{&lt;br /&gt;
{{Show Answer|&lt;br /&gt;
3.5026&lt;br /&gt;
}}&lt;br /&gt;
}&lt;br /&gt;
&lt;br /&gt;
{What is the interquartile range of these numbers?&lt;br /&gt;
 Z&lt;br /&gt;
 12&lt;br /&gt;
 13&lt;br /&gt;
 14&lt;br /&gt;
 15&lt;br /&gt;
  9&lt;br /&gt;
 10&lt;br /&gt;
 16&lt;br /&gt;
 10&lt;br /&gt;
  8&lt;br /&gt;
 10&lt;br /&gt;
 11&lt;br /&gt;
 12&lt;br /&gt;
 13&lt;br /&gt;
 22&lt;br /&gt;
 23&lt;br /&gt;
 24&lt;br /&gt;
 25&lt;br /&gt;
&lt;br /&gt;
|type=&amp;quot;{}&amp;quot;}&lt;br /&gt;
{ 9 }&lt;br /&gt;
&lt;br /&gt;
{&lt;br /&gt;
{{Show Answer|&lt;br /&gt;
9&lt;br /&gt;
&lt;br /&gt;
25th% is 10, 75th% is 19, 19 - 10 is 9&lt;br /&gt;
}}&lt;br /&gt;
}&lt;/div&gt;</summary>
		<author><name>Bernard Szlachta</name></author>
	</entry>
</feed>