Sigma Rules and Measures of Central Location
๐น Video Overviewโ
๐ฏ What We're Learning Todayโ
Main Topics:
-
Sigma Rules (โ) - Math shortcuts for summations
-
Measures of Central Location - Finding the "middle" or "typical" value
-
Mode (most common)
-
Mean (average)
-
Median (middle value)
-
Part 1: Sigma Rules (โ)โ
๐ค What is Sigma (โ)?โ
Sigma (โ) = Mathematical shorthand for "add everything up"
Example: Defective items in 10 shipments: 3, 4, 1, 5, 2, 3, 2, 6, 3, 1
Instead of writing: xโ + xโ + xโ + xโ + xโ + xโ + xโ + xโ + xโ + xโโ
We write:
๐ก Memory hack: Think of โ as a "smart calculator" that knows to add everything from position 1 to n.
Breaking down the notation:
n โ Stop here (n = 10 in our example)
โ x_i โ Sum all x values
i=1 โ Start here (first observation)
๐ The 5 Essential Sigma Rulesโ
Rule 1: Sum of a Constantโ
Formula:
Plain English: If you add the same number n times, just multiply!
Example: Data series: 3, 3, 3, 3, 3 (n = 5)
๐ก Memory hack: Adding 5 threes is just 5 ร 3. Don't overthink it!
Rule 2: Sum of a Constant ร Variableโ
Formula:
Plain English: You can "pull out" the constant from the sum!
Example: Defective items: 3, 4, 1, 5, 2, 3, 2, 6, 3, 1
If each defective item costs 5 NIS (a = 5), what's the total damage?
๐ก Memory hack: The constant is like a "multiplier hat" - you can put it on at the end instead of on each item!
Rule 3: Sum of Additionโ
Formula:
Plain English: Sum of sums = sum of each separately!
Example:
| xแตข | yแตข | xแตข + yแตข |
|---|---|---|
| 1 | 3 | 4 |
| 2 | 4 | 6 |
| 3 | 6 | 9 |
OR:
๐ก Memory hack: Addition is "friendly" - you can split it up!
Rule 4: Sum of Multiplication (TRICKY!)โ
Formula:
Plain English: You CANNOT split multiplication! Must multiply first, then sum.
Example:
| xแตข | yแตข | xแตข ร yแตข |
|---|---|---|
| 1 | 3 | 3 |
| 2 | 4 | 8 |
| 3 | 6 | 18 |
CORRECT:
WRONG:
โ ๏ธ CRITICAL: Multiplication is NOT friendly! Don't split it!
๐ก Memory hack: "Multiply INSIDE the sum, not outside!"
Rule 5: Sum of Squares (ALSO TRICKY!)โ
Formula:
Plain English: Square each value first, THEN sum. Not the other way around!
Example:
| xแตข | xแตขยฒ |
|---|---|
| 1 | 1 |
| 2 | 4 |
| 3 | 9 |
CORRECT:
WRONG:
โ ๏ธ CRITICAL: Square INSIDE first, then add!
๐ก Memory hack: "Square the individuals, not the team!"
๐ Sigma Rules Quick Referenceโ
๐งฎ Practice Problem: Complete Walkthroughโ
Given:
| xแตข | yแตข |
|---|---|
| 1 | 3 |
| 2 | 4 |
| 3 | 6 |
Problem 1: Calculate
Solution:
Problem 2: Calculate
Solution:
First expand:
Calculate each part:
Final answer:
Part 2: Measures of Central Locationโ
๐ฏ The Big Questionโ
What's a "typical" or "representative" value in the data?
Example claim: "Economists earn more than teachers"
Does EVERY economist earn more than EVERY teacher? No!
So we need a way to describe the "center" of the data.
Three main measures:

๐ Measure 1: Mode (xฬ)โ
Definition: The value (or category) that appears MOST frequently
Notation: xฬ (x-hat)
For Qualitative Variables:โ
Example: Preferred social network
| Social Network | f(x) |
|---|---|
| 43 | |
| 16 | |
| 4 | |
| TikTok | 2 |
| 1 | |
| None | 7 |
Mode = Instagram (highest frequency)
๐ก Memory hack: Mode = Most popular = Most Often Displayed Everywhere
For Discrete Quantitative Variables:โ
Example: Number of people in family
| People (x) | f(x) |
|---|---|
| 2 | 3 |
| 3 | 2 |
| 4 | 1 |
| 5 | 3 |
| 6 | 1 |
Mode = 2 and 5 (both appear 3 times - bimodal!)
For Continuous Quantitative Variables:โ
Example: Test scores
| Scores | f(x) | l | d |
|---|---|---|---|
| 40-60 | 5 | 20 | 0.25 |
| 60-70 | 5 | 10 | 0.5 |
| 70-75 | 10 | 5 | 2 |
| 75-85 | 10 | 10 | 1 |
| 85-100 | 15 | 15 | 1 |
Mode = 72.5 (middle of class 70-75, which has highest density d = 2)
โ ๏ธ Important: For continuous variables, use DENSITY (d), not frequency!
๐ก Memory hack: The modal class is the "densest crowd"
๐ Characteristics of Mode:โ
โ Not affected by extreme values
โ Not affected by other frequencies (only cares about the winner)
โ Can have multiple modes (bimodal, multimodal)
โ Might not represent the "center" well
๐ Measure 2: Mean (xฬ)โ
Definition: The arithmetic average - sum all values and divide by count
Notation: xฬ (x-bar)
Basic Formula:โ
Example: Number of people in family: 2, 2, 6, 5, 3, 5, 5, 4, 3, 2
๐ก Memory hack: Mean = What everyone would get if we distributed equally
Mean from Frequency Table:โ
Formula:
Why? If 2 appears 3 times, instead of writing 2+2+2, write 2ร3!
Example:
| x | f(x) | p(x) | x ร f(x) | x ร p(x) |
|---|---|---|---|---|
| 2 | 3 | 30% | 6 | 0.6 |
| 3 | 2 | 20% | 6 | 0.6 |
| 4 | 1 | 10% | 4 | 0.4 |
| 5 | 3 | 30% | 15 | 1.5 |
| 6 | 1 | 10% | 6 | 0.6 |
| Total | 10 | 100% | 37 | 3.7 |
Using frequencies:
Using relative frequencies:
๐ก Memory hack: "Multiply before you divide" - weight each value by how often it appears!
Mean for Continuous Variables:โ
Use the MIDPOINT of each class!
Example: Test scores
| Scores | f(x) | Midpoint (xแตข) | xแตข ร f(x) |
|---|---|---|---|
| 40-60 | 5 | 50 | 250 |
| 60-70 | 5 | 65 | 325 |
| 70-75 | 10 | 72.5 | 725 |
| 75-85 | 10 | 80 | 800 |
| 85-100 | 10 | 92.5 | 925 |
| Total | 40 | - | 3025 |
How to find midpoint:
Example: For 40-60 โ Midpoint = (40+60)/2 = 50
๐ Characteristics of Mean:โ
โ Uses all data - every value matters
โ Most common measure - used everywhere
โ Affected by extreme values (outliers can pull it way off!)
โ May not be an actual data value (can't have 3.7 people!)
Special Property: Sum of deviations from mean = 0
Example: Data: 2, 2, 6, 5, 3, 5, 5, 4, 3, 2 (xฬ = 3.7)
| xแตข | xแตข - xฬ |
|---|---|
| 2 | -1.7 |
| 2 | -1.7 |
| 6 | +2.3 |
| 5 | +1.3 |
| 3 | -0.7 |
| 5 | +1.3 |
| 5 | +1.3 |
| 4 | +0.3 |
| 3 | -0.7 |
| 2 | -1.7 |
Sum = 0 โ
๐ก Memory hack: The mean is like a "balance point" - negatives and positives cancel out!
๐ Measure 3: Median (xฬ)โ
Definition: The middle value when data is sorted - splits data 50-50
Notation: xฬ (x-tilde)
How to Find Median:โ
Step 1: Sort data from smallest to largest
Step 2: Find the middle position
If n is ODD:โ
Formula:
Example: Scores: 50, 60, 60, 70, 80 (n = 5)
Position of median = (5+1)/2 = 3rd value
Median = 60
If n is EVEN:โ
Formula:
Example: Scores: 50, 60, 60, 70, 80, 90 (n = 6)
Positions: 6/2 = 3rd and 4th values
Median = (60 + 70)/2 = 65
๐ก Memory hack:
-
Odd: One clear middle person
-
Even: Two middle people, average them!
Median from Frequency Table:โ
Example: Number of people in family (n = 20)
| x | f(x) | F(x) |
|---|---|---|
| 2 | 3 | 3 |
| 3 | 5 | 8 |
| 4 | 6 | 14 |
| 5 | 3 | 17 |
| 6 | 3 | 20 |
Find: Position n/2 = 20/2 = 10th value
Look at F(x):
-
F(4) = 14 (14 observations up to x=4)
-
F(3) = 8 (8 observations up to x=3)
The 10th value is in the x=4 category!
Median = (xโโ + xโโ)/2 = (4 + 4)/2 = 4
Median for Continuous Variables:โ
Find the "median class" - the class containing position n/2
Example: Test scores (n = 100)
| Scores | f(x) | F(x) |
|---|---|---|
| 40-60 | 5 | 5 |
| 60-70 | 10 | 15 |
| 70-75 | 20 | 35 |
| 75-85 | 40 | 75 |
| 85-100 | 25 | 100 |
Position n/2 = 100/2 = 50th value
-
F(70-75) = 35 (not enough)
-
F(75-85) = 75 (includes position 50!)
Median is in class 75-85
๐ก Memory hack: Keep adding F(x) until you pass n/2!
๐ Characteristics of Median:โ
โ Not affected by extreme values (only cares about position, not actual values!)
โ Always an actual possible value (for discrete) or in a real class
โ Doesn't use information from all values
Powerful example:
Salaries: 3000, 4000, 4700, 5000, 5500
Median = 4700 (middle value)
Now change 5500 to 5,500,000!
Median still = 4700 (position unchanged!)
Mean would jump dramatically!
๐ก Memory hack: Median is the "bodyguard" - protects against extreme outliers!
๐ Linear Transformationsโ
What if we add/multiply all values by a constant?
Rules:
If (where a and b are constants):
Example: Test scores of 7 students: 91, 77, 65, 83, 88, 71, 98
Teacher adds 2 points to everyone:
New mean:
๐ก Memory hack: "What you do to the data, you do to the measures!"
๐ Distribution Shapes & Central Measuresโ
Symmetric Bell-Shaped (Normal Distribution):โ
๐
๐๐๐
๐๐๐๐๐
๐๐๐๐๐๐๐
Mode = Median = Mean
All three are equal!
Symmetric (Two Peaks):โ
๐ ๐
๐๐ ๐๐
๐๐๐๐๐๐
Mode < Median = Mean
Two modes, median and mean still equal
Uniform Distribution:โ
๐๐๐๐๐๐๐
No clear mode
Median = Mean
Positive Skew (Right Tail):โ
๐๐๐
๐๐๐๐
๐๐๐๐๐๐ โ
Mode < Median < Mean
Mean pulled by high values!
๐ก Memory hack: "Mean follows the tail" - gets pulled toward outliers
Example: Salaries - few very high earners pull mean up
Negative Skew (Left Tail):โ
๐๐๐
๐๐๐๐
โ ๐๐๐๐๐๐
Mean < Median < Mode
Mean pulled by low values!
๐ Decision Guide: Which Measure to Use?โ
| Situation | Best Measure | Why |
|---|---|---|
| Symmetric data, no outliers | Mean | Uses all info, most precise |
| Skewed data | Median | Not affected by extreme values |
| Categorical data | Mode | Only option! |
| Need "most typical" | Mode | Most common actual value |
| Extreme outliers present | Median | Robust against extremes |
๐ฏ Test Question Practiceโ
Question: "University graduate salaries are positively skewed. Therefore, the percentage earning above average is greater than the percentage earning below average."
True or False?
Answer: FALSE!
Explanation:
-
Positively skewed โ tail on right (high salaries)
-
Mean gets pulled UP by extreme high salaries
-
Mean > Median
-
Since median splits data 50-50:
-
50% earn below median
-
50% earn above median
-
-
Since mean > median:
-
MORE than 50% earn below mean
-
LESS than 50% earn above mean
-
๐ก Memory hack: In positive skew, the mean "chases" the few rich people, leaving most below average!
๐ Advanced Conceptsโ
Skewnessโ
Measures asymmetry of distribution:
Rule of thumb:
-
Between -0.5 and 0.5 โ Approximately symmetric
-
Between 0.5 and 1 (or -0.5 and -1) โ Moderately skewed
-
Beyond ยฑ1 โ Highly skewed
Kurtosisโ
Measures "peakedness" of distribution:
-
Mesokurtic (Kurt = 3): Normal distribution
-
Leptokurtic (Kurt > 3): Tall and thin peak, heavy tails
-
Platykurtic (Kurt < 3): Flat peak, light tails
๐ Key Formulas Summaryโ
| Measure | Formula | Use When |
|---|---|---|
| Mode | Value with highest f(x) or d | Any variable type |
| Mean (raw) | Individual data | |
| Mean (frequency) | Frequency table | |
| Mean (relative) | Relative frequency | |
| Median (odd n) | Sorted odd data | |
| Median (even n) | Sorted even data | |
| Linear transform | When transforming all data |
๐ก Master Memory Hacksโ
-
Sigma โ = Smart calculator that adds for you
-
Mode = Most Often Displayed Everywhere
-
Mean = Everyone gets equal share
-
Median = The middleman/bodyguard (protects from extremes)
-
Multiplication & Squares = Cannot split the sum!
-
Mean follows the tail in skewed distributions
-
Midpoint = Average of class limits
-
F(x) = Climbing stairs - keep adding
๐ฏ Quick Reference Mind Mapโ
โ ๏ธ Common Exam Mistakesโ
โ Splitting multiplication/squares in sigma sums
โ Using frequency instead of density for continuous mode
โ Forgetting to sort data before finding median
โ Confusing "at least" with "at most"
โ Thinking positive skew means more above average
โ Using mean when there are extreme outliers
๐ Pro Exam Tipsโ
-
See โ? Check if you can pull constants out!
-
Finding mode in continuous? Look for highest DENSITY (d), not frequency!
-
Extreme values present? Use median, not mean
-
Positive skew? Mean > Median > Mode (remember: mean follows tail)
-
Linear transformation? Apply it to the measures directly
-
Frequency table? Use weighted formula (x ร f)
Remember: The distribution shape tells you the relationship between mode, median, and mean!