chapter 1 descriptive stats 2 2012

26
CHAPTER 1 DESCRIPTIVE STATISTICS WEEK 1 L2 - Graphical display of Data

Upload: jollychang1

Post on 16-Feb-2016

232 views

Category:

Documents


0 download

DESCRIPTION

Probability and Stat

TRANSCRIPT

Page 1: Chapter 1 Descriptive Stats 2 2012

CHAPTER 1 DESCRIPTIVE STATISTICS

WEEK 1

L2 - Graphical display of Data

Page 2: Chapter 1 Descriptive Stats 2 2012

At the end of the lesson, students should be able to:

Construct and interpret pictorial and tabular display of data

Learning Objectives:

Page 3: Chapter 1 Descriptive Stats 2 2012

Pictorial & Tabular Methods

1. Stem-and-Leaf Displays:

How to construct a Stem-and-Leaf Display:

1. Each numerical data is divided into two parts:

- The leading digit(s) becomes the stem,

and the remaining digit(s) becomes the leaf

2. List the stem values in a vertical column.

3. Record the leaf for each observation beside its stem.

4. Write the units for stems and leaves on the display.

Page 4: Chapter 1 Descriptive Stats 2 2012

Result of Math. Exam.

of a 50-student class:

35 42 56 41 63 26 37 66 92 16

49 28 56 64 72 59 17 45 56 29

30 45 39 37 43 76 73 64 51 60

40 52 57 65 83 68 52 84 91 64

45 76 56 90 73 34 26 57 41 56

Stem & Leaf Display

Page 5: Chapter 1 Descriptive Stats 2 2012

Place the numbers in order from smallest to the largest

16 17 26 26 28

29 30 34 35 37

37 39 40 41 41

42 43 45 45 45

49 51 52 52 56

56 56 56 56 57

57 59 60 63 64

64 64 65 66 68

72 73 73 76 76

83 84 90 91 92

1 6 7

2 6 6 8 9

3 0 4 5 7 7 9

4 0 1 1 2 3 5 5 5 9

5 1 2 2 6 6 6 6 6 7 7 9

6 0 3 4 4 4 5 6 8

7 2 3 3 6 6

8 3 4

9 0 1 2

Stem-and-Leaf Display

Stem & Leaf Display

Stem: tens digit

Leaf: ones digit

Page 6: Chapter 1 Descriptive Stats 2 2012

2. Histogram:

A bar graph representing a frequency distribution of a

quantitative variable.A histogram is made up of the following

components. Histograms are used to summarize large data sets.

0.20

0.30 Rel. Freq.

Histogram: ages of 100 students Age Freq. Rel. Freq.

18 20 0.20

19 24 0.24

20 26 0.26

21 18 0.18

22 5 0.05

23 3 0.03

24 2 0.02

25 2 0.02

Sum 100 1.00

Page 7: Chapter 1 Descriptive Stats 2 2012

3. Box plot:

a graphical display that simultaneously describes several

important features of a data set:

center

Spread

departure from symmetry

identification of outliers

a box plot displays the median, the first quartile and the third

quartiles on a rectangular box, aligned either horizontally or

vertically.

sometimes called box whiskers plot.

Page 8: Chapter 1 Descriptive Stats 2 2012

HOW TO CONSTRUCT A BOX PLOT

Page 9: Chapter 1 Descriptive Stats 2 2012

Arrange the observations x1, …, xn in increasing

order:

)n(2)1( xxx

odd.isnifx

evenisnifxx2

1

x

)(

1)()(

21n

2n

2n~

Use the following rule:

Numerical Summary : Sample Median

The median of a sample depends on whether the number of terms in the

sample is even or odd.

•If the number of terms is odd, then the median is the value of the

term in the middle.

•If the number of terms is even, then the median is the average of

the two terms in the middle

Page 10: Chapter 1 Descriptive Stats 2 2012

Example 1: Find Median for the following observations:

0.3 7.8 4.6 3.7 9.2 12.1 -5 -2.5 10.8

Numerical Summary : Sample Median

Page 11: Chapter 1 Descriptive Stats 2 2012

odd.isnifx

evenisnifxx2

1

x

)(

1)()(

21n

2n

2n~

Example 1: Find Median for the following observations:

0.3 7.8 4.6 3.7 9.2 12.1 -5 -2.5 10.8

Arrange the observations in increasing order: n = 9

- 5 -2.5 0.3 3.7 4.6 7.8 9.2 10.8 12.1

Numerical Summary : Sample Median

Page 12: Chapter 1 Descriptive Stats 2 2012

Example 2: Find Median for given observations :

2.8 5.2 -2.3 2.6 3.6 1.4 6.9 4.3 8.4 2.8

Page 13: Chapter 1 Descriptive Stats 2 2012

Example 2: Find Median for given observations :

2.8 5.2 -2.3 2.6 3.6 1.4 6.9 4.3 8.4 2.8

Rearrange the observations in increasing order:

- 2.3 1.4 2.6 2.8 2.8 3.6 4.3 5.2 6.9 8.4

odd.isnifx

evenisnifxx2

1

x

)(

1)()(

21n

2n

2n~

Median = (2.8 + 3.6)/2 = 3.2

Page 14: Chapter 1 Descriptive Stats 2 2012

Percentile:

Measure of central tendency that divide a group of data into 100 parts.

At least n% of the data lie between the nth percentile and at most (100-n)% of the data lie above the nth percentile

90 percentile:

Nth percentile:

At least 90% of the data lie between the 90th percentile and at most (10)% of the data lie above the 90th percentile

Page 15: Chapter 1 Descriptive Stats 2 2012

LQ (Q1) is 25 percentile

Median (Q2) is 50 percentile

UP (Q3) is 75 percentile

At least 25% of the data lie between the 25th percentile and at most (75)% of the data lie above the 25th percentile

25 percentile = Q1

Page 16: Chapter 1 Descriptive Stats 2 2012

LQ (Q1) and UQ (Q3) are defined as follows

Step 1. Arrange the values in increasing order

Step 2. Q1 is the value in position 0.25(n+1)

Q3 is the value in position 0.75(n+1)

Step 3. If the positions are not integers, Q1 and Q3 are found by interpolation, using adjacent values

IQR = Q3 – Q1

Page 17: Chapter 1 Descriptive Stats 2 2012

Example 1: (values are arranged in increasing order)

- 5 -2.5 0.4 3.7 4.6 7.8 9.2 10.8 12.1 13.5 14

n = 11, 0.25(n+1) = 0.25(12) = 3;

0.75(n+1) = 0.75(12) = 9

Q1 = x(3) = 0.4,

Q3 = x(9) = 12.1,

and IQR = 12.1 – 0.4 = 11.7

Page 18: Chapter 1 Descriptive Stats 2 2012

Example 2: (values are arranged in increasing order)

- 5 - 4 2 6 6.5 7.8 9.2 10.8 12.5 14.5 15 16.4

n=12,

0.25(n+1) = 0.25(13)=3.25; 0.75(n+1)=0.75(13) = 9.75

Q1 = x(3) + 0.25(x(4) - x(3))= 2 + 0.25(6 – 2) = 2 + 0.25(4) = 3

Q3 = x(9) + 0.75(x(10) - x(9))= 12.5 + 0.75(14.5 – 12.5) = 14

Page 19: Chapter 1 Descriptive Stats 2 2012

Example 3: (values are arranged in increasing order)

2 5 9 9.8 10.2 10.8 12.5 14 16.4 18.7

n=10,

0.25(n+1) = 0.25(11)=2.75; 0.75(n+1)=0.75(11) = 8.25

Q1 = x(2) + 0.75(x(3) - x(2))= 5 + 0.75(9 – 5) = 5 + 0.75(4) = 8

Q3 = x(8) + 0.25(x(9) - x(8))= 14 + 0.25(16.4 – 14) = 14.6

Page 20: Chapter 1 Descriptive Stats 2 2012

Example 2

The following “ cold start ignition time” of an automobile

engine obtained for a test vehicle are as follows:

1.75 1.92 2.62 2.35 3.09 3.15 2.53 1.91

Solution:

a) Calculate the sample median, the quartiles and the IQR

b) Construct a box plot of the data.

Rank the n = 8 measurements from smallest to largest

1.75 1.91 1.92 2.35 2.53 2.62 3.09 3.15

sample median: since n is even )(

2

1~12/2/ nn xxx

44.2)53.235.2(2

1)(

2

1~54 xxx

Page 21: Chapter 1 Descriptive Stats 2 2012

Solution:

1.75 1.91 1.92 2.35 2.53 2.62 3.09 3.15

Lower quartile: )25.2())18(25.0())1(25.0(1 xxxQ n

9125.1)91.192.1(25.091.1)(25.0 23)2(1 xxxQ

Upper quartile: )75.6())18(75.0())1(75.0(3 xxxQ n

9725.2)62.209.3(75.062.2)(75.0 67)6(3 xxxQ

IQR:

06.19725.19125.213 QQ

Page 22: Chapter 1 Descriptive Stats 2 2012

b) Construct a box plot of the data.

Max: 3.15

Q3 = 2.9725

Q2 = 2.44 IQR = 1.06

Q1 = 1.9125

Min: 1.75

4.5625

0.3225

1.5IQR

1.5IQR

Page 23: Chapter 1 Descriptive Stats 2 2012

Example 3 Suppose that the ages of thirty UTP students live in village 2 are follow:

Solution:

a) Calculate the sample median, the quartiles and the IQR

b) Construct a box plot of the data.

18, 20, 21, 26, 24, 19, 25, 20, 22, 21,

19, 24, 25, 28, 24, 20, 26, 20, 35, 17,

18, 24, 20, 21, 22, 27, 25, 28, 27, 24.

Step 1: Place the number in order from smallest to the largest

17, 18, 18, 19, 19, 20, 20, 20, 20, 20,

21, 21, 21, 22, 22, 24, 24, 24, 24, 24,

25, 25, 25, 25, 26, 26, 27, 27, 28, 35.

Step 2: The median, Q2 , the lower quartile, Q1, the upper quartile, Q3, and

the IQR are:

Page 24: Chapter 1 Descriptive Stats 2 2012

The median, Q2= (X15+X16)/2 = (22 + 24)/2 = 23

The position of Q1= 0.25(n+1) = 0.25(31) = 7.75

Q1 = X7+0.75(X8 - X7 ) = 20 + 0.75(20-20) = 20

The position of Q3= 0.75(n+1) = 0.75(31) = 23.25

Q3 = X23 + 0.25(X24 – X23 ) = 25 + 0.25(26 - 25) = 25.25

Step 2: The IQR = Q3 - Q1 = 25.25 – 20 = 5.25, 1.5IQR = 1.5(5.25) = 7.875

Step 2: Start to draw the Box-Plot:

17 28 o35 outlier

Q1 =20 Q2 =23 Q3=25.25

12.125.<………1.5IQR………>< ------------…..IQR=5.25--------------><……1.5IQR………>.33.125

Page 25: Chapter 1 Descriptive Stats 2 2012

QUIZ 1

79, 53, 82, 91, 87, 98, 80, 93

Lower quartile: )25.2())18(25.0())1(25.0(1 xxxQ n

913.1)91.192.1(25.091.1)(25.0 23)2(1 xxxQ

Upper quartile: )75.6())18(75.0())1(75.0(3 xxxQ n

973.2)62.209.3(75.062.2)(75.0 67)6(3 xxxQ

IQR:

06.1913.1973.213 QQ

Page 26: Chapter 1 Descriptive Stats 2 2012

QUIZ 1

The following data are given as follows:

82 79 53 91 87 98 93 80

a) Calculate the sample mean and sample variance

c) Construct a complete box plot of the data

d) Identify the possible outlier(s) if any

b) Calculate the sample median, the quartiles and the IQR