essentials of business statistics: communicating with numbers by sanjiv jaggia and alison kelly...
TRANSCRIPT
![Page 1: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/1.jpg)
Essentials of Business Statistics: Communicating with Numbers
By Sanjiv Jaggia and Alison KellyCopyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
![Page 2: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/2.jpg)
3-2Numerical Descriptive Measures
Chapter 3 Learning Objectives
LO 3.1 Calculate and interpret the mean, the median, and the mode.
LO 3.2 Calculate and interpret percentiles and a box plot.LO 3.3 Calculate and interpret the range, the mean absolute
deviation, the variance, the standard deviation, and the coefficient of variation.
LO 3.4 Explain mean-variance analysis and the Sharpe ratio.LO 3.5 Apply Chebyshev’s Theorem, the empirical rule, and z-
scores.LO 3.6 Calculate the mean and the variance for grouped data.LO 3.7 Calculate and interpret the covariance and the correlation
coefficient.
![Page 3: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/3.jpg)
3-3Numerical Descriptive Measures
The arithmetic mean is a primary measure of central location.
LO 3.1 Calculate and interpret the arithmetic mean, the median, and the mode.
3.1 Measures of Central Location
x
Sample Mean Population Mean
![Page 4: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/4.jpg)
3-4Numerical Descriptive Measures
The mean is sensitive to outliers. Consider the salaries of employees at Acetech.
LO 3.1 3.1 Measures of Central Location
This mean does not reflect the typical salary!
![Page 5: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/5.jpg)
3-5Numerical Descriptive Measures
The median is another measure of central location that is not affected by outliers.
When the data are arranged in ascending order, the median is
the middle value if the number of observations is odd, or
the average of the two middle values if the number of observations is even.
LO 3.1 3.1 Measures of Central Location
![Page 6: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/6.jpg)
3-6Numerical Descriptive Measures
The mode is another measure of central location. The most frequently occurring value in a data set Used to summarize qualitative data A data set can have no mode, one mode (unimodal),
or many modes (multimodal). Consider the salary of employees at Acetech
The mode is $40,000 since this value appears most often.
LO 3.1 3.1 Measures of Central Location
![Page 7: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/7.jpg)
3-7Numerical Descriptive Measures
Weighted Mean
Let w1, w2, . . . , wn denote the weights of the sample observations x1, x2, . . . , xn such that
w1 + w2 + . . . + wn = 1, then
i ix w x
LO 3.1 3.1 Measures of Central Location
![Page 8: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/8.jpg)
3-8Numerical Descriptive Measures
In general, the pth percentile divides a data set into two parts: Approximately p percent of the observations have
values less than the pth percentile; Approximately (100 p ) percent of the
observations have values greater than the pth percentile.
LO 3.2 Calculate and interpret percentiles and a box plot.
3.2 Percentiles and Box Plots
![Page 9: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/9.jpg)
3-9Numerical Descriptive Measures
Calculating the pth percentile: First arrange the data in ascending order. Locate the position, Lp, of the pth percentile by using
the formula:
We use this position to find the percentile as shown next.
1100p
pL n
LO 3.2 Percentiles and Box Plots
![Page 10: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/10.jpg)
3-10Numerical Descriptive Measures
Calculating the pth percentile
Once you find Lp, observe whether or not it is an integer. If Lp is an integer, then the Lpth observation in the
sorted data set is the pth percentile. If Lp is not an integer, then interpolate between two
corresponding observations to approximate the pth percentile.
LO 3.2 Percentiles and Box Plots
![Page 11: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/11.jpg)
3-11Numerical Descriptive Measures
Both L25 = 2.75 and L75 = 8.25 are not integers, thus
The 25th percentile is located 75% of the distance between the second and third observations, and it is
The 75th percentile is located 25% of the distance between the eighth and ninth observations, and it is
7.34 0.75(8.09 ( 7.34)) 4.23
43.79 0.25(59.45 43.79) 47.71
LO 3.2 Percentiles and Box Plots
![Page 12: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/12.jpg)
3-12
A box plot allows you to: Graphically display the distribution of a data set. Compare two or more distributions. Identify outliers in a data set.
Numerical Descriptive Measures
Box
* *
WhiskersOutliers
LO 3.2 Percentiles and Box Plots
![Page 13: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/13.jpg)
3-13Numerical Descriptive Measures
The box plot displays 5 summary values: Min = smallest value Max = largest value Q1 = first quartile = 25th percentile Q2 = median = second quartile = 50th percentile Q3 = third quartile = 75th percentile
LO 3.2 Percentiles and Box Plots
Min Max
![Page 14: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/14.jpg)
3-14
Detecting outliers Calculate IQR, then multiply by 1.5 There are outliers if
Q1 – Smallest Value > 1.5*IQR Largest Value – Q3 > 1.5*IQR
Numerical Descriptive Measures
LO 3.2 Percentiles and Box Plots
![Page 15: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/15.jpg)
3-15Numerical Descriptive Measures
Measures of dispersion gauge the variability of a data set.
Measures of dispersion include: Range Mean Absolute Deviation (MAD) Variance and Standard Deviation Coefficient of Variation (CV)
LO 3.4 Calculate and interpret the range, the mean absolute deviation, the variance, the standard deviation, and the coefficient of variation.
3.3 Measures of Dispersion
![Page 16: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/16.jpg)
3-16Numerical Descriptive Measures
Range
It is the simplest measure. It is focused on extreme values.
Range Maximum Value Minimum Value
LO 3.4 3.3 Measures of Dispersion
![Page 17: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/17.jpg)
3-17Numerical Descriptive Measures
Mean Absolute Deviation (MAD) MAD is an average of the absolute difference of each
observation from the mean.
Sample MAD ix x
n
Population MAD ix
N
LO 3.4 3.3 Measures of Dispersion
![Page 18: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/18.jpg)
3-18Numerical Descriptive Measures
Variance and standard deviation
For a given sample,
For a given population,
2
2 2 and 1
ix xs s s
n
2
2 2 and ix
N
LO 3.4 3.3 Measures of Dispersion
![Page 19: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/19.jpg)
3-19Numerical Descriptive Measures
Coefficient of variation (CV) CV adjusts for differences in the magnitudes of the
means. CV is unitless, allowing easy comparisons of mean-
adjusted dispersion across different data sets.
Sample CV
Population CV
s
x
LO 3.4 3.3 Measures of Dispersion
![Page 20: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/20.jpg)
3-20Numerical Descriptive Measures
Mean-variance analysis: The performance of an asset is measured by its rate of
return. The rate of return may be evaluated in terms of its reward
(mean) and risk (variance). Higher average returns are often associated with higher
risk. The Sharpe ratio uses the mean and variance to
evaluate risk.
LO 3.5 Explain mean-variance analysis and the Sharpe Ratio.
3.4 Mean-Variance Analysis andthe Sharpe ratio
![Page 21: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/21.jpg)
3-21Numerical Descriptive Measures
Sharpe Ratio Measures the extra reward per unit of risk. For an investment І , the Sharpe ratio is computed as:
where is the mean return for the investment is the mean return for a risk-free asset is the standard deviation for the investment
Sharpe Ratiox R
s
LO 3.4 3.4 Mean-Variance Analysis andthe Sharpe Ratio
![Page 22: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/22.jpg)
3-22Numerical Descriptive Measures
Chebyshev’s Theorem For any data set, the proportion of observations that lie
within k standard deviations from the mean is at least 11/k2 , where k is any number greater than 1.
Consider a large lecture class with 280 students. The mean score on an exam is 74 with a standard deviation of 8. At least how many students scored within 58 and 90?With k = 2, we have 1(1/2)2 = 0.75. At least 75% of 280 or 210 students scored within 58 and 90.
LO 3.5 Apply Chebyshev’s Theorem and the empirical rule.
3.5 Analysis of Relative Location
![Page 23: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/23.jpg)
3-23Numerical Descriptive Measures
The Empirical Rule: Approximately 68% of all observations fall in the interval
Approximately 95% of all
observations fall in the interval
Almost all observations fall in the interval
x s
2x s
3x s
LO 3.5 Analysis of Relative Location
![Page 24: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/24.jpg)
3-24Numerical Descriptive Measures
LO 3.5 Analysis of Relative Location z-Scores
Often useful to use z-score to denote relative location A sample value’s z-score indicates that sample value’s
distance from the mean z-score calculated as
A z-score of 1.25 indicates that that sample value is 1.25 standard deviations above the mean. A z-score of –2.43 would indicate a sample value that is 2.43 standard deviations below the mean.
s
xxz
![Page 25: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/25.jpg)
3-25Numerical Descriptive Measures
When data are grouped or aggregated, we use these formulas:
where mi and i are the midpoint and frequency of the ith class, respectively.
LO 3.6 Calculate the mean and the variance for grouped data.
2
2
2
Mean:
Variance: 1
Standard Deviation:
i i
i i
mx
n
m xs
n
s s
3.6 Summarizing Grouped Data
![Page 26: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/26.jpg)
3-26Numerical Descriptive Measures
The covariance (sxy or xy) describes the direction of the linear relationship between two variables, x and y.
The correlation coefficient (rxy or xy) describes both the direction and strength of the relationship between x and y.
LO 3.7 Calculate and interpret the covariance and the correlation coefficient.
3.7 Covariance and Correlation
![Page 27: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/27.jpg)
3-27Numerical Descriptive Measures
The sample covariance sxy is computed as
1
i ixy
x x y ys
n
The population covariance xy is computed as
i x i y
xy
x y
N
LO 3.7 Covariance and Correlation
![Page 28: Essentials of Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly Copyright © 2014 by McGraw-Hill Higher Education. All rights](https://reader033.vdocuments.us/reader033/viewer/2022061306/55147355550346494e8b6186/html5/thumbnails/28.jpg)
3-28Numerical Descriptive Measures
The sample correlation rxy is computed as
xyxy
x y
sr
s s
The population correlation xy is computed as
Note, 1 < rxy < +1 or 1 < xy < +1
xyxy
x y
LO 3.7 Covariance and Correlation