![Page 1: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/1.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 1
Ratio Estimation and Regression Estimation(Chapter 4, Textbook, Barnett, V., 1991)
2.1 Estimation of a population ratio: The ratio estimator In some situations it is useful to estimate a (positive) ratio of two
population characteristics: the totals, or means, of two (positive) variables X and Y.
![Page 2: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/2.jpg)
The sample average of ratio
unbiased for estimating the population mean
Two obvious estimators of R are
The ratio of the sample averages
is widely used.
23/4/19 www.uic.edu.hk/~xlpeng 2
1 1
1 1( / )
n n
i i ii i
r y x rn n
/ /T Tr y x y x
1 1
1 1( / )
N N
j j jj j
R R Y XN N
but biased for estimating R
![Page 3: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/3.jpg)
The bias in estimating R by r
The bias in estimating R by r is the expectation of the following difference:
(2.3)
23/4/19 www.uic.edu.hk/~xlpeng 3
( ) /r R y Rx x 1
1y Rx x X
X X
2
1 .y Rx x X x X
X X X
2
[( )( )]( )
y Rx E y Rx x XE r R E
X X
![Page 4: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/4.jpg)
Discussion about the bias
23/4/19 www.uic.edu.hk/~xlpeng 4
≈
![Page 5: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/5.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 5
(2.5)
2
21
2 2 22
( )1
1
12
Nj j
j
Y YX X
Y RXf
nX N
fS RS R S
nX
( ) ( )j j j j jZ Y RX Y Y RX RX
![Page 6: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/6.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 6
A (slightly) biased estimate of the true variance parameter ( )V r
2 2
2 21
( ) (1 )1ˆ( )1
ni i r
i
y rx f sfV r
nx n nx
For large n, an approximate 100 (1-α) % (symmetric) two-sided confidence interval for the population ratio R is:
)(ˆ)(ˆ rVzrRrVzr
And the required sample size is
2 2 2/V d x z
2 2 / 4d x
![Page 7: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/7.jpg)
2.2 Ratio estimation of a population mean or total
23/4/19 www.uic.edu.hk/~xlpeng 7
( / )Ry rX X x y
( / )TR T Ry rX NX x y Ny
![Page 8: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/8.jpg)
Variance of ratio estimator
23/4/19 www.uic.edu.hk/~xlpeng 8
![Page 9: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/9.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 9
![Page 10: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/10.jpg)
Example: (Food additive)
A researcher was investigating a new food additive for cattle. Midway through the two-month study, he was interested in estimating the average weight for the entire herd of N = 500 steers. A simple random sample of n = 12 steers was selected from the herd and weighed. These data and prestudy weights are presented in the accompanying table for all cattle sampled. Assume the prestudy average = 880 pounds.
Estimate the ratio of present weight to prestudy weight of the herd, and provide an estimate of the standard error for your answer.
Which points have greatest influence on the estimate?
23/4/19 www.uic.edu.hk/~xlpeng 10
![Page 11: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/11.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 11
![Page 12: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/12.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 12
Solution:
![Page 13: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/13.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 13
The estimate of the ratio R of the present weight to prestudy weight for the herd is:
Solution:
000929.012
646.848,8)
500
121(
880
11)(
22
2
rSXn
frVar
030485.0000929.0)( rse
![Page 14: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/14.jpg)
Sample size determination
23/4/19 www.uic.edu.hk/~xlpeng 14
Hence we should sample 94 steers to estimate R, the change in weight of herd after the study with error bound of 1%.
![Page 15: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/15.jpg)
Example: (Sugar content)
23/4/19 www.uic.edu.hk/~xlpeng 15
In a study to estimate the total sugar content of a truckload of oranges, a SRS of n = 10 oranges was juiced and weighted. The total weight of all the oranges, obtained by first weighing the truck loaded and then unloaded, was found to be 1800 pounds. Estimate Y , the total sugar content for the oranges and provide the standard error of the estimate.
![Page 16: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/16.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 16
Solution:
The scatter plot shows a strong positive association between sugar content and weight, making the ratio estimator a reasonable choice.
An estimate of Y is
![Page 17: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/17.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 17
Solution:
![Page 18: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/18.jpg)
Example: (Promotional campaign)
23/4/19 www.uic.edu.hk/~xlpeng 18
An advertising firm is concerned about the effect of a new regional promotional campaign on the total dollar sales for a particular product. A SRS of n = 20
stores is drawn from the N = 452 regional stores in which the product is sold. Quarterly sales data are obtained for the current three-month period and the three-month period prior to the new campaign. The pre-campaign sales for all stores X = 260, 256. Check the scatter plot to see if these stores are in two different size groups.
![Page 19: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/19.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 19
Example: (Promotional campaign)
![Page 20: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/20.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 20
Solution:
(a) Without using the auxiliary information, the estimate of the average current three-month sales using ordinary estimator is
![Page 21: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/21.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 21
Solution:
(b) When the total pre-campaign three-month sales is known to be X = 260256, the average pre-campaign three-month sales is
Then the estimate of the average current three-month sales using ratio estimator is
which represent an average increase of 7.1% of the current three-month sales from the pre-campaign three-month sales.
![Page 22: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/22.jpg)
Solution:
23/4/19 www.uic.edu.hk/~xlpeng 22
The ratio estimator here is much better than ordinary estimator since the current three-month sales yi is closely and positively related to the pre-campaign three-month sales xi with correlation coefficient 0.9986.
![Page 23: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/23.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 23
![Page 24: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/24.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 24
This examines when the variance of (2.10) could be less or greater than that of (1.9)
![Page 25: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/25.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 25
![Page 26: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/26.jpg)
2.3 Regression estimation
Condition (2.15.1) demands that X and Y be linearly related, but, if the linear relationship does not pass through the origin, then, it suggests considering an alternative estimator known as regression estimator.
23/4/19 www.uic.edu.hk/~xlpeng 26
![Page 27: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/27.jpg)
2.3 Regression estimation
23/4/19 www.uic.edu.hk/~xlpeng 27
A practicable simple linear regression model is (2.17)
.
An ideal (perfect) linear relationship is
(2.16)
)( jj XXbYY
(2.18)
jjj EXXbYY )(
![Page 28: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/28.jpg)
2.3 Regression estimation
23/4/19 www.uic.edu.hk/~xlpeng 28
Consider the average (mean) of either (2.16) or (2.17),
( )Ly y b X x (2.19)
![Page 29: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/29.jpg)
2.3 Regression estimation
23/4/19 www.uic.edu.hk/~xlpeng 29
2( ) [( ) ]L LVar y E y Y 2
2 2 2
2 2
{[( ) ( )] }
1( 2 )
1(1 )
L
Y YX X
Y YX
E y Y b x X
fS bS b S
nfS
n
21( )Y
fS Var y
n
(2.20)
y
![Page 30: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/30.jpg)
2.3 Regression estimation
23/4/19 www.uic.edu.hk/~xlpeng 30
(2.21) 2 2 21ˆ( ) ( 2 )L y yx x
fV y s bs b s
n
![Page 31: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/31.jpg)
2.3 Regression estimation
23/4/19 www.uic.edu.hk/~xlpeng 31
From (2.20),
2 2 21min { ( )} min ( 2 )b L b Y YX X
fVar y S bS b S
n
2 21(1 )Y YX
fS
n
The minimum is obtained with 2min / /YX X YX Y Xb b S S S S
Y
Thus the most efficient regression estimator of is
( / )( )L YX Y Xy y S S X x
![Page 32: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/32.jpg)
2.3 Regression estimation
23/4/19 www.uic.edu.hk/~xlpeng 32
The optimal value of b of (2.22) suggests the obvious estimate:
1min 2 2
1
( )( )( )
( )
n
i iyx in
x ii
y y x xsb b
s x x
(2.24)
( )Ly y b X x (2.25)
which enjoys the following asymptotic properties:
1( ) ( )LE y Y O n
![Page 33: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/33.jpg)
2.3 Regression estimation
23/4/19 www.uic.edu.hk/~xlpeng 33
Asymptotic properties:
( )LVar y 2 2 2 3/21( / ) ( )Y YX X
fS S S O n
n
21( ) ( )L y yx
fV y s bs
n
(2.27)
![Page 34: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/34.jpg)
2.4 Comparison of ratio and regression estimators
23/4/19 www.uic.edu.hk/~xlpeng 34
![Page 35: 2015-7-3 xlpeng 1 Ratio Estimation and Regression Estimation (Chapter 4, Textbook, Barnett, V., 1991) 2.1 Estimation of a population ratio:](https://reader038.vdocuments.us/reader038/viewer/2022102622/56649d755503460f94a55d2f/html5/thumbnails/35.jpg)
23/4/19 www.uic.edu.hk/~xlpeng 35
2.4 Comparison of ratio and regression estimators
2 2 2 21( ) ( ) 2R L X YX Y X YX Y
fV y Var y R S R S S S
n
21X YX Y
fRS S
n