topic statistics
TRANSCRIPT
-
8/6/2019 Topic Statistics
1/43
Engineering Mathematics 2
STATISTICS
-
8/6/2019 Topic Statistics
2/43
-
8/6/2019 Topic Statistics
3/43
Mathematics for Technology II (CT034-3-2) Statistics Slide 3 (of 59)
Learning Outcomes
A the end of this topic, You should be able to:
Understand the concepts of sampling
theory and Central limit theorem.
Perform statistical inference using
estimation and hypothesis testing.
-
8/6/2019 Topic Statistics
4/43
Mathematics for Technology II (CT034-3-2) Statistics Slide 4 (of 59)
K ey Terms you must be able to use
If you have mastered this topic, you should beable to use the following terms correctly in your assignments and final exam :
-
8/6/2019 Topic Statistics
5/43
Mathematics for Technology II (CT034-3-2) Statistics
Central Limit Theorem
If
1 2, 3
2
1 2 3
2
1
, , ... , ...is a sequence of independent random variables
with ( ) var( ) , 1, 2,3, 4,5,6........ and if ... ,...then under certain general conditions
and variance
n
i i i i
n n
n
i
i
X X X X
E X X i nS X X X X
Q W
Q Q W !
! ! !!
! 21
as n tends to infinity.n
i
i
W !
!
-
8/6/2019 Topic Statistics
6/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory- Population or universe
An aggregate of objects (a nim ate/ inanim ate)und er stu dy is calle d populat ion or u niv erse. Itis thus a collect ion of individ uals or of the irattr ibutes or of results of operat ions wh ichcan be num er ically spec if ied .
A univ erse co n ta inin g a f ini te num ber of individ uals or m em bers is calle d f ini teuniv erse . For exa m ple , the u niv erse of th ewe ight of the stu den ts in a part icular class.
-
8/6/2019 Topic Statistics
7/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory
A univ erse of co ncrete objects is a n ex iste n tuniv erse .
The collect ion of all poss ible wa ys in wh ich aspec if ied even t ca n happe n is calle d ahypothet ical u niv erse.
The u niv erse of hea ds a nd ta ils obta ined by toss ing a co in in an in f ini te num ber of t im es.
-
8/6/2019 Topic Statistics
8/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory
The stat ist ician is ofte n con fro n te d with theproble m of discuss ing univ erse of wh ich he
cann ot exa min e e ver y m em ber i.e. of wh ichcom plete e num erat ion is im pract icable.
For exa m ple if we wa n t to ha ve a n idea of theaverage per cap ital incom e of the people of Indi a ,e num erat ion of e ver y ear nin g individ ualin the cou n tr y is a ver y di ff icult task.
-
8/6/2019 Topic Statistics
9/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory
N aturall y the quest ion ar ises:What ca n be sa id about a u niv erse of wh ich
we ca n exa min e o n ly a limi te d num ber of m em bers?The quest ion is the or igin of the THEORY OF
SAMPLING
.
-
8/6/2019 Topic Statistics
10/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory
Sam ple: A f ini te subset is calle d a sa m ple. A sa m ple is thus a s m all port ion of the u niv erse.
Sam ple s ize: The num ber of individ uals in asa m ple is calle d the sa m ple s ize.
Sam pling : The process of select ing a sa m plefro m a u niv erse is calle d a sa m pling.
-
8/6/2019 Topic Statistics
11/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory
The theor y of sa m pling is a stu dy of relat ionsh ip ex ist ing betwee n a populat ion
and sa m ples d raw n fro m the populat ion .The fu nd am en tal object of sa m pling is to getthe in for m at ion as poss ible of the wholeuniv erse b y exa minin g on ly a part of it.An atte m pt of is thus m ade through sa m plingto g ive the m axim um infor m at ion about thepare n t u niv erse w ith the minim um effort .
-
8/6/2019 Topic Statistics
12/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory
Sam pling is qu ite ofte n use d in our da ily life.
For exa m ple , in a shop we asses the qual ity of sugar , r ice or a ny other pro duct o n ly by tak ing it fro m the bag a nd the n dec idewhether to purchase it or not .
-
8/6/2019 Topic Statistics
13/43
Mathematics for Technology II (CT034-3-2) Statistics
Sampling Theory
A house w ife nor m ally tests the cooke d pro ducts to f ind if the y are properl y cooke d
and con ta in the proper qual ity of salt or sugar,by tak ing a spoo n ful of it.
-
8/6/2019 Topic Statistics
14/43
Mathematics for Technology II (CT034-3-2) Statistics
Random Sampling
The select ion of a n individ ual fro m theuniv erse in such a wa y that each individ ual of
the u niv erse has the sa m e cha nce of be ingselecte d is calle d R andom SamplingA sa m ple obta ined by the ra nd om sa m pling iscalle d a ra nd om sa m pleThe s im plest m etho d wh ich is nor m ally use d for ra nd om sa m pling is the lotter y syste m
-
8/6/2019 Topic Statistics
15/43
Mathematics for Technology II (CT034-3-2) Statistics
Random Sampling
Sam pling of attr ibutes: The sa m pling of attr ibutes m ay be regar ded as the d raw ing of
sa m ples fro m a u niv erse whose num berspossess the attr ibutes A or B
The u niv erse is thus divid ed in to two m utuall y exclus ive a nd collect ively exhaust ive classes-one class possess ing the attr ibute A and theother class not possess ing the attr ibute A
-
8/6/2019 Topic Statistics
16/43
Mathematics for Technology II (CT034-3-2) Statistics
Random Sampling
The prese nce of a part icular attr ibute in sa m ple d uni t m ay be ter m ed as success a nd
its abse nce is fa ilure.
-
8/6/2019 Topic Statistics
17/43
Mathematics for Technology II (CT034-3-2) Statistics
Random Sampling
By sim ple sa m pling we m ea n ra nd om sa m pling in wh ich each e ven t has the sa m e
probab ility of success a nd the probab ility of even t is ind epe nd en t of the success or fa ilureof e ven ts in the prece din g tr ials.Thus the s im ple sa m pling is a spec ial case of ra nd om sa m pling in wh ich the tr ials areind epe nd en t a nd probab ility of success isconsta n t.
-
8/6/2019 Topic Statistics
18/43
Mathematics for Technology II (CT034-3-2) Statistics
Random Sampling
For exa m ple , cou n t ing the num ber of successes in the throw ing of a dice or toss ing
of a co in is a case of s im ple sa m pling since theprobab ility of gett ing hea ds w ith the a co in isunaffecte d by the pre vious tr ials a nd re m a insconsta n t irrespect ive of the num ber of tr ialsm ade pro vid ed the co in re m a ins u nb iase d .
-
8/6/2019 Topic Statistics
19/43
Mathematics for Technology II (CT034-3-2) Statistics
Random Sampling
Sam pling , through ra nd om , nee d not besim ple.
Rand om sa m pling fro m an in f ini te u niv ersealwa ys s im ple but ra nd om sa m pling fro m af ini te u niv erse m ay or m ay not be s im pleaccor din g as the m em bers d raw n are replace d or not.
-
8/6/2019 Topic Statistics
20/43
Mathematics for Technology II (CT034-3-2) Statistics
Random Sampling
The co ndi t ions of s im ple sa m pling viz,consta n t probab ility p, a nd ind epe nd en t
even ts sat isf y the bas ic assu m pt ions of theBinomi al Distr ibut ion .
The b inomi al probab ility distr ibut ion thusdeter min ed is calle d the sa m pling distr ibut ion of the num ber of successes in the sa m ple.
-
8/6/2019 Topic Statistics
21/43
Mathematics for Technology II (CT034-3-2) Statistics
Students t test
G osset , who wrote u nd er the pe n nam e of stu den t , der ived a theoret ical distr ibut ion wh ich has co m e to be k now n asStu den t s t distr ibut ion .
The qua n t ity t is def ined as
n = number of the obser vat ions in the sa m ple
Where is the sa mple m ea n i s the populat ion m ea n.S is the sa m ple sta nd ar d deviat ion .
xt
sn
Q!
x
Q
-
8/6/2019 Topic Statistics
22/43
Mathematics for Technology II (CT034-3-2) Statistics
Application of Students t test
To test the s igni f icance of a sa m ple m ea n
To test the s igni f icance of the di ffere ncebetwee n two sa m ple m ea ns
To test the s igni f icance of the coeff icien t of correlat ion .
-
8/6/2019 Topic Statistics
23/43
Mathematics for Technology II (CT034-3-2) Statistics
SNEDECORS F-test-For equality of Population Variances
Let
Let be the sa m ple m ea ns.
Let
Then
( 1, 2,... ) ( 1, 2, ... )
var .i j x i n and y j n be the values of two independent random samples
drawn from the normal populations withthe same iance
! !
xand y
1 222
2 2
1 11 2
1 1&
1 1
n n
x i y ji i
S x x S y yn n
! !
! !
22 2
2
22 2
2
x x y
y
y y x
x
S F if S S
S
or
S F if S S
S
! "
! "
-
8/6/2019 Topic Statistics
24/43
Mathematics for Technology II (CT034-3-2) Statistics
SNEDECORS F-test-For equalityof Population Variances
Tak ing the h ypothes is that the two sa m ples ha ved raw n fro m nor m al populat ions w ith the sa m evar iance , we co m pare the calculate d value F w ith itstable value.If the calculate d value of F > the table value of F w ith5% level of s ignif icance , the rat io is s igni f ican t a nd the h ypothes is m ay be rejecte d If the calculate d value of F < the table value of F w ith5% level of s ignif icance ,the h ypothes is m ay berejecte d
-
8/6/2019 Topic Statistics
25/43
Mathematics for Technology II (CT034-3-2) Statistics
Chi Square Test
Whe n a co in is tosse d 200 t im es , the theoret icalconsiderat ions lea d us to expect 100 hea ds a nd 100 ta ils.But in pract ice the results are rarel y ach ieved .
The qua n t ity a G reek letter , pro nou nce d as CHI SQUAREdescr ibes the m agni tu de of discrepa ncy betwee n the theor y and obser vat ion .Then
2 G
2
2
1
( 1 , 2 ,... )
( 1 , 2 ,... )
ni i
i i
i
i
O
O i n Ob seved frequency
E i n Expected frequency
G !
! ! !! !
-
8/6/2019 Topic Statistics
26/43
Mathematics for Technology II (CT034-3-2) Statistics
Chi Square Test
Degrees of free dom = n -1Chi square test is o ne of the s im plest a nd
m ost ge neral test k now n .It is appl icable to a ver y large num ber of proble m s in pract ice wh ich ca n be su mm ed up
und er the follow ing hea ds:Chi square test of goo dn ess of f itChi square test of ind epe nd ence of attr ibutes.
-
8/6/2019 Topic Statistics
27/43
Mathematics for Technology II (CT034-3-2) Statistics
Chi Square Test
Condi t ions for appl yin g ch i square test:Follow ing are the co ndi t ions wh ich shoul d be
sat isf ied before Ch i square test is appl ied .N , the total num ber of freque ncies shoul d belarge.
It is di ff icult to sa y what co nst itutes large ness,but as a n arb itrar y f igure , we m ay sa y that N shoul d be at least 50, howe ver , few the cells.
-
8/6/2019 Topic Statistics
28/43
Mathematics for Technology II (CT034-3-2) Statistics
Chi Square Test
N o theoret ical cell freque ncy shoul d be s m all.
5 shoul d be regar ded as the ver y minim um and 10 is better.
It is im porta n t to re m em ber that the num berof degrees of free dom is deter min ed with thenum ber of classes after regroup ing
-
8/6/2019 Topic Statistics
29/43
Mathematics for Technology II (CT034-3-2) Statistics
R egression
Regress ion is the est im at ion or pre dict ion of unknow n values of o ne var iable fro m know n
values of a nother var iable.After establ ish ing the fact of correlat ion betwee n two var iables , it is natural cur ios ity to k now the exte n t to wh ich o ne var iablevar ies in respo nse to a g iven var iat ion in another var iable
-
8/6/2019 Topic Statistics
30/43
Mathematics for Technology II (CT034-3-2) Statistics
R egression
To know about the nature of relat ionsh ipbetwee n the two var iables.
R egression measures the nature and extentof correlation.
-
8/6/2019 Topic Statistics
31/43
Mathematics for Technology II (CT034-3-2) Statistics
Linear R egression
If two var iables x a nd y are correlate d i.e.there ex ists a n assoc iat ion or relat ionsh ip
betwee n the m , the n the scatter diagra m willbe m ore or less co nce n trate d rou nd a cur ve.
This cur ve is calle d the cur ve of regress ion .The relat ion sh ip is sa id to be expresse d by m ea ns of cur vilinear regress ion .
-
8/6/2019 Topic Statistics
32/43
Mathematics for Technology II (CT034-3-2) Statistics
Linear R egression
Whe n the cur ve is a stra ight l ine , it is calle d aline of regress ion and the regress ion is sa id to
be l inearA line of regression is the straight line whichgives the best fit in the least square sense tothe given frequency.
-
8/6/2019 Topic Statistics
33/43
Mathematics for Technology II (CT034-3-2) Statistics
Regression Lines
If the l ine of regress ion is so chose n that thesu m of squares of deviat ion parallel to the ax is
of y is minimiz ed , it is calle d the l ine of regress ion of y on x
It gives the best est im ate of y for a ny given values of x
-
8/6/2019 Topic Statistics
34/43
Mathematics for Technology II (CT034-3-2) Statistics
Regression Lines
If the l ine of regress ion is so chose n that thesu m of squares of deviat ions parallel to the
axis of x is minimiz ed , it is calle d the l ine of regress ion of x o n y
It gives the best est im ate of x for a ny given values of y
-
8/6/2019 Topic Statistics
35/43
Mathematics for Technology II (CT034-3-2) Statistics
Lines of Regression
Y on X
X on Y
( ) y yx x
y y b r x xW
W
! !
( ) x xy y
x x r y yW
W
! !
-
8/6/2019 Topic Statistics
36/43
Mathematics for Technology II (CT034-3-2) Statistics
Regression Lines
If r = 0 , the two l ines of regress ion beco m es
The two stra ight l ines are parallel to X a nd Y axes respect ively and pass ing through the m ea ns
They are m utuall y perpe ndi cular
If , the two l ines of regress ion will coincide.
, y y x x
! !
1r ! s
-
8/6/2019 Topic Statistics
37/43
Mathematics for Technology II (CT034-3-2) Statistics
Properties of Regression coefficients
Correlat ion coeff icien ts is the geo m etr ic m ea n betwee n the regress ion coeff icien ts.
If one of the regress ion coeff icien ts is greatertha n uni ty, the other m ust be less tha n uni ty.
Arith m et ic m ea n of regress ion coeff icien ts isgreater tha n the correlat ion coeff icien t.
-
8/6/2019 Topic Statistics
38/43
Mathematics for Technology II (CT034-3-2) Statistics
Properties of Regression coefficients
Regress ion coeff icien ts are ind epe nd en t of cha nge of or igin but not of scale.
The correlat ion coeff icien t a nd the tworegress ion coeff icien ts ha ve the sa m e s ign
-
8/6/2019 Topic Statistics
39/43
Mathematics for Technology II (CT034-3-2) Statistics
A ngle between two R egression lines
Angle betwee n two regress ion lines
2
2 2
1an x y
x y
r
r
W W U
W W !
-
8/6/2019 Topic Statistics
40/43
Mathematics for Technology II (CT034-3-2) Statistics Slide 56 (of 59)
Q uick R eview Q uestion
?
-
8/6/2019 Topic Statistics
41/43
Mathematics for Technology II (CT034-3-2) Statistics Slide 57 (of 59)
T opic OutlineCen tral Limit theore mTest ing of h ypothes isCorrelat ionRegress ion
S ummary of Main Teaching Points
-
8/6/2019 Topic Statistics
42/43
Mathematics for Technology II (CT034-3-2) Statistics Slide 58 (of 59)
Q & A
Q uestion and Answer S ession
-
8/6/2019 Topic Statistics
43/43
Mathematics for Technology II (CT034-3-2) Statistics Slide 59 (of 59)
Matrices
and Determin
ants
N ext S ession