statistics slides

Post on 19-Oct-2014

5.603 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

 

TRANSCRIPT

Ritesh Singhal

WELCOME

To all PGDM Students

from

Ritesh Singhal{M.Sc.(Maths), MIT, M.Phil.}

Ritesh Singhal

Statistics The systematic and scientific treatment of

quantitative measurement is precisely known as statistics.

Statistics may be called as science of counting.

Statistics is concerned with the collection, classification (or organization), presentation and analysis of data which are measurable in numerical terms.

Ritesh Singhal

Stages of Statistical Investigation

Collection of Data

Organization of data

Presentation of data

Analysis

Interpretation of Results

Ritesh Singhal

Statistics It is divided into two major parts:

Descriptive and Inferential Statistics. Descriptive statistics, is a set of methods

to describe data that we have collected. i.e. summarization of data.

Inferential statistics, is a set of methods used to make a generalization, estimate, prediction or decision. When we want to draw conclusions about a distribution.

Ritesh Singhal

Collection of Data

Data can be collected by two ways:>>> Primary Data CollectionIt is the data collected by a particular person

or organization for his own use.

>>> Secondary Data CollectionIt is the data collected by some other person

or organization, but the investigator also get it for his use.

Ritesh Singhal

Methods of Primary data collection

Direct personal interview Data through questionnaire Indirect investigationEtc.

Ritesh Singhal

Methods of Secondary data collection

Data collected through newspapers & periodicals.

Data collected from research papers. Data collected from government

officials. Data collected from various NGO, UN,

UNESCO, WHO, ILO, UNICEF etc. Other published resources

Ritesh Singhal

Classification of data Classification is a process of arranging

data into sequences and groups according to their common characteristics or separating them into different but related parts.

It is a process of arranging data into various homogeneous classes and subclasses according to some common characteristics.

Ritesh Singhal

Presentation of Data Data should be presented in such a

manner, so that it may be easily understood and grasped, and the conclusion may be drawn promptly from the data presented. e.g.

>>> Histogram>>> Frequency polygon & curve>>> Pie Chart>>> Ogives>>> Pictogram & Cartogram>>> Bar Chart

Ritesh Singhal

Variables Discrete Variablee.g. No. of books, table, chairs Continuous Variablee.g. Height, Weight Quantitative VariableThat can be measured on a scale Qualitative VariableThat can not be measured on a scale

Ritesh Singhal

Frequency Distribution

The observations can be recorded by three ways:

1. Individual SeriesData recorded for individual member.2. Discrete SeriesThis variable can assume values after an

interval (or jumps).3. Continuous SeriesHere the variable may be having any value,

integer or fraction.

Ritesh Singhal

Statistics functions & Uses It simplifies complex data It provides techniques for comparison It studies relationship It helps in formulating policies It helps in forecasting It is helpful for common man Statistical methods merges with speed of

computer can make wonders; SPSS, STATA MATLAB, MINITAB etc.

Ritesh Singhal

Scope of Statistics

In Business Decision Making In Medical Sciences In Actuarial Science In Economic Planning In Agricultural Sciences In Banking & Insurance In Politics & Social Science

Ritesh Singhal

Distrust & Misuse of Statistics

Statistics is like a clay of which one can make a God or Devil.

Statistics are the liers of first order. Statistics can prove or disprove

anything.

Ritesh Singhal

Measure of Central Tendency

It is a single value represent the entire mass of data. Generally, these are the central part of the distribution.

It facilitates comparison & decision-making

There are mainly three type of measure1. Arithmetic mean2. Median3. Mode

Ritesh Singhal

Arithmetic Mean

This single representative value can be determined by:

A.M. =Sum/No. of observationsProperties:1. The sum of the deviations from AM is

always zero.2. If every value of the variable increased or

decreased by a constant then new AM will also change in same ratio.

Ritesh Singhal

Arithmetic Mean (contd..)

3. If every value of the variable multiplied or divide by a constant then new AM will also change in same ratio.

4. The sum of squares of deviations from AM is minimum.

5. The combined AM of two or more related group is defined as

Ritesh Singhal

Median

The median is that value of the variable which divides the group into two equal parts, one part comprising all values greater, and the other part having lesser value than median.

Determination of Median>>> Arrange the data first>>> Find the size of (N+1)/2 th item.

Ritesh Singhal

Mode

Mode is that value which occurs most often in the series.

It is the value around which, the items tends to be heavily concentrated.

It is important average when we talk about “most common size of shoe or shirt”.

Ritesh Singhal

Relationship among Mean, Median & Mode

For a symmetric distribution: Mode = Median = Mean

The empirical relationship between mean, median and mode for asymmetric distribution is: Mode = 3 Median – 2 Mean

Ritesh Singhal

Skewness

Mode: Peak of the curve.Median: Divide the curve into two equal

parts.Mean: Center of gravity of the curve.

For a positively skewed distribution:Mean>Median>Mode For a Negatively skewed distribution: Mean<Median<Mode

Ritesh Singhal

Dispersion or Variation

The average does not enable us to draw a full picture of the distribution. So a further description is necessary to get a better description.

The extent or degree to which data tends to spread around an average is called dispersion & Variation.

Ritesh Singhal

Objectives

For judging the reliability of averages. Comparison of distributions Useful for controlling variability Useful in further analysis

Ritesh Singhal

Measure of Dispersion

Range Inter quartile Range Mean Deviation Standard Deviation

Ritesh Singhal

Range

Range is the difference between the largest and the smallest observation.

Range = L-S It is easy to calculate and provides a

full picture of variation of the data quickly.

It is crude measure & not based on all the observations.

Ritesh Singhal

Correlation Analysis

Correlation denotes the degree of interdependence between variables or the tendency of simultaneous variation between variables.

Types of Correlation:1. Positive & Negative2. Linear & Non-linear3. Multiple & Partial

Ritesh Singhal

Positive & Negative Correlation Positive Income Vs

Expenditure Agricultural Prod Vs

Rainfall Sales Vs Advt Expd Cost of raw

material Vs Cost of Industrial Prod

Negative Price Vs

Consumption Day temp Vs Sale

of Woolen clothes

Ritesh Singhal

Measure of Correlation

Scatter Diagram Method Karl Pearson’s Coefficient of

Correlation Spearman’s Coefficient of Rank

Correlation Concurrent Deviation Method

Ritesh Singhal

Scatter Diagram Method

It is a graphical method to find the correlation between variables.

Here the pair of the observations are plotted on a 2-D space.

After joining the these points we can have the idea about the relationship between variables.

Ritesh Singhal

Karl-Pearson’s coefficient of correlation (r)

The value of r lying between -1 and +1 i.e., -1≤r ≤+1

Coefficient of correlation is independent of change origin and scale.

Coefficient ‘r’ is symmetric rxy=ryx

The Probable error of ‘r’ is used to interpreting its estimated value.

Ritesh Singhal

Spearman’s Coefficient of Rank Correlation

Karl-Pearson’s method discusses the relationship between the quantitative variable where as Spearman’s coefficient suitable for qualitative variable like, rank given to the participant in any contest by two judges and we want to measure the relationship between rank given by these judges.

Ritesh Singhal

Concurrent Deviation Method

This is the simplest method in which only the direction of change is taken into consideration rather than magnitude of variation.

It gives a general idea about the correlation between variables quickly.

Ritesh Singhal

Regression Analysis It is concerned with the formulation

and determination of algebraic expression for the relationship between variables.

For this purpose we use regression lines.

These regression line are used for predicting the value of one variable from that of other.

Ritesh Singhal

Regression Analysis contd..

Here the variable whose value is to be predicted is called dependent (Explained) variable and the variable used for prediction is called independent (Explanatory) variable.

This method first introduced by “Sir Francis Galton”.

It helps in prediction & estimation.

Ritesh Singhal

Properties of Regression Lines & Coefficient

The regression line Y on X is used to estimate the best value of Y (Dep.) for a given value of X (Indep.).

The regression line X on Y is used to estimate the best value of X (Dep.) for a given value of Y (Indep.).

Both the regression coefficients are independent of change of origin & scale.

Ritesh Singhal

Properties of Regression Lines & Coefficient (contd..)

The relation between r, byx and bxy is

r = ±√ byx bxy

Both the regression coefficient should have same sign.

Both the regression coefficient could not more than one simultaneously.

Regression coefficient denotes the rate of change. i.e. byx measure the change in Y for a unit change in X.

Ritesh Singhal

Properties of Regression Lines & Coefficient (contd..)

Both lines cut each other at (X, Y). If r=0, both lines perpendicular to

each other. If the regression lines are identical,

the correlation between the variable is perfect.

Ritesh Singhal

Standard Error of Estimate

It provides us a measure of scatter of the observations about an average line, the standard error of estimate of Y on X is:

SY.X = √ [Σ(Y-Yest)2 / N]

Ritesh Singhal

Probability

Probability is a concept which numerically measures the degree of uncertainty or certainty of the occurrence of any event. i.e. the chance of occurrence of any event.

The probability of an event A is No. of Favorable cases

P(A)= Total No. of Cases

Ritesh Singhal

Probability

If P(A)=0, Impossible Event If P(A)=1, Sure Event 0≤P(A)≤1 P(A)= Probability of occurrence P(Ā)= Probability of Non-occurrence P(A) + P(Ā) = 1

Ritesh Singhal

Some Keywords Equally Likely Events: When the

chance of occurrence of all the events are same in an experiment.

Mutually Exclusive Events: If the occurrence of any one of them prevents the occurrence of other in the same experiment.

Sample Space: the set of all possible outcomes.

Ritesh Singhal

Some Keywords

Independent Events: If two or more events occur in such a way that the occurrence of one does not effect the occurrence of other.

Dependent Events: If the occurrence of one event influences the occurrence of the other.

Ritesh Singhal

Classical or Priori Probability

If a trial result in ‘n’ exhaustive, mutually exclusive and equally likely cases and ‘m’ of them are favorable to the happenings of an event E, then the probability ‘P’ of happening of E is given by:

P(E) = m / n

Ritesh Singhal

Empirical or Posteriori Probability

The classical def requires that ‘n’ is finite and that all cases are equally likely.

This condition is very restrictive and can not cover all situations.

The above conditions are not necessarily active in this case.

Ritesh Singhal

Fundamental rule of counting

If an event can occur in ‘m’ ways and following it, a second event can occur in ‘n’ ways, then these two event in succession can occur in ‘mxn’ ways.

E.g. A tricolor can be formed out of 6 colors in 6x5x4=120 ways.

No. of words of 3 characters out of 26 alphabets 26x25x24= 15600 ways.

Ritesh Singhal

Permutations The different arrangement can be

made out of a given no. of things by taking some or all at a time are called permutations.

P (n,r) = n! / (n-r)! E.g. permutations made with letters

a,b,c by taking two at a time:P(3,2)=6ab, ba, ac, ca, bc, cb

Ritesh Singhal

Combinations The combination of ‘n’ different

objects taken ‘r’ at a time is a selection of ‘r’ out of ‘n’ objects with no attention given to order of arrangement

C (n,r) = n!/r!(n-r)!e.g. From 5 boys & 6 girls a group of 3 is

to be formed having 2 boys & 1 girl is C(5,2) x C(6,1) = 60 ways

Ritesh Singhal

Example

A coin is tossed three times. Find the probability of getting:

i) Exactly one headii) Exactly two headiii) One or two head

Ritesh Singhal

Example

One card is randomly drawn from a pack of 52 cards. Find the probability that

i) Drawn card is redii) Drawn card is an aceiii) Drawn card is red and kingiv) Drawn card is red or king

Ritesh Singhal

Example

A bag contains 3 red, 6 white and 7 blue balls. Two balls are drawn at random. Find the probability that

i) Both the balls are white.ii) Both the balls are blue.iii) One ball is red & other is white.iv) One ball is white & other is blue.

Ritesh Singhal

Addition Theorem

For any two event A and B the probability for the occurrence of A or B is given by:P(AUB)= P(A) + P(B) – P(AПB)If A & B are mutually Exclusive then P(AПB)=0 P(AUB)= P(A) + P(B)

Ritesh Singhal

Multiplication or Conditional Probability

The probability of an event B when it is known that the event A has occurred already: P(B/A)= P(AПB) / P(A) ;if P(A)>0

ie. P(AПB)= P(A).P(B/A) If A and B are Independent event:

P(AПB)= P(A).P(B)

Ritesh Singhal

Example A bag contains 25 balls numbered from 1

to 25. Two balls are drawn at random from the bag with replacement. Find the probability of selecting:

i) Both odd numbers.ii) One odd & one even.iii) At least one odd.iv) No odd numbers.v) Both even numbers.

Ritesh Singhal

Example

Five men in a company of 20 are graduate. If 3 men are picked up at random, what is the probability that they are all graduate? What is the probability that at least one is graduate.

Ritesh Singhal

Example

The probability that A hits a target is 1/3 and the probability that B hits the target is 2/5. What is the probability that the target will be hit, if each one of A and B shoots at the target.

Ritesh Singhal

Expected Value of Probability

Let X be the random variable with the following distribution:

X : x1 x2 x3………..

P(X) :P(x1) P(x2) P(x3)……..

Expected Value is given by:E(X) = Σ xi . P (xi)

Ritesh Singhal

Example

A player tossed two coins. If two heads show he wins Rs. 4. if one head shows he wins Rs. 2, but if two tails show he pays Rs. 3 as penalty. Calculate the expected value of the game to him.

Solution:E(X)= (-3) ¼ + (2) ½ + (4) ¼ =1.25

Ritesh Singhal

Example An insurance company sells a

particular life insurance policy with a face value of Rs. 1000 and a yearly premium of Rs. 20. If 0.2% of the policy holder can be expected to die in the course of a year, what would be the company’s expected earning per policy holder per year.

E(X)= (-980) 0.002 + (20) 0.998=18

Ritesh Singhal

Theoretical Probability Distribution

top related