history of numbers and expectations 1 krishna.v.palem kenneth and audrey kennedy professor of...

41
History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Upload: harvey-miles

Post on 02-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

History of Numbers and Expectations

1

Krishna.V.PalemKenneth and Audrey Kennedy Professor of ComputingDepartment of Computer Science, Rice University

Page 2: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

ContentTake home exerciseContinuation of History of NumbersBayes’ TheoremGeneral Product RuleExpectations

2

Page 3: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Take Home Exercise - 7 Suppose two players A and B play a game of dice Let the dice be fair and unique: Suppose one is colored blue and other black Let Player A rolls this pair of dice and player B guess the numbers on them Let the outcome of rolling be (x,y), where x is outcome on the blue die and y

is the outcome on the black die.This implies (x, y) ≠ (y, x)

Player A “may” inform Player B about the number of “even numbers” in the outcome of rolling the pair of dies.For example, (2,3) has only 1 even number while (4,6) has 2 even numbers

(1)Calculate the probabilities of guessing the correct (x,y) for each pair of x,yWithout any informationWith the information of number of even numbers as stated above.

Tabulate the results of all the (x,y) with and without the information Take a single case say (2,5) and derive the above probability of guessing

(2,5) using conditional probability.

3

Page 4: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Take Home Exercise - 7Part 2: (Slightly tricky)Consider the same game but we are not

interested in individual cases of (x,y)Instead we are only interested in the probability

of player B guessing the correct numberHow often will player B guess the correct

answerWithout any informationAnd with informationHint: Find the probabilities of getting “1 even

number”, “2 even numbers and “3 even numbers”

4

Page 5: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

ContentTake home exerciseContinuation of History of NumbersBayes’ TheoremGeneral Product RuleExpectations

5

Page 6: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

The Babylonian Number systemThe Babylonians lived in Mesopotamia, a fertile plain

between the Tigris and Euphrates rivers.3400 BC: The Egyptians and Babylonians were first

recorded as using the natural numbers and rational numbers.

The base of a number system is the number of symbols available for representation.The modern day numbers are a base-10 system, because we

have 10 different symbols for all our numbers.

Babylonians had a base of 60. That means they had 60 symbols.

6

0 1 2 3 4 5 6 7 8 9

…. …. ….

Notice the white space for a zero !!

0 1 2 3 4 58 59

Page 7: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Sumerian/ Babylonian Numerals

7

Though we mentioned that the Babylonians had a base of 60 which means that they have to remember 60 different symbols to use numbers, they had invented a clever way of creating allthe 60 numerals from just two symbols . And

They needed 59 numerals to represent all the numbers from 1 and 59. So they had a symbol for 10 and another one for 1. Thus the number of times each one of them is repeated gives the numeral forthat particular number.

Now let us create the numeral 7

Thus to create a numeral for 7, we need to combine 7 of the symbols for 1.

Page 8: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Sumerian/ Babylonian Numerals

8

Thus to create a symbol for 23, we need two symbolsof ‘10’ and three of ‘1’.

Now let us consider another example

Let us create the numeral 23

Similarly

Page 9: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Sumerian/ Babylonian Numerals

9

To create other numerals there are some simple rules that have to be followed.

1. When we are stacking symbols for 1 to create numerals, each stack should have at most three in each row.

For example, To create the number 3We stack all the three symbols in a single row.

To create the number 5 We stack three in the first rowand then the remaining two in the next row.

Similarly when we stack the symbols for 10, the symbols are stacked in the same way except diagonally.

Can you infer from the following ?

Page 10: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Constructing bigger numerals from small numerals

10

Thus we obtain the complete set of symbols for all the 59 numbers.

0

Now as we have all the numerals for the number system, we need to understand how to write to write a number.

Page 11: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Positional Significance

11

Consider the number 4256. This is a numeral.

What is the value of this numeral ?

How is it evaluated?

4256 = 4*1000 + 2*100 + 5*10 + 6

Positional significance

Positional significance is the mechanism by which a symbol is elevated in it’s value to easilycreate bigger numbers.

This is done by writing the symbols adjacent to each other to create bigger numerals.

Powers of 10

Page 12: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Sumerian/ Babylonian Numbers

12

For example, let us try writing in Babylonian system, the value 4256

So, 4256 can be written as 4256 = 1*3600 + 10*60 + 56*1 = 1*602 + 10*601 + 56*600

As the Babylonian system has a base of 60, the positional significance of each symbol varies with as a power of 60.

Now, we have to represent this in terms of the Babylonian numerals

Solution:

Exercise: Represent the quantity 2764 in Babylonian number system.

Page 13: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

A quick recap

13

Topic Tally Marks Conventional

Sumerian

Symbol 0,1,2,3… ,9

Numerals 4256

Base 1 10 60

Positional significance

Powers of 1 Powers of 10 Powers of 60

Page 14: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Problems in Sumerian/ Babylonian System

14

Now there is a potential problem with the system.

Using this number system let us represent the two numbers 61 and 2.

First 61 = 1*60 +1*1 is represented as

And 2 = 2*1 is represented as

The only difference being the spacebetween the symbols.

A much more serious problem was the fact that there was no symbol for zero.

They have exactly the same representation and now there was no

way that spacing could help.

Let us see for ourselves. Let us represent the numbers 1 and 60.

1 = 1*1 60= 1*60

Page 15: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Egyptian CivilizationThe ancient Egyptians were possibly

the first civilization to practice the scientific arts.

But each symbol represented a power of 10. All other decimal numbers were represented using the above symbols

15

Page 16: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

16

Egyptian Number SystemTo represent a quantity in Egyptian system, we first represent the quantity in terms of the powers of 10, similar to the present day system.

3244 3*1000 + 2*100 + 4*10 +4*1For example,

Exercise: Represent in Egyptian number system the quantity 21,237.

Solution:

Page 17: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

The different number systemsTally Marks(20000BC)

Sumerian(3000BC) Egyptian(3000BC)

Easy to update the number

Only two symbols used to generate all numerals.

Ease of representation and manipulation.

Larger numbers become difficult to represent and manipulate.

Manipulation is cumbersome because of the larger number of numerals.

Difficult to use too, but has a hint of the modern base-10 number system in it’s positional significance.

17

Page 18: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

ContentTake home exerciseContinuation of History of NumbersBayes’ TheoremGeneral Product RuleExpectations

18

Page 19: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Who was Bayes?Thomas Bayes was a British mathematician and a Church

minister He formulated the famous Bayes theorem his work on this was published posthumously

as Essay Towards Solving a Problem in the Doctrine of Chances (1764)

His work on Bayes theorem gave birth to the branch of Statistics

Bayesian probability is the name given to several related interpretations of probability,

they have in common the notion of probability as something like a partial belief, rather than a frequency.

"Bayesian" has been used in this sense since about 1950

19

Page 20: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Deriving Bayes Theorem with an Example

Suppose you have a closed box containing a large number of black and white balls.you do not know the proportion of black and white balls

You take out a sample of balls from the box and find that there are three-fourths of black balls in the sample

Bayes worked out a theorem which indicates exactly how opinions held before the experiment should be modified by the evidence of the sample

20

What is your guess about the composition of balls in the box?

Now, what is your guess about the composition of balls in the box?

Page 21: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Birth of StatisticsStatistics arose from the need of states to collect data

on their people and economiesfor administrative purposesstarted in 18th century

Bayes theorem provided the mathematical basis for this branchinitial intuition was given by Francis BaconThomas Bayes provided the first mathematical basis to this

branch of logic

Its meaning broadened in the early 19th century to include the collection and analysis of data in general. today statistics is widely employed in government, business,

and the natural and social sciences.

21

Page 22: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Probability Theory Vs Statistics

Probability theory computes the probability that future (and hence presently unknown) samples out of a known population turn out to have stated characteristics

Statistics looks at the present and hence known sample taken out of an unknown population, and makes estimates of what the population is likely to be, compares likelihood of various populations and tells how confident you have a right to be about these estimates

22

Page 23: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

What is Bayes Theorem?Bayes' theorem relates the conditional and

unconditional probabilities of events A and B, where B has a non-zero probability:

Each term in Bayes' theorem has a conventional name:P(A) is the prior probability or unconditional probability of A.

It is "prior" in the sense that it does not take into account any information about B.

P(A|B) is the conditional probability of A, given B. P(B|A) is the conditional probability of B given A.P(B) is the prior or marginal probability of B

23

Page 24: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Alternate Form of Bayes Theorem

Consider that A has two events : A1 and A2

If we want to compute the probability of A1 given B, then

But, P(B) can be written as

Hence, we get

More generally, Bayes theorem can be written as

24

)A(P)A|B(P)A(P)A|B(P)B(P 2211

)A(P)A|B(P)A(P)A|B(P

)A(P)A|B(P)B|A(P

2211

111

Page 25: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Understanding Bayes Theorem

Bayes theorem is often used to compute posterior probabilities given observations. For example, a patient may be observed to have

certain symptoms. Bayes' theorem can be used to compute the probability

that a proposed diagnosis is correct, given that observation.

Intuitively, Bayes’ theorem in this form describes the way in which one's beliefs about observing ‘A’ are updated by having observed ‘B’.

25

Page 26: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Derivation of Bayes TheoremTo derive the theorem, we start from the

definition of conditional probability. The probability of event A given event B is

Equivalently, the probability of event B given event A is

Rearranging and combining these two equations, we find

Dividing both sides by P(B), provided that it is non-zero, we obtain Bayes' theorem:

26

( is probability of A and B occurring simultaneously)

Page 27: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

ContentTake home exerciseContinuation of History of NumbersBayes’ TheoremGeneral Product RuleExpectations

27

Page 28: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

General Product RuleAll along, we have been using product rule as given below

P(A and B and C and …) = P(A)P(B)P(C)….The above formula is a “Special case” of the general

Product Rule.All the problems we have been dealing with have

consisted of “Independent” EventsRolling of a pair of diesTossing of coinsTherefore, P(A and B and C and ….) = P(A)P(B)P(C)…..

But what if they were not independent? Will the same formula work?NO!!

So is there a general product rule which can be applied?YES!!

28

Page 29: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

General Product RuleSuppose we are interested in simultaneous

occurrence of event A, B and C. Suppose these events are all dependent on

each otherP(A and B and C) = P(A)P(B|A)P(C|A,B)In general for n different dependent events A1,

A2, A3….An

P(A1 and A2 and A3 …. An) = P(A1)P(A2|A1)P(A3|A1,A2)P(A4|A1,A2,A3)………………P(An|A1,A2,A3,….,An-1)

Can we derive it?29

Page 30: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

Proof for General Product Rule

30

Page 31: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

ContentTake home exerciseContinuation of History of NumbersBayes’ TheoremGeneral Product RuleIntroduction to Expectations

31

Page 32: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

In-class exercise -1

32

Let us change gears and move to a new topicConsider the following exercise

You will be given a coin and you toss itIf you get heads you get some reward (2 chips)

And if you get tails you do not get anythingAfter 10 rounds, how many chips do you think

you will have ?

Let us test it out for ourselves.

Page 33: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

33

Maintain the record of your game in the following way

Toss OutcomeNumber of chips till

that point

1

2

3

4

5

6

7

8

9

10

Observe these numbers

Can you now take a guessof your earnings after 20turns, 50 turns, 100 turns ?

Play this game for 20 rounds

Page 34: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

34

Can you now take a guessof your earnings after 20turns, 50 turns, 100 turns ?

Answering this question is very important to get an idea of how much you are going to earn.

Let us use probability

First let us calculate the earnings we can expect in one turn

i p

HEAD=0 ½

TAIL=1 ½

So we have ½ chance of earning 2 chips

We have ½ chance of earning nothing

So we can expect to earn ½ * 2 = 1 chip at the end of every turn on an average

Very important

Page 35: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

35

Let us see another example

This time we will consider another randomizing device – our friendly die

Earnings

Outcome

Earning

1 2

2 2

3 2

4 1

5 1

6 1

Throw OutcomeNumber of chips till

that point

1

2

3

4

5

6

7

8

9

10

Maintain a similar record till 20 throws in the following format

Page 36: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

36

We again visit the same question.Can you now take a guessof your earnings after 20turns, 50 turns, 100 turns ?

This time we know that the probability function of a die can be represented as

i p(i)

1 1/6

2 1/6

3 1/6

4 1/6

5 1/6

6 1/6

There is a 1/6th chance of winning 2 chipsThere is a 1/6th chance of winning 2 chipsThere is a 1/6th chance of winning 2 chipsThere is a 1/6th chance of winning 1 chipThere is a 1/6th chance of winning 1 chipThere is a 1/6th chance of winning 1 chip

On an average we can expect (1/6)*2 + (1/6)*2 + (1/6)*2 + (1/6)*1+ (1/6)*1 + (1/6)*1 = 3/2 chips every throw

What does this mean ?

Page 37: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

37

What do we mean when we say that the earning per game is 3/2 chips?

That does not seem right!!

To understand this, let us first consider the following question.

Consider an unbiased coin toss.

The probability of obtaining a HEAD = ½

But for n trials of the experiment do we always get n/2 HEADs and n/2 TAILs ?

Consider the following experiment:

Toss a coin 5, 10, 50, 100, 500, 1000 … 10000 times. At each point collect the data regarding number of HEADs and number of TAILs.

Now let us analyze data obtained from one such experiment.

Page 38: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

38

Noticethat the Y-axisis in logarithmicscale

What do you observe as the number of trials grows large ?

Page 39: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

39

Can you observe that as the number of trials grows “large” the result of the experimenttends to agree with the ideal case ?

Page 40: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

40

What does this mean with regard to expectation of 3/2 chip for each experiment ?

It means that as the number of trials(n) grows “large” then it can be expected thatthe earnings will be equal to 3/2 * n

Take Home:

Perform a similar analysis of the coin to the die experiment.Show that on an average when the number of trials grows very large the earnings is 3/2 per trial.

Page 41: History of Numbers and Expectations 1 Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University

END

41