dr. ameer kadhim hussein m.b.ch.b.ficms (com.)

Post on 01-Oct-2021

4 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Biostatistics

Presentation of data

DR. AMEER KADHIM HUSSEIN

M.B.CH.B.FICMS (COM.)

PRESENTATION OF DATA

1. Mathematical presentation (measures of

central tendency and measures of

dispersion).

2. Tabular presentation.

3. Graphical presentation.

4. Pictorial presentation.

(Map chart).

TABULAR PRESENTATION

Presentation of data in tables make them

into a compact, concise and readily

comprehensible form. They can display

the characteristics of data more efficiently

than the raw data.

TYPES OF TABLES

1.Simple Table including one variable (quantitative or

qualitative) and the corresponding frequency.

2. Cross tabulation is a tabular method for simultaneously

summarizing the data for two

categorical variables.

CRITERIA FOR PROPER TABLE

1.Simple. 2.Understandable and self explanatory (all symbols should be explained in details in a foot note,each row or column should be labeled clearly, units of the data should be clearly mentioned,the title should be clear, precise, and should answer the questions, what? where? and when? and totals should be shown. 3.The title should be separated from the body of the table by lines or spaces. 4.Avoid too much ruling. 5.If the data are not original, their source should be mentioned as a foot note or in the title.

GRAPHICAL AND PICTORIAL

PRESENTATION

The use of diagrams or pictures to describe the

distribution or characteristics of one or more

sets of data in a compact and readily

comprehensible form. They can provide a

better visual presentation

of characteristics of data than

tabular presentation.

1. Vertical and horizontal scales should be clearly

labeled and units identified.

2. Keep graphs as simple as possible – avoid too

many bars or lines – two or three is appropriate –

more than four is probably too many.

3. Graphs are designed to provide a “snapshot” of

the results – use tables for details.

4. Avoid presentation of numbers

in the body of a graph.

Criteria For proper graph

Qualitative Data:

Tabular presentation include:

1. Frequency distribution.

2. Relative frequency distribution.

3. Percent frequency distribution.

4. Cross tabulation.

Graphical presentation include:

1.Bar chart.

2. Pie chart.

Tabular and Graphical Presentation of data

Quantitative data

Tabular presentation include:

1. Frequency distribution.

2. Relative frequency distribution.

3. Cumulative frequency.

4. Cumulative relative frequency.

Graphical presentation include:

1. Histogram.

2. Frequency polygon.

3. Scatter diagrams.

4. Line graph.

Tabular and Graphical Presentation of data

Qualitative Data

Frequency: It determines the number of

observations falling into each category.

Relative frequency: It determines the proportion

of observation in the particular class relative to

the total observations.

A relative frequency distribution

is a tabular summary of a set of data showing the

relative frequency for each class.

The percent frequency of a class is the relative

frequency multiplied by 100.

FREQUENCY DISTRIBUTION

FREQUENCY DISTRIBUTION

Example:

A sample of 10 students were examined by

certain teacher and the results of examination

was as below:

1. good 2. very good 3. good

4. excellent 5. poor 6. very good

7. good 8. poor 9. excellent

10. poor

FREQUENCY DISTRIBUTION

Frequency Results

3 poor

3 good

2 Very good

2 excellent

10 Total

RELATIVE FREQUENCY AND PERCENT

FREQUENCY DISTRIBUTION

Percent

frequency

Relative

frequency Results

30% 0.3 poor

30% 0.3 good

20% 0.2 Very good

20% 0.2 excellent

100% 1 Total

BAR GRAPH

A bar graph is a graphical device for depicting qualitative data. On the horizontal axis we specify the labels that are used for each of the classes. A frequency, relative frequency, or percent frequency scale can be used for the vertical axis. Using a bar of fixed width drawn above each class label, we extend the height appropriately. The bars are separated to emphasize the fact that each class is a separate category.

BAR GRAPH

0%

10%

20%

30%

PoorGood

Very goodExcellent

30% 30%

20% 20%

PIE CHART

The pie chart is a commonly used graphical device for presenting relative frequency distributions for qualitative data. First draw a circle; then use the relative frequencies to subdivide the circle into sectors that correspond to the relative frequency for each class. Since there are 360 degrees in a circle, a class with a relative frequency of 0.25 would consume 0.25(360) = 90 degrees of the circle.

PIE CHART

30%

30%

20%

20%

Poor

Good

Very good

Excellent

CROSS-TABULATIONS

Cross-tabulation : is a tabular method for simultaneously summarizing the data for two categorical variables. Steps for Constructing a Cross-tabulation

1. Put the categories of one variable at the top of each column, and the categories of the other variable at the beginning of each row.

2. For each row and column combination, enter the number of observations that fall in the two categories.

3.The bottom of the table gives the column totals, and the right-hand column gives the row totals.

CROSS-TABULATIONS

Total

Group

Gender Control Case

40 10 30 Male

60 40 20 Female

100 50 50 Total

Table: Distribution of case and control groups by gender

CLUSTERED BAR GRAPHS

Clustered bar graphs are useful for comparing two

categorical variables and are often used in conjunction

with cross-tabulations . (we can use frequency or

relative frequency ).

Quantitative Data

To group a set of observations, we select a set of

contagious, non overlapping intervals, such that each

value in the set of observation can be placed in one, and

only one, of the interval, and no single observation

should be missed.

The interval is called: Class interval

THE FREQUENCY DISTRIBUTION

Number of class interval : Too few intervals are not good because information will be lost. Too many intervals are not helpful to summarize the data.

A commonly followed rule is that number of class interval should be not fewer than 6 and not more than 15.

The specific guidance to decide the number of classes is (Sturges formula).

k = 1 + 3.322 (log n) k= number of class intervals. n= number of observations in the set. The result should not be regarded as final, but can be regard as guide only. The number of class intervals obtain by sturges rule can be increase or decrease for convenience and clear presentation.

THE FREQUENCY DISTRIBUTION

Range: It is the difference between the largest and the smallest

observation in the data set.

The Width of the interval (w):

Class intervals generally should be of the same width, but

sometimes this is impossible to do. Width of class interval can be

obtain by the following formula:

W= Width of the class interval , R= Range , K= Number of class

intervals.

To make the summarization more comprehensible, the class width

may be 5 or 10 or the multiples of 10.

THE FREQUENCY DISTRIBUTION

Frequency: It determines the number of observations falling into

each class interval.

Relative frequency: It determines the proportion of observation in

the particular class interval relative to the total observations in the

set.

Cumulative frequency: This is calculated by adding the number of

observation in each class interval to the number of observations in

the class interval above, starting from the second class interval

onward.

Cumulative relative frequency: This calculated by adding the

relative frequency in each class interval to the relative frequency in

the class interval above, starting also from the second class interval

onward.

THE FREQUENCY DISTRIBUTION

Cumulative frequency and cumulative relative frequency

distributions are used to facilitate obtaining information regarding

the frequency or relative frequency within two or more contagious

class intervals.

The Mid-interval (midpoint):

It can be computed by adding the lower bound of the interval plus

the upper bound of it and then divide by 2.

THE FREQUENCY DISTRIBUTION

91 78 93 57 75 52 99 80 97 62

71 69 72 89 66 75 79 75 72 76

104 74 62 68 97 105 77 65 80 109

85 97 88 68 83 68 71 69 67 74

62 82 98 101 79 105 79 69 62 73

The following are the heart rate of 50 patients

1. Number of classes :

(K) = 1+3.322 Log 50

= 1+3.322 (1.69)

= 1+ 5.64 = 6.64 6

2. Width of class interval

W = R/K

= 109 – 52 /6 = 57/6 = 9.5 10

ANSWER

Frequency, cumulative frequency, relative frequency

and cumulative relative frequency distribution of

heart rate of 50 patients

Cumulative

relative

frequency

Relative

frequency

Cumulative

frequency

Frequency Class

interval

0.04 0.04 2 2 50-59

0.3 0.26 15 13 60-69

0.62 0.32 31 16 70-79

0.76 0.14 38 7 80-89

0.9 0.14 45 7 90-99

1 0.1 50 5 100-109

1 50 Total

Example: The following are the hemoglobin values

(g/100ml) of 30 children receiving treatment for hemolytic anemia.

10.0 8.7 6.7 7.8 8.9 10.8

9.7 9.9 8.5 7.5 9.0 10.0

9.1 9.1 8.4 10.6 10.2 8.5

8.6 9.7 9.7 9.6 10.2 11.4

12.2 9.4 9.3 8.4 8.2 9.2

Order the sample observations by size,

6.7 7.5 7.8 8.2 8.4 8.4

8.5 8.5 8.6 8.7 8.9 9.0

9.1 9.1 9.2 9.3 9.4 9.6

9.7 9.7 9.7 9.9 10.0 10.0

10.2 10.2 10.6 10.8 11.4 12.2

No. of classes = 1+ 3.322 (Log10 30) 6

Width = (12.2 – 6.7) / 6 1

Cumulative

relative

frequency

Relative

frequency

Cumulative

frequency Midpoint Frequency

True class

limits

0.033 0.033 1 7 1 6.5 -7.5

0.2 0.167 6 8 5 7.5 - 8.5

0.567 0.367 17 9 11 8.5 - 9.5

0.867 0.300 26 10 9 9.5-10.5

0.967 0.100 29 11 3 10.5 - 11.5

1 0.033 30 12 1 11.5 - 12.5

1 30 Total

A common graphical presentation of quantitative data is a

histogram.

The variable of interest is placed on the horizontal axis.

A rectangle is drawn above each class interval with its height

corresponding to the interval’s frequency, relative frequency, or

percent frequency.

Unlike a bar graph, a histogram has no natural separation between

rectangles of adjacent classes.

To draw the histogram, the true classes limits should be used.

They can be computed by subtracting 0.5 from the lower limit and

adding 0.5 to the upper limit for each interval.

HISTOGRAM

0

2

4

6

8

10

12N

o. o

f ch

ildre

n

Hemoglobin values (g/100ml)Fig ( ) Hemoglobin values of children receiving

treatment for hemolytic anemia

6.5 7.5 8.5 9.5 10.5 11.5 12.5

FREQUENCY POLYGON

Another form of graphical presentation of frequency distribution

of quantitative variables.

It is similar to the histogram, but instead of using rectangles to

present data, the midpoint of the top of each rectangle are

plotted, and connected together by straight lines.

SCATTER DIAGRAM

A scatter diagram is a graphical presentation of

the relationship between two quantitative

variables.

One variable is shown on the horizontal axis and

the other variable is shown on the vertical axis.

The general pattern of the plotted points suggests

the overall relationship between the variables.

SCATTER DIAGRAM

LINE GRAPH

A line graph is used to show trend of events with passage of time and

show how frequency of particular event change over time. Time

could be (Seconds - Minutes - Hours – Days - Weeks - Months –

Years - Decades - Centuries – etc).

Money spent this week

$0.00

$5.00

$10.00

$15.00

$20.00

$25.00

Mon. Tues. Wed. Thurs. Fri.

Day

Am

outn

of $

LINE GRAPH

Calories burned while running

0

20

40

60

80

100

120

140

160

180

200

220

240

30 60 90 120

150

180

210

240

270

Hours

Cal

orie

s

PICTORIAL PRESENTATION

Small pictures or symbols are used to present data.

For example: picture of no horn on road near a hospital.

1. PICTOGRAM

PICTORIAL PRESENTATION

These maps are prepared to show the geographical distribution of

frequencies of a characteristic.

2. MAP DIAGRAM OR SPOT MAP

Thank you

top related