making tables and figures with stata biostatistics 212 lecture 6
TRANSCRIPT
![Page 1: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/1.jpg)
Making Tables and Figures with Stata
Biostatistics 212
Lecture 6
![Page 2: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/2.jpg)
Housekeeping
• Brackets indicate optional parts of a command (usually!)• use vs. insheet• Low p-value for heterogeneity ≠ important interaction
• “The strata never lie”
• Final projects– Read 1-page directions closely
![Page 3: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/3.jpg)
Today
• Organizing your Stata files
• Making a table
• Making a figure
![Page 4: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/4.jpg)
Today
• Organizing your Stata files• Making a table Lab practice = Final Project
• Making a figure Lab practice = Lab 6
![Page 5: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/5.jpg)
Organizing your Stata files
• Pitfalls– Proliferating dataset– Can’t remember what you did– Can’t remember why you did it– Can’t easily redo with new data
![Page 6: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/6.jpg)
Organizing your Stata files
My system (it’s not perfect)1) Import data into Stata
a) Using a Stata command (e.g., insheet or import) within a do file
b) Using other method (e.g., StatTransfer?) outside a do file, then SAVE the “raw” Stata file immediately
2) Write a do file that “cleans” your data, and saves it as a new clean dataset
3) Write do files for each component of your analysis
![Page 7: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/7.jpg)
Raw data
My organizational scheme
![Page 8: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/8.jpg)
Raw data
Raw data.csv
Pre-process
My organizational scheme
![Page 9: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/9.jpg)
Raw data
Raw data.csv
In Stata
My organizational scheme
Pre-process
![Page 10: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/10.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log
My organizational scheme
Pre-process
![Page 11: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/11.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.do
Table 1.log
My organizational scheme
Pre-process
![Page 12: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/12.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.do
Table 1.log
Table 1.xls
Cut and paste
My organizational scheme
Pre-process
![Page 13: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/13.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.do
Table 1.log
Table 1.xls
Cut and paste
My organizational scheme
Table 1.doc
Cut and paste
Pre-process
![Page 14: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/14.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.doTable 2.do
Table 1.logTable 2.log
Table 1.xls Table 2.xls
Cut and paste
My organizational schemeTable 1.doc Table 2.doc
Cut and paste
Pre-process
![Page 15: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/15.jpg)
Organizing your Stata files
• You will end up with:– 1 or 2 Stata datasets
• Data, from Excel.dta (only if you import outside your do file)
• Data.dta
– 1 do file used for cleaning• Data prep.do
– 1 do file to create each Table and Figure• Table 1.do, Figure 1.do, Text data.do, etc
– Matching log files (with the same names) for each do file• Data prep.log, Table 1.log, Figure 2.log, Text data.log, etc
![Page 16: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/16.jpg)
Organizing your Stata files
• Put them all in one folder called, “Stata files”, sort by file type.
• Example
![Page 17: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/17.jpg)
![Page 18: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/18.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.doTable 2.do
Table 1.logTable 2.log
Table 1.xls Table 2.xls
Cut and paste
Any questions?Table 1.doc Table 2.doc
Cut and paste
Pre-process
![Page 19: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/19.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.doTable 2.do
Table 1.logTable 2.log
Table 1.xls Table 2.xls
Cut and paste
My organizational schemeTable 1.doc Table 2.doc
Cut and paste
Lecture 3Pre-process
![Page 20: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/20.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.doTable 2.do
Table 1.logTable 2.log
Table 1.xls Table 2.xls
Cut and paste
My organizational schemeTable 1.doc Table 2.doc
Cut and paste
Lecture 3 Lecture 5Pre-process
![Page 21: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/21.jpg)
Raw data
Raw data.csv
In Stata
Clean data.dta
Data prep.do Data prep.log Table 1.doTable 2.do
Table 1.logTable 2.log
Table 1.xls Table 2.xls
Cut and paste
My organizational schemeTable 1.doc Table 2.doc
Cut and paste
Lecture 3 Lecture 5
Lecture 7
Pre-process
![Page 22: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/22.jpg)
Tables
• Two main purposes– Present the facts in a compact format
– Provide side-by-side comparisons
• Six main components:– Data
– Title, row heading, column headings
– Row names
– Footnotes
Browner, W. Publishing and Presenting Clinical Research
![Page 23: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/23.jpg)
5 Steps to Making a Table
• Step 1: Decide what the Table will be about– Sketch it out on paper
• Title, column headings, etc
![Page 24: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/24.jpg)
5 Steps to Making a Table
• Step 2: Make the dummy table– Excel or Word– Makes you specify what you actually want!
• Row headings
• Decide on category cut-offs, labels
• Decide on reference categories for regression, etc
• Footnote liberally
– Leave data cells blank
![Page 25: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/25.jpg)
5 Steps to Making a Table
• Step 3: Write a do file that will produce each number you need– Iterative process, as you know
![Page 26: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/26.jpg)
5 Steps to Making a Table
• Step 4: Copy and Paste the data in– Copy and Paste each number, or– “Copy Table” (under the “Edit” menu)
• http://www.stata.com/support/faqs/data/copytable.html
– Minimize manual retyping, rounding– Use Excel to calculate and round for you
![Page 27: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/27.jpg)
5 Steps to Making a Table
• Step 5: Format it so it looks nice– Standard, plain style – usually:
• Horizontal lines, not vertical
• Double-spaced
• Footnotes - *, †, ‡, §, ║, ¶ (or a,b,c,d,…)
– Create a template for yourself
![Page 28: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/28.jpg)
Word vs. Excel for Tables
• Stata Word– Fewer steps, fewer files– But…
• Can’t cut and paste full tables
• Doesn’t do any calculations for you
• Formatting can become “corrupted”
![Page 29: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/29.jpg)
Word vs. Excel for Tables
• Stata Excel Word– Can cut and paste values or whole tables– Set rounding, do calculations easily– Formatting easier?– Copy and Paste into Word (extra step)
![Page 30: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/30.jpg)
Demo
• Table 1 for “Moderate drinking and coronary calcium in young adults: The CARDIA Study”– Basic content
– Sketch
– Generate numbers in Stata
– Copy and paste into Word
– Show final table
– Demonstrate pasting a full table into Excel
![Page 31: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/31.jpg)
Figures
• When use a figure?
• Making a figure with Excel
• Making a figure with Stata
![Page 32: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/32.jpg)
When use a figure?
• When a graphical display of information more effectively conveys the intended message than words.
• “A picture is worth a thousand words”
![Page 33: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/33.jpg)
Figures
• “A picture is worth a thousand words”
52%48%
No Yes
Moderate alcohol consumption in CARDIA participants
How many words is this picture worth?
![Page 34: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/34.jpg)
Figures
• “A picture is worth a thousand words”
How many words is this picture worth?
48% of CARDIA participants consume alcohol moderately.
Worth = 7 words
![Page 35: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/35.jpg)
Figures
• “A picture is worth a thousand words”
How many words is this picture worth?
40%
39%
13%
8%
57%26%
9%8%
White Black
0 <1
1-1.9 2+
Alcohol consumption, in drinks/day
![Page 36: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/36.jpg)
Figures
• “A picture is worth a thousand words”
How many words is this picture worth?
White Black
Drinks/day n=1935 n=1727
0 40% 57%
0.1-0.9 39% 26%
1-1.9 13% 9%
2+ 8% 8%
Worth = 1 small table?
(and avoid pie charts in general…)
![Page 37: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/37.jpg)
Figures• “A picture is worth a thousand words”
How many words is this picture worth?
0.0
5.1
.15
.2P
reva
lenc
e of
cor
onar
y ca
lcifi
catio
n
Black women White women Black men White men
By race and genderPrevalence of coronary calcification in moderate drinkers and abstainers
Abstainer Moderate drinker
![Page 38: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/38.jpg)
Figures
• “A picture is worth a thousand words”
How many words is this picture worth?
Proportion with CAC
Abstainer Mod drinker
Black women .047 .036
White women .054 .049
Black men .068 .132
White men .180 .167
Can you see the interaction in this table without a figure?
(Figures are good for illustrating interactions)
![Page 39: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/39.jpg)
Figures
• “A picture is worth a thousand words”
How many words is this picture worth?
-20
00
-10
00
01
00
02
00
0
Ch
an
ge
in
FE
V1 (
mill
ilite
rs)
0 20 40 60
Pack-years of exposure to tobacco
Menthol smokers Non-menthol smokers
Menthol regression Non-menthol regression
![Page 40: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/40.jpg)
Figures
• “A picture is worth a thousand words”
How many words is this picture worth?
Worth = 968 data points?
Nice to show actual data points along with main effect, if possible!
![Page 41: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/41.jpg)
Making a figure
• With Excel– First make a TABLE in Excel!
• Use Stata to generate numbers for the table
– Create a figure from the Table using Excel tools
• With Stata– Use Stata commands to create the figure directly
![Page 42: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/42.jpg)
Steps in making an Excel figure
• Sketch your figure
• Make a dummy TABLE
• Write a .do file to fill in the table
• Copy and paste from the log file or the results window into the Table
• Use the Chart Wizard to create the Figure
• Format, format, format until it looks nice
![Page 43: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/43.jpg)
Example
• Figure 2 from Lipids paper
![Page 44: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/44.jpg)
Steps in making an Excel figure
• Sketch your figure
• Make a dummy TABLE
• Write a .do file to fill in the table
• Copy and paste from the log file or the results window into the Table
• Use the Chart Wizard to create the Figure
• Format, format, format until it looks nice
![Page 45: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/45.jpg)
Steps in making a Stata figure
• Sketch your figure
• Make a dummy TABLE
• Write a .do file with a graph command
• Copy and paste from the log file or the results window into the Table
• Use the Chart Wizard to create the Figure
• Format, format, format until it looks nice
![Page 46: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/46.jpg)
Pay attention to…
• Formatting– Make it look nice and professional, but not
gaudy• Black and white, usually
– The time-consuming part of making a figure is usually related to formatting.
![Page 47: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/47.jpg)
Pay attention to…
• Labeling– Your figure should be understandable by itself,
without the rest of the manuscript– All axes should be labeled.– Include important p-values
![Page 48: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/48.jpg)
Pay attention to…
• The Figure Legend– Title, explanations, extra p-values, etc– Separate section in manuscript or at bottom of
page – depends on journal
![Page 49: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/49.jpg)
Stata vs. Excel for Figures
• Excel– Flexible and intuitive point-and-click figures
• Easy to create and modify• Flexible, more options, error bars, adjusted
estimates, good for bar graphs, etc
– But…• Requires an extra step – copy/pasting to Excel• Harder to reproduce• Much harder to do scatter plots
![Page 50: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/50.jpg)
Stata vs. Excel for Figures
• Stata– Can create very customizable figures using 1 complex
Stata command• Easy to recreate – simple do file
• No error
• Scatter plots are MUCH easier with Stata
– But…• Harder to create the first time? - no point and click
• A little less flexible?
• Difficult to format: Graphic Editor helps address this
![Page 51: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/51.jpg)
Stata vs. Excel for Figures
Easiest tool
• Bar/Line* figures Excel
• Confidence intervals Excel
• Scatter plots Stata
• Box plots Stata
• Dot plots Stata
* Where x-axis is categorical
![Page 52: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/52.jpg)
Stata Demo
• Scatter plots: bmi vs. lipids– Iterative process of adding commands to do file– Cutting and pasting with substitution– Lowess smoother and linear fit lines– Post-graph editing
![Page 53: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/53.jpg)
twoway (scatter dfev1 cumpy10 if menthol1==1, msymbol(plus) msize(small) mcolor(black)) /// (scatter dfev1 cumpy10 if menthol1==0, msymbol(circle_hollow)) /// (line m cumpy10 if menthol1==1, sort clcolor(black) clpat(dash) clwidth(thick)) /// (line nm cumpy10 if menthol1==0, sort clcolor(black) clpat(solid) clwidth(thick)) /// , ytitle(Change in FEV1 (milliliters), size(large)) yscale(titlegap(5)) /// xtitle(Pack-years of exposure to tobacco, size(large)) /// xscale(titlegap(3)) /// legend(order(1 "Menthol smokers" 2 "Non-menthol smokers" 3 "Menthol regression" /// 4 "Non-menthol regression")) /// scheme(s1mono) /// graphregion(fcolor(none) lcolor(none) ifcolor(none) ilcolor(none)) /// plotregion(fcolor(none) lcolor(none) ifcolor(none) ilcolor(none))
-20
00
-10
00
01
00
02
00
0
Ch
an
ge
in
FE
V1 (
mill
ilite
rs)
0 20 40 60
Pack-years of exposure to tobacco
Menthol smokers Non-menthol smokersMenthol regression Non-menthol regression
![Page 54: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/54.jpg)
Key points• It’s worth putting thought into your file organization
• Tables:– First sketch it on paper– Next make your dummy table– Only then write your do file
• Figures:– Make it with Stata if you can– Use dialog boxes to get the correct syntax, paste into your do file– Make bar charts with Excel
• Document everything you do!
![Page 55: Making Tables and Figures with Stata Biostatistics 212 Lecture 6](https://reader035.vdocuments.us/reader035/viewer/2022062721/56649f1c5503460f94c320f5/html5/thumbnails/55.jpg)
Today’s Lab
• You will create 4 figures using stata.
• The focus will be on learning to use the dialog boxes to get the syntax right, and then transferring commands and options into a do file.