penn uspstf breast cancer screening: science, policy & politics j. sanford schwartz, md leon...

Penn

USPSTF Breast Cancer Screening:Science, Policy & Politics

J. Sanford Schwartz, MD

Leon Hess Professor of Medicine andHealth Management & Economics

Perelman School of Medicine &The Wharton School

University of Pennsylvania

The Good, The Bad and The Ugly

Penn

USPSTF Screening Mammography Recommendation for Women Ages 40-50

• What is the USPSTF? (mandate, membership)

• The USPSTF decision process

• Why recommendation was made (and timing)

• Data on which recommendation was generated

• The recommendation and why it changed from the previous USPSTF recommendation

• Why the recommendation generated controversy – Importance– How recommendation was communicated– Political context

• My subjective assessment of how things went, why and why topic will remain controversial

Penn

“The USPSTF recommends against routine screening mammography in women aged 40-49. The decision to start…should be an individualized one and take patient context into account, including the patient’s values regarding specific benefits and harms.”

(C recommendation)

Issued October 2009

Moderate certainty that the net benefit is small

Penn

Government appointed, independent advisory group established 1984 byCongressional mandate• Recommend preventive services that

should be incorporated routinely intoprimary care medical care populations (age, gender, risk factors)

• Identify research agenda for preventive care• 16 PCPs (IM, FP, Peds, Ob–Gyn)• Rotating 4–6 year terms• Review scientific evidence clinical preventive

services (Members with COI excluded)

United States Preventive Services Task Force

Penn

Staffed by AHRQ (staff, fellows, medical officers)

Partner organizations:Federal• CDC• NIH• VAProfessional Societies• ACP• APA• ACOG• ACFMPublic Advocacy Groups• AARP


Penn

USPSTF Methodology:A (Very) Short Primer

• Select topic (largely subjective process)• Identify interventions and outcomes of interest• Examines key questions via chain of evidence

within specified analytic framework• Systematic review of evidence (AHRQ EPC)• Assesses evidence, estimates magnitude and

certainty of benefits and harms, assigns consensus recommendation grade

• Peer review evidence report & recommendation• Draft recommendation posted on website*• Final recommendation issued• US government and Ann Intern Med reviewhttp://www.ahrq.gov/clinic/uspstf08/methods/procmanualap7.htm

Penn

USPSTF Recommendation Grade

Certainty Net Benefit

High

Moderate

Low

Magnitude of Net Benefit

Substantial Moderate SmallZero/Neg

A B C D

B B C D

Insufficient

Evidence: Convincing, Adequate, Inadequate

Penn

USPSTF Recommendation

A Strongly recommendsB RecommendsC Recommends against routinely providingD Recommends againstI insufficient evidence

• Highlights Clinical/Other Considerations

• Discussion & Recommendation of Others

Do Not Provide

Provide Routinely

Individual Risk/Benefit

Penn

Does not:

• Advise insurers

• Make health care coverage decisions

However, the Affordable Care Act of 2009 mandates that all preventive services that receive an ‘A’ or ‘B’ recommendation by the USPSTF must be covered by insurers at no cost to the beneficiary


Penn

Screening Mammography:Primary MD and Patient Questions

• Should I get mammograms?

• If so, starting at what age, and how often?

• When, if ever, should I stop?

– What is the benefit?

– What are the harms?

– How do my personal risk factors for breast cancer affect the decision?

Penn

“I have yet to see any problem,however complicated,

which … looked at it in the right way,

did not become still more complicated”

– Poul Anderson

Penn

USPSTF Breast Ca Screening:Methods of Analysis

• Meta-analysis of RCTs of screening effectiveness

• Trials rated “fair-quality” or better from 2002 review and any new trials or updates since then

• Rates and proportions calculated using primary data from Breast Cancer Surveillance Consortium

• Outcomes Table constructed to estimate magnitude of screening benefits & harms (by age)

• Natural history modeling (CISNET)

Penn

Diagnostic Test Performance

Test Result Disease State

Disease Present Disease Absent

Test Positive True Positive(TP)

False Positive(FP)

Test Negative False Negative(FN)

True Negative(TN)

Sensitivity (Se) = TP Predictive Value (PV) + = TP_

TP+FN TP+FP

Specificity (Sp) = TN Predictive Value (PV) – = TN_

TN+FP TN+FN

Penn

Lead Time

Time between disease detection by screening and time of usual symptomatic diagnosis

• Rate biological progression disease

• Screening test sensitivity

Lead time bias

Artifactual survival prolongation resulting from earlier disease detection in the absence of increased effectiveness of earlier intervention

Penn

Lead Time Bias

Penn

Length/Time Bias

Artifactual increased measured survival from selectively increased detection of less aggressive disease with better prognosis

Penn

Overestimation Screening Test Benefit:Prevalence Bias

Unrepresentative impact of detection of prevalent cases in early screening cycles

Impact incident cases increases with number subsequent cycles

Penn

Length Time Bias

Death

Death

DiseaseBegins

DiseaseBegins

Clinical symptoms

Screen detection

Screen detection

Clinical symptoms

Courtesy of Emily Conant, MD. University of Pennsylvania

Penn

Screen Detection Capability Based on Tumor Biology and Growth Rates

Penn

Overdiagnosis Bias

Overdiagnosis of a condition (pseudodisease) that would not become clinically significant in a patient’s lifetime

The disease has no affect on mortality and is the major harm of screening

Penn

New Evidence: Age 40-49 YrsAge Trial (UK 1991–1997)

Study Design

• RCT annual mammography to age 48 yrs (n=53,884) vs. “usual care” (n=106,956)

• F/U through National Health Service register

– 81% attended at least 1 screen;

– 4.5 mean rounds

– 10.7 yrs follow-up.

Results

• Breast cancer mortality: RR 0.83 (0.66-1.04)

NNI 2,512 (1,149-13,544)

• All-cause mortality: RR 0.97 (0.89-1.04)

Penn

Strengths

• Designed to determine effectiveness age 40-49

• Largest trial, community population

• Most recently conducted RCT

• Consistent with results of meta-analysis previous RCTs

Limitations

• Applicability to US not clear (recall rate 3%–5%)

• Mortality lower than expected in control group

• Only 10 yrs follow–up

• 30% attrition, contamination not reported

New Evidence: Age 40-49 YrsAge Trial (UK 1991–1997)

Penn

New Evidence: Age 40-49 Yrs:Additional F/U Gothenburg Trial

RCT, ages 39–59 yrs, Gothenburg, Sweden 1982

• Mammography q18 mo (n = 20,724) vs. “usual care” (n = 29,200)

• All offered screening at end of trial (year 5)

• 85% attended first screen; 5 mean rounds;14 yrs follow-up.

• 25-40% attrition, 20% contamination

• Results: age 40-49 yrs:Breast cancer mortality RR 0.69 (0.45-1.05)

Penn

Meta-analysis Screening RCTs:Women Ages 39 to 49 Year

Screening every 1-3 years, all “fair quality”

Penn

10–Year Risk of Death from Breast Cancer:

Beginning Routine Screening Age 40 vs. Age 50

Ages 40-49 Ages 50-59

Without screening 0.33% 0.89%

With screening 0.28% 0.69%

Absolute RR 0.05% 0.20%

Source: Steve Woloshin, Veterans Affairs Outcomes Group

Penn

Breast Cancer Surveillance Consortium Data

• Women in BCSC who had at least one prior screening mammogram within 2 years (“routine screening”)

• Screened between 2000-2005 at all 7 sites

• Data provided by age in decades beginning at 40 years (also collapsed for women 70+)

Penn

Breast Cancer Surveillance Consortium:Registry Advantages

• Represent current U.S. practice

• National multi-site sample of 8M mammograms.

• Reflects real world rather than study population (especially useful when evaluating harms)

Penn

• Cancer rates increase and false positive mammogram rates decrease with age

• Number women undergoing additional imaging and biopsy per BCa diagnosed decrease with age

• Biopsy rates are lower in younger than older women

• Cancer detection rates similar in US, UK, Europe

• Rates of false positives and recall rates in the US at least twice rates in Canada, UK, Europe

Breast Cancer Surveillance Consortium Data

Penn

Outcomes Table

Penn

Incremental Benefit of Extending Screening Age 50–69 to Age 40–69:

CISNET Models

Penn

• Subgroup analysis by age excludes data

• Trials use intention-to-treat analysis and report “number needed to invite for screening,” not those actually screened

• Trials are only “fair-quality” due to attrition (>30%) and contamination (>20%)

• Applicability questionable: only one U.S. study, >20 yrs ago, prior to current diagnostic and treatment practices

• Harms and CISNET data are for those actually screened

Meta-analysis of Screening Trials and CISNET Modeling: Limitations

Penn

Screening Mammography: Benefits

• Eight RCTS enrolling more than 600,000 women:

– Screening mammography reduces breast cancer mortality.

– Observed mortality reduction is ~ 15% (0 to 30%, with better designed trials – i.e., less biased mortality ascertainment and randomization)

• Effects on all-cause mortality are unknown.

Penn

Screening Mammography: Harms

• Overdiagnosis (screen detection and subsequent treatment of breast cancer that never would have surfaced clinically)– Extent of overdiagnosis difficult to estimate,

requiring life-long f/u of screened and unscreened cohorts

– Best estimate 2%-10%, with higher estimates in more rigorous studies (i.e., up to 18% of screen detected breast cancers would never surface clinically)

• False positive mammograms resulting in unnecessary biopsies, anxiety and expense

Penn

USPSTF Screening Mammography:Benefits vs. Harms Beginning Age 40 vs. Age 50

Benefit Harms

Magnitude Very LargeVery small – moderate

FrequencyRare Very common

< 1:1,000 10–50/1,000

Timing Late Early

Penn

http://images.google.com/imgres?imgurl=http://therivalryrun.com/michigan/images/AmerCancer.jpg&imgrefurl=http://therivalryrun.com/michigan/donate.html&usg=__-0MdrL-GGX6jkz4FlyNQ5AR3wME=&h=414&w=637&sz=48&hl=en&start=2&sig2=Th2zWV1pIP6f6WjvIZhzJA&tbnid=ymgsmHi3p0g4DM:&tbnh=89&tbnw=137&prev=/images?q=american+cancer+society&gbv=2&hl=en&safe=active&ei=aPxRS9-RG5PsNZmO-I8J

http://images.google.com/imgres?imgurl=http://sacrosanctgospel.files.wordpress.com/2008/06/button_enemies.jpg&imgrefurl=http://sacrosanctgospel.wordpress.com/2008/06/06/the-love-of-christ-and-our-enemies-matthew-538-44/&usg=__RD0-X9w8BFnUMiWLzZ-DwB4tnl4=&h=382&w=386&sz=27&hl=en&start=1&sig2=c9y4rAMhTHJl5C-mYG1-vw&um=1&tbnid=XSQkP72zkHlV0M:&tbnh=122&tbnw=123&prev=/images?q=enemies&hl=en&safe=active&sa=G&um=1&ei=QgdSS4eQB5C2NO_-tIoJ

Penn

http://images.google.com/imgres?imgurl=http://informedmedicaldecisions.org/images/large_gary.jpg&imgrefurl=http://informedmedicaldecisions.org/ask_schwitzer.html&usg=__KACU_9woHnRutnSTh_xPHEApvjc=&h=556&w=544&sz=74&hl=en&start=4&sig2=O3ZK9oM8Z5FqUK-JOPODfw&um=1&tbnid=qo4YKl2_zZEjiM:&tbnh=133&tbnw=130&prev=/images?q=gary+schwitzer&hl=en&safe=active&um=1&ei=rABSS7yHAY7WNdSXyM4D

http://images.google.com/imgres?imgurl=http://mitracreative.com/blog/wp-content/uploads/2009/12/nbc-logo.gif&imgrefurl=http://www.mitracreative.com/blog/tag/online/&usg=__zE35klU94Jby9I0U2ThOzwrM3dE=&h=432&w=432&sz=30&hl=en&start=1&sig2=Rbn_epVwIDa3pms8xFudiQ&um=1&tbnid=DCCthAefLnWJTM:&tbnh=126&tbnw=126&prev=/images?q=NBC&hl=en&safe=active&um=1&ei=zwRSS7zIOJ6ONdfZjYgJ

http://www.aafp.org/online/en/home.html

http://images.google.com/imgres?imgurl=http://blogs.drgreene.com/perspectives/wp-content/uploads/image/new_york_times_logo.gif&imgrefurl=http://blogs.drgreene.com/perspectives/2009/02/23/clothes-and-health/&usg=__39DSCH101OZi6uk-_trZQlLBGO0=&h=313&w=316&sz=7&hl=en&start=17&sig2=EFtu9401jMeXvXidtO6kQw&um=1&tbnid=CgNmNxBFeXMEAM:&tbnh=116&tbnw=117&prev=/images?q=new+york+times&hl=en&safe=active&um=1&ei=GABSS9jFEor2Nej3oJYJ

Penn

Summary of Evidence

• Primary evidence is not changing (and likely will not change, given no active prospective trials)

• Interpretation of evidence is changing, but slowly (as usual)– Benefits are modest– Consensus benefits of mammography

outweigh harms in women ages 50–69– Disagreement RE: frequency (annual vs.

biennial)– Disagreement RE: screening ages 40–49– Disagreement RE: when to discontinue

screening (age 74; age 79; never)

Penn

Evidence Limitations:Why there is so much disagreement

Data limitations

• RCTs comparing start ages 40 vs. 50 inadequate power and f/u duration

• No RCTs RE: screening frequency (and unlikely to be conducted)

Cultural limitations

• Harms difficult for many people to grasp

• Bias toward inherent belief in earlier detection, regardless of impact on outcome

• Misinformation (incidence, prevalence, benefits, harms)

Penn

Evidence Limitations:Why there is so much disagreement

Evidence based medicine is not value free:

• Harms and benefits involve comparison of dissimilar outcomes

• Subjective expertise – just locus of control shifted from physician to methodologist

Penn

"What we've got hereis a failure to communicate”

Paul NewmanCool Hand Luke, 1967

Penn

The decision to start…should be an individualized one and take patient context into account, including the patient’s values regarding specific benefits and harms.”

(C recommendation)

Moderate certainty that the net benefit is small

Penn

“In the midst of every challengelies opportunity”

-Albert Einstein

penn uspstf breast cancer screening: science, policy & politics j. sanford schwartz, md leon...

Documents

b recommendation

c recommendation

generatedthe recommendation

chain of evidence

controversialpennthe

preventive care16 pcps

affordable care act

routine screening mammography