hr cvd final hcl 2011

68
Health Consultation HUDSON RIVER PCBs RESIDENTIAL PROXIMITY TO THE HUDSON RIVER AND HOSPITALIZATION RATES FOR ISCHEMIC HEART DISEASE AND STROKE: 1990-2005 Westchester, Rockland, Putnam, Orange, Dutchess, Ulster, Columbia, Greene, Rensselaer, Albany, Washington and Saratoga Counties, New York EPA FACILITY ID: NYD980763841 Prepared by: The New York State Department of Health JUNE 16, 2011 Prepared under a Cooperative Agreement with the U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES Agency for Toxic Substances and Disease Registry Division of Health Assessment and Consultation Atlanta, Georgia 30333

Upload: others

Post on 18-Dec-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: HR CVD Final HCl 2011

Health Consultation HUDSON RIVER PCBs

RESIDENTIAL PROXIMITY TO THE HUDSON RIVER AND HOSPITALIZATION RATES FOR

ISCHEMIC HEART DISEASE AND STROKE 1990-2005

Westchester Rockland Putnam Orange Dutchess Ulster Columbia Greene Rensselaer Albany Washington and Saratoga Counties

New York

EPA FACILITY ID NYD980763841

Prepared by The New York State Department of Health

JUNE 16 2011

Prepared under a Cooperative Agreement with the US DEPARTMENT OF HEALTH AND HUMAN SERVICES

Agency for Toxic Substances and Disease Registry Division of Health Assessment and Consultation

Atlanta Georgia 30333

Health Consultation A Note of Explanation

A health consultation is a verbal or written response from ATSDR or ATSDRrsquos Cooperative Agreement Partners to a specific request for information about health risks related to a specific site a chemical release or the presence of hazardous material In order to prevent or mitigate exposures a consultation may lead to specific actions such as restricting use of or replacing water supplies intensifying environmental sampling restricting site access or removing the contaminated material

In addition consultations may recommend additional public health actions such as conducting health surveillance activities to evaluate exposure or trends in adverse health outcomes conducting biological indicators of exposure studies to assess exposure and providing health education for health care providers and community members This concludes the health consultation process for this site unless additional information is obtained by ATSDR or ATSDRrsquos Cooperative Agreement Partner which in the Agencyrsquos opinion indicates a need to revise or append the conclusions previously issued

You May Contact ATSDR Toll Free at 1-800-CDC-INFO

or Visit our Home Page at httpwwwatsdrcdcgov

HEALTH CONSULTATION

HUDSON RIVER PCBs

RESIDENTIAL PROXIMITY TO THE HUDSON RIVER AND HOSPITALIZATION RATES FOR

ISCHEMIC HEART DISEASE AND STROKE 1990-2005

Westchester Rockland Putnam Orange Dutchess Ulster Columbia Greene Rensselaer Albany Washington and Saratoga Counties

New York

EPA FACILITY ID NYD980763841

Prepared By

The New York State Department of Health Center for Environmental Health

Troy New York Under cooperative agreement with

The US Department of Health and Human Services Agency for Toxic Substances and Disease Registry

Division of Health Assessment and Consultation Atlanta Georgia

For additional information about this document you may contact the

New York State Department of Health Center for Environmental Health

547 River Street Flanigan Square Room 300 Troy New York 12180-2216

1-518-402-7950 or

1-800-458-1158

E-mail beoehealthstatenyus

Table of Contents

Executive Summary helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellipii

1 Introduction helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 1

2 Background helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 3 Cardiovascular Disease Hudson River PCB Contamination

` POPs and PCBs PCBs and Cardiovascular Disease Health Outcomes

3 Follow-up Study Data Sources and Methodshelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip8 Study Area and Exposure Areas

Hospitalization Data Statistical Analysis

4 Results helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 13 Demographics Associations with Proximity to the River

Statistical Analysis Proximity to the River and CVD Hospitalization Risk Summary of Results

5 Discussion helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 19

6 Conclusion and Recommendations helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip25

7 Agency Information (Preparers of the Report) helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 27

8 References helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 28

9 Tables helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 33

10 Figures helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 44

11 Appendices helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 48

EXECUTIVE SUMMARY

Background In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priorities List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward In support of his request Member of Congress Hinchey cited three studies of cardiovascular hospitalizations conducted with similar data and methods These 2004-2005 statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings for the Hudson River area ATSDR and NYS DOH agreed to conduct a similar group-level study but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socio-economic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River The follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson River from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river

Objectives The follow-up investigationrsquos specific objectives are to increase the studyrsquos ability to control for confounding by socio-economic status through the

use of block groups rather than ZIP codes as the unit of analysis compare hospitalization rates among residents of block groups nearest the Hudson River to

those among residents of block groups farther from the Hudson River and compare the findings of this review to those of the earlier studies

Methods This study examines CVD hospitalization rates and residence near the Hudson River in the 12 counties abutting the Hudson River below Hudson Falls from 1990 through 2005 The hospitalization data for ischemic heart disease and stroke were acquired from the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Statewide US Census data at the block group level from 1990 and 2000 were used to calculate the number of people in each race sex and age group for the study years

Classification of the population into potential exposure categories was based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories the hospitalizations were assigned to the adjacent category if the residential address was in a block group directly abutting the Hudson River For the distance categories block

ii

group centroid (population-weighted) distance to the river shore was used to assign hospitalizations in each block group to one of three categories less than one-half mile from the Hudson River (closest) between one-half to one mile (close) from the river and farther than one mile from the river (distant) Multivariable regression analyses were conducted to compare rates of hospitalizations among the exposure categories assigned at the block group level These analyses controlled for age sex and race using individual-level data and population density percent Hispanic median income education and distance to the nearest hospital at the block group level

Results The use of block groups resulted in 2371 block groups as the units of analysis for the study rather than the 368 ZIP codes in the 12-County study area The average population per block group in the study area is 1200 compared to average population of 8000 per ZIP code The smaller geographical area and smaller populations in block groups contributed to improving the accuracy of the assignment of income and other group-level demographics to the hospitalizations included in the study and enhancing control for confounding by socio-economic status (Tables 8a and 8b)

The studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups categorized as adjacent in close and closest proximity to the Hudson River There is no evidence that residents of the closest block groups (less than frac12 mile) had higher hospitalization rates than residents of the close block groups (between frac12 to 1 mile) If there were a strong association between exposures from living near the river and the risk of CVD hospitalization such a ldquodose-responserdquo might have been shown by the analyses

The data showed strong statistically significant associations between CVD risk and socioshyeconomic status To enhance the control for socio-economic status separate analyses for each income quartile were conducted These stratified analyses showed that residence in block groups near (close closest or adjacent to) the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups Conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups In other words in analyses that held income constant by looking only within the lower or higher income populations living near the river shows opposite effects depending upon which income group is considered The estimated increased and decreased risks are of sufficient magnitude to cancel each other out when average rate ratios for the four income quartiles are calculated The average rate ratios can be interpreted as showing no evidence that hospitalizations among the general population are elevated for residents living near the river

This type of group-level statistical analysis is not able to precisely identify the reason for these differing results However the strength and direction of associations among variables in the statistical models can be evaluated in light of known risk factors to suggest likely explanations In this case socio-economic status appears to be the most likely explanation for the findings The strong association between socio-economic status and residential location (the indicator of potential exposure) and between socio-economic status and CVD hospitalization risk makes it likely that the small but statistically significant elevations shown in the multivariate models

iii

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 2: HR CVD Final HCl 2011

Health Consultation A Note of Explanation

A health consultation is a verbal or written response from ATSDR or ATSDRrsquos Cooperative Agreement Partners to a specific request for information about health risks related to a specific site a chemical release or the presence of hazardous material In order to prevent or mitigate exposures a consultation may lead to specific actions such as restricting use of or replacing water supplies intensifying environmental sampling restricting site access or removing the contaminated material

In addition consultations may recommend additional public health actions such as conducting health surveillance activities to evaluate exposure or trends in adverse health outcomes conducting biological indicators of exposure studies to assess exposure and providing health education for health care providers and community members This concludes the health consultation process for this site unless additional information is obtained by ATSDR or ATSDRrsquos Cooperative Agreement Partner which in the Agencyrsquos opinion indicates a need to revise or append the conclusions previously issued

You May Contact ATSDR Toll Free at 1-800-CDC-INFO

or Visit our Home Page at httpwwwatsdrcdcgov

HEALTH CONSULTATION

HUDSON RIVER PCBs

RESIDENTIAL PROXIMITY TO THE HUDSON RIVER AND HOSPITALIZATION RATES FOR

ISCHEMIC HEART DISEASE AND STROKE 1990-2005

Westchester Rockland Putnam Orange Dutchess Ulster Columbia Greene Rensselaer Albany Washington and Saratoga Counties

New York

EPA FACILITY ID NYD980763841

Prepared By

The New York State Department of Health Center for Environmental Health

Troy New York Under cooperative agreement with

The US Department of Health and Human Services Agency for Toxic Substances and Disease Registry

Division of Health Assessment and Consultation Atlanta Georgia

For additional information about this document you may contact the

New York State Department of Health Center for Environmental Health

547 River Street Flanigan Square Room 300 Troy New York 12180-2216

1-518-402-7950 or

1-800-458-1158

E-mail beoehealthstatenyus

Table of Contents

Executive Summary helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellipii

1 Introduction helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 1

2 Background helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 3 Cardiovascular Disease Hudson River PCB Contamination

` POPs and PCBs PCBs and Cardiovascular Disease Health Outcomes

3 Follow-up Study Data Sources and Methodshelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip8 Study Area and Exposure Areas

Hospitalization Data Statistical Analysis

4 Results helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 13 Demographics Associations with Proximity to the River

Statistical Analysis Proximity to the River and CVD Hospitalization Risk Summary of Results

5 Discussion helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 19

6 Conclusion and Recommendations helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip25

7 Agency Information (Preparers of the Report) helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 27

8 References helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 28

9 Tables helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 33

10 Figures helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 44

11 Appendices helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 48

EXECUTIVE SUMMARY

Background In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priorities List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward In support of his request Member of Congress Hinchey cited three studies of cardiovascular hospitalizations conducted with similar data and methods These 2004-2005 statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings for the Hudson River area ATSDR and NYS DOH agreed to conduct a similar group-level study but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socio-economic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River The follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson River from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river

Objectives The follow-up investigationrsquos specific objectives are to increase the studyrsquos ability to control for confounding by socio-economic status through the

use of block groups rather than ZIP codes as the unit of analysis compare hospitalization rates among residents of block groups nearest the Hudson River to

those among residents of block groups farther from the Hudson River and compare the findings of this review to those of the earlier studies

Methods This study examines CVD hospitalization rates and residence near the Hudson River in the 12 counties abutting the Hudson River below Hudson Falls from 1990 through 2005 The hospitalization data for ischemic heart disease and stroke were acquired from the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Statewide US Census data at the block group level from 1990 and 2000 were used to calculate the number of people in each race sex and age group for the study years

Classification of the population into potential exposure categories was based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories the hospitalizations were assigned to the adjacent category if the residential address was in a block group directly abutting the Hudson River For the distance categories block

ii

group centroid (population-weighted) distance to the river shore was used to assign hospitalizations in each block group to one of three categories less than one-half mile from the Hudson River (closest) between one-half to one mile (close) from the river and farther than one mile from the river (distant) Multivariable regression analyses were conducted to compare rates of hospitalizations among the exposure categories assigned at the block group level These analyses controlled for age sex and race using individual-level data and population density percent Hispanic median income education and distance to the nearest hospital at the block group level

Results The use of block groups resulted in 2371 block groups as the units of analysis for the study rather than the 368 ZIP codes in the 12-County study area The average population per block group in the study area is 1200 compared to average population of 8000 per ZIP code The smaller geographical area and smaller populations in block groups contributed to improving the accuracy of the assignment of income and other group-level demographics to the hospitalizations included in the study and enhancing control for confounding by socio-economic status (Tables 8a and 8b)

The studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups categorized as adjacent in close and closest proximity to the Hudson River There is no evidence that residents of the closest block groups (less than frac12 mile) had higher hospitalization rates than residents of the close block groups (between frac12 to 1 mile) If there were a strong association between exposures from living near the river and the risk of CVD hospitalization such a ldquodose-responserdquo might have been shown by the analyses

The data showed strong statistically significant associations between CVD risk and socioshyeconomic status To enhance the control for socio-economic status separate analyses for each income quartile were conducted These stratified analyses showed that residence in block groups near (close closest or adjacent to) the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups Conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups In other words in analyses that held income constant by looking only within the lower or higher income populations living near the river shows opposite effects depending upon which income group is considered The estimated increased and decreased risks are of sufficient magnitude to cancel each other out when average rate ratios for the four income quartiles are calculated The average rate ratios can be interpreted as showing no evidence that hospitalizations among the general population are elevated for residents living near the river

This type of group-level statistical analysis is not able to precisely identify the reason for these differing results However the strength and direction of associations among variables in the statistical models can be evaluated in light of known risk factors to suggest likely explanations In this case socio-economic status appears to be the most likely explanation for the findings The strong association between socio-economic status and residential location (the indicator of potential exposure) and between socio-economic status and CVD hospitalization risk makes it likely that the small but statistically significant elevations shown in the multivariate models

iii

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 3: HR CVD Final HCl 2011

HEALTH CONSULTATION

HUDSON RIVER PCBs

RESIDENTIAL PROXIMITY TO THE HUDSON RIVER AND HOSPITALIZATION RATES FOR

ISCHEMIC HEART DISEASE AND STROKE 1990-2005

Westchester Rockland Putnam Orange Dutchess Ulster Columbia Greene Rensselaer Albany Washington and Saratoga Counties

New York

EPA FACILITY ID NYD980763841

Prepared By

The New York State Department of Health Center for Environmental Health

Troy New York Under cooperative agreement with

The US Department of Health and Human Services Agency for Toxic Substances and Disease Registry

Division of Health Assessment and Consultation Atlanta Georgia

For additional information about this document you may contact the

New York State Department of Health Center for Environmental Health

547 River Street Flanigan Square Room 300 Troy New York 12180-2216

1-518-402-7950 or

1-800-458-1158

E-mail beoehealthstatenyus

Table of Contents

Executive Summary helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellipii

1 Introduction helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 1

2 Background helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 3 Cardiovascular Disease Hudson River PCB Contamination

` POPs and PCBs PCBs and Cardiovascular Disease Health Outcomes

3 Follow-up Study Data Sources and Methodshelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip8 Study Area and Exposure Areas

Hospitalization Data Statistical Analysis

4 Results helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 13 Demographics Associations with Proximity to the River

Statistical Analysis Proximity to the River and CVD Hospitalization Risk Summary of Results

5 Discussion helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 19

6 Conclusion and Recommendations helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip25

7 Agency Information (Preparers of the Report) helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 27

8 References helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 28

9 Tables helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 33

10 Figures helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 44

11 Appendices helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 48

EXECUTIVE SUMMARY

Background In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priorities List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward In support of his request Member of Congress Hinchey cited three studies of cardiovascular hospitalizations conducted with similar data and methods These 2004-2005 statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings for the Hudson River area ATSDR and NYS DOH agreed to conduct a similar group-level study but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socio-economic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River The follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson River from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river

Objectives The follow-up investigationrsquos specific objectives are to increase the studyrsquos ability to control for confounding by socio-economic status through the

use of block groups rather than ZIP codes as the unit of analysis compare hospitalization rates among residents of block groups nearest the Hudson River to

those among residents of block groups farther from the Hudson River and compare the findings of this review to those of the earlier studies

Methods This study examines CVD hospitalization rates and residence near the Hudson River in the 12 counties abutting the Hudson River below Hudson Falls from 1990 through 2005 The hospitalization data for ischemic heart disease and stroke were acquired from the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Statewide US Census data at the block group level from 1990 and 2000 were used to calculate the number of people in each race sex and age group for the study years

Classification of the population into potential exposure categories was based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories the hospitalizations were assigned to the adjacent category if the residential address was in a block group directly abutting the Hudson River For the distance categories block

ii

group centroid (population-weighted) distance to the river shore was used to assign hospitalizations in each block group to one of three categories less than one-half mile from the Hudson River (closest) between one-half to one mile (close) from the river and farther than one mile from the river (distant) Multivariable regression analyses were conducted to compare rates of hospitalizations among the exposure categories assigned at the block group level These analyses controlled for age sex and race using individual-level data and population density percent Hispanic median income education and distance to the nearest hospital at the block group level

Results The use of block groups resulted in 2371 block groups as the units of analysis for the study rather than the 368 ZIP codes in the 12-County study area The average population per block group in the study area is 1200 compared to average population of 8000 per ZIP code The smaller geographical area and smaller populations in block groups contributed to improving the accuracy of the assignment of income and other group-level demographics to the hospitalizations included in the study and enhancing control for confounding by socio-economic status (Tables 8a and 8b)

The studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups categorized as adjacent in close and closest proximity to the Hudson River There is no evidence that residents of the closest block groups (less than frac12 mile) had higher hospitalization rates than residents of the close block groups (between frac12 to 1 mile) If there were a strong association between exposures from living near the river and the risk of CVD hospitalization such a ldquodose-responserdquo might have been shown by the analyses

The data showed strong statistically significant associations between CVD risk and socioshyeconomic status To enhance the control for socio-economic status separate analyses for each income quartile were conducted These stratified analyses showed that residence in block groups near (close closest or adjacent to) the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups Conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups In other words in analyses that held income constant by looking only within the lower or higher income populations living near the river shows opposite effects depending upon which income group is considered The estimated increased and decreased risks are of sufficient magnitude to cancel each other out when average rate ratios for the four income quartiles are calculated The average rate ratios can be interpreted as showing no evidence that hospitalizations among the general population are elevated for residents living near the river

This type of group-level statistical analysis is not able to precisely identify the reason for these differing results However the strength and direction of associations among variables in the statistical models can be evaluated in light of known risk factors to suggest likely explanations In this case socio-economic status appears to be the most likely explanation for the findings The strong association between socio-economic status and residential location (the indicator of potential exposure) and between socio-economic status and CVD hospitalization risk makes it likely that the small but statistically significant elevations shown in the multivariate models

iii

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 4: HR CVD Final HCl 2011

For additional information about this document you may contact the

New York State Department of Health Center for Environmental Health

547 River Street Flanigan Square Room 300 Troy New York 12180-2216

1-518-402-7950 or

1-800-458-1158

E-mail beoehealthstatenyus

Table of Contents

Executive Summary helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellipii

1 Introduction helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 1

2 Background helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 3 Cardiovascular Disease Hudson River PCB Contamination

` POPs and PCBs PCBs and Cardiovascular Disease Health Outcomes

3 Follow-up Study Data Sources and Methodshelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip8 Study Area and Exposure Areas

Hospitalization Data Statistical Analysis

4 Results helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 13 Demographics Associations with Proximity to the River

Statistical Analysis Proximity to the River and CVD Hospitalization Risk Summary of Results

5 Discussion helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 19

6 Conclusion and Recommendations helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip25

7 Agency Information (Preparers of the Report) helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 27

8 References helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 28

9 Tables helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 33

10 Figures helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 44

11 Appendices helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 48

EXECUTIVE SUMMARY

Background In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priorities List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward In support of his request Member of Congress Hinchey cited three studies of cardiovascular hospitalizations conducted with similar data and methods These 2004-2005 statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings for the Hudson River area ATSDR and NYS DOH agreed to conduct a similar group-level study but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socio-economic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River The follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson River from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river

Objectives The follow-up investigationrsquos specific objectives are to increase the studyrsquos ability to control for confounding by socio-economic status through the

use of block groups rather than ZIP codes as the unit of analysis compare hospitalization rates among residents of block groups nearest the Hudson River to

those among residents of block groups farther from the Hudson River and compare the findings of this review to those of the earlier studies

Methods This study examines CVD hospitalization rates and residence near the Hudson River in the 12 counties abutting the Hudson River below Hudson Falls from 1990 through 2005 The hospitalization data for ischemic heart disease and stroke were acquired from the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Statewide US Census data at the block group level from 1990 and 2000 were used to calculate the number of people in each race sex and age group for the study years

Classification of the population into potential exposure categories was based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories the hospitalizations were assigned to the adjacent category if the residential address was in a block group directly abutting the Hudson River For the distance categories block

ii

group centroid (population-weighted) distance to the river shore was used to assign hospitalizations in each block group to one of three categories less than one-half mile from the Hudson River (closest) between one-half to one mile (close) from the river and farther than one mile from the river (distant) Multivariable regression analyses were conducted to compare rates of hospitalizations among the exposure categories assigned at the block group level These analyses controlled for age sex and race using individual-level data and population density percent Hispanic median income education and distance to the nearest hospital at the block group level

Results The use of block groups resulted in 2371 block groups as the units of analysis for the study rather than the 368 ZIP codes in the 12-County study area The average population per block group in the study area is 1200 compared to average population of 8000 per ZIP code The smaller geographical area and smaller populations in block groups contributed to improving the accuracy of the assignment of income and other group-level demographics to the hospitalizations included in the study and enhancing control for confounding by socio-economic status (Tables 8a and 8b)

The studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups categorized as adjacent in close and closest proximity to the Hudson River There is no evidence that residents of the closest block groups (less than frac12 mile) had higher hospitalization rates than residents of the close block groups (between frac12 to 1 mile) If there were a strong association between exposures from living near the river and the risk of CVD hospitalization such a ldquodose-responserdquo might have been shown by the analyses

The data showed strong statistically significant associations between CVD risk and socioshyeconomic status To enhance the control for socio-economic status separate analyses for each income quartile were conducted These stratified analyses showed that residence in block groups near (close closest or adjacent to) the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups Conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups In other words in analyses that held income constant by looking only within the lower or higher income populations living near the river shows opposite effects depending upon which income group is considered The estimated increased and decreased risks are of sufficient magnitude to cancel each other out when average rate ratios for the four income quartiles are calculated The average rate ratios can be interpreted as showing no evidence that hospitalizations among the general population are elevated for residents living near the river

This type of group-level statistical analysis is not able to precisely identify the reason for these differing results However the strength and direction of associations among variables in the statistical models can be evaluated in light of known risk factors to suggest likely explanations In this case socio-economic status appears to be the most likely explanation for the findings The strong association between socio-economic status and residential location (the indicator of potential exposure) and between socio-economic status and CVD hospitalization risk makes it likely that the small but statistically significant elevations shown in the multivariate models

iii

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 5: HR CVD Final HCl 2011

Table of Contents

Executive Summary helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellipii

1 Introduction helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 1

2 Background helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 3 Cardiovascular Disease Hudson River PCB Contamination

` POPs and PCBs PCBs and Cardiovascular Disease Health Outcomes

3 Follow-up Study Data Sources and Methodshelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip8 Study Area and Exposure Areas

Hospitalization Data Statistical Analysis

4 Results helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 13 Demographics Associations with Proximity to the River

Statistical Analysis Proximity to the River and CVD Hospitalization Risk Summary of Results

5 Discussion helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 19

6 Conclusion and Recommendations helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip25

7 Agency Information (Preparers of the Report) helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 27

8 References helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 28

9 Tables helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 33

10 Figures helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 44

11 Appendices helliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphelliphellip 48

EXECUTIVE SUMMARY

Background In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priorities List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward In support of his request Member of Congress Hinchey cited three studies of cardiovascular hospitalizations conducted with similar data and methods These 2004-2005 statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings for the Hudson River area ATSDR and NYS DOH agreed to conduct a similar group-level study but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socio-economic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River The follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson River from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river

Objectives The follow-up investigationrsquos specific objectives are to increase the studyrsquos ability to control for confounding by socio-economic status through the

use of block groups rather than ZIP codes as the unit of analysis compare hospitalization rates among residents of block groups nearest the Hudson River to

those among residents of block groups farther from the Hudson River and compare the findings of this review to those of the earlier studies

Methods This study examines CVD hospitalization rates and residence near the Hudson River in the 12 counties abutting the Hudson River below Hudson Falls from 1990 through 2005 The hospitalization data for ischemic heart disease and stroke were acquired from the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Statewide US Census data at the block group level from 1990 and 2000 were used to calculate the number of people in each race sex and age group for the study years

Classification of the population into potential exposure categories was based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories the hospitalizations were assigned to the adjacent category if the residential address was in a block group directly abutting the Hudson River For the distance categories block

ii

group centroid (population-weighted) distance to the river shore was used to assign hospitalizations in each block group to one of three categories less than one-half mile from the Hudson River (closest) between one-half to one mile (close) from the river and farther than one mile from the river (distant) Multivariable regression analyses were conducted to compare rates of hospitalizations among the exposure categories assigned at the block group level These analyses controlled for age sex and race using individual-level data and population density percent Hispanic median income education and distance to the nearest hospital at the block group level

Results The use of block groups resulted in 2371 block groups as the units of analysis for the study rather than the 368 ZIP codes in the 12-County study area The average population per block group in the study area is 1200 compared to average population of 8000 per ZIP code The smaller geographical area and smaller populations in block groups contributed to improving the accuracy of the assignment of income and other group-level demographics to the hospitalizations included in the study and enhancing control for confounding by socio-economic status (Tables 8a and 8b)

The studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups categorized as adjacent in close and closest proximity to the Hudson River There is no evidence that residents of the closest block groups (less than frac12 mile) had higher hospitalization rates than residents of the close block groups (between frac12 to 1 mile) If there were a strong association between exposures from living near the river and the risk of CVD hospitalization such a ldquodose-responserdquo might have been shown by the analyses

The data showed strong statistically significant associations between CVD risk and socioshyeconomic status To enhance the control for socio-economic status separate analyses for each income quartile were conducted These stratified analyses showed that residence in block groups near (close closest or adjacent to) the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups Conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups In other words in analyses that held income constant by looking only within the lower or higher income populations living near the river shows opposite effects depending upon which income group is considered The estimated increased and decreased risks are of sufficient magnitude to cancel each other out when average rate ratios for the four income quartiles are calculated The average rate ratios can be interpreted as showing no evidence that hospitalizations among the general population are elevated for residents living near the river

This type of group-level statistical analysis is not able to precisely identify the reason for these differing results However the strength and direction of associations among variables in the statistical models can be evaluated in light of known risk factors to suggest likely explanations In this case socio-economic status appears to be the most likely explanation for the findings The strong association between socio-economic status and residential location (the indicator of potential exposure) and between socio-economic status and CVD hospitalization risk makes it likely that the small but statistically significant elevations shown in the multivariate models

iii

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 6: HR CVD Final HCl 2011

EXECUTIVE SUMMARY

Background In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priorities List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward In support of his request Member of Congress Hinchey cited three studies of cardiovascular hospitalizations conducted with similar data and methods These 2004-2005 statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings for the Hudson River area ATSDR and NYS DOH agreed to conduct a similar group-level study but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socio-economic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River The follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson River from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river

Objectives The follow-up investigationrsquos specific objectives are to increase the studyrsquos ability to control for confounding by socio-economic status through the

use of block groups rather than ZIP codes as the unit of analysis compare hospitalization rates among residents of block groups nearest the Hudson River to

those among residents of block groups farther from the Hudson River and compare the findings of this review to those of the earlier studies

Methods This study examines CVD hospitalization rates and residence near the Hudson River in the 12 counties abutting the Hudson River below Hudson Falls from 1990 through 2005 The hospitalization data for ischemic heart disease and stroke were acquired from the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Statewide US Census data at the block group level from 1990 and 2000 were used to calculate the number of people in each race sex and age group for the study years

Classification of the population into potential exposure categories was based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories the hospitalizations were assigned to the adjacent category if the residential address was in a block group directly abutting the Hudson River For the distance categories block

ii

group centroid (population-weighted) distance to the river shore was used to assign hospitalizations in each block group to one of three categories less than one-half mile from the Hudson River (closest) between one-half to one mile (close) from the river and farther than one mile from the river (distant) Multivariable regression analyses were conducted to compare rates of hospitalizations among the exposure categories assigned at the block group level These analyses controlled for age sex and race using individual-level data and population density percent Hispanic median income education and distance to the nearest hospital at the block group level

Results The use of block groups resulted in 2371 block groups as the units of analysis for the study rather than the 368 ZIP codes in the 12-County study area The average population per block group in the study area is 1200 compared to average population of 8000 per ZIP code The smaller geographical area and smaller populations in block groups contributed to improving the accuracy of the assignment of income and other group-level demographics to the hospitalizations included in the study and enhancing control for confounding by socio-economic status (Tables 8a and 8b)

The studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups categorized as adjacent in close and closest proximity to the Hudson River There is no evidence that residents of the closest block groups (less than frac12 mile) had higher hospitalization rates than residents of the close block groups (between frac12 to 1 mile) If there were a strong association between exposures from living near the river and the risk of CVD hospitalization such a ldquodose-responserdquo might have been shown by the analyses

The data showed strong statistically significant associations between CVD risk and socioshyeconomic status To enhance the control for socio-economic status separate analyses for each income quartile were conducted These stratified analyses showed that residence in block groups near (close closest or adjacent to) the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups Conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups In other words in analyses that held income constant by looking only within the lower or higher income populations living near the river shows opposite effects depending upon which income group is considered The estimated increased and decreased risks are of sufficient magnitude to cancel each other out when average rate ratios for the four income quartiles are calculated The average rate ratios can be interpreted as showing no evidence that hospitalizations among the general population are elevated for residents living near the river

This type of group-level statistical analysis is not able to precisely identify the reason for these differing results However the strength and direction of associations among variables in the statistical models can be evaluated in light of known risk factors to suggest likely explanations In this case socio-economic status appears to be the most likely explanation for the findings The strong association between socio-economic status and residential location (the indicator of potential exposure) and between socio-economic status and CVD hospitalization risk makes it likely that the small but statistically significant elevations shown in the multivariate models

iii

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 7: HR CVD Final HCl 2011

group centroid (population-weighted) distance to the river shore was used to assign hospitalizations in each block group to one of three categories less than one-half mile from the Hudson River (closest) between one-half to one mile (close) from the river and farther than one mile from the river (distant) Multivariable regression analyses were conducted to compare rates of hospitalizations among the exposure categories assigned at the block group level These analyses controlled for age sex and race using individual-level data and population density percent Hispanic median income education and distance to the nearest hospital at the block group level

Results The use of block groups resulted in 2371 block groups as the units of analysis for the study rather than the 368 ZIP codes in the 12-County study area The average population per block group in the study area is 1200 compared to average population of 8000 per ZIP code The smaller geographical area and smaller populations in block groups contributed to improving the accuracy of the assignment of income and other group-level demographics to the hospitalizations included in the study and enhancing control for confounding by socio-economic status (Tables 8a and 8b)

The studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups categorized as adjacent in close and closest proximity to the Hudson River There is no evidence that residents of the closest block groups (less than frac12 mile) had higher hospitalization rates than residents of the close block groups (between frac12 to 1 mile) If there were a strong association between exposures from living near the river and the risk of CVD hospitalization such a ldquodose-responserdquo might have been shown by the analyses

The data showed strong statistically significant associations between CVD risk and socioshyeconomic status To enhance the control for socio-economic status separate analyses for each income quartile were conducted These stratified analyses showed that residence in block groups near (close closest or adjacent to) the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups Conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups In other words in analyses that held income constant by looking only within the lower or higher income populations living near the river shows opposite effects depending upon which income group is considered The estimated increased and decreased risks are of sufficient magnitude to cancel each other out when average rate ratios for the four income quartiles are calculated The average rate ratios can be interpreted as showing no evidence that hospitalizations among the general population are elevated for residents living near the river

This type of group-level statistical analysis is not able to precisely identify the reason for these differing results However the strength and direction of associations among variables in the statistical models can be evaluated in light of known risk factors to suggest likely explanations In this case socio-economic status appears to be the most likely explanation for the findings The strong association between socio-economic status and residential location (the indicator of potential exposure) and between socio-economic status and CVD hospitalization risk makes it likely that the small but statistically significant elevations shown in the multivariate models

iii

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 8: HR CVD Final HCl 2011

(Tables 8a amp 8b) are the result of the inability to completely control for socio-economic statusrsquo effects on CVD hospitalization risk The strong effect of socio-economic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that these overall multivariable regression results are affected by uncontrolled ie residual confounding by socioshyeconomic status

Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared to some degree with the findings of the multivariate regression analyses in the current study The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences described in the report the studies are not entirely comparable

CVD hospitalization rate ratios estimated for living in block groups adjacent closest or close to the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

Conclusions and Recommendations While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socio-economic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socio-economic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses that showed modest but statistically significant elevations of CVD hospitalization risk for residents of block groups near the river

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study subjectsrsquo serum

iv

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 9: HR CVD Final HCl 2011

PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalization rates and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

v

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 10: HR CVD Final HCl 2011

1 INTRODUCTION

The New York State Department of Health (NYS DOH) has a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) to perform environmental exposure and health assessments conduct health statistics reviews and perform epidemiological studies of populations in New York State which may have been exposed to environmental contaminants In October 2006 Member of Congress Maurice Hinchey requested that ATSDR conduct a health consultation to assess cardiovascular disease among people residing near the Hudson River A 200-mile segment of the Hudson River from Hudson Falls south to New York City is a National Priority List (NPL) (Superfund) site due to contamination of the river with polychlorinated biphenyls (PCBs) from GE sites in Hudson Falls and Ft Edward (See Appendix A for Member of Congress Hincheyrsquos letter)

In support of his request Member of Congress Hinchey cited three studies (Sergeev and Carpenter 2005 [ischemic heart disease] Shcherbatykh et al 2005 [stroke] Huang et al 2006 [hypertension]) These statewide studies suggest residents of ZIP codes containing hazardous waste sites with persistent organic pollutants (POPs) such as PCBs have elevated risk for hospitalizations for cardiovascular disease including ischemic heart disease stroke and hypertension Analyses conducted separately for ZIP codes next to the Hudson River suggest residents of these specific ZIP codes also have increased rates of hospitalization for cardiovascular disease

These studies published in 2005-2006 are ecological studies In an ecological study data are analyzed by evaluating risk factors and health outcomes among groups For example rates of disease are compared for populations of ZIP codes with varying median income levels to see if income levels appear to be related to disease outcome rates In these previously conducted studies rates of hospitalization for residents of ZIP codes with hazardous waste sites containing POPs were compared with hospitalization rates for residents of ZIP codes without such waste sites to look for evidence that potential exposures might be impacting health outcomes The studies report statistically significant elevations of cardiovascular disease hospitalizations in ZIP codes with POPs For coronary heart disease and a subset acute myocardial infarction the study showed elevations of approximately 15 and 20 respectively (Sergeev and Carpenter 2005) for stroke a 15 elevation (Shcherbatykh et al 2005) and for hypertension a 19 elevation (Huang et al 2006) in ZIP codes containing waste sites with POPs compared with hospitalization rates in other New York State ZIP codes (excluding NYC) that did not contain waste sites with POPs (Table 1)

In sub-analyses in these studies cardiovascular hospitalization rates for ZIP codes adjacent to the Hudson River were compared with cardiovascular hospitalization rates for ZIP codes in the rest of the state (excluding NYC) identified as not having inactive hazardous waste sites with POPs These sub-analyses showed statistically significantly elevated hospitalization rates of 36 and 39 for coronary heart disease and the subset acute myocardial infarction a 20 elevation for stroke and a 14 elevation for hypertension (Table 1) The Sergeev and Carpenter Shcherbatykh et al and Huang et al articles state that the Hudson River ZIP codes are in an area where people have higher income smoke less exercise more and have healthier diets This information is cited in these articles as evidence that known risk factors related to lower socioshy

1

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 11: HR CVD Final HCl 2011

economic status are likely not the cause for the elevated hospitalization rates in these Hudson River area ZIP codes

In these previously published reports Behavioral Risk Factor Surveillance System (BRFSS) data for entire counties are provided to support statements about the population of Hudson River ZIP codes having higher income and healthier lifestyles However because the ZIP codes adjacent to the Hudson River include only a proportion of the population of the counties abutting the river these county-level BRFSS data may be misleading regarding characteristics of the ZIP codes adjacent to the river (Figure 1) In fact data from the US Census show that the ZIP codes adjacent to the river have lower median income than the counties within which they are located The median income for the total population of the twelve counties adjacent to the Hudson River was $61010 in 2000 while the median income for the ZIP codes adjacent to the river was $45356 This median income for the population of the ZIP codes adjacent to the river is also slightly lower than the statewide (excluding NYC) median income of $47735 (US Census 2000) Thus the Hudson River ZIP codes analyzed in the Sergeev and Carpenter Shcherbatykh et al and Huang et al studies include a population of lower not higher socio-economic status compared to the rest of NYS (excluding NYC)

Because CVD and many CVD risk factors are associated with lower socio-economic status it is especially important to adjust as much as possible for socio-economic status when conducting comparative analyses of CVD hospitalizations or disease incidence or prevalence Regarding the hypothesis about POPs and CVD research studies described in more detail below have suggested possible biological mechanisms for POP exposuresrsquo effects on the heart There is very little research on human exposures to POPs and potential cardiovascular effects however

In response to Congressman Hincheyrsquos request for follow-up of the 2004-2005 findings ATSDR and NYS DOH agreed to conduct a similar study also ecological but at a different level of geography the Census block group By using a level of geography that is generally smaller in area and population than a ZIP code this follow-up seeks to more accurately adjust for socioshyeconomic and other factors that affect cardiovascular hospitalizations and to improve upon the geographic classification of relative distance to the Hudson River In addition rather than comparing hospitalizations among residents of Hudson River area ZIP codes to hospitalizations among residents of all other NYS (exclusive of NYC) ZIP codes with no waste sites with POPs this follow-up investigationrsquos analyses are restricted to the twelve counties abutting the Hudson river from Washington County south to Westchester County Within the study arearsquos twelve counties cardiovascular hospitalization rates in areas closest to the Hudson River are compared with rates in areas farther from the river This more localized approach may also assist with reducing the potential impact of regional differences in medical care practice or reporting factors that could affect the hospitalization data (Ko et al 2007) (Table 2 shows a more detailed listing of the methods used in the prior studies and follow-up study)

Regarding environmental exposures it is important to note that living near a hazardous waste site that contains PCBs or other POPs does not necessarily mean that human exposure from the site has occurred Depending upon the conditions at a site or the specific chemicals exposures may be highly improbable Information on body burden for example PCB levels in blood or on levels of the chemical in off-site air dust or soil is needed to support a claim of exposure The

2

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 12: HR CVD Final HCl 2011

studies published in 2005-2006 did not provide specific information such as environmental or biomonitoring data in support of the assumption that residence in ZIP codes containing hazardous waste sites with POPs or residence in ZIP codes adjacent to the Hudson River resulted in increased exposures to POPs in the population For the purpose of this follow-up we did not conduct environmental or human biological sampling to address this issue Rather this follow-up replicates the prior studiesrsquo approach with some enhancements to better control for confounding particularly confounding by socio-economic status

This descriptive epidemiologic investigation uses already existing hospitalization data from the Statewide Planning and Research Cooperative System and population data from the United States Census to evaluate comparative rates of cardiovascular hospitalizations While this type of investigation can not show proof of cause and effect it can assist in interpreting relationships among indicators of CVD burden and hypothesized exposures associated with distance from the Hudson River This descriptive investigation may also assist in determining whether additional health investigations of these associations are warranted

2 BACKGROUND

Cardiovascular Disease

The widely known risk factors for cardiovascular disease (CVD) in addition to age sex race and family history include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes low socio-economic status mental ill-health psychosocial stress and the use of certain medications (WHO 2010 NYS DOH 2009 Wing 1988 NYS DOH 2009a)

CVD Definitions CVD is a general term for any disease of the circulatory system The two main types of CVD are ischemic heart disease also termed coronary heart disease and cerebrovascular disease also called stroke Ischemic heart disease (IHD) results from restricted blood flow through the arteries supplying the heart muscle The most serious risk from IHD is acute myocardial infarction (AMI) another term for heart attack AMI occurs when blood flow to the heart is suddenly reduced or stopped usually due to a blockage in a coronary artery Angina is severe constricting pain in the chest often radiating from the area immediately over the heart to the left shoulder and down the arm Cerebrovascular disease refers primarily to stroke the sudden impairment of brain function resulting from interruption of circulation to the brain Ischemic stroke results from blockage of an artery supplying blood to the brain Hemorrhagic stroke occurs due to the escape of blood from a ruptured artery supplying blood to the brain

CVD Burden and Trends CVD is the leading cause of death for men and women of all ages in the United States and in New York State In 1999 for every death from IHD there were five hospitalizations for ischemic disease in NYS (NYS DOH 2009a) Mortality rates from CVD have been declining steadily since the 1980rsquos Research studies estimate that approximately half of the decline is due to improved treatments for CVD and its risk factors such as high cholesterol and high blood pressure and approximately half is due to decreasing incidence of CVD The decrease in incidence is due to trends in health behaviors primarily reduced smoking rates

3

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 13: HR CVD Final HCl 2011

CVD mortality rates do not appear to be declining for all segments of the population however and may actually be increasing among some groups particularly people of lower socio-economic status (Barnett et al 1999) This may be due in part to higher rates of smoking among people of lower socio-economic status as well as to other factors related to socio-economic status and race A growing literature points to multiple causes for CVD and other health disparities including health practices psychosocial stress limited resources discrimination and access to health care (Health United States 2007) Increases in the prevalence of diabetes and obesity in the general population partially offset advances in treatment and favorable trends for other risk factors An increasing ischemic heart disease mortality rate in women ages 35 to 54 since 1997 is attributed to these unfavorable risk factor trends (Ford 2007 Lloyd-Jones 2009) New York State (NYS) and other northeast states have high IHD and low stroke mortality with NYS having the highest IHD mortality and lowest stroke mortality among the 50 states (Howard et al 2009)

CVD Data Sources CVD mortality rates are more easily measured than incidence prevalence or hospitalizations because mortality data from vital records are a comprehensive and relatively accessible data source Information for estimating the incidence and prevalence of CVD outcomes comes primarily from health interview physical examination and record surveys Hospitalization data are available for NYS and the US but are generally collected and analyzed in order to track trends in hospital care and resources expended Actual CVD incidence data validated by physician diagnosis physician examination or reviews of medical records are not directly available for the entire US or for specific geographic areas of NYS Such information is available from population-based cohort studies that follow people over time in selected communities While the concern underlying this research project is to understand CVD levels in the community ie incidence and prevalence we do not have data that directly provide us with this information In the absence of such data hospitalizations are frequently used as indicators of the burden of disease in communities particularly in surveillance or hypothesis-generating investigations such as this one In NYS the Behavioral Risk Factor Surveillance System (BRFSS) an ongoing monthly telephone survey of adults aged 18 years and older provides information about the burden of CVD by age sex race income and education

CVD Risk Factor Data The BRFSS data for NYS show that over 20 of the population aged 65 and older reported having been diagnosed with CVD versus less than five percent of the population under 65 Males were slightly more likely to report having CVD than females Nonshyhispanic whites compared to other ethnic groups combined were slightly more likely to report having CVD None of these slight differences were statistically significant however The only statistically significant demographic risk factors in the BRFSS data were low income and education levels Statistically significant elevations of reported CVD were observed for respondents with incomes below $25000 and respondents with a high school degree or less education (NYS DOH 2009a)

Another factor that can affect the outcome measure hospitalizations is the likelihood of mortality prior to reaching the hospital Studies show a variety of factors affect whether an individual experiencing a heart attack or stroke outside the hospital survives to be counted in hospitalization data These factors include lack of knowledge about personal risk or prior disease socio-economic status race marital status and distance to the hospital (Galea 2007

4

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 14: HR CVD Final HCl 2011

Barnett 2006 Ayala 2003) Population-based studies report sudden cardiac death usually outside the hospital to be the first overt manifestation of heart disease for 40 to 60 of all cases ( Kannel 1987 deVreede-Swagemakers 1997 UK Heart Attack Study 1998) An analysis of out-of-hospital mortality from AMI in Pennsylvania in 1998 showed that in relatively rural areas where travel time to the hospital was 25 minutes or longer 72 of all AMI deaths occurred outside the hospital compared with a statewide average of 49 of AMI deaths occurring outside of the hospital (OrsquoNeill 2003) In New York State 55 of IHD deaths occur outside the hospital compared to 35 of stroke deaths (NYS DOH 2009b)

Hudson River PCB Contamination

During an approximate 30-year period ending in 1977 the General Electric (GE) plants in Hudson Falls and Fort Edward used PCBs in the manufacture of electrical capacitors and discharged large quantities of PCBs into the upper Hudson River Estimates of the total direct discharges to the river are as high as 1330000 pounds In 1983 the US EPA classified a 200shymile section of the Hudson River from Hudson Falls to New York City as a National Priority List site making it one of the largest Superfund sites in the United States In 2002 the US EPA issued a Record of Decision calling for targeted environmental dredging and removal of PCB-contaminated sediment in the upper River (USEPA 2002) The dredging began in May 2009

The Superfund environmental investigations and planned remedial actions have focused on the areas of the river near Hudson Falls and Fort Edward where the highest levels of PCBs in sediments are located The 2002 Record of Decision provides information about the human health risk assessment conducted for the Upper and Mid-Hudson and the ecological risk assessment conducted for the Upper and Lower Hudson River (USEPA 2002) Fish contamination and fish consumption are the primary concerns at the site

In 1976 fishing was banned in the upper river and in 1995 a catch and release policy replaced the ban NYS DOH continues to recommend people eat no fish from the Upper Hudson River that children under age 15 and women of childbearing age eat no fish from the entire 200 mile stretch of river below Hudson Falls and that the general population eat none of most species of fish caught between the federal dam at Troy and Catskill approximately 40 miles south of Troy The Lower Hudson remains closed to commercial fishing for striped bass and eight other species

The fishing bans and advisories were issued because ingestion of fish is the major potential pathway for exposure for the general population Many previous studies have shown relationships between consumption of fish from contaminated waters and human PCB levels Another potential pathway for exposure less frequently studied to date is inhalation of PCBs NYS DOH and ATSDR launched ldquoThe Hudson River Communities Projectrdquo in 2000 to address this issue (More information on this project is provided below)

POPS and PCBs

The category ldquopersistent organic pollutantsrdquo (POPs) includes man-made chemicals that are by definition persistent in the environment Examples of POPs are polychlorinated biphenyls

5

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 15: HR CVD Final HCl 2011

(PCBs) polychlorinated dibenzo-p-dioxins (PCDDs) polychorinated dibenso-furans (PCDFs) and pprsquo-diphenyldichloroethene (DDE) a breakdown product of the pesticide dichlorodiphenyltrichloroethane (DDT) These chemicals are stored in human fat tissues accumulating over an individualrsquos lifetime They also accumulate in fish and other species thereby concentrating up the food chain Levels of most POPs measured in blood serum of Americans have decreased in the last few decades An exception is polybrominated diphenyl ethers (PBDEs) levels of which have been increasing in recent years While levels of many POPs are declining a large number continue to be detectable in blood for most of the US population (USDHHS 2009)

This follow-up investigation focuses on the Hudson River where the POPs of concern are PCBs PCBs are a group of synthetic chlorinated compounds manufactured in the US through 1977 Production was halted due to concerns about the persistence of PCBs in the environment and the toxicity of PCBs shown in animal models PCBs were used primarily by the electrical industry for capacitors and transformers and were also used in hydraulic fluids fluorescent light fixtures flame retardants inks adhesives carbonless copy paper paints pesticide extenders plasticizers and wire insulators PCBs can be oily liquids or solids and some are volatile and can exist as vapor in air PCBs in the environment are generally mixtures of the many different types known as congeners (ATSDR 2000)

The variety of types of PCBs adds to the complexity of studying adverse animal and human health effects Increasingly PCB research focuses on specific PCB congeners and groups of congeners because specific types of congeners appear to have different mechanisms of action and toxicity Dioxin-like PCBs (DLPCBs) are an important subset of PCBs due to concerns about their toxicity The focus on dioxin-like PCBs is paralleled by increasing research on the group of POPs with dioxin-like properties This type of PCB and POP research assigns toxic potency values relative to dioxin to various PCBs and POPs to provide an overall toxic equivalency value to specific PCB and other POPs

From 2002 through 2007 NYS DOH and SUNY Albany School of Public Health researchers conducted the Hudson River Communities Study among residents of Hudson Falls and Fort Edward the villages where the PCB discharges from two GE electrical capacitor plants occurred The comparison community was upriver from the discharges in Glens Falls This studyrsquos evaluation of exposures included measuring PCBs in outdoor air and in blood serum The study found that PCB levels in outdoor air were statistically significantly higher in the Hudson FallsFort Edward area (072 nanograms per cubic meter) than in the upriver Glens Falls area (040 nanograms per cubic meter) but that the PCB concentrations in the Hudson FallsFort Edward area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States A statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 (Fitzgerald et al 2007 Palmer et al 2008)

The average PCB concentrations in serum for the study and comparison populations did not differ significantly (307 ppb wet weight versus 323 ppb) The average toxic equivalency quantities (TEQ) for dioxin-like PCB congeners for the study and comparison area populations

6

also did not differ In addition the study found no detectable differences in study subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources The study did find that lifetime consumption of Hudson River fish was associated with higher serum PCB levels (Fitzgerald 2007 Palmer 2008) Another recent biomonitoring study of Brooklyn Manhattan and New Jersey fisherman who consumed versus those who did not consume their catch from the lower Hudson River and New YorkNew Jersey harbor suggests eating fish from these waters is not associated with an increased body burden of PCBs or most other organochlorines This study did not address the inhalation pathway (Morland 2008)

PCBs and Cardiovascular Disease Outcomes

Research studies on adverse health effects associated with PCB exposure have focused on many types of outcomes organs and biological systems including cancer liver reproductive and developmental effects thyroid endocrine immune system and neurological effects as well as specific effects on the skin and eyes Relatively little research has focused to date on associations between PCB exposures and CVD outcomes The majority of human studies have assessed health effects in occupational groups having relatively high-level exposures among people consuming fish from contaminated waters or people exposed to relatively high levels due to accidental contamination of food (USDHHS NIH 2010)

PCBs have been shown to produce a variety of toxic effects in animals These effects include cancer allergies hypersensitivity damage to the central and peripheral nervous systems reproductive disorders and disruption of the immune system (ATSDR 2000) In contrast research to date does not tell us whether specific types of human health effects have occurred from the low-level exposures experienced by the general population Some human epidemiological studies of relatively low levels of PCDDs PCDFs and PCBs suggest possible relationships not only with neurodevelopment of infants but also diabetes and a condition called insulin resistance conditions that increase CVD risk (Longnecker 2001 Carpenter 2006 Arisawa 2005)

In 1998 a study of a population in Seveso Italy exposed to relatively pure TCDD (dioxin) from an industrial accident showed increased mortality from CVD respiratory diseases and diabetes (Bertazzi 1998) These findings suggested the possibility that other dioxin-like chemicals such as dioxin-like PCBs might increase risk for these diseases Some more recent animal studies described below suggest possible mechanisms of action for PCB effects that increase risk for CVD diabetes and hypertension Another relevant source of information is from research using the biological monitoring data from the National Health and Nutrition Examination Survey (NHANES) project to look at the association between PCB levels in blood and self-reported diabetes and heart disease The body of evidence based on analytical human epidemiological studies remains relatively sparse however

Some recent studies using NHANES biomonitoring data show associations between PCB levels and CVD (Lee 2006 Lee 2007 Ha 2007 Everett 2008) These studies are cross-sectional (comparisons at one point in time) evaluations of PCB levels measured in blood samples from individuals and self-reported CVD and diabetes a risk factor for CVD While these studies use individual-level biomonitoring and self-reported health outcome data they are considered to be

7

relatively weak in terms of providing evidence of causality because both exposure and outcome are measured at the same point in time Given the nature of NHANES data collection and reporting practices these studies are not able to control for possible geographical confounding which may occur if urban versus rural residence is associated with higher PCB levels and higher risk for CVD for example The higher PCB levels among African Americans in these data suggest urban residence may be playing a role in the PCB data

The results of animal studies suggest multiple mechanisms by which PCB exposures could affect cardiovascular risk as well as multiple types of adverse health outcomes One hypothesized mechanisms of action specifically for dioxin-like PCBs is that they may have anti-estrogenic effects which impact CVD risk factors such as serum cholesterol levels and blood pressure (Lind 2004) Several studies suggest PCBs interact with a specific receptor (AhR) and increase cellular oxidative stress causing inflammation cell injury and cardiovascular dysfunction (Hennig 2002) Similarly some studies suggest PCB-induced inflammation of fat cells leads to obesity and increased risk for coronary artery disease (atherosclerosis) (Arsenescu 2008)

3 FOLLOW-UP STUDY DATA SOURCES AND METHODS

This follow-up investigation will provide perspective on the 2004-2005 findings (Sergeev and Carpenter 2005 Shcherbatykh et al 2005) by producing results from analyses conducted at a more detailed level of geography the block group rather than ZIP code in order to better control for confounding When the term confounding is used by epidemioogists it refers to the problem that can arise when there is a factor such as income level that is not the factor of interest in the study but that is related to the risk factor being studied (residential location ndash proxy for exposure) and to the disease being studied (CVD) This type of factor is labeled a ldquoconfounderrdquo because it can confound ie interfere with or confuse the evaluation of associations among the variables of interest in the data All types of epidemiological studies face issues of confounding but ecological ie group-level analyses face additional challenges related to potential confounding because analyses can not account for some important individual differences within the groups that are the units of analysis (Morganstern 1998b Hertz-Picciotto 1998) This investigation was able to control for each individualrsquos age sex and race but not for individual-level risk factors such as income or education level

The follow-up investigationrsquos specific objectives are to

improve the ability to control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis

compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River

compare the findings of this follow-up study to those of the earlier studies

Study Area and Exposure Areas

The overall study area is comprised of the 12 counties with geographic areas abutting the Hudson River south of the source of PCB contamination from the General Electric facility in Hudson Falls The 12 counties are Westchester Rockland Putnam Orange Dutchess Ulster

8

Columbia Greene Rensselaer Albany Washington and Saratoga US Census boundaries are used to define the study area because census population estimates provide the data for the study area populationrsquos size and the populationrsquos age sex and race distribution Because the study period spanned 1990 through 2005 and populations change over time 1990 and 2000 Census data were used to linearly interpolate the between-year populations from 1991 through 1999 for each block group 2000 data were used without extrapolation to estimate block group populations from 2001 through 2005

Because block group boundaries change between the Census years 1990 block populations were normalized to fit 2000 block group boundaries before the estimation of sex age and race group specific populations In the digital map 2000 block group boundaries were overlaid on the 1990 block centroids to assign 1990 block population counts to the corresponding 2000 block group boundaries Then 1990 block populations were summed to fit the block group boundaries Next population counts by age group (0-14 15-24 25-34 35-44 45-54 55-64 65-74 75+) sex (male female) and race (white black asian other) were estimated for each year at the block group level This investigation includes all ages over 25 years and the study population is divided into four race categories white black or African-American Asian and ldquoOtherrdquo The Pacific Island group was combined with the Asian group and American Indian ldquootherrdquo and ldquomultiple racesrdquo categories are combined into a larger ldquootherrdquo category

Classification of the population into potential exposure categories for this study is based exclusively on location relative to the Hudson River Two types of exposure proxy categories were developed block group adjacency and block group centroid distance to the river shore For the adjacency categories block groups were assigned to the adjacent category if they directly abutted the Hudson River and all other block groups in the 12 counties were assigned to the nonshyadjacent category

For the distance categories a central point (centroid) within each block group was identified and used to assign block group centroid distance to the river shore The block group centroids were population-weighted which means the geographic distribution of households within 2000 census blocks were used to locate block group centroids based on the geographic population distribution By locating a population-weighted block centroid rather than a more straightforward distance centroid the assignment of block group distance to the river shore more accurately reflects distance to the river for the majority of block group residents Buffers of one-half mile and one mile from the Hudson River boundary were created for the assignment of each block group into one of the three distance categories less than frac12 mile (closest) frac12 to one mile (close) and farther than one mile from the river (distant)

Distance categories were selected a priori at the start of the study Because environmental sampling data were not collected along the length of the river included in this investigation we are not able to determine whether these distance categories represent actual differences in exposures As discussed previously a statistically significant reduction in the total PCB concentrations in outdoor air samples was observed at a distance of 1200 meters (34 mile) from the river from 0760 ngm3 to 0497 ngm3 in a study conducted in the Hudson Falls Fort Edward area near the source of PCBs (Fitzgerald 2007 Palmer 2007)

9

For the adjacency categories of the 2371 block groups in the study area 255 (11) were classified as adjacent to the Hudson River and 2116 (89) were classified as non-adjacent (Figure 2) For the distance categories 225 (9) were classified as closest to the river 216 (9) as close and 1930 block groups (81) as distant from the river

Comparison of Figures 1 and 2 show that block groups include smaller geographic areas than ZIP codes and that block groups are relatively smaller in area in urban versus rural areas The study area includes a total of 368 ZIP codes compared to a total of 2371 block groups ZIP code average population in the study area is approximately 8000 compared to block group average population of 1200 Comparison of Figures 2 and 3 shows that using population-weighted centroid distance to the river produces different exposure areas than adjacency Fifty-three (20) of the adjacent block groups were categorized for the distance indicator as distant because the population in the block group was concentrated in the area farther than one mile from the river On the other hand about 11 of the 2116 non-adjacent block groups were classified as closest (77) or close (162)

Hospitalization Data

Hospitalizations for ischemic heart disease (IHD) and cerebrovascular disease (stroke) were included in this study More specifically hospital discharge records that listed as the principal diagnosis either IHD (ICD-9-CM 410-414 and 4297) or cerebrovascular disease (ICD-9-CM 430-436) and subsets of each of these categories similar to the subsets evaluated in the Sergeev et al and Shcherbatykh studies were included The subsets are AMI and angina subsets of IHD and ischemic and hemorrhagic stroke which are subsets of cerebrovascular disease (stroke) (Principal diagnosis represents the illness for which the person was admitted to the hospital) Angina is an outcome not evaluated in the prior studies It was added here for comparative purposes because it represents an outcome appropriately addressed by primary outpatient rather than inpatient hospital care Angina is one of a group of outcomes included in the recently developed national ldquoPrevention Quality Indicatorsrdquo (USDHHS AHRQ 2007) as an indicator for geographic areas or populations for whom primary care is inadequate Table 3 lists the specific ICD-9-CM codes and definitions included in this investigation

While one of the previously published studies (Huang et al 2006) showed elevations in hypertension (high blood pressure) rates this outcome is rarely listed as the principal reason for hospitalization As a secondary diagnosis however hypertension is extremely common with substantial overlap with the primary diagnoses of IHD and stroke In addition hypertension hospitalizations were not as elevated in the ZIP codes adjacent to the Hudson River as the IHD and stroke outcomes As a practical matter geocoding the hypertension hospitalization data (1364000 hospitalizations) would overwhelm the resources for this project For these reasons hypertension was not included in this follow-up investigation

The source of the hospitalization data was the NYS DOH Statewide Planning and Research Cooperative System (SPARCS) established in 1979 to collect detailed records on discharges from hospitals located in New York State Persons seen in the emergency department but not admitted are not included in the SPARCS data Hospitalizations at US Veterans Administration

10

Hospitals are also not included in this dataset We identified the hospital transfers in the data and excluded them so that individuals are only counted once for the same event

The NYS DOH Data Protection Review Board for SPARCS hospitalization data approved our use of hospitalization records for this investigation and provided approximately 460000 records with a principal diagnosis of either IHD or stroke Because of the large number of cardiovascular hospitalizations and this studyrsquos requirement for geocoding each hospitalized personrsquos address we selected a 50 random sample of the hospital discharges with principal diagnoses of IHD or stroke for inclusion in the study (229948 records) Approximately one percent of these records (2737) were eliminated from the study because they contained no residential address resulting in 227211 hospitalization records 24939 of these records were eliminated after geocoding because they were not within the 12 county study area This left 202272 records remaining in the study

For geocoding to the block group level the hospitalization street addresses were modified as needed using US Postal Service standards The addresses were assigned geographic coordinates using commercially available geocoding software (MapMarker 2004) A combination of land parcel data and street centerline files were used to assign geographic coordinates No contact was made with cases parents legal guardians or next of kin of cases to determine residential locations

Once geographic coordinates were assigned to cases through address-matching the case locations were overlaid onto digital maps of the study area using a geographic information system so that the number of observed cases falling within the study area boundaries could be determined and a 2000 Census block group could be assigned to each hospitalization In order to protect confidentiality no maps of individual case locations are provided Geographic coordinates were not able to be assigned for some addresses for example rural routes or post office boxes Internet searches were conducted for addresses that were located in facilities such as nursing homes and trailer parks

While fewer than one percent of records were missing sex or age information approximately 15000 records seven percent were missing race information Evaluation of the geographic and temporal distribution of these records showed they were not randomly geographically distributed 47 of the records with no race information were from one hospital for that hospital race was missing from 100 of the records in all years except one after 1997 mid-way through the study time-frame Evaluation of other records missing race showed similar patterns of 100 of records missing race in specific years In order to prevent biased hospitalization rate estimates records missing race were assigned race randomly assuming a race distribution equal to that of the individualrsquos residential block group

Statistical Analysis

This investigation used negative binomial multivariable regression models the statistical methods used in the previous studies (Sergeev and Carpenter 2005 [IHD] Shcherbatykh et al 2005 [stroke]) The follow-up studyrsquos analyses compared hospitalization rates for populations in block groups adjacent or in close proximity to the Hudson River to rates for populations more

11

distant from the Hudson River while adjusting for the potential influence of other factors Sex age and race-specific hospitalization count data comprise the dependent variable and the statistical methods attempt to predict the counts using the sex age and race distributions from Census data as well as block group-level demographics such as median income Models were first estimated using basic Census information for age sex and race Then more fully adjusted models were estimated using information at the block group level for income education population density Hispanic ethnicity and distance to the nearest hospital

More specifically each hospitalized personrsquos block group was used to assign quartile categories of block group median household income of population with less than a college education population density of the population identified as Hispanic and distance from the block group centroid to the nearest hospital These variables were selected in advance The population density variable was selected to account for potential effects on hospitalization rates of large differences in population density from the relatively rural northern counties to the more urban southern counties in the study area of the population identified as Hispanic was included because Hispanic ethnicity may be associated with CVD risk but Hispanic ethnicity was not available in the hospitalization records Distance to the hospital was included because studies show that shorter distances increase the likelihood that individuals with AMI and stroke survive to be admitted to the hospital

As described previously the literature about risk factors for CVD shows strong associations between lower income and education and higher CVD risk Regarding the effects of income and education on hospitalization rates the literature from analytic studies specifically of hospitalizations for CVD is quite sparse Most analytic studies focus on prevalence incidence or mortality in population-based studies or cohort studies and there are not many of these An example of a study that was useful as background is a study of income level and CVD mortality that showed strong associations of lower income and higher CVD mortality for men and women with the relationship for men being linear throughout the range of income levels while for women the relationship was only strong at levels of income below the median (Rehkopf 2008)

This follow-up studyrsquos approach was to use the same set of variables for the more fully adjusted models using quartile cut points for all the sexrace groups and types of CVD diagnostic subsets The 2000 Census data were used to create the population-weighted quartile categories to be used in the statistical analyses The quartile cut points are shown below

population density(persons per square mile)

median 1999 household income

with less than a college education

with Hispanic ethnicity

Hospital distance

Quartile 1 61 - lt507 $2499-lt$40968 0 - lt31 0 - lt2 lt25 (median) Quartile 2 507 - lt102 $40968 - lt54650 31 - lt43 2 - lt5 25 - lt6 (50-75) Quartile 3 102 - lt6410 $54650 - lt72944 43 - lt55 5 - lt10 6 ndash 11 (75-90) Quartile 4 6410 - 94372 $72944 - 200001 55 - 100 10 - 86 11 ndash 45 (90-100)

SAS version 9 provided the statistical programs used to estimate hospitalization rates for exposure proxy categories and demographic categories The models provide ldquocontrast

12

estimatesrdquo ie rate ratios for one category versus a reference category of a variable and 95 confidence intervals (CI) for these rate ratios For the location categories rate ratios for the areas adjacent are compared to areas not adjacent to the river (reference) and areas closest and close to the river are compared to areas distant from the river (reference) If a rate ratio is greater than 100 the model has estimated an elevation of hospitalizations in that category of the study population compared to the reference population If the rate ratio is less than 100 then there are fewer than expected hospitalizations compared to the reference population

The magnitude of the elevation or deficit is also estimated by the rate ratio For instance if the hospitalization rate is twice as high in the adjacent population it would result in a rate ratio of 200 while a 50 higher rate would result in a rate ratio of 150 If the hospitalization rate in the comparison group was only half as big as in the reference group this would result in a rate ratio of 050 The 95 confidence interval assists with determining whether the observed differences are statistically meaningful ie statistically significant The confidence interval provides the range in which there is a 95 probability of including the true rate ratio

In addition to directly assess potential confounding we examined the associations between income and exposure proxy categories and between income and CVD hospitalization risk Because the modeling process provides rate ratios that provide comparisons among groups but not the rates themselves we also calculated the rates per population for each sex race and income category in order to directly observe the relative magnitudes of CVD hospitalization rates Finally analyses stratified by income quartile were also conducted in order to assess the potential role of confounding by income The stratified analyses consist of negative binomial regressions conducted separately for each quartile of income

4 RESULTS

Demographics Associations with Proximity to the River

Study Population Table 4 provides detailed demographic characteristics for the total study population and the populations of the adjacent versus non-adjacent and nearer versus farther from the river block groups from the 2000 Census The total population of the study area for this one year of the study time period is 2924631 persons The first row of Table 4 shows that approximately 11 percent of the study area population is classified as living in block groups adjacent to the Hudson River and 89 live in non-adjacent block groups Approximately 8 of the studyrsquos population lives in the closest block groups approximately 10 lives in close block groups and approximately 82 in distant block groups The table also shows that approximately 81 of the total study population is identified as white 9 black 3 Asian less than 01 Pacific Islander 02 American Indian 4 ldquootherrdquo and 2 ldquomultiple racesrdquo

The 2000 Census data (Table 4) also show that the populations in block groups adjacent and nonshyadjacent to the river are similar in terms of the percent of minority population but the adjacent block groups include a smaller percentage of persons who have attended college and a higher percentage of households below the poverty level Median income is $47320 for block groups adjacent versus $62668 for block groups not adjacent to the river Data for the study area from the 1990 Census show similar distributions (data not shown)

13

For the distance to the river categories there is a gradient for the three levels of distance from the river with the closest block groups on average having higher percentages of minorities smaller percentages of persons who have attended college and higher percentages of households below the poverty level (Table 4) The populations of the close and closest block groups are more similar to each other than to the populations of the distant block groups The population of the closest block groups is estimated to be 19 black close block groups 14 black and distant block groups 8 black The proportions for the ldquootherrdquo category used in the analyses which includes multiple races and American Indians are approximately 12 in both the closest block groups and in the close block groups compared to 5 in the block groups most distant from the river Median income for the closest block groups is $40748 close block groups $44962 and most distant block groups $64847

The bar chart below illustrates the correlations between US Census 2000 indicators of block group socio-economic status and block group centroid (population weighted) distance from the river for the study population

2000 Census Demographics for Study Population by Distance to River Category

black other hispanic no college below poverty

of pop ulati on

0

10

20

30

40

50

60

lt12 mile

12 to 1 mile

gt1 mile

Table 5 shows selected demographic information for the person-year estimates used in the study analyses for the 1990-2005 study period Person-year data represent individuals from Census estimates counted repeatedly for each year of the study timeframe As described in the Methods section the estimation of person years used interpolation to account for population changes from 1990-2000 and techniques for taking account of block boundary changes The person-years estimates for demographic categories (complete data not shown) show similar patterns as the 1990 Census data (data not shown) and 2000 Census data (Table 5) Table 5 for example shows nearly identical percentages of population in each of the river distance categories as Table 4

Table 5 also shows person year information for income quartiles and exposure proxy categories as well as race and exposure proxy categories that is consistent with the Table 4 data for the

14

2000 population Table 5 row percentages show for example that while approximately 11 of the total study population live in adjacent block groups approximately 19 of the population in the lowest income quartile live in adjacent block groups By design approximately 25 of the total study population is in each of the income quartiles Table 5 column percentages show however that in the adjacent block groups 41 of the population is in the lowest income quartile and only 9 are in the highest income quartile

For the block groups within frac12 mile of the river the distribution among income quartiles is even more skewed towards the lowest income quartile 61 of the population living within frac12 mile of the river is in the lowest income quartile and only 6 is in the highest income quartile Given that approximately 25 of the total study population is within each income quartile these percentages indicate that living within the block groups closest to the river is associated with more than a doubling of the likelihood of being in a block group in the lowest socio-economic status quartile 436 of those living in the block groups close to the river (12 to one mile from the river) are in block groups in the lowest income quartile While less than a doubling this indicates that living in the close block groups is associated with a 75 increase in the likelihood of being in the lowest quartile for socio-economic status

Hospitalizations Table 6 shows the process of removing hospitalization records from the study for a variety of reasons The first column includes the 202272 records that contained a residential address and were located within the 12-county study area as described in the Methods section The next five columns show the records remaining after data cleaning exclusion of transfers elimination of records with missing covariates and exclusions due to poor geocoding results Approximately 33000 records were eliminated because they were transfers from one hospital to another for the same event For approximately 1100 records geocoding was inadequate for assigning block group for imputing race as described in the Methods section 336 records were missing sex (11) or age (325) Of the 167733 records remaining 154220 (92) were able to be adequately geocoded to a street address for assignment in a specific block group While approximately eight percent of hospitalizations were eliminated due to inadequate geocoding Table 6 shows that the percentages of hospitalization records for white black Asian and ldquootherrdquo categories and the distribution among diagnostic categories change very little after these records were dropped

The final column of Table 6 shows that of the 154220 hospitalizations in the analysis 101012 IHD hospitalizations comprise the largest health outcome category and are 65 of the hospitalizations in the study AMI hospitalizations (35811) account for 35 of the IHD hospitalizations and angina (4731) accounts for five percent of the IHD hospitalizations Stroke hospitalizations (53208) comprise 34 of the studyrsquos hospitalizations Approximately 36 of the stroke hospitalizations are for ischemic stroke (19371) and approximately 11 are for hemorrhagic stroke (5995)

Table 7 shows the distributions of hospitalizations among the four income quartiles While each income quartile (by definition) contains approximately 25 of the study population for all types of hospitalizations we see that the lowest income quartile includes more than 25 of hospitalizations from a minimum of 286 (IHD) to a maximum of 356 of hospitalizations (angina) The highest income quartile includes fewer than 25 of hospitalizations for each CVD

15

category ranging from 182 for angina to 224 for hemorrhagic stroke These deviations from the 25 quartile distribution show a strong linear pattern with the largest percentage of hospitalizations coming from residents of block groups in the lowest income quartile and the smallest percentages coming from residents of the most wealthy block groups For AMI the lowest income quartilersquos share of hospitalizations is 50 higher than the highest income quartilersquos share of hospitalizations (30 vs 20) IHD and stroke show similar differences and angina shows a doubling with the lowest income quartile contributing 36 of hospitalizations compared to the highest income quartilersquos 18 share of hospitalizations

Table 7 also shows the distribution of hospitalizations among the race categories and the study population race category distribution for comparison Blacks show lower percentages of IHD and AMI than their population percentages and higher percentages of angina and stroke While full presentation of race and sex findings is beyond the scope of this report the lower than expected percentages of IHD and AMI among blacks who on average have lower socioshyeconomic status and are at greater risk for CVD hospitalizations raised questions that were addressed by calculating hospitalization rates per 1000 population for each sex and race group for each income quartile (Appendix B Table 2) Figures 4 5 and 6 show these rates for the white black and Asian categories (These rates for the ldquootherrdquo category are comparatively very high and are not shown on these graphs This issue is addressed in the Discussion section) Figures 4 5 and 6 show the consistent and strong effects of income quartile on hospitalization rates for AMI and stroke for whites blacks and Asians of both sexes The other outcomes show similar patterns (The ldquootherrdquo group showed an opposite pattern [See Discussion Section])

Demographics Summary

The strong associations of lower income with residence closer to the river (Table 5) and lower income with higher hospitalization rates (Tables 7 amp 8) create a difficult challenge for this investigation The multivariable analyses of hospitalization rates attempt to control for socioshyeconomic differences However because these income differences which strongly affect hospitalization rates are also strongly correlated with the gradations of exposure (river distance) it will be difficult to disentangle the influence of socio-economic status from the influence of potential exposures

Statistical Analyses Proximity to the River and CVD Hospitalization Risk

All the variables described above were included in the multivariable regression models and almost every quartile category for every variable showed statistically significant and expected types of effects for all the CVD hospitalization categories (Appendix B Tables 1a and 1b) The following presentation of results focuses on the rate ratios for the exposure proxy variables Table 8a (IHD) and 8b (stroke) show the regression modeling results Results are shown first for models that include only age sex and race in addition to the exposure proxy categories and second for models that also include median income education population density Hispanic ethnicity and distance to nearest hospital The tables provide numbers of hospitalizations and person-years to assist with interpretation For example for total IHD we can see that while 108 of the study population lives in adjacent block groups 122 of the hospitalizations are among people in adjacent block groups This difference of 14 percentage points is the basis for

16

the finding of a statistically significantly elevated rate ratio (risk) for IHD among people living in the adjacent block groups (RR=133 CI 112-157)

Looking across the columns of Tables 8a and 8b we can see the rate ratios for each of the outcomes for the adjacent versus non-adjacent block groups and the results are similar to those for IHD The elevated rate ratios range from 123 for angina to 136 for AMI The only rate ratio not elevated in these first models adjusting only for age sex and race is for hemorrhagic stroke Going down the rows of the Table the next set of rate ratios are for the distance to the river categories The findings are very similar to those for the adjacent block groups Here however we have two exposure proxy categories allowing comparison of the two These rate ratios show no evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile from the river)

The next set of rate ratios are for models with additional adjustment for the demographics of each hospitalized personrsquos block group as well as distance to the nearest hospital As expected adjustment for block group median income and other demographics accounts for some of the variability in hospitalization counts and thereby reduces the magnitude of the rate ratios estimated for the river adjacency and distance categories The rate ratio elevation for total IHD for example is reduced from 133 to 107 (RR=107 CI 104-111) The elevated rate ratios in these more fully adjusted models for the adjacency analyses are of similarly small magnitude ranging from 106 for AMI to 110 for ischemic stroke Four of the six outcomes show statistically significant elevations Again only hemorrhagic stroke shows no evidence of an elevated rate ratio

For the river distance categories none of the rate ratios show any evidence that living in the block groups closest to the river (lt12 mile to the river) produces greater risk than living in the block groups close to the river (12 to 1 mile) Of the six types of outcomes only two show statistically significantly elevated rate ratios for living in the closest block groups (total stroke and ischemic stroke) For the close block groups four out of six outcomes show statistically significantly elevated rate ratios The statistically significantly elevated rate ratios vary from 106 (AMI stroke) to 113 (angina)

To address the strong evidence described earlier for confounding in these data ie the association of lower income with both CVD hospitalizations and residence closer to the river we also conducted the analyses separately for each quartile of income These results are presented in Table 9 The first results are from analyses adjusting only for age sex and race and the second set of results are from models including the additional demographic factors For simplicity rate ratios are shown only for block groups closest to the river compared as in the other models to block groups farther than one mile from the river (Results were similar for the other distance categories (data not shown)

For the two lowest income quartiles Table 9 shows results similar to those for the study population as a whole (Table 8a 8b) However for the two highest income quartiles which represent frac12 of the study population Table 9 shows a consistent pattern of reduced rate ratios for residence in the closest block groups within frac12 mile of the river In the highest income quartile

17

these reduced rate ratios are statistically significant for almost all of the outcomes in both the partially and more fully adjusted models In addition the magnitude of the rate ratio estimates ie their departure from 100 is relatively high For the population living in block groups with the highest median income (income gt$73000) living within frac12 mile of the river is associated with having about 70 of the risk for IHD AMI and stroke (CI=068 069 069) as the population in equally high income block groups farther than one mile from the river In other words among the highest income quartile living this close to the river is associated with an estimated 30 reduction in risk for these types of hospitalizations For angina (CI=028) there is an estimated 70 reduction in risk and for ischemic stroke (CI=058) the reduction is estimated to be 40 These stratified analyses show that in one-half of the study population the two higher income quartiles there is no evidence of elevated hospitalization risk for populations in block groups closer to the river On the contrary reduced rate ratios are shown for all outcomes in the higher income quartiles for residents of block groups within frac12 mile of the river compared to those living farther than one mile from the river

Weighted average rate ratio estimates are a standard method for estimating a general population finding from stratified analyses and are also provided in Table 9 These average rate ratios range from 078 to 102 showing no evidence of elevated risk for the overall population associated with residence within frac12 mile of the river compared to living farther than one mile from the river Results for the other categories of river proximity showed similar results (data not shown) (Given the evidence of confounding interfering with analyses of the entire study population and the opposing findings depending on income quartile no confidence intervals are provided here for the overall population-weighted average estimates)

Summary of Results

In summary the multivariable regression results for the entire study population (Tables 8a 8b) show statistically significantly elevated rate ratio estimates for the adjacent versus non-adjacent block groups for four out of six hospitalization diagnosis categories The analyses also produce statistically significantly elevated rate ratio estimates for block groups closest to the river (less than frac12 mile from the river) and close to the river (between 12 to 1 mile from the river) compared to distant block groups (farther than 1 mile from the river) for most of the outcome categories However the findings provide no evidence that the closest block groups have higher hospitalization rates than the close block groups

The most elevated rate ratio estimate is 113 (angina) indicating an estimated 13 percent increase in risk for angina hospitalizations for residents of close versus distant block groups The only outcome with no statistically significant elevations for living near (adjacent closest or close) the river in the more fully adjusted models is hemorrhagic stroke It is noteworthy that the outcome that is considered an indicator of inadequate primary care rather than of incidence or prevalence angina hospitalizations shows the highest elevations associated with residence near the river (Table 8a) In addition the hospitalization outcome with the least association with socioshyeconomic status in our data (Appendix B Table 1b) and the lowest out-of-hospital mortality (NYS DOH2009b) hemorrhagic stroke is the outcome that shows no pattern of elevations associated with residence near the river (Table 8b) This pattern of findings is consistent with the interpretation that the rate ratios for living closer to the river are reflecting the influence of

18

lower socio-economic status and other known risk factors for hospitalizations for CVD rather than any unusual environmental exposures in areas near the river

The stratified results add strong support to this conclusion They suggest that classical confounding of the exposure-disease relationship by socio-economic status produced biased results in the overall multivariable regression analyses While the regression analysis of the entire study population shows elevated hospitalization rates for residents of block groups near the river compared to residents living farther away (Table 8a and 8b) the stratified analyses (Table 9) show this association applies only to the 50 of residents living in block groups with lower median incomes For the other 50 of residents those who live in block groups with higher median incomes hospitalization rate ratios are reduced for those living near the river Classical confounding refers to the type of confounding that can occur in individual-level or group-level (ecological) studies when a factor such as socio-economic status affects both the exposure (residential location) and the health effect In this study of CVD hospitalizations there are disproportionate numbers of outcomes and exposures (residence near the river) among specific levels of the confounder the lower income quartiles (Tables 4 5 7) The disproportionate numbers of hospitalizations and potential exposures (block groups close to the river) among the lower income quartiles create an artifactual exaggerated ie biased effect on the estimates for the overall analysis (Greenland 1999 Greenland 2003)

5 DISCUSSION

The key finding of this investigation is that the results of the multivariable regression modeling for the entire study population the elevated CVD hospitalization rates for populations living closer to the Hudson River are likely to be artifacts of classical confounding Stratifying on the confounding variable ie holding median income level constant the standard method for controlling for confounding showed the decisive role of income as a confounding variable The associations between living closer to the river and having higher hospitalization rates were evident only for people in the two lowest income quartiles For people in the two highest income quartiles the association was completely opposite ie people living closer to the river had lower rates of hospitalizations This variability of effects can be interpreted as evidence that there are no actual effects of the exposure-proxy categories on hospitalization rates

This type of classical confounding due to the strong influence of a variable that precedes causes or strongly affects both the exposure and the health outcome can occur in any type of epidemiological study The ecologic ie group-level analysis in this study is not the primary cause of the confounding Rather the complex associations in the population among income education CVD risk geographic residential patterns and geographic assignment of exposure status are the basis for the confounding However the studyrsquos group-level design contributes to the studyrsquos inability to completely control for confounding using multivariable regression methods and contributes to difficulties of interpreting the study findings

The stratified results shown in Table 9 may also be a reflection of inadequate control for socioshyeconomic status using quartile groupings A possible explanation for the Table 9 findings is that among lower income quartile populations people living in closest proximity to the river may have the lowest incomes while in higher income quartile populations people living in closest

19

proximity to the river may have the highest incomes This potential income difference associated with river distance within quartile groupings would explain why the stratified analyses showed positive associations with hospitalization risk for lower income residents closest to the river and negative associations with hospital risk for higher income residents closest to the river In this example of a possible explanation it is assumed that river proximity categories are capturing relative income (and hospitalization risk) differences rather than exposure Because the exposure proxy river distance categories are assigned at the group level this type of analysis is not able to distinguish whether these categories are capturing income rather than potential exposure in the analyses

When trying to interpret this studyrsquos findings it is important to keep in mind the many factors associated with CVD risk that are not controlled for in this studyrsquos analyses These include tobacco use high blood pressure high serum cholesterol alcohol use obesity low fruit and vegetable intake physical inactivity diabetes mental ill-health psychosocial stress and the use of certain medications Another factor that is not known to be associated with CVD risk but that is relevant for PCB exposure is consumption of contaminated fish It is possible that lower income populations are more exposed to PCBs because they are more likely to eat fish from the Hudson River This however would not help explain why higher income people near the river showed lowered CVD hospitalization risk than their counterparts living farther from the river

In addition to the confounding due to socio-economic statusrsquos effects on both the risk of CVD and residential location (the exposure proxy) there is the possibility that another pattern of effects also associated with socio-economic status called effect modification plays a role in these data Effect modification occurs when different groups people with differing types of health insurance or differing behaviors for example show different patterns or levels of health effects associated with the same exposures The differing effects (CVD hospitalization risks) of residence in block groups closest to the river by income quartile could be interpreted as evidence of effect modification (Table 9) However the magnitude and the change in direction of effects is unlikely to be due solely to differing biological or behavioral factors associated with income level that affect the results of exposures from living near the river Given the strong associations of socio-economic status and CVD hospitalization risk it is more likely that the analyses stratified by income enhance protection against bias compared to the multivariable models and that the stratified analyses represent findings from better control for confounding as opposed to effect modification

Effect modification is a concept used to describe important relationships between exposures and effects (that may differ among sub-groups for example) that the researcher wants to identify and understand Confounding on the other hand refers to associations among the variables that the researcher is trying to control so that the effects of interest (effects from exposures rather than age or sex for example) can be identified and the strength of the effects can be estimated The ecological design of this study contributes to the difficulty of interpreting whether the study shows evidence of effect modification as opposed to confounding (Morgenstern 1998b Hertz-Picciotto 1998)

Another limitation associated with this studyrsquos design is that block-group level socio-economic and environmental attributes that are assigned to individuals in this type of study can be more

20

strongly correlated at the group level than at the individual level In this way the ecological assignment of variables may lead to increasing problems of confounding because of strong associations among indicators of socio-economic status and environmental exposure categories (Morganstern 1998a) This problem is likely to have affected this studyrsquos results As with effect modification by group described above this type of confounding by group cannot be evaluated with ecologic data because individual-level information is needed to check for this type of ldquocross-levelrdquo bias

Ecological analyses are an important tool for environmental epidemiology and there is therefore a large and growing literature on their strengths and limitations and methods to address limitations (Piantadosi 1988 Greenland 1989 Greenland 1992 Greenland 1994 Richardson 2000) One particular concern is the interpretation of ecological study findings showing modest elevations of risk such as in the current study with rate ratios in the more fully adjusted models generally estimated to be below 110 (Wakefield 2003 Morgenstern 1998) Wakefield states that slight elevations need to be interpreted with great caution

in particular for cancer and heart disease (because) there are risk factors that are far more predictive of disease than environmental factors hellip Consequently the potential for confounding is strong since ecological studies do not directly use individual-level risk factor data although stratification by age and gender is routinely carried out (Frequently an area-level measure of socio-economic status is also used as a confounding variable but being an aggregate summary it only provides very crude control) (Wakefield 2003 p 9-10)

A strength of this study is that it used Census block groups which are relatively small average population of 1200 in the study area to assign socio-economic characteristics rather than the larger ZIP codes with average population size 8000 in this area By using block groups this study sought to reduce confounding by socio-economic status because the assignment of median income and other variables at the block group level was expected to be more accurate with block group populations expected to be more homogenous than larger and more geographically diffuse ZIP code populations used in the prior studies

The use of census block groups rather than ZIP codes likely enhanced the accuracy of the group-level category assignments However this type of design that assigns characteristics to hospitalizations using group-level characteristics and distance to the river is limited in comparison to individual-level designs that gather such information from personal interviews for example Most importantly for this investigation the group-level assignment of median income and other demographic variables to each hospitalization limited the studyrsquos ability to distinguish among individuals within the exposure proxy categories and thereby reduced the studyrsquos ability to control for confounding

This type of measurement error or inaccuracy from assigning group-level potential exposure income or educational level to the individual hospitalizations is generally assumed to bias a study towards findings of no associations or no effects But in situations where the measurement error is more severe for some variables than others and these variables are correlated then the study findings can be biased either towards exaggerated effects or no effect In this

21

investigation potential measurement error associated with the assignment of exposure categories based on block group centroid distance to the river is expected to be relatively low because a large proportion of the hospitalization addresses are very clearly in areas distant from the river The measurement error associated with the assignment of group-level demographic characteristics (income and education for example) from the Census is potentially relatively higher because this type of error can occur throughout every block group in the study

Another strength of this study is the use of NYS SPARCS data a relatively comprehensive population-based source of information about hospitalizations These data include individual-level information used in this study for patient address sex age race and diagnosis Of these variables only the address and race categories were missing information for substantial numbers of records and this limitation could have contributed to imprecision sufficient to create bias in the findings These issues are discussed in greater detail below

While geocoding to block groups represents a strength of this study this procedure required high quality address information One percent of the study arearsquos hospitalization records contained no address and eight percent of the addresses could not be accurately geocoded to a block group resulting in nine percent of hospitalizations being eliminated from the study because of address andor geocoding issues The distributions of race categories and distribution among the types of diagnoses did not change after dropping records with no or insufficient address information suggesting that the dropped records did not represent a substantially skewed group of records

Inaccuracy in the studyrsquos assignments of race categories results from limitations associated with both the hospitalization and Census data The SPARCS hospitalization database did not include information on race for seven percent of the studyrsquos hospitalizations We imputed race by assigning race category randomly based on the race distribution of the personrsquos block group This method would have biased the study toward a finding of no effect of race on hospitalization risk

Another limitation of the SPARCS data used in this investigation is that repeat hospitalizations for the same individual were not able to be identified If an individual was admitted to the hospital many times over the course of the study period then that individual would be counted multiple times This limitation may have contributed to an exaggerated association of lower socio-economic status and CVD hospitalizations because some people with lower incomes receive less adequate outpatient care and may have more repeat admissions Because of the association of lower income with residence near the river this limitation may have contributed to the difficulty of adequately controlling for confounding by socio-economic status in this investigation

While the use of Census data at the block group level rather than at the ZIP code level is a strength in terms of relative homogeneity among the population within block groups it is possible that block group population estimation errors contributed to imprecise estimation of hospitalization rates Particularly with regard to the race categories the study finding that the ldquootherrdquo category showed a counter-trend of higher income and higher CVD hospitalization rates (Appendix B Table 2) is evidence that differences between the categories used in the Census race designations and hospitalization race assignments affected the estimated hospitalization

22

rates The evolving choices for identifying onersquos race in the Census are not concurrently reflected in the SPARCS data and these inconsistencies likely contributed to imprecision in the estimates of CVD hospitalization risk associated with specific race categories These issues did not contribute to bias for the overall findings however We modeled the data with a variety of race category groupings combining black and ldquootherrdquo for example and the overall results did not change (data not shown)

Also regarding the Census undercounting of approximately two percent for the US population in 2000 was estimated to be primarily among the lowest socio-economic groups and minorities while over-counting of one percent was estimated to have occurred among the most wealthy who sometimes received and completed forms at multiple residences (United States Censusrsquo Monitoring Board 2001) The undercounting of the lower income population would result in over-estimation of rates in this population Undercounting of recent Asian immigrants for example particularly those of lower socio-economic status is a potential reason for the relatively high rates of CVD hospitalizations estimated for lower income quartiles of the Asian population (Figures 4-6) Models that grouped the Asian population with other race categories were also estimated and these showed the same results as the models reported here (data not shown) Undercounting of lower income groups in general would have contributed to the finding of higher hospitalization rates in these groups and added to the difficulty of adequately controlling for confounding of lower income with residence near the river

This study used a 50 percent sample of the CVD hospitalizations in the study area This was accomplished by using every other record provided in the complete dataset This would be expected to provide a non-biased random sample As a general check for data provision management or analysis errors we compared our estimates of hospitalization rates per population with those from other sources Table 10 shows these comparisons and shows that our estimates are generally consistent with rates from other sources

Actual exposures were not measured in this investigation Rather proximity to the river was used as a proxy for potential exposures associated with the river In addition other potential exposures that may impact CVD were not evaluated or measured Exposures to lead and air pollution would be expected to be higher in more urban areas and areas near heavy traffic which may tend to coincide with proximity to the river These issues were not addressed in this investigation If these exposures were occurring more often near the river and contributing to cardiovascular hospitalization rates this would contribute to findings of elevations in areas closer to the river

It is important to keep in mind the limitations associated with using CVD hospitalizations when incidence andor prevalence of CVD are the actual outcomes of interest As described previously mortality prior to reaching the hospital may reduce hospitalization rates particularly for AMI and stroke for some groups or areas more than others and introduce bias if hospitalizations are being interpreted as a surrogate for incidence or prevalence

23

Addressing Study Objectives

This follow-up report has addressed the studyrsquos first two objectives to increase control for confounding by socio-economic status through the use of block groups rather than ZIP codes as the unit of analysis and to compare the hospitalization rates among residents of block groups nearest the Hudson River to those among residents of block groups farther from the Hudson River Regarding the first objective the use of smaller units of analysis block groups rather than ZIP codes improves the accuracy of the assignment of income and other group-level demographic categories to the hospitalizations

Regarding the studyrsquos second objective to compare hospitalization rates among residents of block groups near to the river to hospitalization rates among residents far from the river this studyrsquos multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses that controlled for confounding directly by conducting analyses within each income quartile showed that residence in block groups near the Hudson River was associated with increased risk for CVD hospitalizations among residents of lower income block groups and conversely residence in block groups near the Hudson River was associated with decreased risk for CVD hospitalizations among residents of higher income block groups The estimated increased and decreased risks were of sufficient magnitude to nearly cancel each other out when average rate ratios for the four income quartiles were calculated The average rate ratios may be interpreted as showing no evidence that CVD hospitalization rates are elevated for residents living near the river for the general study population This finding suggests that the multivariable regression findings for the overall population show elevated risks for hospitalizations among residents of block groups near the river due to confounding of the relationship of residential location and CVD risk by socio-economic status

The findings from analyses stratified by income quartile may also be interpreted as showing that residing in a higher versus lower income quartile block group produces different effects from similar exposures (effect modification) As discussed previously the ecological ie group-level analyses are unable to directly evaluate confounding and effect modification Rather information about the strengths and direction of associations among variables in the analyses need to be evaluated in light of known risk factors to draw conclusions about likely explanations and plausible interpretations In this case uncontrolled confounding by socio-economic status appears to be the most likely explanation for the findings This is due to the strong association between residential proximity to the river and socio-economic status

The studyrsquos third objective was to compare the findings of this review to those of the earlier studies that prompted this investigation Because of the similarities in study design and statistical analysis the prior studyrsquos findings can be compared with the findings of the multivariate regression analyses in the current study using Table 1 and Tables 8a and 8b The major difference between the studies is the composition of the comparison areas for the analyses Because of this difference as well as other differences (Table 3) the studies are not entirely comparable

24

Table 1 shows some findings from the prior studies Comparison with Table 8a and 8b from this study shows that the CV hospitalization rate ratios estimated for living in block groups adjacent or near the river in the current study are substantially lower than the rate ratios for residence in ZIP codes adjacent to the Hudson River in the previously published studies In most cases the current analyses show elevations below 10 in contrast to elevations of 20 to 39 in the prior studies The results of the current study are quite similar to the prior study results in terms of the associations reported for age sex race and income (Appendix B Tables 1a and 1b) (Sergeev and Carpenter 2005 Tables 4 amp 5 Shcherbatykh et al 2005 Table 3) These similarities and the general methodological similarities between the current and prior studies suggest the likelihood that the prior studiesrsquo findings also were affected by confounding by socio-economic status

6 CONCLUSION AND RECOMMENDATIONS

While this type of study can neither prove nor disprove causal links among risk factors and health effects it is useful for examining evidence for unusual patterns that might warrant additional investigation This evaluation of hospitalization data residential location and socioshyeconomic factors in block groups in the 12 Counties abutting the Hudson River revealed a closely coinciding pattern of lower socio-economic status and residential proximity to the Hudson River the proxy for potential exposure and a strong association of lower socioshyeconomic status and risk for CVD hospitalizations The strong associations among residential location socio-economic status and CVD risk prevent us from drawing definitive conclusions from the multivariable regression analyses

The multivariable analyses showed relatively small but statistically significant elevations of CVD hospitalization risk for residents of block groups in close proximity to the Hudson River Stratified analyses conducted within each of the four income quartiles showed such elevations only among residents of lower income block groups This studyrsquos data methods and analyses can not provide a conclusive interpretation for this finding However the strong effect of socioshyeconomic status on residential location (exposure proxy) and on CVD hospitalization risk makes it likely that the overall multivariable regression results are affected by uncontrolled confounding by socio-economic status

To draw more definitive conclusions particularly for outcomes such as CVD that are determined by multiple known risk factors studies seeking to investigate the role of PCB or POP exposures will require resource-intensive methods that include gathering individual-level exposure or biomonitoring information as well as individual-level medical histories and information on other risk factors

This current investigation did not include environmental or biological sampling data One study described previously did gather such data in the Hudson River region with the highest potential exposures (Hudson Falls Fort Edward) This study detected a statistically significant reduction in the total PCB concentrations in outdoor air samples at a distance of 1200 meters (34 mile) from the river However the PCB levels in outdoor air in this area were lower than those in other communities with known PCB-contaminated sites and similar to levels reported in other locations in the northeastern United States The study found no detectable differences in study

25

subjectsrsquo serum PCB levels associated with proximity to contaminated portions of the river or other PCB sources (Fitzgerald 2007 Palmer 2008)

It is recommended that the results of the current investigation of CVD hospitalizations and residence near the Hudson River be shared with NYS residents and other stakeholders with interest in this issue and be submitted for publication in a peer-reviewed journal

26

AGENCY INFORMATION

New York State Department of Health Authors Elizabeth Lewis-Michl PhD

Research Scientist Bureau of Environmental and Occupational Epidemiology

June Beckman-Moore MPH Research Scientist Bureau of Environmental and Occupational Epidemiology

Marta Gomez MS Research Scientist Bureau of Environmental and Occupational Epidemiology

Acknowledgments This study was supported by ATSDR and the Centers for Disease Control and Prevention

through a Cooperative Agreement Grant to NYS DOH (3U61TS200002-19W1)

The authors would like to thank the following people for their contribution to this project Alissa Caton PhD of the Bureau of Environmental and Occupational Epidemiology NYS DOH developed the study protocol assembled the preliminary data and conducted preliminary analyses Syni-an Hwang PhD Bureau Director and Shao Lin PhD Research Scientist Bureau of Environmental and Occupational Epidemiology NYS DOH and Edward Fitzgerald PhD Professor Division of Environmental Health and Toxicology SUNY Albany School of Public Health reviewed and provided comments on draft versions

New York State Department of Health Cooperative Agreement Coordinator Don Miles MS

Public Health Specialist Bureau of Environmental Exposure Investigation

ATSDR Technical Project Officer Gregory V Ulirsch PhD

Environmental Health Scientist Cooperative Agreement and Program Evaluation Branch

Division of Health Assessment and Consultation

Mohammed Uddin MD Epidemiologist

Division of Health Studies

ATSDR Regional Representatives Leah Graziano

Senior Regional Representative - Region 2 Office of Regional Operations

Racquel Stephenson Regional Representative - Region 2

Office of Regional Operations

27

REFERENCES Agency for Toxic Substances and Disease Registry (ATSDR) 2000 Toxicological Profile for Polychlorinated Biphenyls (PCBs) ATSDR Division of ToxicologyToxicology Information Branch Atlanta GA

Arisawa K Takeda H Mikasa H 2005 ldquoBackground exposure to PCDDsPCDFsPCBs and its potential health effects a review of epidemiologic studiesrdquo The Journal of Medical Investigation Feb 52(1-2)10-21

Arsenescu V Arsenescu RI King V Swanson H Cassis LA 2008 ldquoPolychlorinated biphenylshy77 induces adipocyte differentiation and proinflammatory adipokines and promotes obesity and atherosclerosisrdquo Environmental Health Perspectives 116 (6) 761-768

Ayala C Croft J Keenan N Neff L Greenlund K Donohoo R Zheng Zhi Mensah G 2003 ldquoIncreasing Trends in Pre-transport stroke deaths- United States 1990-1998rdquo Ethnicity amp Disease 12 Spr S2131-S2137

Barnett E Armstrong D Casper M 1999 Evidence of increasing coronary heart disease mortality among black men of lower social class AEP 98 464-471

Bertazzi PA Bernucci I Brambilla G Consonni D Pesatori AC 1998 ldquoThe Seveso studies on early and long-term effects of dioxin exposure a reviewrdquo Environmental Health Perspectives April 1062 625-633

Carpenter DO 2006 ldquoPolychlorinated biphenyls (PCBs) routes of exposure and effects on human healthrdquo Reviews in Environmental Health Jan-Mar 21(1)1-23

deVreede-Swagemakers JJ Gorgels APDubois-Arbouw WI van Ree JW et al 1997 ldquoOut-of hospital cardiac arrest in the 1990s a population-based study in the Maastricht area on incidence characteristics and survivalrdquo Journal of the American College of Cardiology 30(6) 1500-5

Everett CJ Mainous AG Frithsen IL Player MS Matheson EM 2008 ldquoAssociation of polychlorinated biphenyls with hypertension in the 1999-2002 National Health and Nutrition Examination Surveyrdquo Environmental Research 108 94-97

Fitzgerald EF Belanger EE Gomez MI Hwang S Jansing RL Hicks HE 2007 ldquoEnvironmental exposures to polychlorinated biphenyls (PCBS) among older residents of upper Hudson River communitiesrdquo Environmental Research 104 352-360

Galea S Blaney S Nandi A Silverman R Vlahov D Foltin G Kusick M Tunik M Richmond N 2007 ldquoExplaining racial disparities in incidence of and survival from out-of-hospital cardiac arrestrdquo American Journal of Epidemiology 1665 534-543

Greenland S Morgenstern H 1989 ldquoEcological bias confounding and effect modification International Journal of Epidemiology 18269-74

28

Greenland S 1992 ldquoDivergent biases in ecologic and individual level studiesrdquo Statistics in Medicine 111209-23

Greenland S Robins J 1994 ldquoInvited commentary ecologic studies ndash biases misconceptions and counterexamples American Journal of Epidemiology 139747-60

Greenland S Pearl J Robins JM 1999 Causal diagrams for epidemiologic research Epidemiology 1037-48

Greenland S 2003 ldquoQuantifying biases in causal models classical confounding vs collidershystratification biasrdquo Epidemiology May Vol 143 300-306

Ha M Lee D Jacobs DR 2007 ldquoAssociation between serum concentrations of persistent organic pollutants and self-reported cardiovascular disease prevalence results from the national health and nutrition examination survey 1999-2002rdquo Environmental Health Perspectives 1158 1204-1209

Hennig B Meerarani P Slim R Toborek M Daugherty A Silverstone AE Robertson LW 2002 ldquoProinflammatory properties of coplanar PCBs in vitro and in vivo evidence Toxicology and Applied Pharmacology Jun 15181(3)174-83

Hertz-Picciotto I ldquoEnvironmental Epidemiologyrdquo in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp555-584

Howard G Cushman M Prineas R Howard V Moy C Sullivan L DrsquoAgostino R McClure L Pulley L Safford M 2009 ldquoAdvancing the hypothesis that geographic variations in risk factors contribute relatively little to observbed geographic variations in heart disease and stroke mortalityrdquo Preventive Medicine 49 129-132

Huang X Lessner L Carpenter DO 2006 ldquoExposure to persistent organic pollutants and hypertensive diseaserdquo Environmental Research 102(2005) 101-106

Kannel WB Cupples LA DrsquoAgostino RB 1987 ldquoSudden death risk in overt coronary heart disease the Framingham Studyrdquo American Heart Journal 113(3) 799-804

Ko DT Krumholz HM Wang Y Foody JM Masoudi FA Havranek EP You JJ Alter DA Stukel TA Newman AM Tu JV 2007 ldquoRegional differences in process of care and outcomes for older acute myocardial infarction patients in the United States and Ontario Canadardquo Circulation 115196-203

Lee D Lee I Song K Steffes M Toscano W Baker BA Jacobs DR 2006 ldquoA strong dose-response relation between serum concentrations of persistent organic pollutant and diabetesrdquo Diabetes Care 29 (7) 1638-1644

Lee D Lee I Porta M Steffes M Jacobs DR 2007 ldquoRelationship between serum concentrations of persistent organic pollutants and the prevalence of metabolic syndrome among non-diabetic

29

adults results from the National Health and Nutrition Examination Survey 1999-2002rdquo Diabetologia 50 1841-1851

Lind PM Orberg J Edlund U Sjoblom L Lind L 2004 ldquoThe dioxin-like pollutant PCB126 (33rsquo44rsquo5-pentachlorobiphenyl) affects risk factors for cardiovascular disease in female ratsrdquo Toxicology Letters 150 293-299

Longnecker MP Klebanoff MA Brock JW Zhou H 2001 ldquopolychlorinated biphenyl seum levels in pregnant subjects with diabetesrdquo Diabetes Care 24(6) 1099-1101

Lloyd-Jones D Adams R Carnethon et al (AHA Statistics Committee and Stroke Statstics Subcommittee) 2009 ldquoAHA Statistical Update Heart disease and stroke statisticsmdash2009 updaterdquo Circulation 119e21-e181

MapMarker Plus V 100 MapInfo Corp 2004

Morgenstern H 1998 ldquoEcologic studyrdquo In Armitage P Colton T eds Encyclopedia of biostatistics Vol2 NY NY John Wiley and Sons Inc 1255-76

Morganstern H ldquoEcologic Studiesrdquo Chapter 23 in Modern Epidemiology Rothman KJ and Greenland S 2nd ed Philadelphia Lippincott Williams amp Williams 1998 pp459-480

Morland K Wolff M Bopp R Godbold J Landrigan P 2008 ldquoFish consumption and body burden of organochlorines among Lower Hudson urban anglersrdquo American Journal of Industrial Medicine 51587-594

New York State Department of Health (NYS DOH) 2009a ldquoThe Burden of Cardiovascular Disease in New York Mortality Prevalence Risk Factors Costs and Selected Populationsrdquo Bureau of Chronic Disease Epidemiology and Surveillance Bureau of Health Risk Reduction httpwwwhealthstatenyusnysdohchronic_diseasecardiovascularburdenofcvdinnyspdf

New York State Department of Health (NYS DOH) 2009b ldquoHeart Disease and Stroke Indicatorsrdquo Community health assessment indicators Bureau of Biometrics and Health Statistics httpwwwhealthstatenyusstatisticschacchai

OrsquoNeill L 2003 ldquoEstimating out-of-hospital mortality due to myocardial infarctionrdquo Health Care Management Science 6 147-154

Palmer PM Belanger EE Wilson LR Hwang SA Narang RS Gomez MI Cayo MR Durocher LA Fitzgerald EF ldquoOutdoor air PCB concentrations in three communities along the Upper Hudson River New Yorkrdquo 2008 Archives of Environmental Contamination and Toxicology 54363-371

Piantadosi S Byar DP Green SB 1988 ldquoThe ecological fallacyrdquo American Journal of Epidemiology 127893-904

30

Rehkopf DH Berkman LF Coull B Krieger N 2008 ldquoThe non-linear risk of mortality by income level in a healthy population UgtSgt National Health and Nutrition Examination Survey mortality follow-up cohort 1988-2001rdquo British Medical Journal Public Health 8383

Richardson S Montifort C 2000 ldquoEcological correlation studies In Elliott P Wakefield JC Best NG et al eds Spatial epidemiology methods and applications NY NY Oxford University Press 205-220

SAS version 9 Cary North Carolina

Schoenbach V 2001 ldquoMulticausality Effect Modification wwwepidemiolognet Ch12 381shy413

Schoenbach V 2004 ldquoMulticausality Confounding wwwepidemiolognet Ch11 335-375

Sergeev AV and Carpenter DO 2005 Hospitalization rates for coronary heart disease in relation to residence near areas contaminated with persistent organic pollutants and other pollutants Environmental Health Perspectives 113(6)756-761

Shcherbatykh I Huang X Lessner L Carpenter DO 2005 Hazardous waste sites and stroke in New York State Environmental Health A Global Access Science Source 4(18) httpwwwehjournalnetcontent4118

Siemiatycki J Wacholder S Dewar R Cardis E Greenwood C and Richardson L 1988 ldquoDegree of confounding bias related to smoking ethnic group and socio-economic status in estimates of the associations between occupation and cancerrdquo Journal of Occupational Medicine 30 617-625

Tu YK Gunnell D Gilthorpe MS 2008 ldquoSimpsonrsquos paradox Lordrsquos paradox and suppression effects are the same phenomenon- the reversal paradoxrdquo Emerging Themes in Epidemiology 52

United Kingdom Heart Attack Study Collaborative Group Norris RM 1998 ldquoFatality outside hospital from acute coronary event in three British health districts 1994-1995rdquo British Medical Journal 316(7137) 1065-70

United States Bureau of the Census 2000 Census of population and housing summary file 1 (SF1) United States Department of Commerce 2001

United States Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) United States Department of Commerce 2002

United States Bureau of the Census 1990 Census of population and housing summary file 1 (SF1) United States Department of Commerce 1991

31

United States Bureau of the Census 1990 Census of population and housing summary file 3 (SF3) United States Department of Commerce 1992

United States Censusrsquo Monitoring Board ldquoFinal Report to Congressrdquo September 1 2001 httpgovinfolibraryunteducmbcmbpreportsfinal)report

United States Department of Health and Human Services (USDHHS) Agency for Healthcare Research and Quality 2007 ldquoAHRQ Quality Indicators Guide to Prevention Quality Indicators Hospital Admission for Ambulatory Care Sensitive Conditionsrdquo October 2001 Version 31 (March 12 2007) downloaded 2009 httpwwwqualityindicatorsahrqgov

United States Department of Health and Human Services (USDHHS) Centers for Disease Control 2007 Health United States A Chartbook National Center for Health Statistics wwwcdcgovnchsdatahushus07pdf

United States Department of Health and Human Services (USDHHS) Centers for Disease Control National Center for Health Statistics 2007 ldquo2005 National Hospital Discharge Surveyrdquo No 385 by De Frances C and Hall M Division of Health Care Statistics

United States Department of Health and Human Services (USDHHS) 2009 Centers for Disease Control and Prevention National Center for Environmental Health 4th National Report on Environmental Chemicals httpwwwcdcgovexposurereportpdfFourthReportpdf

United States Department of Health and Human Services (USDHHS) National Institutes of Health 2010 National Institute of Environmental Health Sciences National Toxicology Program 11th Report on Carcinogens httpntpniehsnihgovntproceleventhprofiless149pcbpdf

United States Environmental Protection Agency (USEPA) 2002 EPA Superfund record of decision Hudson River PCBs Site Hudson Falls to New York City New York Washington DC

Wakefield J 2003 Sensitivity Analyses for Ecological Regression Biometrics March 59 9-17

Wakefield J 2004

Wing S 1988 Social inequalities in the decline of coronary mortality American Journal of Public Health 781415-1416

World Health Organization 2010 Atlas of Heart Disease and Stroke Part 2 Ch 3 Cardiovascular Disease Risk Factors httpwwwwhointcardiovascular_diseasesencvd_atlas_03_risk_factorspdf

32

Table 1 Previous Study Results Statewide Analyses and Hudson River Specific Analyses

Principal Diagnosis of Hospitalization

Adjusted Rate Ratioa (Lower 95CI shyUpper 95 CI)

NYS analysis Hudson River Subset

analysis Ischemic Heart Diseaseb

115 (103-129) 136 (118-156) Acute Myocardial Infarction 120 (103-139) 139 (118-163)

Cerebrovascular Diseasec 115 (105-126) 120 (110-132)

Ischemic Stroke 117(104-139) --shy Hemorrhagic Stroke 110 (099-122) ---

Hypertensionc 119 (108-131) 114 (105-123)

Table adapted from Sergeev et al 2005 Shcherbatykh et al 2005 and Huang et al 2006 Ischemic heart disease and stroke hospitalization rates for NYS ZIP codes containing (or

adjacent to) inactive hazardous waste sites with persistent organic pollutants compared to ZIP codes without (or not adjacent to) such sites

a All analyses include a 2nd exposure category for other types of inactive hazardous waste sites and adjust for age race and sex

b Ischemic heart disease analyses include all ages over 25 Ischemic heart disease analyses adjust for quartiles of median household income and health insurance coverage

c Cerebrovascular disease and hypertension analyses include ages 25-64

33

Table 2 Comparison of Design amp Methods for Previously Published Studies and Current Follow-up Study

OUTCOME ASSESSMENT CONFOUNDER ASSESSMENT EXPOSURE ASSESSMENT Previous Study Follow-up Study Previous Study Follow-up Study Previous Study Follow-up Study

Coronary Heart Disease 1993-2000 More years 1990-2005 ZIP code level Block group level ZIP code level Block group level

Ischemic heart disease (ICD-9 410shy Same outcomes with 4 levels of income ndash Same income Exposure = residence Two types of exposure 414) and sequelae of myocardial one addition separate 2000 Census categories within 196 ZIP codes indicator categories within infarction not elsewhere classified analysis of angina (ICDshy quartiles with hazardous waste 12 counties abutting (ICD-9 4297) Separate analysis of 9 413) sites containing POPs Hudson River acute myocardial infarction 6 age groups Same age groups or 222 ZIP codes with (ICD-9 410) age 25 and above hazardous waste sites Adjacency 255 block

without POPs versus groups adjacent versus Principal or other diagnoses (up to 14) Only principal diagnoses 4 race groups Different race residence within 996 2116 not adjacent to the combined AsianPacific categories White ldquocleanrdquo ZIP codes Hudson River

Use the admission Islander White black AsianPacific statewide Reference Sergeev AV Carpenter source field to eliminate black Native Island and Other which Distance Block group DO Hospitalization rates for coronary intrahospital transfers for American includes Native Sub-analyses (focus of population-weighted heart disease in relation to residence the same individual American and Multiple follow-up) Hudson centroid distance from the near areas contaminated with Race River area 78 POP ZIP Hudson River lt12 mile persistent organic pollutants and other codes versus 996 ldquocleanrdquo (225 block groups) pollutants Environ Health Perspect Add education ZIP codes statewide between frac12 and 1 mile 2005 113(6)756-761 population density (216 block groups) farther

hispanic and distance to than 1 mile (1930 block nearest hospital as groups) additional covariates

Stroke 1993-2000 More years 1990-2005 Restricted to 2nd and (same as above) (same as above) (same as above) 3rd quartile income

Cerebrovascular disease excluding Same outcomes ZIP codes other ill-defined and late effects of cerebrovascular disease (ICD9 430shy 4 age groups 25-64 436) Separate analyses of ischemic stroke (433x1 434x1 436) and 2 race groups hemorrhagic stroke ( 430-432) White black

Principal or secondary diagnoses (up Only principal diagnoses to 14) combined

Use the admission Reference Shcherbatykh I Huang X source field to eliminate Lessner L Carpenter DO Hazardous intrahospital transfers for waste sites and stroke in New York the same individual State Environ Health 2005 418

34

Table 3 Study Cardiovascular Disease Outcomes ICD-9-CM Codes and Definitions

Ischemic Heart Disease

ICD-9-CM 410 - ldquoAcute myocardial infarction (AMI)rdquo is defined as a sudden insufficiency of blood supply to an area of the heart muscle usually due to a coronary artery occlusion (obstruction) commonly known as heart attack

ICD-9-CM 411- ldquoOther acute and subacute forms of ischemic heart diseaserdquo refers to complications or symptoms occurring before or after AMI or a coronary artery occlusion (obstruction) interrupting blood flow to the heart without AMI

ICD-9-CM 412 - ldquoOld myocardial infarctionrdquo refers to a past AMI that is currently presenting no symptoms ICD-9-CM 413 - ldquoAngina pectorisrdquo is defined as severe constricting pain in the chest often radiating from the

area immediately over the heart to the left shoulder and down the arm ICD-9-CM 414 - ldquoOther forms of chronic ischemic heart diseaserdquo includes coronary atherosclerosis a chronic

condition marked by thickening and loss of elasticity of the coronary artery caused by deposits of plaque

ICD-9-CM 4297 - ldquoCertain sequelae of myocardial infarction not elsewhere classifiedrdquo refers to other symptoms or complications associated with a prior AMI

Cerebrovascular Disease

ICD-9-CM 430 - ldquoSubarachnoid hemorrhagerdquo is bleeding in the space between the brain and lining ICD-9-CM 431 - ldquoIntracerebral hemorrhagerdquo is bleeding within the brain ICD-9-CM 432 - ldquoOther and unspecified intracranial hemorrhagerdquo is bleeding nontraumatic between the skull

and brain lining ICD-9-CM 433 - ldquoOcclusion and stenosis of precerebral arteriesrdquo is blockage or stricture of the arteries

branching into the brain ICD-9-CM 434 - ldquoOcclusion of cerebral arteriesrdquo includes cerebral thrombosis and cerebral embolism ICD-9-CM 435 - ldquoTransient cerebral ischemiardquo includes cerebrovascular insufficiency (acute) with transient

neurological signs and symptoms ICD-9-CM 436 - ldquoAcute but ill-defined cerebrovascular diseaserdquo

35

Table 4 ndash Census Demographics for Study Area by Proximity to the Hudson River 2000

Census Demographics Study Area Within Block Groups Adjacent to River Block Group Centroid Distance to River Yes No lt frac12 mile frac12-1 mile gt 1 mile

n n n n n n Total Population and distribution within river distance categories

2924631 1000 315662 108 2608969 892 231414 79 283321 97 2409896 824

RaceEthnic Distribution 1000 1000 1000 1000 1000 1000 White 2383274 815 255023 808 2128251 816 154371 667 201457 711 2027446 841 Black 277302 95 34039 108 243263 93 43895 190 38959 138 194448 81 Total ldquoOtherrdquo (listed below)

American Indian 7147 02 882 03 6265 02 975 04 1058 04 5114 02

Asian 86465 30 6233 20 80232 31 4711 20 8979 32 72775 30 Pacific Islander 1058 lt01 111 lt01 947 lt01 104 lt01 145 lt01 809 lt01 Other 105182 36 12492 40 92690 36 19288 83 23338 82 62556 26 Multiple Races 64203 22 6882 22 57321 22 8070 35 9385 33 46748 19

Hispanic 268036 92 28449 90 239587 92 40691 176 46978 166 180367 75

Education lt9th grade 108312 56 13370 64 94942 55 13585 91 14518 81 80209 50 9th-12th grade 200374 103 27023 130 173351 100 23306 155 25161 140 151907 94 High school or GED 519236 267 62727 302 456509 263 44366 296 51585 287 423285 263 Some college 336283 173 36464 175 299819 173 24995 167 31141 173 280147 174 Associates degree 150134 77 16574 80 133560 77 10134 68 12939 72 127061 79 Bachelors degree 342297 176 28578 137 313719 181 18898 126 24584 137 298815 185 Graduate or professional degree

285823 147 23199 112 262624 151 14813 99 19909 111 251101 156

+Other SES Factors Households below poverty 91940 87 12875 108 79065 83 13977 154 14685 140 63278 72

mean SD mean SD mean SD mean SD mean SD mean SD Population density 57466 9589 54496 11391 57826 93468 13501 16071 11068 16248 43763 66909 Median household income 61010 31989 47320 23630 62668 32468 40748 21923 44962 20829 64847 32564

Sources US Bureau of the Census 2000 Census of population and housing summary file 1(SF1) US Department of Commerce 2001 US Bureau of the Census 2000 Census of population and housing summary file 3 (SF3) US Department of Commerce 2001

36

Table 5 Study Population Person Years by River Proximity Category For Income Quartiles and Race Categories

Person Years Total Adjacent Non-Adjacent lt12 mile frac12 to 1 mile gt1 mile n (row)

[column ] n (row)

[column ] n ()

[column ] n ()

[column ] n ()

[column ] n ()

[column ] Total Population 30219001 (1000)

[1000]

3269547 (108)

[1000]

26949454 (892)

[1000]

2380477 (079)

[1000]

2834956 (094)

[1000]

25003568 (827)

[1000]

Income Quartile

Lowest Income 7182853 (1000)

[237]

1345251 (187)

[411]

5837601 (813)

[217]

1462710 (204)

[614]

1236936 (172)

[436]

4483206 (624)

[179]

2nd Income 7778921 (1000)

[257]

1077888 (139)

[330]

6701033 (861

[249]

420361 (054)

[177]

808782 (104)

[285]

6549777 (842)

[262]

3rd Income 7671943 (1000)

[254]

551617 (072)

[169]

7120326 (928)

[264]

344840 (045)

[145]

511237 (067)

[180]

6815866 (888)

[273]

Highest Income 7585283 (1000)

[251]

294790 (039)

[090]

7290493 (961)

[271]

152565 (020)

[064]

278000 (037)

[098]

7154718 (943)

[286]

Race Category

White 25859811 (1000)

[856]

2784914 (108)

[851]

23074897 (892)

[856]

1761910 (68)

[740]

2238348 (87)

[790]

21859554 (845)

[874]

Black 2466185 (1000

[82]

297220 (121)

[91]

2168965 (879)

[81]

389721 (158)

[164]

316704 (128)

[112]

1759760 (714)

[70]

Asian 792737 (1000)

[26]

58007 (73)

[18]

734731 (827)

[27]

45151 (57)

[19]

78161 (99)

[28]

669426 (844)

[27]

Multiple Other 1100269 (1000)

[36]

129407 (118)

[40]

970862 (882)

[36]

183696 (167)

[77]

201743 (183)

[71]

714830 (650)

[29]

37

Table 6 Study Hospitalization Record Exclusions

After After dropping After imputing dropping records with

After excluding race amp poorly missing continuation After excluding eliminating geocoded covariates

Geocoded records transfers missing race records (racesex)

Total 202272 201814 02 169143 162 167733 08 154236 80 154220 001 (Loss ) Demograp hic group

White 155216 849 154805 849 132749 859 144262 860 132343 858 132332 858 Black 12878 70 12849 70 11109 72 12010 72 11328 73 11326 73 OtherNa 12745 70 12741 70 8814 57 9313 55 8647 56 8644 56 AsianPI 2021 11 2020 11 1810 12 2148 13 1918 12 1918 12 Missing Race 19412 96 19399 96 14661 87 (1410) Diagnosis

IHD 137569 680 137303 680 111005 656 110005 656 110000 656 101012 655

AMI 51840 256 51635 256 39338 233 39338 233 39013 233 35811 232

Angina Stroke

5792 64703

29 320

5788 64511

29 320

5316 58138

31 344

5316 58138

31 344

5285 57712

31 344

4731 53208

31 345

Isch str 23673 117 23543 117 21012 124 21012 124 20912 125 19371 126 Hem str 8097 40 8074 40 6544 39 6444 39 6492 39 5995 39

Missing covariates 11 records were missing sex 325 were missing age and 14661 records were missing race Race was imputed by random assignment based on block group of residencersquos race distribution for 13251 records (310 were also missing other covariates For approximately 1100 records with missing race geocoding was inadequate for assigning block group and 310 of those missing race were also missing additional covariates) One discharge can have one or more covariates missing (ie missing covariates are not mutually exclusive

38

Table 7 Income Quartile and Race Distributions for CVD Hospitalizations

Study Population

IHD n=101012

AMI n=35811

Angina n=4731

Stroke n=53208

Ischemic Stroke n=19371

Hemorrhagic Stroke n=5995

Income (Quartiles) Lowest Income 237 286 300 356 298 309 301 Second Income 257 254 265 237 256 256 252 Third Income 254 241 236 224 236 234 223 Highest Income 251 221 201 182 209 201 224

Total 1000 999 999 1000 1000

White 855 859 885 833 858 844 809 Black 82 64 62 102 92 108 110 Asian 26 12 12 11 12 14 19 Other 36 65 41 53 38 34 62

Study population income quartiles are based on census block median income Quartile cut-points were developed by weighting block group median income levels by block group population to identify the cut-points within the continuous income distribution After creating the cut-points the study population of each block group was assigned to the appropriate quartile category This resulted in the percentage of population in each quartile varying slightly from 25 because the entire populations of block groups were assigned to the same quartile

39

Table 8a Exposure Area Distributions and Rate Ratios for Ischemic Heart Disease Hospitalizations

Exposure Person-Years

(N=30219001) in

Ischemic Heart Disease Hospitalizations Total AMI Angina

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 12327 (122) 133 (112-157) 4398 (123) 136 (114-163) 591 (125) 123 (106-142) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 120 (103-140) 3399 (095) 126 (109-145) 542 (115) 160 (137-188) frac12 - 1 mile 2834956 (094) 11236 (111) 120 (104-140) 4048 (113) 125 (109-144) 610 (129) 156 (133-183) gt1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Addrsquol adjustment for SES and other factors Adjacent Block Group Yes 3269547 (108) 12327 (122) 107 (104-111) 4398 (123) 106 (101-110) 591 (125) 106 (096-117) No 26949454 (892) 88611 (878) 100 Reference 31380 (877) 100 Reference 4137 (875) 100 Reference Distance From River lt frac12 mile 2380477 (079) 9817 (97) 102 (098-106) 3399 (095) 103 (098-109) 542 (115) 109 (098-121) frac12 - 1 mile 2834956 (094) 11236 (111) 103 (099-108) 4048 (113) 106 (101-111) 610 (129) 113 (102-125) gt 1 mile 25003569 (827) 79885 (791) 100 Reference 28331(792) 100 Reference 3576 (756) 100 Reference

Agegt25 1990-2005 50 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

40

Table 8b Exposure Area Distributions and Rate Ratios for Stroke Hospitalizations Age gt= 25 1990-2005

Exposure Person-Years

(N=30219001) in Study Area

Stroke Hospitalizations

Total Ischemic Hemorrhagic

No () No () RR (95 CI) No () RR (95 CI) No () RR (95 CI)

Adjusted for Age Sex Race Adjacent Block Group Yes 3269547 (108) 6276 (118) 125 (109-144) 2334 (121) 128 (110-148) 635 (108) 099 (086-113)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference

Distance From River lt frac12 mile 2380477 (079) 5280 (099) 116 (104-131) 1925 (100) 120 (105-137) 518 (088) 102 (088-117)

frac12 - 1 mile 2834956 (094) 6023 (114) 118 (106-132) 2227 (115) 127 (112-145) 658 (111) 115 (100-131)

gt1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Addl Adjustment for SES and Other Factors Adjacent Block Group Yes 3269547 (108) 6276 (118) 107 (103-112) 2334 (121) 110 (104-116) 635 (108) 097 (088 ndash 106)

No 26949454 (892) 46748 (882) 100 Reference 16989 (879) 100 Reference 5263 (892) 100 Reference Distance From River lt frac12 mile 2380477 (079) 5280 (099) 106 (101-110) 1925 (100) 108 (101-115) 518 (088) 089 (081-099)

frac12 - 1 mile 2834956 (094) 6023 (114) 106 (102-111) 2227 (115) 111 (104-117) 658 (111) 108 (099-119)

gt 1 mile 25003569 (827) 41721 (787) 100 Reference 15171 (785) 100 Reference 4722 (801) 100 Reference

Age gt 25 years 1990-200650 sample Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Adjusted for SES factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) median 1999 household income ($249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

41

Table 9 Rate Ratios for CVD Hospitalizations Stratified by Income Quartile

Rate Ratio for lt frac12 mile versus gt 1 mile from river

IHD AMI Angina Stroke Ischemic Stroke Hemorrhagic Stroke

RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI) RR (95CI)

Adjusted for Age Sex Race Lowest Income 1 123 (108-140) 124 (110-139) 139 (120-160) 120 (109-133) 122 (107-138) 118 (103-137) Income 2 115 (101-131) 118 (101-139) 155 (124-195) 124 (113-135) 127 (109-147) --shyIncome 3 088 (075-103) 083 (068-102) 102 (076-137) 092 (079-107) 093 (076-113) --shyHighest Income 4 080 (064-099) 074 (054-103) 030 (013-070) 078 (065-093) 071 (053-095) 043 (024-076) Weighted average 100 098 091 102 100 071

Addrsquol Adjustment for other demographic factors Lowest Income 1 114 (107-120) 114 (106-123) 112 (097-130) 116 (109-124) 120 (110-131) 109 (095-125) Income 2 110 (102-118) 111 (101-123) 136 (108-172) 110 (101-119) 112 (099-128) 095 (076-119) Income 3 085 (077-094) 086 (075-097) 093 (068-126) 097 (087-108) 096 (082-112) 046 (033-066) Highest Income 4 068 (060-078) 069 (057-085) 028 (012-062) 069 (059-081) 058 (044-077) --shyWeighted average 092 093 079 096 093 078

To reduce the complexity of this Table only one of the distance categories is provided The other distance categories showed similar results Adjusted for age (25-34 35-44 45-54 55-64 65-74 75+ years) race (white black other) and sex (male female) Block group median income quartile ranges $249900-4096800 $4096800lt-5465000 $5465000lt-7294400 $7294400lt-20000100 Adjusted for demographic factors quartiles of population density (61-5066 5066lt-21020 21020lt-64102 64102lt-943716 persons per square mile) with less than a college education (0-306 306lt-427 427lt-555 555lt-1000) with Hispanic ethnicity (0-21 21lt-47 47lt- 96 96lt-860) and distance to nearest hospital

42

Table 10 Hospitalization Rate Estimates for Current Study and from Published Sources (age-adjusted to 2000 US standard per 10000)

Current Study 12 Counties abutting Hudson River 1990-2005

Total US CDC 2005

Northeastern US CDC 2005

SPARCS NYS excluding NYC 2005-2007

IHD 649 619 691

AMI 232 231 278

Angina 31 Stroke 349 303 307 351

Ischemic Stroke 127 39 Hemorrhagic Stroke

Source USDHHS 2005 National Hospital Discharge Survey No 385 July 12 2007 Source NYS DOH SPARCS 2009b Comparable estimates were not able to be provided because published outcome groupings differed

43

Figure 1 ZIP codes by Adjacency to the Hudson River

44

Figure 2 Census Block Groups by Adjacency to the Hudson River

45

Figure 3 Census Block Groups by Population-Weighted Centroid Distance to the Hudson River

46

IHD Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

200

400

600

800

1000

1200

1400

1600

Low est IncQ

Q2 Q3 Highest Inc Q

Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

AMI Hospitalization Rates by Sex Race Income (age-adjusted) (other race

group excluded)

000

100

200

300

400

500

600

700

Low est IncQ

Q2 Q3 Highest Inc Q Income Quartile

Hos

pita

lizat

ions

per

10

00

w hite men black men

asian men w hite w omen

black w omen asian w omen

Stroke Hospitalization Rates by Sex Race Income (age-adjusted) (other

race group excluded)

000

100

200

300

400

500

600

700

800

900

Low est In Q

Q2 Q3 Highest Inc Q Income quartile

Hos

pita

lizat

ions

per

10

00w hite men black men

asian men w hite w omen

black w omen asian w omen

Figure 4 Figure 5 Figure 6

47

MAURICE D HINCHEY 22NO DISTRICT NEW YORK

COMMITTEE ON APPROPRIATIONS

~lIC()foltIIInas

AGRICUJTlAE RURAL OEElOPWNT rooo AND ORUG ADMINISTRATION

AND RBATEO AGEfjC~S

INTERIOR

~onlJltss of tl)t ~nttn ~tatt5 ~oU5t of ItpttUtlltl1tibeg lHaJington 111laquo 20515-3222

October 4 2006

Howard Frumkin MD DrPH Director Agency for Toxic Substances and

Disease Registty and CDCs National Center for Environmental Health

1600 Clifton Road NE MSE-28 Atlanta GA 30333

Dear Dr Frumkin

First I would like to congratulate you on your appoinbnent as Director of A TSDR and CDCs National Center for Environmental Health llook forward to the opportunity to work with you on important public health issues this year and in the future

My second reason for writing today is to request that ATSDR assist my office in conducting a health consultation study for residents along a 200 mile segment of the Hudson River a Superfund site on the National Priority List containing PCBs in order to measure whether further evaluation may be warranted

As you may know Dr David Carpenter has done a number of studies at SUNY Albany in New York that suggest that resideilcy in proximity to hazardous waste sites with Persistant Organic Pollutants (POPs) may be associated with increased hospitalization rates for Coronary Heart Disease (CHD) when compared to hospitalization rates tor areas with no waste sites or other types of waste not catagorized as POPs I have enclosed a copy of three of these research papers that demonstrate this correlation for heart disease hypertension and stroke

I am very concerned about the health of the residents living along the Hudson River and respectfully request that your agency look into what seems to be a very serious health threat For purposes of my request T would appreciate your agency looking into the subset population Dr Carpenter examined in 78 zip codes along the 200 mile stretch of the Hudson River from Hudson Falls to New York City This subset population studied by Dr Carpenter showed a statistically significant increase in hospital discharges for heart disease hypertension and stroke Interestingly according to CDCs data base the Behavioral Risk Factor Surveillance System this population on average has higher incomes exercises more frequently and conswnes more fruits and vegetables than the average New Yorker Additionally this population has more nonsmokers than the rest of Upstate New Yorks population

WASHINGTON OIFCE

2411 RAYBURN HOUSE OfFICE BUILDING WASH1NcrroN DC 20amp1amp-3222

(Zl2) 225-6335

_fOMetoYht lnchIY

BlNGI-UMTON OfRCE m-wA OfFICE KIGSTCN OFFIepound MIDDLnWJN OffICE MOJmCEllO OffiCE IDDA IfOCML lUllDlNG 123 SOUTH CAYJOA smra U l WAll

m STREET em HALL THlflD flOOR 18 ANWANf LAKf CAD

RIIrlGHAtATON NY 13901 SUITE 201 KINGSTOfl 12401 te JUpoundS STilEET MONTlCfllO NY 12701 (6171773-2768 ITHACA HY 14860 (M5)3~ MlDgtlETOWN NY 109(0 (amptS) 7S1-71e

(607) 273-~388 (845) l4C-3l11

Appendix A Representative Hinchey Letter

Dr Howard Frumkin Page 2 October 4 2006

Since this type of population would typically exhibit better than average cardiac health Im sme you can understand and appreciate my concern and sense of urgency that we must follow up on Dr Carpenters good work and detcnnine what further study is warranted to protect the health of the populatioD living and working aloDg the Hudson River

I than1c you for yom kind attention to my request I look forward to hearing back from you at your earliest convenience

Best regards

Sincerely

~ane~ey Member of Congress

MHlag

49

Appendix B Table 1a IHD Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) IHD (N=101012) AMI (N=35811) Angina (N=4731)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 120 (103-140) 102 (098-106) 126 (109-145) 103 (098-109) 160 (137-188) 109 (098-121) frac12 - 1 mile vs gt 1 mile 120 (104-140) 103 (099-107) 125 (109-144) 106 (101-111) 156 (133-183) 113 (102-125)

Sex Male vs female 197 (174-223) 214 (208-220) 211 (187-238) 209 (202-215) 113 (099-129) 113 (106-121) Race Black vs white 120 (102-142) 093 (089-097) 109 (094-127) 092 (087-097) 178 (152-208) 121 (108-136)

AsianPacific Is ldquo 080 (066-096) 073 (069-079) 085 (070-103) 072 (065-081) 075 (055-102) 065 (049-087) Other ldquo 246 (208-291) 377 (363-392) 189 (162-220) 204 (191-218) 257 (215-307) 205 (177-238)

Age 35-44 vs 25-34 809 (630-104) 824 (743-913) 647 (494-849) 658 (568-762) 491 (348-692) 498 (376-660) 45-54 ldquo 333 (261-426) 344 (312-380) 228 (175-296) 230 (200-265) 1539 (1108-214) 157 (120-206) 55-64 ldquo 827 (647-1057) 802 (727-885) 576 (443-748) 516 (449-593) 278 (200-386) 284 (217-371) 65-74 ldquo 1477 (1155-1890) 1384 (1255-1527) 1067 (820-1387) 899 (782-1033) 409 (294-567) 449 (344-586) 75+ ldquo 2124 (1656-2724) 1942 (1760-2142) 2190 (1683-2849) 1849 (1610-2123) 661 (476-919) 695 (533-905)

Population density

2nd lowest vs least dense

126 (121-132) 131 (124-137) 142 (127-159)

more dense ldquo 130 (124-136) 134 (127-141) 131 (116-147) most dense ldquo 120 (114-126) 121 (114-129) 115 (100-131)

Median Household Income

2nd highest vs highest income

105 (101-110) 113 (108-119) 123 (110-137)

lower ldquo 109 (104-113) 123 (117-130) 125 (111-141) lowest ldquo 116 (111-122) 129 (122-137) 162 (142-185)

with less than college

2nd lowest vs least college

114 (109-118) 118 (113-124) 110 (098-122)

higher ldquo 122 (117-127) 126 (119-132) 115(102-129) highest

ldquo

122 (116-128) 128 (120-135) 129 (113-148)

Hispanic 2nd lowest vs least Hispanic

093 (089-097) 094 (0890-098) 101 (091-112)

higher ldquo 092 (088-096) 096 (092-101) 118 (107-131) highest

ldquo

085 (081-088) 088 (084-093) 138 (123-154)

Distance to Hospital

2nd closest vs closest 097 (094-100) 094 (090-098) 107 (098-118) more distant ldquo 090 (087-094) 093 (089-098) 096 (086-107) furthest ldquo 087 (083-091) 091 (087-096) 089 (079-101)

50

Appendix B Table 1b Stroke Hospitalization Rate Ratios agesexrace adjusted versus additionally adjusted For residents (age 25 and older) of block groups closer versus more distant from the Hudson River in models adjusting for sex age race and block group median household income population density education level percent Hispanic and block group centroid distance to nearest hospital 1990-2005

Adjusted Rate Ratios by Block Group River Proximity (95 Confidence Intervals) Total Stroke (N=53208) Ischemic Stroke (N=19371) Hemorrhagic Stroke (N=5995)

Risk Factor Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment Agesexrace adjusted

Addrsquol adjustment

Block group distance from River

lt frac12 mile vs gt 1 mile 116 (104-131) 106 (101-110) 120 (105-137) 108 (101-115) 102 (088-117) 089 (081-099) frac12 - 1 mile vs gt 1 mile 118 (106-132) 106 (102-111) 127 (112-145) 111 (104-117) 115 (100-131) 108 (099-119)

Sex Male vs female 114 (104-125) 127 (123-130) 126 (113-141) 131 (126-136) 114 (103-127) 123 (116-130) Race Black vs white 194 (173-219) 153 (147-160) 233 (204-267) 191 (180-203) 215 (188-246) 190 (173-209)

AsianPacific Isl ldquo 097 (083-113) 088 (080-096) 114 (094-138) 101 (088-116) 142 (114-177) 129 (106-157) Other ldquo 230 (203-260) 232 (219-245) 229 (197-265) 212 (193-232) 360 (310-417) 334 (296-377)

Age 35-44 vs 25-34 309 (251-379) 303 (269-343) 359 (267-483) 362 (289-455) 274 (212-353) 268 (219-329) 45-54 ldquo 114 (938-139) 1087 (970-1217) 149 (113-197) 135 (109-167) 655 (513-836) 605 (500-734) 55-64 ldquo 331 (273-402) 329 (295-368) 433 (329-569) 418 (340-514) 119 (933-152) 109 (904-132) 65-74 ldquo 826 (682-1001 861 (772-959) 1145 (872-1504) 1114 (908-1367) 214 (168-272) 224 (186-270) 75+ ldquo 1890 (1559-2290) 2037 (1828-2269) 2768 (2112-3627) 3060 (2497-3750) 554 (438-700) 587 (490-703)

Population density

2nd lowest vs least dense

126 (120-132) 124 (117-132) 129 (117-142)

more dense ldquo 126 (120-132) 121 (113-129) 142 (128-157) most dense ldquo 124 (118-132) 128 (119-138) 150 (133-169)

Median Household Income

2nd highest vs highest income

106 (101-110) 110 (103-117) 099 (090-108)

lower ldquo 113 (108-118) 113 (106-121) 107 (097-118) lowest ldquo 116 (110-122) 117 (108-126) 114 (102-127)

with less than college

2nd lowest vs least college

111 (107-116) 112 (106-119) 098 (090-107)

higher ldquo 120 (114-125) 120 (113-128) 102 (093-112) highest ldquo 116 (110-122) 119 (110-128) 093 (090-108)

Hispanic 2nd lowest vs least Hispanic

096 (092-100) 094 (088-099) 084 (077-091)

higher ldquo 101 (096-105) 096 (091-102) 084 (077-091) highest ldquo 096 (092-100) 088 (082-093) 078 (070-085)

Distance to Hospital

2nd closest vs closest 095 (091-098) 092 (087-097) 097 (089-105) more distant ldquo 092 (088-096) 091 (086-097) 089 (081-097) furthest ldquo 085 (081-090) 082 (077-088) 088 (079-098)

51

Appendix B Table 2 Sex and Race-Specific Age-Adjusted IHD AMI and Stroke Rates by Income (per 1000)

Men Women white black asian other white black asian other

IHD Lowest IncQ 470 401 492 940 283 354 477 543 Q2 425 339 675 1443 221 236 364 690 Q3 436 301 420 2463 199 233 236 902 Highest Inc Q 399 325 218 4307 160 172 120 1247 AMI Lowest IncQ 180 154 203 269 101 112 221 159 Q2 159 141 300 306 085 088 138 179 Q3 152 114 139 480 078 084 096 214 Highest Inc Q 135 130 046 820 062 065 052 275 Stroke Lowest IncQ 201 292 330 370 166 281 343 261 Q2 182 254 389 466 150 247 274 373 Q3 192 238 231 598 145 245 201 401 Highest Inc Q 177 275 102 905 132 197 070 776

These rates are estimated from the 50 sample of hospitalizations in the 12-county study area included in this investigation The rates are adjusted to the 2000 standard US population

52

CERTIFICATION

The health consultation for the Hudson River PCBs site Residential Proximity to the Hudson River and Hospitalization Rates for Ischemic Heart Disease and Stroke 1990shy2005 was prepared by the New York State Department of Health under a cooperative agreement with the Agency for Toxic Substances and Disease Registry (ATSDR) It is in accordance with approved methodology and procedures existing at the time the health consultation was initiated Editorial review was completed by the cooperative agreement partner

The Division of Health Assessment and Consultation (DHAC) ATSDR has reviewed this public health assessment and concurs with its findings

ATSDR

Page 16: HR CVD Final HCl 2011
Page 17: HR CVD Final HCl 2011
Page 18: HR CVD Final HCl 2011
Page 19: HR CVD Final HCl 2011
Page 20: HR CVD Final HCl 2011
Page 21: HR CVD Final HCl 2011
Page 22: HR CVD Final HCl 2011
Page 23: HR CVD Final HCl 2011
Page 24: HR CVD Final HCl 2011
Page 25: HR CVD Final HCl 2011
Page 26: HR CVD Final HCl 2011
Page 27: HR CVD Final HCl 2011
Page 28: HR CVD Final HCl 2011
Page 29: HR CVD Final HCl 2011
Page 30: HR CVD Final HCl 2011
Page 31: HR CVD Final HCl 2011
Page 32: HR CVD Final HCl 2011
Page 33: HR CVD Final HCl 2011
Page 34: HR CVD Final HCl 2011
Page 35: HR CVD Final HCl 2011
Page 36: HR CVD Final HCl 2011
Page 37: HR CVD Final HCl 2011
Page 38: HR CVD Final HCl 2011
Page 39: HR CVD Final HCl 2011
Page 40: HR CVD Final HCl 2011
Page 41: HR CVD Final HCl 2011
Page 42: HR CVD Final HCl 2011
Page 43: HR CVD Final HCl 2011
Page 44: HR CVD Final HCl 2011
Page 45: HR CVD Final HCl 2011
Page 46: HR CVD Final HCl 2011
Page 47: HR CVD Final HCl 2011
Page 48: HR CVD Final HCl 2011
Page 49: HR CVD Final HCl 2011
Page 50: HR CVD Final HCl 2011
Page 51: HR CVD Final HCl 2011
Page 52: HR CVD Final HCl 2011
Page 53: HR CVD Final HCl 2011
Page 54: HR CVD Final HCl 2011
Page 55: HR CVD Final HCl 2011
Page 56: HR CVD Final HCl 2011
Page 57: HR CVD Final HCl 2011
Page 58: HR CVD Final HCl 2011
Page 59: HR CVD Final HCl 2011
Page 60: HR CVD Final HCl 2011
Page 61: HR CVD Final HCl 2011
Page 62: HR CVD Final HCl 2011
Page 63: HR CVD Final HCl 2011