Understanding the Data

Methodology

How we identify communities that thrive beyond what statistics predict.

Dataset Statistics

83,117 Census Tracts
50 States + DC
330M+ People
2020 Census Boundaries

Score Reference

Thriving (≥+2.0) Exceptional resilience despite challenges
Strong (+1.0 to +2.0) Notable community strength
Steady (0 to +1.0) Meeting expectations
Challenged (-1.0 to 0) Facing headwinds
Struggling (<-1.0) Needs support

Score Distribution

-4-20+2+4
Struggling Average Thriving

Methodology

We're looking for neighborhoods that are healthier than they "should" be based on their income, education, and access to resources. What makes them special?

Which communities demonstrate better health outcomes than predicted by their material circumstances?

Traditional metrics like income, education, and access to healthcare explain much of the variation in community health. But some communities do significantly better than these factors would predict. We call this "community resilience" — the capacity for a community to thrive despite challenging conditions.

This project identifies these communities by looking at the gap between predicted and actual health outcomes, potentially revealing communities with strong social ties, cultural assets, or other protective factors that aren't captured by standard demographic measures.

We use publicly available government data on health conditions, food access, and demographics. All data is standardized to 2020 Census tract boundaries for consistency.
  • CDC PLACES 2024 — Census tract-level health outcomes (BRFSS 2022) including obesity, diabetes, hypertension, heart disease, and physical inactivity. Kentucky and Pennsylvania use PLACES 2023 data (BRFSS 2021) as a fallback.
  • USDA Food Access Research Atlas 2019 — Low-income, low-access (LILA) classification identifying food deserts (crosswalked to 2020 boundaries)
  • Census Bureau 2020 — Population data and tract boundary relationship files for geographic crosswalking
We build a model that predicts how healthy a neighborhood "should" be based on income, education, and similar factors. Then we compare prediction to reality. Neighborhoods that are healthier than predicted get a positive score—the bigger the number, the more the community is beating the odds.
Data CDC, USDA, Census
Model Predict health from factors
Score Actual − Predicted
Better than expected = Positive score

Step 1: Measure Health Burden

We construct a composite health burden index (0-1 scale) from CDC PLACES data, combining:

  • Chronic disease prevalence (diabetes, obesity, heart disease)
  • Mental health indicators (depression, poor mental health days)
  • Preventive care gaps (lack of health insurance, missed checkups)
  • Behavioral risk factors (smoking, physical inactivity)

Step 2: Predict Expected Health

We use OLS regression with state fixed effects to predict health burden based on socioeconomic and access factors:

Health Burden ~ Poverty Rate + Education + Insurance
              + Food Access + Housing Cost Burden
              + State Fixed Effects

Step 3: Calculate Resilience Score

The resilience score is the standardized residual from this model (z-score). A positive score means the community has better health outcomes than predicted; a negative score means worse outcomes than predicted.

Example: A tract with score +2.0 means the community is 2 standard deviations healthier than predicted — roughly in the top 2% of "positive outliers."

Note on Group Quarters

Tracts where more than 10% of the population lives in group quarters (college dorms, military barracks, correctional facilities, nursing homes) are flagged but not excluded. These populations may have health outcomes driven by institutional factors rather than community characteristics.

This data has real limitations. It's an estimate, not a definitive answer. Use it as a starting point for asking questions, not as proof of anything.
  • Kentucky & Pennsylvania Data Vintage — These states use CDC PLACES 2023 data (BRFSS 2021) instead of 2024 data, as they couldn't collect enough BRFSS data in 2023. Their health outcomes are approximately one year older than other states.
  • Temporal Gap — Health data (2024/BRFSS 2022), food access data (2019), and Census data (2020) come from different time periods.
  • Model-Based Estimates — CDC PLACES uses small area estimation, not direct measurement. Some tract-level estimates have high uncertainty.
  • Ecological Fallacy — Tract-level patterns do not describe individual outcomes. A "resilient" tract may still contain individuals with poor health.
  • Unmeasured Confounders — The model cannot capture all factors affecting health. Positive residuals may reflect unmeasured socioeconomic advantages rather than community resilience.
  • Static Snapshot — This is a cross-sectional analysis and cannot show trends or causation.
Use this data to ask interesting questions and find communities worth learning more about. Don't use it to make decisions that affect people's lives without doing more research first.

Appropriate

  • Exploratory research into community health patterns
  • Identifying communities for qualitative follow-up research
  • Asset-based community development planning
  • Understanding geographic variation in health outcomes
  • Educational purposes and data literacy

Inappropriate

  • Justifying disinvestment from "resilient" communities
  • Making predictions about individual health
  • Punitive comparisons between communities
  • Replacing qualitative community input
  • Definitive claims about community characteristics
Access the data programmatically through these endpoints, or download the full dataset as CSV.
GET /api/stats Aggregate statistics for the dataset
GET /api/tracts?limit=100&offset=0 Paginated list of tracts with resilience scores
GET /api/tracts?format=csv Full dataset as CSV download
GET /api/tracts/:fips Single tract by 11-digit FIPS code
GET /api/geocode?address=... Geocode address and return tract with resilience data
If you use this data, please cite it.
@misc{resilience-mapping,
  title = {Community Resilience Mapping},
  year = {2025},
  url = {https://resilience-mapping.fly.dev},
  note = {Census tract-level analysis of community
         health resilience using CDC PLACES 2024
         and USDA Food Access Research Atlas data,
         standardized to 2020 Census boundaries}
}