Abstract

Background

Current and future pandemics will require informatics solutions to assess the risks, resources and policies to guide better public health decision-making.

Methods

Cross-sectional study of all COVID-19 cases and deaths in the USA on a population- and resource-adjusted basis (as of 24 April 2020) by applying biomedical informatics and data visualization tools to several public and federal government datasets, including analysis of the impact of statewide stay-at-home orders.

Results

There were 2753.2 cases and 158.0 deaths per million residents, respectively, in the USA with variable distributions throughout divisions, regions and states. Forty-two states and Washington, DC, (84.3%) had statewide stay-at-home orders, with the remaining states having population-adjusted characteristics in the highest risk quartile.

Conclusions

Effective national preparedness requires clearly understanding states’ ability to predict, manage and balance public health needs through all stages of a pandemic. This will require leveraging data quickly, correctly and responsibly into sound public health policies.

Background

The coronavirus disease 2019 (COVID-19) pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) presents complex challenges to health professionals, researchers and policymakers.1 There has been a global effort to make relevant technologies, resources and information available that would accelerate data-driven solutions for all aspects of this pandemic.2,3 However, the current focus on raw counts of cases and deaths in the USA, while necessary, is not sufficient to fully assess the risks, resources and policies to guide better public health decision-making, now and in the future.1,4

Our study analyzes important pandemic characteristics in the USA on a population- and resource-adjusted basis using several publicly available datasets and visualization tools in order to provide deeper insight into critical issues for the current pandemic.

Methods

Study population, data collection and definitions

This was a cross-sectional study of daily report and time series data for the USA from the study date (24 April 2020), that was downloaded from the GitHub COVID-19 data repository hosted by the Center for Systems Science and Engineering at Johns Hopkins University in comma-separated values file format.5 Specific fields from the daily report file used for analysis include the name of the state (Province_State), country (Country_Region), total number of COVID-19 cases (Confirmed) and total number of COVID-19 deaths (Deaths). Similarly, specific fields from the time series file include the name of the state (Province_State) and country (Country_Region), along with a single field for each date from 22 January 2020 up to the study date 24 April 2020 for confirmed COVID-19 cases and deaths.

Population-adjusted characteristics were calculated by dividing US state-level totals for COVID-19 (i) cases and (ii) deaths, respectively, by 2019 state population estimates from the US Census Bureau (https://data.census.gov/).6 Resource-adjusted characteristics were calculated by dividing state-level cases by (i) estimated state-level physician totals from the Agency for Healthcare Research and Quality 2018 Compendium of US Health Systems (https://www.ahrq.gov/chsp), and (ii) published state-level estimates for mechanical ventilators as described in the Society of Critical Care Medicine report on US ICU Resource Availability for COVID-19.7–9

Each state in the USA is responsible for setting their own policies regarding pandemic risk mitigation. Using information from available publication and news sources, we identified states with and without stay-at-home or similar nonpharmaceutical intervention (NPI) orders that were implemented statewide as of the study date.10–12 For each state with a statewide stay-at-home order, we calculated the number of days between the effective date of the order and (i) the date of a state’s first reported case, (ii) the date of a state’s first reported death and (iii) the study date.

Statistical analysis and data visualization

Summary statistics were collected as medians with interquartile range (IQR). Quartiles for COVID-19 case and death characteristics were calculated for each state and visualized as choropleth maps generated in Plotly (Version 4.5.2). All data were integrated using an informatics pipeline built via a Jupyter notebook running Python (Release 3.7.6), and analyses were performed using Microsoft Excel Version 16.16.19 (Redmond, WA).

Results

As of 24 April 2020, there were 903 696 reported COVID-19 cases and 51 859 deaths in the USA.5 Broken down by US Census region, there were 133 094 cases and 7367 deaths in the midwest, 499 110 cases and 33 510 deaths in the northeast, 182 254 cases and 7196 deaths in the south and 89 238 cases and 3786 deaths in the West. Overall, there were 2753.2 cases and 158.0 deaths per million residents, respectively. Population- and resource-adjusted characteristics by US Census region and division are shown in Table 1. Trends in cases and deaths adjusted for state populations and resource estimates are visualized in Figure 1.

Table 1

Population- and resource-adjusted characteristics of reported COVID-19 cases and deaths by US region and division, as of 24 April 2020

US region and divisionNumber of cases per million residentsNumber of deaths per million residentsNumber of cases per hundred physiciansNumber of cases per hundred ventilators
Northeast
 New England5733.6314.5215.33036.6
 Middle Atlantic10063.6701.1449.34596.9
 Total8915.4598.6379.14226.5
Midwest
 East North Central2356.1140.2124.21198.1
 West North Central1054.337.039.4551.1
 Total1947.8107.891.0999.1
South
 South Atlantic1559.259.1113.0812.3
 East South Central1249.841.057.5576.8
 West South Central1371.662.0127.1705.1
 Total1451.357.3103.3738.4
West
 Mountain1279.053.7123.6739.1
 Pacific1074.045.851.9694.4
 Total1139.048.365.4709.7
US region and divisionNumber of cases per million residentsNumber of deaths per million residentsNumber of cases per hundred physiciansNumber of cases per hundred ventilators
Northeast
 New England5733.6314.5215.33036.6
 Middle Atlantic10063.6701.1449.34596.9
 Total8915.4598.6379.14226.5
Midwest
 East North Central2356.1140.2124.21198.1
 West North Central1054.337.039.4551.1
 Total1947.8107.891.0999.1
South
 South Atlantic1559.259.1113.0812.3
 East South Central1249.841.057.5576.8
 West South Central1371.662.0127.1705.1
 Total1451.357.3103.3738.4
West
 Mountain1279.053.7123.6739.1
 Pacific1074.045.851.9694.4
 Total1139.048.365.4709.7
Table 1

Population- and resource-adjusted characteristics of reported COVID-19 cases and deaths by US region and division, as of 24 April 2020

US region and divisionNumber of cases per million residentsNumber of deaths per million residentsNumber of cases per hundred physiciansNumber of cases per hundred ventilators
Northeast
 New England5733.6314.5215.33036.6
 Middle Atlantic10063.6701.1449.34596.9
 Total8915.4598.6379.14226.5
Midwest
 East North Central2356.1140.2124.21198.1
 West North Central1054.337.039.4551.1
 Total1947.8107.891.0999.1
South
 South Atlantic1559.259.1113.0812.3
 East South Central1249.841.057.5576.8
 West South Central1371.662.0127.1705.1
 Total1451.357.3103.3738.4
West
 Mountain1279.053.7123.6739.1
 Pacific1074.045.851.9694.4
 Total1139.048.365.4709.7
US region and divisionNumber of cases per million residentsNumber of deaths per million residentsNumber of cases per hundred physiciansNumber of cases per hundred ventilators
Northeast
 New England5733.6314.5215.33036.6
 Middle Atlantic10063.6701.1449.34596.9
 Total8915.4598.6379.14226.5
Midwest
 East North Central2356.1140.2124.21198.1
 West North Central1054.337.039.4551.1
 Total1947.8107.891.0999.1
South
 South Atlantic1559.259.1113.0812.3
 East South Central1249.841.057.5576.8
 West South Central1371.662.0127.1705.1
 Total1451.357.3103.3738.4
West
 Mountain1279.053.7123.6739.1
 Pacific1074.045.851.9694.4
 Total1139.048.365.4709.7
COVID-19 cases (a) and deaths (b) per million residents, and cases per hundred doctors (c) and ventilators (d) by state and quartile, as of 24 April 2020.
Fig. 1

COVID-19 cases (a) and deaths (b) per million residents, and cases per hundred doctors (c) and ventilators (d) by state and quartile, as of 24 April 2020.

Forty-two states and Washington, DC, (84.3%) had statewide stay-at-home orders for all residents, implemented at a median (IQR) of 22.0 (15.5–27.0) days after first reported case and 8.0 (4.0–14.5) days after first death, and have been in place for a median of 28.0 (23.5–31.0) days as of the study date. At the time that their respective orders became effective, states had a median of 168.2 (88.4–310.4) cases per million residents and 2.5 (0.8–6.8) deaths per million residents. At these thresholds, all states without statewide orders (Arkansas, Iowa, Nebraska, North Dakota, Oklahoma, South Dakota, Utah, Wyoming) would be ranked in the highest quartile for cases per million residents, as well as in the highest quartile for deaths per million residents.

Discussion

Main finding of this study

Most statewide stay-at-home orders were implemented roughly 2–4 weeks after a state’s outbreak was first detected, and have only been in place for a few weeks at the time this study was performed. The 1918 influenza pandemic triggered nonpharmaceutical intervention orders in many US cities lasting several months, had multiple (e.g. two or more) waves of pandemic infections, and killed more than half a million people in the USA and tens of millions worldwide.10,13,14 Yet even for that pandemic, US cities with NPI orders that were (i) started soon after an outbreak was detected, (ii) longer in duration, and (iii) broader in scope had better overall outcomes than cities without those characteristics.10,13 It remains concerning that most states currently without stay-at-home orders have population-adjusted case and death metrics that are just as grave as the rest of the country. We strongly recommend that all states implement or continue to implement responsible mitigation strategies focused on protecting their most vulnerable populations.

What is already known on this topic

The US Food and Drug Administration (FDA) is responsible for regulating medical devices and laboratory developed tests (LDTs) essential to address the current pandemic, including products like mechanical ventilators and diagnostic tests for COVID-19.15 The traditional regulatory approval process requires that companies provide evidence of a product’s safety, effectiveness and performance; however, the FDA recently issued several Emergency Use Authorizations which lower regulatory standards in order to quickly address urgent supply shortages.15,16 As the pandemic continues to intensify, it will be critical to improve the quality of all medical devices (including LDTs) reaching patients. COVID-19 diagnostic tests with low specificity (and high false positive rates) could lead to unnecessary quarantines, mental stress and wasted hospital resources while low sensitivity (high false negative) tests could lead to multiple waves of uncontrolled community transmission.17 Until there is a robust supply of accurate, validated diagnostic tests to gauge community spread, the high-risk (upper quartile) states in our figure highlight where greater healthcare resource capacity is needed to prevent overwhelmed hospital systems from increasing patient mortality and putting healthcare employees at risk of infection.1,7,18,19

What this study adds

Effective national preparedness requires clearly understanding states’ ability to predict, manage and balance public health needs through all stages of a pandemic.1,10,20 The rapid spread of SARS-CoV-2 has exposed the limited availability of key resources, from personal protective equipment to mechanical ventilators to the diverse healthcare providers at the front lines of clinical care.1 Looking beyond raw case and death counts by adjusting for publicly accessible data on populations and resource estimates can help clarify risks and inform public health policy.4,21 Choropleth maps, for example, can help policymakers visually gauge which states are at risk for negative outcomes, where public health strategies should shift from containment to risk mitigation, and for how long policies should remain in place.1,10,18

Limitations of this study

Our study had several limitations. First, the numbers of COVID-19 cases and deaths are likely underestimated, given the current shortage of adequate diagnostic tests across the USA. Second, provider and ventilator estimates are not updated as frequently as the pandemic counts, and thus only provide general guidance on resource availability. Third, our study focuses primarily on state-level data but future research should leverage finer levels of granularity, including data about counties, population density, race, age and social determinants of health. Finally, our work integrates and harmonizes pandemic reports with fragmented data from various federal agencies and published outlets. However, effective public health solutions for COVID-19 and future pandemics will require access to interoperable public health data at all levels.22

Conclusion

The COVID-19 pandemic is neither the first nor last major health challenge that the world will face but lasting success will depend on synthesizing data and information quickly, correctly and responsibly into sound national and international public health policies.

Acknowledgements

None.

Authors’ contributions

All authors included in the manuscript provided substantial contribution to (i) conception and design, acquisition of data, or analysis and interpretation of data, (ii) drafting the article or revising it critically for important intellectual content and (iii) final approval of the completed manuscript. JGR had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. The funders had no role in the design and conduct of the study; collection, management, analysis and interpretation of the data; preparation, review or approval of the manuscript; or decision to submit the manuscript for publication.

Funding

None.

Conflict of interest

Dr. Ronquillo reports working for Syapse, which had no role in the study, has received cloud research grants from Google and Microsoft during previous work as a medical school faculty member and has received cloud research funding from the Google Cloud for Startups Program. The authors have no other competing interests to declare.

Jay G. Ronquillo, MD, MPH, MMSc, MEng

William T. Lester, MD, MS

Diana M. Zuckerman, PhD

References

1

Adalja
 
AA
,
Toner
 
E
,
Inglesby
 
TV
.
Priorities for the US health community responding to COVID-19
.
JAMA
 
2020
. doi: .

2

Kamel Boulos
 
MN
,
Geraghty
 
EM
.
Geographical tracking and mapping of coronavirus disease COVID-19/SARS-CoV-2 epidemic and associated events around the world: how 21st century GIS technologies are supporting the global fight against outbreaks and epidemics
.
Int J Health Geogr
 
2020
;
19
(
8
). doi: .

3

Keesara
 
S
,
Jonas
 
A
,
Schulman
 
K
.
Covid-19 and health care’s digital revolution
.
N Engl J Med
 
2020
;
382
(23):
e82
. doi: .

4

Lipsitch
 
M
,
Swerdlow
 
DL
,
Finelli
 
L
.
Defining the epidemiology of Covid-19—studies needed
.
N Engl J Med
 
2020
. doi: .

5

Dong
 
E
,
Du
 
H
,
Gardner
 
L
.
An interactive web-based dashboard to track COVID-19 in real time
.
Lancet Infect Dis
 
2020
. doi: .

6

U.S. Census Bureau
.
Population, population change, and estimated components of population change: April 1, 2010 to July 1, 2019 (NST-EST2019-alldata)
.
2019
. https://www.census.gov/data/tables/time-series/demo/popest/2010s-state-total.html  
(26 March 2020, date last accessed)
.

7

Rubinson
 
L
,
Vaughn
 
F
,
Nelson
 
S
 et al.  
Mechanical ventilators in US acute care hospitals
.
Disaster Med Public Health Prep
 
2010
;
4
(
3
):
199
206
.

8

Halpern
 
NA
,
Tan
 
KS
,
SCCM ventilator taskforce. U.S. ICU resource availability for COVID-19
.
2020
. https://www.sccm.org/Blog/March-2020/United-States-Resource-Availability-for-COVID-19  
(20 April 2020, date last accessed)
.

9

Agency for Healthcare Research and Quality
.
Compendium of U.S. health systems
,
2018
. https://www.ahrq.gov/chsp/data-resources/compendium-2018.html  
(26 March 2020, date last accessed)
.

10

Hatchett
 
RJ
,
Mecher
 
CE
,
Lipsitch
 
M
.
Public health interventions and epidemic intensity during the 1918 influenza pandemic
.
Proc Natl Acad Sci USA
 
2007
;
104
(
18
):
7582
7
.

11

Mervosh
 
S
,
Lu
 
D
.
See Which States and Cities Have Told Residents to Stay at Home
.
The New York Times
,
2020
. https://www.nytimes.com/interactive/2020/us/coronavirus-stay-at-home-order.html  
(20 April 2020, date last accessed)
.

12

Haffajee
 
RL
,
Mello
 
MM
.
Thinking globally, acting locally—the U.S. response to Covid-19
.
N Engl J Med
 
2020
;
382
(22):
e75
. doi: .

13

Markel
 
H
,
Lipman
 
HB
,
Navarro
 
JA
 et al.  
Nonpharmaceutical interventions implemented by US cities during the 1918-1919 influenza pandemic
.
JAMA
 
2007
;
298
(
6
):
644
54
.

14

Mills
 
CE
,
Robins
 
JM
,
Lipsitch
 
M
.
Transmissibility of 1918 pandemic influenza
.
Nature
 
2004
;
432
(
7019
):
904
6
.

15

Ronquillo
 
JG
,
Zuckerman
 
DM
.
Software-related recalls of health information technology and other medical devices: implications for FDA regulation of digital health
.
Milbank Q
 
2017
;
95
(
3
):
535
53
.

16

U.S. Food and Drug Administration
.
Policy for diagnostic tests for coronavirus Disease-2019 during the public health emergency: immediately in effect guidance for clinical laboratories, commercial manufacturers, and Food and Drug Administration staff
.
2020
. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/policy-diagnostic-tests-coronavirus-disease-2019-during-public-health-emergency  
(18 April 2020, date last accessed)
.

17

U.S. food and drug administration. Guidance for industry and FDA staff: statistical guidance on reporting results from studies evaluating diagnostic tests
.
2007
. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/statistical-guidance-reporting-results-studies-evaluating-diagnostic-tests-guidance-industry-and-fda  
(18 April 2020, date last accessed)
.

18

Fauci
 
AS
,
Lane
 
HC
,
Redfield
 
RR
.
Covid-19—navigating the uncharted
.
N Engl J Med
 
2020
;
382
(
13
):
1268
9
.

19

Ji
 
Y
,
Ma
 
Z
,
Peppelenbosch
 
MP
,
Pan
 
Q
.
Potential association between COVID-19 mortality and health-care resource availability
.
Lancet Glob Health
 
2020
;
8
(
4
):e480.

20

Gostin
 
LO
,
Wiley
 
LF
.
Governmental public health powers during the COVID-19 pandemic stay-at-home orders, business closures, and travel restrictions
.
JAMA
 
2020
. doi: .

21

Ronquillo
 
JG
,
Winterholler
 
JE
,
Cwikla
 
K
 et al.  
Health IT, hacking, and cybersecurity: national trends in data breaches of protected health information
.
JAMIA Open
 
2018
;
1
(
1
):
15
9
.

22

Dixon
 
B
,
Kharrazi
 
H
,
Lehmann
 
H
.
Public health and epidemiology informatics: recent research and trends in the United States
.
Yearb Med Inform
 
2015
;
10
:
199
206
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)