Model Input and Output#
Input#
The Bucky model uses two main sources of input: the input graph and CDC-recommended parameters.
Input Graph#
The input graph contains data regarding demographics, geographic information, and historical case information. For details, see Graph Information.
CDC-Recommended Parameters#
The Centers for Disease Control and Prevention (CDC) has published pandemic planning scenarios [fDCP+20] which contain recommended parameters describing biological and epidemiological parameters. Of these five planning scenarios, Bucky utilizes scenario 5, which contains the CDC’s current best estimates for disease severity and transmission. These parameters are described in detail, based on information available from the CDC, and summarized in the table below. CDC-recommended parameters are controlled by parameter files located in the par
directory.
Parameter Description |
Bucky Variable Name |
Value (Interquartile Range) |
---|---|---|
Mean generation interval |
T_g |
7.5 (5.5, 8.5) |
Mean serial interval |
T_s |
6 (5, 7) |
Fraction of infections that are asymptomatic |
asym_frac |
0.4 |
Relative infectiousness of asymptomatic individuals |
rel_inf_asym |
0.75 |
Percentage of transmission prior to symptom onset |
frac_trans_before_sym |
0.5 |
Case fatality ratio |
F |
|
Case hospitalization ratio |
H |
|
TIme from symptom onset to hospitalization |
I_TO_H_TIME |
|
Duration of hospitalization |
H_TIME |
|
Time between death and reporting |
D_REPORT_TIME |
|
Disease Transmission#
The following parameters describe the transmissibility of the virus. The percentage of infections that are asymptomatic, asym_frac
, refers to the percentage of infections that will never develop symptoms. This is a difficult parameter to estimate due to logistical complications (individuals would need to be tested to ensure they remain asymptomatic while infectious) and because the level of asymptomatic infections varies by age. The best estimate for this parameter is the midpoint between the lower bound of [BCB+20], the upper bound of [PTC+20], which corresponds to the estimates from [OT20].
The relative infectiousness of asymptomatic individuals compared to symptomatic individuals rel_inf_asym
is calculated using upper and lower bounds on the difference in viral dynamics between asymptomatic and symptomatic cases. The lower bound is derived from data indicating that more severe cases have higher viral loads [LYW+20] and a study that indicates symptomatic cases shed for longer and have higher viral loads than asymptomatic cases [NYS+20]. Other studies indicate that both symptomatic and asymptomatic cases have similar duration and viral shedding [LKL+20], which is used as the upper bound.
The final parameter relating to disease transmission is the fraction of transmission prior to symptom onset frac_trans_before_sym
which corresponds to the percentage of new cases that were caused by transmission from an individual before they become symptomatic. The lower bound is derived from [HLW+20], with the upper bound derived from [CGM+20].
Disease Characteristics and Severity#
The mean serial interval, Ts
, is the time in days from exposure to onset of symptoms and is taken from [MCH+20]. The mean generation interval, Tg
, is the period of time (in days) between symptom onset for one individual and symptom onset for a person they have infected. This value is from [HLW+20].
The case fatality ratio (CFR) is the number of individuals who will die of the disease; the case hospitalization-severity ratio (CHR) corresponds to the number of cases that are severe and necessitate hospitalization. Within the context of the United States, this ratio corresponds to the individuals admitted to a hospital. In a context where access to medical care is limited, this ratio corresponds to the ratio of individuals who exhibit severe disease symptoms.
Hospital-related parameters are derived using data from COVID-Net [CDC] and the CDC’s Data Collation and Integration for Public Health Event Response (DCIPHER). All data is taken from the period between March 1, 2020 to July 15, 2020 unless otherwise noted. The time it takes from symptom onset to hospitalization in days is denoted by I_to_H_time
. The number of days an individual will be hospitalized is H_TIME
. Finally, the number of days between death and reporting is D_REPORT_TIME
.
Output#
The Bucky model generates one file per Monte Carlo run. This data is post-processed to combine data across all dates and simulations. It can then be aggregated at desired geographic levels. A separate file is created for each requested administrative level, with each row indexed by data, admin ID, and quantile. The columns of this output file are described in the tables below.
Aggregated files are placed in subfolder named using the Monte Carlo ID within the specified output directory. Filenames are constructed by appending the aggregation level with the aggregation type (quantiles vs mean). For example, the following file contains quantiles at the national level:
/output/2020-06-10__14_13_04/adm0_quantiles.csv
An example output directory structure is shown below:
2020-07-28__15_21_52/
├── adm0_quantiles.csv
├── adm1_quantiles.csv
├── adm2_quantiles.csv
├── maps
│ └── ADM1
│ ├── adm1_AlabamaDailyReportedCases2020-07-26.png
│ ├── adm1_AlabamaDailyReportedCases2020-08-02.png
│ ├── ...
└── plots
├── ADM1
│ ├── DailyReportedCases_Alabama.png
│ ├── ...
├── US.csv
└── US.png
Column and Index Names#
Index name |
Description |
---|---|
adm* |
The adm ID corresponding to the geographic level (i.e. adm2 ID) |
date |
The date |
quantile |
Quantile value |
Column name |
Description |
---|---|
case_reporting_rate |
Case reporting rate |
active_asymptomatic_cases |
Current number of actively infectious but asymptomatic cases |
cumulative_cases |
Cumulative number of cumulative cases (including unreported) |
cumulative_deaths |
Cumulative number of deaths |
cumulative_deaths_per_100k |
Cumulative number of deaths per 100,000 people |
cumulative_reported_cases |
Cumulative number of reported cases |
cumulative_reported_cases_per_100k |
Number of reported cumulative cases per 100,000 people |
current_hospitalizations |
Current number of hospitalizations |
current_hospitalizations_per_100k |
Number of current hospitalizations per 100,000 people |
current_icu_usage |
ICU bed usage |
current_vent_usage |
Current ventilator usage |
total_population |
Population |
daily_cases |
Number of daily new cases (including unreported) |
daily_deaths |
Number of daily new deaths |
daily_hospitalizations |
Number of daily new hospitalizations |
daily_reported_cases |
Number of reported daily new cases |
doubling_t |
Local doubling time as estimated from the historical data |
R_eff |
Local effective reproductive number |