Introduction to the SIMPLE Macro, a Tool to Increase the Accessibility of 24-Hour Dietary Recall Analysis and Modeling

ABSTRACT Background Information on long-term dietary intake is often required for research or program planning, but surveys routinely use short-term assessments such as 24-h recalls (24HRs). Methods to reduce the impact of within-person variation in 24HRs, such as the National Cancer Institute (NCI) method, typically require extensive training and skill. Objectives We introduce the Simulating Intake of Micronutrients for Policy Learning and Engagement (SIMPLE) macro, a new tool to increase the accessibility of 24HR analysis. We explain the underlying theory behind the tool and provide examples of potential applications. Methods The SIMPLE macro connects the core NCI statistical code to estimate usual intake distributions and includes additional code to enable advanced analyses such as predictive modeling. The related SIMPLE-Iron macro applies the full probability method to estimate inadequate iron intake, and the SIMPLE-1D macro is used for descriptive or modeling analyses of data with a single 24HR per person. The macros and associated documentations are freely available. We analyzed data from the US National Health and Nutrition Examination Survey (NHANES) and the Cameroon National Micronutrient Survey to compare the SIMPLE macro to 1) the core NCI code using the Estimated Average Requirement cut point method, and 2) the IMAPP software for iron only, and to demonstrate the applications of the SIMPLE macro for estimating usual intake and predictive modeling. Results The SIMPLE macro generates identical results to the core NCI code. The SIMPLE-Iron macro also produces estimates of inadequate iron intake comparable to the IMAPP software. The examples demonstrate application of the SIMPLE macro to 1) descriptive analyses of nutrient intake from food and supplements (NHANES), and 2) analyses accounting for breast-milk nutrient intake and modeling fortification and supplementation programs (Cameroon). Conclusions The SIMPLE macros may facilitate the analysis and modeling of dietary data to inform nutrition research, programs, and policy.


Introduction
Poor diet contributes to each of the diverse forms of malnutrition that now coexist across the globe, from wasting and micronutrient deficiencies to diet-related noncommunicable disease (1,2). High-quality information on usual dietary intake is a critical input to plan effective and efficient programs to improve nutritional status and related outcomes (3). Such data are needed to understand food consumption patterns, assess which nutrients may be of public health concern (either because intakes are low or because they are high), and determine which population subgroups are most affected. Information on the predicted effect of hypothetical nutrition intervention programs on dietary intake and other outcomes may also guide decisions regarding how to address population-specific nutrition problems.
Analysts of dietary intake data are typically interested in information on usual, or long-term average, nutrient intake, including parameters such as the mean, median, and interquartile range (IQR) of the distribution, or the prevalence of inadequate or excessive usual nutrient intakes. In contrast, surveys routinely rely on short-term assessments of dietary intake, such as 24-h recalls (24HRs), to collect information on population intake. Estimation of usual intake is challenging when such methods are used, however, because the distribution of dietary intake measured on ≥1 d is subject to substantial within-person variation (due largely to the day-to-day variation of a person's dietary intake in relation to their usual intake). To address this problem, researchers have developed methods to mathematically adjust for this day-to-day variation to estimate the distribution of usual intake using short-term data (4)(5)(6)(7)(8). Among these methods, the National Cancer Institute (NCI) method has been widely used for analysis of foods or nutrients in 24HR data (8), using freely available SAS macro code. Although the NCI method is very flexible and powerful, it has a fairly steep learning curve. Applying the method requires understanding the theoretical basis for analyzing dietary intake data, mastery of multiple SAS macros, and a high proficiency in SAS coding to enable data-set-specific modifications.
Researchers and policymakers are also often interested in knowing how a population's diet can be improved by hypothetical nutrition-related interventions, such as micronutrient fortification, dietary supplementation, or interventions intended to improve diet quality or diversity (9)(10)(11)(12)(13)(14)(15). Modeling the effect of nutrition interventions involves complex dietary analysis and can be conducted with the NCI macro code, but further effort and technical skill are necessary to modify the input data or code to meet these objectives. Organizations that are willing and able to invest in building this capacity must also consider the time required to do so; this time lag may result in lost opportunities to make dietary data available during critical decision-making periods. The need for global efforts to standardize guidance for dietary data analysis and interpretation has been repeatedly emphasized by recent nutrition initiatives (3,16,17).
To improve the utilization of dietary data by helping researchers and analysts conquer the difficulties of applying the NCI method, the University of California, Davis (UCD) and NCI jointly developed the UCD/NCI Simulating Intake of Micronutrients for Policy Learning and Engagement (SIMPLE) macro. The SIMPLE macro provides a single tool that can be used for both basic and advanced dietary analysis and modeling. Two primary advantages of this tool are that 1) it combines the several existing NCI macros that need to be used jointly to estimate distributions of usual intake for any dietary components consumed by nearly everyone nearly every day ("nearly-daily"), which reduces the time required for analysts to do this linking themselves; and 2) in addition to typical descriptive analyses, the tool allows for advanced dietary analysis and modeling, such as incorporating nutrient intake from supplements and breast milk into total nutrient intake or simulating the effect of micronutrient fortification of foods. The dietary components that can be analyzed with the SIMPLE macro include nutrients, energy, food, or other dietary components, but, for simplicity, in this article we use "nutrients" to refer to all dietary components consumed nearly daily.
Variations of this macro, the SIMPLE-Iron and SIMPLE-1D, are available to address specific challenges, such as the need to apply the full probability method to assess nutrient adequacy for iron and the availability of only a single day of 24HR per person (18). The SIMPLE, SIMPLE-Iron, and SIMPLE-1D macros and example codes are available online (Open Science Framework link: https://osf.io/ytd34/); the corresponding SIMPLE macro user manual and a detailed description of the method used in the SIMPLE-Iron are provided with this article (Supplemental User Manual 1, Supplemental Method 1). We hope the detailed user manual and examples will allow users to modify the example codes for their own dietary research.
The objectives of this article are to describe the general structure and features of the SIMPLE macro, explain the underlying theory, and present example applications of the SIMPLE macro for descriptive analyses and dietary modeling. We first provide an overview of the NCI macros, which form the underlying analytical method of the SIMPLE macro. We then describe the general structure and features of the SIMPLE macro (including the SIMPLE-1D and SIMPLE-Iron variations) and compare the usual intake distribution estimated by the SIMPLE macro with that generated by the standard NCI macros; we also compare the results from applying the full probability method for estimating inadequate iron intake using the SIMPLE-Iron macro with the same analysis conducted with the IMAPP software (5). Next, we describe the core methods applied within the SIMPLE macro to implement selected advanced analyses and modeling techniques. Finally, we present example applications of the SIMPLE macro to national survey data from the United States (19) and Cameroon (20) and comment on the strengths and limitations of the SIMPLE macro with respect to other approaches to dietary analysis.

Overview of the NCI method for estimating usual nutrient intake
The NCI method for estimating usual intake distributions is implemented in 2 macros written in the SAS programming language: MIXTRAN and DISTRIB (21). Figure 1A shows an overview of how the NCI method estimates usual intake distributions. The MIXTRAN macro transforms dietary intake data to approximate normality, models the transformed data as the sum of a covariate-based prediction function and 2 kinds of deviations from that function (between-and withinindividual deviations), and produces parameter estimates and covariatebased predictions used as inputs for the DISTRIB macro. In turn, the DISTRIB macro generates a simulated data set of estimated usual intakes based on the 24HRs by, first, multiply imputing the betweenperson deviations and then adding them to each sampled person's prediction, and, second, converting the imputed values back to the original scale. By only imputing between-person deviations and by using a numerical integration for the back-transformation, this approach analytically corrects for within-person variation. Percentiles of usual intake and proportions of the population with usual intake below or above specified values can be estimated from the simulated data set. Advanced analyses are also supported by using the option of saving the data set. The MIXTRAN macro will not run on data sets without repeated recalls on at least a subsample of individuals, but using the approach of Luo et al. (22), the MIXTRAN macro can be replaced with a different macro (TRAN1) that uses an external ratio of within-and between-person variance and more restrictive assumptions to perform the modeling step with only one 24HR per person.
Obtaining Standard Errors (SEs) essential to inference for parameter estimates from MIXTRAN and usual intake percentiles or proportions from DISTRIB is not straightforward ( Figure 1A). For some complex survey designs, SEs can be calculated using the Balanced Repeated Replication (BRR) method (23); for other survey designs or in simple randomized samples, a bootstrap approach is required. Both methods entail running the MIXTRAN/DISTRIB combination many times on modified input data, saving the desired output each time, then FIGURE 1 Comparison of the procedures between the SIMPLE macro and NCI method. BRR, Balanced Repeated Replication; NCI, National Cancer Institute; SE, Standard Errors; SIMPLE, Simulating Intake of Micronutrients for Policy Learning and Engagement; UCD, University of California, Davis; 24HR, 24-h recall. The shaded boxes show that the SIMPLE macro condenses steps [1] to [7] of the NCI method. For data sets with a single 24HR per person, the NCI MIXTRAN macro will be replaced with the UCD/NCI TRAN1 macro and an external variance ratio to estimate the distribution of usual intake, and the SIMPLE macro will be replaced with the SIMPLE-1D macro and an external variance ratio to estimate the distribution of usual intake.
estimating SEs from the replicated output using code such as that provided by the NCI BRR_PVALUE_CI macro (21). A substantial amount of SAS programming is required to automate this approach for application to a variety of nutrients and/or data sets, with careful attention paid to properly combining the MIXTRAN and DISTRIB macros.

Overview of the NCI/UCD SIMPLE macro
The SIMPLE macro is a single macro that links 3 NCI macros, the MIXTRAN, DISTRIB, and BRR_PVALUE_CI, to facilitate estimation of usual intake distributions for food and nutrients consumed "nearlydaily." The SIMPLE macro also includes additional features, such as carrying out checks on the input data sets and supporting a variety of specific analyses, including modeling nutrition-related interventions (13,15,24,25); however, before this article, the associated codes and analytical details have not been published, which may have limited the extent to which other researchers could then apply these methods to their own data. Figure 1 shows the relation between the SIMPLE macro and the core NCI macros. Because the SIMPLE macro serves as a connector of the existing NCI macros without modifying any core part of the NCI macros, the results of using the SIMPLE macro are exactly the same as those of using several NCI macros jointly. We applied the SIMPLE macro to the example data provided by the NCI website for estimating the usual intake distribution of added sugars (21). The results generated by the SIMPLE macro were identical to the results provided on the NCI site that correspond to the example data and codes for the MIXTRAN, DISTRIB, and BRR_PVALUE_CI macros ( Table 1). This observation has been confirmed through extensive testing of the SIMPLE macro on 3 nationally representative surveys: multiple cycles of the National Health and Nutrition  (20), and the Ethiopian National Food Consumption Survey (26). Figure 2 shows an overview of the SIMPLE macro inputs, processes, and outputs. Data sets and macro options are checked for formatting and model specification errors before starting analysis; if any errors are detected, the macro provides guidance for correcting them. The output from a successful run includes a formatted spreadsheet (suitable for use in reports or manuscripts) containing estimates and associated SEs for characteristics of the usual intake distribution, such as the mean, median, IQR, and proportions of the population with inadequate or excessive nutrient intake. The core SIMPLE macro estimates inadequate nutrient intake using the Institute of Medicine Estimated Average Requirement (EAR) cut-point method, defining prevalence of inadequate intake as the proportion of individuals with intake below each individual's age-and sex-specific EAR for a given nutrient; likewise, excessive intake is defined as intake above the corresponding Tolerable Upper Intake Level (UL) (27). The SIMPLE macro also outputs model parameters such as within-and between-person variance components and coefficients for included covariates that influence the distribution of usual intake.

SIMPLE-Iron
The EAR cut-point method requires several assumptions, including the assumption that the distributions of nutrient requirements are symmetrical (27), which is known to be violated in the case of iron requirements for populations such as menstruating women. Although the SIMPLE macro can be used to estimate some parameters of iron intake distribution (mean, median, and percentiles), the SIMPLE macro, which uses the EAR cut-point method, is not appropriate for computing the prevalence of inadequate usual iron intake. To enable estimation of the prevalence of inadequate iron intake, we developed the SIMPLE-Iron macro, which is a variation of the SIMPLE macro that applies the full probability method instead of the EAR cut-point method to estimate inadequate iron intake (18). Although the SIMPLE-Iron macro was specifically developed to analyze usual iron intake, it can also be modified to apply the full probability method to other nutrients of interest. A detailed description of the methods and assumptions used in the SIMPLE-Iron macro is provided with this article (Supplemental Method 1). We compared the results of the SIMPLE-Iron macro with the prevalence of inadequate iron intake as estimated by the IMAPP software (5), which also applies the full probability method for iron (18) among children and adolescents 9-18 y of age using NHANES 2011-2014 ( Table 2). When the SIMPLE macro is applied in a way that is analogous to the way IMAPP functions (stratifying analyses of different age and sex groups, without applying any other covariates), the results from the IMAPP software and SIMPLE macro are similar, and the estimates from IMAPP are within the CIs of the results generated by the SIMPLE macro (by default, IMAPP only produces SEs for percentiles when estimating iron usual intake; therefore, we only present the CIs for percentiles).

SIMPLE-1D
Although it is always recommended to collect repeated 24HRs per person, some food consumption surveys and research studies have included only 1 single 24HR per person, owing to logistical challenges or limited resources (25,28). Therefore, the SIMPLE-1D macro was developed as an extension to the SIMPLE macro and can be used to perform the same types of descriptive and advanced analyses on these "single-day" data sets. Technically, just as the SIMPLE macro leverages the ability to manipulate the output from the NCI DISTRIB macro, the SIMPLE-1D macro leverages the ability to manipulate the input to DISTRIB by replacing the MIXTRAN with the TRAN1 macro, which uses external data on the ratio of within-and between-person variance (22).

Application of the SIMPLE macros for advanced analyses and modeling
In the following paragraphs, we explain how the SIMPLE macro modifies the NCI macros to perform dietary analysis and modeling beyond the estimation of usual intake distributions. The SIMPLE macro leverages the ability to manipulate the simulated data set produced by DISTRIB to easily perform some advanced analyses. The SIMPLE macro expands on this technique by in addition incorporating variables that add specified amounts to the simulated usual intakes themselves. This option is intended to account for sources of usual nutrient intake not routinely captured by 24HRs, such as dietary supplements and breast milk, which are often quantified via longer-term assessments (29,30). For dietary supplements in particular, these values may cluster around a small number of dosages that are extremely high compared with the continuous distribution of usual intakes from food sources. For example, iron supplements commonly provide 30 mg Fe/d, which is likely to be substantially higher than usual food iron intake. In the United States, >50% of the adult population regularly consumes dietary supplements, and dietary supplements substantially contribute to total nutrient intake (31,32). Thus, when evaluating the prevalence of inadequate or excessive intake, estimating usual nutrient intake from food sources only provides an incomplete picture of dietary patterns (29). Similarly, for infants and young children, breast milk is an important source of nutrients. Whereas individual-level data on usual breast milk intake and nutrient composition are rarely available, published estimates can be used to approximate the contribution of breast milk to total nutrient intake. To capture nutrient intake from dietary supplements (reported or hypothetical) or breast milk intake, the SIMPLE macro adopts the "shrink-then-add" (30) approach to estimate distributions of usual intake from multiple data sources with different temporal scopes. The macro can use either individual-level data (e.g., from a dietary supplement questionnaire or data on an individual child's breast milk intake measured using isotope dilution) or grouplevel data (e.g., estimated average nutrient intake from breast milk across age groups or geographic strata). The latter case is also relevant for simulating the effects of national dietary supplement distribution programs, where a fixed dosage of a dietary supplement is proposed (e.g., 30 mg/d of iron supplements for pregnant women) (33) and

FIGURE 2
Overview of the SIMPLE macro inputs, processes, and outputs. IQR, interquartile range; SE, Standard Errors; SIMPLE, Simulating Intake of Micronutrients for Policy Learning and Engagement. Input parameters means that users should specify the relevant variable names, such as the names of nutrient intake, survey weight parameters, and unique identification as inputs to the SIMPLE macro.
coverage of dietary supplements is hypothesized or estimated from external data sources (14,15,25).
Like supplementation, fortification of staple foods, condiments, or other products can be an effective means to reduce micronutrient deficiency. Modeling is useful to assess the likely impact of such a strategy before investing in a national program (14,15,25). Because fortification involves changes to the nutrient content of foods which are captured by a 24HR, it is appropriate to modify 24HR observations to reflect hypothetical fortification before applying the NCI method (i.e., the "add-then-shrink" method) (30). This modeling can be accomplished by adjusting the food composition table value for the hypothetical fortified product, or in the 24HR database by multiplying the amount of fortifiable food consumption by the fortification level and adding this to the total nutrient intake calculated without fortification.
In all, the extended advanced modeling capabilities of the SIMPLE macro allow evaluation of the effects of proposed combinations of fortification and supplementation programs on inadequate or excessive nutrient intake. Researchers and policy advocates can use this information to answer questions relevant to program or policy design, such as comparing the effectiveness and safety of alternative fortification scenarios or supplement distribution strategies.

Results
In this section, we present 4 example analyses for the purpose of illustrating the potential applications of the SIMPLE macro. The code, sample data, and a detailed user guide are available online (Open Science Framework: https://osf.io/ytd34/). Users can download additional results from all the examples online (https://osf.io/8ygh3/download). The first example uses data from adult women in the NHANES 2011-2014 (19) to perform a descriptive analysis of usual calcium intake from food sources alone and from the combination of food sources and dietary supplements. The second example uses the same data from the NHANES to estimate inadequate iron intake using the full probability method among children and adolescents 9-18 y old with the SIMPLE-Iron macro. The third example uses data from the CNMS (20) to model the effects of hypothetical vitamin A fortification and supplementation programs among preschool children, accounting for the contribution of breast milk intake to vitamin A intake. The fourth uses the first 24HR per person from the CNMS and implements the same analyses as the third example, to illustrate use of the SIMPLE-1D macro. We strongly recommend that users thoroughly review the NCI measurement error webinar series (34) and the measurement error webpage (35) that explains the theory underpinning the SIMPLE macro. We also recommend that users personally work through the provided examples to gain practical experience before applying the SIMPLE macro to their own studies.

Example 1: Descriptive analysis using the EAR cut-point method and data from NHANES 2011-2014
This example uses data on women 19 y of age and older from NHANES 2011-2014 (19). In this population, individual women have different corresponding EAR and UL values owing to their physiological status, such as pregnancy or lactation. In this example, we applied the SIMPLE macro to estimate 1) the distribution of usual calcium intake from food sources and the proportions of the population with inadequate and excessive usual intakes using a single cutoff for all individuals; 2) the distribution of usual calcium intake and prevalence of inadequate and excessive intakes from food sources using individual-specific EAR and UL cutoffs (i.e., applying appropriate cutoffs to account for the age of each individual in the data set); and 3) the distribution of usual calcium intake and prevalence of inadequate and Values generated from the SIMPLE macro are mean (95% CI). IMAPP only produces SEs for percentiles by default when estimating iron usual intake; therefore, we only present the CIs for percentiles. Estimates from IMAPP are within the CIs of the estimates from the SIMPLE macro. Usual iron intake was estimated based on the assumption of 18% iron absorption and mixed oral contraceptive use among females. The SIMPLE macro was applied in a way that is analogous to the way IMAPP functions (stratifying analyses of different age and sex groups, without applying any other covariates). NCI, National Cancer Institute; NHANES, National Health and Nutrition Examination Survey; SIMPLE, Simulating Intake of Micronutrients for Policy Learning and Engagement; UCD, University of California, Davis.
excessive intakes accounting for both food sources and dietary supplements, and using individual-specific EAR and UL cutoffs.
The BRR method was used to estimate SEs for these results.
The results for Example 1 are presented in Table 3, which illustrates the direct output of the SIMPLE macro (although formatted to adhere to journal policy). Our analysis shows that whereas the mean and percentiles of total nutrient intake were the same in the first 2 analyses, the analysis using individual-specific EAR cutoffs estimated a substantially higher prevalence of inadequate calcium intake among adult women (57.1%) than the analysis using a single EAR value (45.8%). After including calcium intake from supplements, the mean calcium intake was estimated to increase from 862 mg/d to 1071 mg/d, the inadequate calcium intake to decrease from 57.1% to 40.1%, and the excessive intake to increase from 0% to 4.2%. Therefore, applying individual-specific EAR and UL values may change the prevalence of inadequate or excessive nutrient intakes (theoretically increasing the accuracy of the analysis), and including nutrient intake from supplements to capture all sources of the nutrient can change the usual nutrient distribution and prevalence of inadequacy.

Example 2: Descriptive analysis using the full probability approach and data from NHANES 2011-2014
In this example, we used data on children and adolescents 9-18 y old from NHANES 2011-2014 (19) to demonstrate estimation of the distribution of usual iron intakes and the prevalence of inadequate iron intake using the SIMPLE-Iron macro. The SIMPLE-Iron macro applies the full probability method using age-and sex-appropriate reference values to estimate inadequate intake among children and adolescents, for whom the distribution of iron requirements has been defined by the US Institute of Medicine (IOM) Panel on Micronutrients in 2001. The default iron absorption value in the SIMPLE-Iron macro is 18%, consistent with the estimate of fractional iron absorption that was assumed by the IOM for a population with mixed diets (vegetarian and nonvegetarian) in calculating the recommended dietary iron intakes. However, the SIMPLE-Iron macro also allows users to input their own estimate of iron absorption, including the use of algorithms to estimate "absorbable iron" (Supplemental Method 1). Table 4 presents the results of Example 2.

Example 3: Descriptive analysis and modeling of supplementation and fortification programs using the CNMS
In this example, we analyzed 24HRs from children 1-5 y of age from the CNMS (15,20) to demonstrate use of the SIMPLE macro to estimate 1) the distribution of usual vitamin A intake from food sources and the associated prevalence of inadequate vitamin A intake using age-specific EAR cutoffs (i.e., for children 1-3 y of age compared with 4-5 y of age); 2) the distribution of usual vitamin A intake and associated prevalence of inadequacy accounting for both food sources and estimated breast milk consumption (among children reported to be breastfeeding at the time of the survey); 3) the distribution of usual vitamin A intake and associated prevalence of inadequacy accounting for food sources, estimated breast milk consumption, and a simulated dietary supplementation program (assumed to deliver a daily dose of vitamin A containing 167 μg retinol to a random sample of 90% of the target population); and 4) the distribution of usual vitamin A intake and associated prevalence of inadequacy accounting for food  sources, estimated breast milk consumption, a simulated dietary supplementation program, and a simulated oil fortification program (assumed to fortify all industrially refined cooking oil with 12 μg retinol/g of oil). The BRR method is used to estimate SEs for these results. The results of Example 3 show the predicted increase in total vitamin A intake among children after including vitamin A intake from estimated breast milk consumption and simulated vitamin A supplementation and fortification programs ( Table 5). Figure 3 shows the predicted change in shape of the distribution of vitamin A intake when vitamin A intake from breast milk and dietary supplements is included, due to the relatively high amounts of vitamin A in these sources compared with the vitamin A intake from food.

Example 4: Descriptive analysis and modeling of supplementation and fortification programs using the CNMS with only a single 24HR per person
To demonstrate application of the SIMPLE-1D macro, we used the same data set as the previous example (20) but retained only the first 24HR per person. The example includes all 4 scenarios of vitamin A analysis and modeling among children 1-5 y of age as shown in Example 3. For single-day dietary data, users cannot estimate the ratio of within-and betweenperson variance components internally (i.e., from the same data set), which they can when they analyze dietary data that include repeated recalls on the same person. Thus, an external variance ratio, defined as a ratio estimated from a similar population from a different study, is often used (22,36). In this example, we used the ratio of within-and between-person variance components of vitamin A intake estimated from Example 3 (i.e., an internal variance components ratio from the same data set) and, therefore, our results for Example 4 are similar to those of Example 3 (Table 5). However, if the assumed external variance ratio deviates from the true variance ratio, the resulting usual intake estimates can be biased (22,36); therefore, users should be cautious when applying an external variance ratio to their single-day dietary data and should always conduct sensitivity analyses with a range of external ratios.

Discussion
The UCD/NCI SIMPLE macro provides a streamlined structure for 24HR analysis that builds on the existing core NCI macros for estimating usual or long-term dietary intakes from multiple 24HRs. As for other similar dietary analysis tools, users need to understand the theoretical basis for analysis of 24HR data and have familiarity with basic computer code. With this background, the new, freely available tool simplifies both basic descriptive analyses and complex analyses of 24HR data, including accounting for supplement and breast milk intake, and modeling the impact of hypothetical fortification or supplementation programs, and their combinations. Extensions of the tool permit analyses of special cases when the full probability method is required to assess dietary adequacy, and when only a single day of dietary data is available for each person. With these in place, the tool allows "shortcuts" in programming capacity that can dramatically shorten the time from data cleaning to presentation of a final results table.
To strengthen data and information systems for nutrition and provide timely information for policy decision making, the Global Nutrition Report in 2017 proposed the concept of the "nutrition data value chain," which has 5 critical processes-data prioritization, creation and collection, curation, analysis, and interpretation/recommendation-before the final step, decision making (16,37). To date, enormous effort has been spent overcoming the early challenges in this chain: dietary data collection and curation. Open-source and standardized dietary data collection tools, such as ASA24 (38) and Intake24 (39), have been developed, validated, and used in multiple national surveys and research studies. For low-income countries, standardized data collection methods were proposed >2 decades ago (40). More recently, mobile-or tablet-based dietary data collection tools have been independently developed by nutrition researchers, such as the mobile application-and web-based dietary database by the International Dietary Data Expansion (INDDEX) (3) and the mobile-based, open-source dietary collection tool by Caswell et al. (41). In contrast, less progress has been achieved in streamlining the analysis of detailed dietary intake data, such as those collected using 24HR methods (17). Although technical guidelines exist for best practices in analysis of usual nutrient intake (34,42), 24HR analysis still relies primarily on certain statistical packages that require both a high level of proficiency in computer coding and a deep understanding of the theoretical bases of 24HR analysis (6,8) or software that is accessible to new users but has more limited features (5,7).
The SIMPLE macro is a multifunctional dietary software tool that can be applied to diverse global contexts to help fill the gap in the nutrition data value chain between data collection and interpretation for decision making. Because the SIMPLE macro has simplified procedures for data processing using the  2 We assumed that breastfed children in the North, South, and Cities consumed 550.6, 232.2, and 473.2 μg RAE/d of vitamin A from breast milk, respectively. The regional variation in vitamin A intake from breast milk reflects differing estimates of breast-milk vitamin A concentrations by region. 3 The simulated dietary supplementation program was assumed to deliver a daily dose of 167 μg retinol to a random sample of 90% of the target population. 4 The simulated oil fortification program was assumed to fortify all industrially refined cooking oil with 12 μg retinol/g of oil.
core NCI method as well as the ability to automatically output a spreadsheet containing the information typically requested on usual intake (e.g., mean, median, inadequate or excessive intake), timely reporting of results can be achieved more easily.
It is important to note that, because the SIMPLE macro uses the existing NCI macros, the SIMPLE macro does not shorten the computation time. Instead, the SIMPLE macro reduces the time spent on 1) learning the in-depth theory of advanced dietary analysis and the NCI approach in particular, to allow appropriate manipulation of the NCI macros; and 2) creating the code to modify several NCI macros and the output data sets for each NCI macro. Using the existing NCI macros for descriptive analyses requires implementing a minimum of 6 steps to apply the MIXTRAN, DISTRIB, and NCI BRR_PVALUE_CI macros, and additional steps may be required to conduct any advanced modeling work. In contrast, the SIMPLE macro serves as an "all-in-one" macro "shell" that can be used to carry out a wide range of analyses in a single step. The capability to model nutrition intervention programs, such as food fortification programs, is important to assess the likely impacts of such programs on dietary adequacy or excess; this information is useful both when generated in advance to guide the planning process, and after programs are in place, to guide later program management decisions. In the United States, scientists and policymakers rely heavily on the scientific evidence generated from analyzing NHANES (43) to develop the Dietary Guidelines for Americans. Our extensive testing demonstrates that the SIMPLE macro can be successfully applied to NHANES data, so researchers could use the SIMPLE macro to explore new research questions or to make the results of the latest cycles of NHANES available in a speedy manner. In addition, there is an increasing push to improve research capacity at institutions in low-and middle-income countries. The SIMPLE macro can facilitate these efforts by providing a user-friendly platform for researchers to conduct advanced dietary analysis and modeling, as well as report relevant findings to local governments and the international community, with substantial reductions in the time and cost required to do so. The SIMPLE macro also has several limitations. First, the SIMPLE macro was developed to analyze "nearly-daily" consumed dietary components. The tool was designed to estimate population distributions of usual intake and does not incorporate the NCI method's approach to predict the relation between usual dietary intake and health outcomes (e.g., the relation between whole wheat consumption and cardiovascular health). Second, the tool cannot analyze dietary components that are episodically consumed by a population (e.g., whole wheat products in the United States, or preformed retinol intakes in some populations with very low animal-sourced food consumption). Third, the SIMPLE macro can only generate unbiased results when the dietary data are of high quality and represent the population's dietary intake. It cannot correct for errors in data collection or misreporting, such as extreme or unreasonable values of food or nutrient intake, that can exist in 24HR data. Careful data cleaning and mobile-or tabletbased dietary data collection applications that have the built-in feature of automatically cleaning the dietary data (e.g., range SIMPLE macro to simplify 24-h analysis 1337 FIGURE 3 Distribution of vitamin A intake (μg RAE/d) from food, breast milk, simulated supplementation program and oil fortification program among children 1 to 5 years of age from the Cameroon National Micronutrient Survey 2009, estimated using the UCD/NCI SIMPLE macro. NCI, National Cancer Institute; RAE, Retinol Activity Equivalent; UCD, University of California, Davis. We assumed that breastfed children in the North, South, and Cities consumed 550.6, 232.2, and 473.2 μg RAE/d of vitamin A from breast milk. The regional variation in vitamin A intake from the breast milk are due to varying vitamin A concentrations in mothers' breast milk by region. The simulated dietary supplementation program was assumed to deliver a daily dose of 167 μg retinol to a random sample of 90% of the target population. The simulated oil fortification program was assumed to fortify all industrially refined cooking oil with 12 μg retinol/g of oil.
checks on portion sizes to avoid entering improbable values), in addition to rigorous, standardized training of interviewers, may help to minimize misreporting of dietary intake. Fourth, the SIMPLE macro has specific technical requirements. Users need to have both SAS (SAS Institute) and Microsoft Excel installed. As the non-academic-based SAS license is expensive, resourceconstrained analysts or those who are not affiliated with an academic institution may need to partner with universities to carry out dietary analysis with this tool. In addition, users are required to have basic SAS programming skills, which requires time and access to appropriate training materials. These user requirements may block the usage of the SIMPLE macro from a wider audience. In the future, we aim to improve the use of the SIMPLE macro and address these limitations. To facilitate use of the current macro version, we have designed training materials to enhance the usage of the SIMPLE macro and build the capacity of researchers and policy analysts (training materials are available upon request). Plans to expand the current SIMPLE macro tool include the addition of new functionalities, such as analysis of episodically consumed nutrients and foods. In the longer term, we envision development of a web-based tool based on the SIMPLE macro to further increase the accessibility of the method and decrease the time and resources required to make dietary data results available.
In conclusion, although the value of dietary analysis for nutrition research and program planning is widely recognized (3), such data may be underutilized owing to the complexity of appropriate statistical analyses and the corresponding capacity needs. By providing a more user-friendly structure around the existing core NCI macros, the SIMPLE macro streamlines basic descriptive analyses of 24HR data while also providing extra functionality to enable more complex analyses. Specifically, the tool permits users to estimate the observed contributions of supplements or breast milk to usual nutrient intakes and model the effects of hypothetical nutrition intervention programs, such as supplementation or food fortification. Variations of the tool help to overcome other technical challenges, such as applying the full probability method to estimate inadequate iron intake and estimating usual intake distributions with only a single day of data per person. The SIMPLE macro has the potential to make analysis of 24HR data more accessible by reducing the time and resources needed to conduct high-quality dietary analyses, facilitating the availability of dietary intake data for research and to inform nutrition policies and programs.