-
PDF
- Split View
-
Views
-
Cite
Cite
Steve Gutreuter, Michael A. Boogaard, Prediction of lethal/effective concentration/dose in the presence of multiple auxiliary covariates and components of variance, Environmental Toxicology and Chemistry, Volume 26, Issue 9, 1 September 2007, Pages 1978–1986, https://doi.org/10.1897/06-630R.1
Close - Share Icon Share
Abstract
Predictors of the percentile lethal/effective concentration/dose are commonly used measures of efficacy and toxicity. Typically such quantal‐response predictors (e.g., the exposure required to kill 50% of some population) are estimated from simple bioassays wherein organisms are exposed to a gradient of several concentrations of a single agent. The toxicity of an agent may be influenced by auxiliary covariates, however, and more complicated experimental designs may introduce multiple variance components. Prediction methods lag examples of those cases. A conventional two‐stage approach consists of multiple bivariate predictions of, say, medial lethal concentration followed by regression of those predictions on the auxiliary covariates. We propose a more effective and parsimonious class of generalized nonlinear mixed‐effects models for prediction of lethal/effective dose/concentration from auxiliary covariates. We demonstrate examples using data from a study regarding the effects of pH and additions of variable quantities 2',5'‐dichloro‐4'‐nitrosalicylanilide (niclosamide) on the toxicity of 3‐trifluoromethyl‐4‐nitrophenol to larval sea lamprey (Petromyzon marinus). The new models yielded unbiased predictions and root‐mean‐squared errors (RMSEs) of prediction for the exposure required to kill 50 and 99.9% of some population that were 29 to 82% smaller, respectively, than those from the conventional two‐stage procedure. The model class is flexible and easily implemented using commonly available software.
INTRODUCTION
Quantities such as the percentile lethal/effective dose/concentration predict the exposure required to evoke some quantal response (e.g., death) in 100p*% of some population, where 0 < p* < 1. Such prevalence‐predicting exposures, herein denoted LC100p* for simplicity, are commonly used end points in toxicity studies and regulatory assessments. The median lethal concentration (LC50), for example, is defined by p* = 0.5. Values of LC100p* are not observable as primary data. Rather, they must be predicted from the inversion of models that predict the probability of the quantal response from exposure dose or concentration. Typically, LC100p* is predicted from relatively simple bivariate bioassays wherein organisms are exposed, often groupwise, to one of several concentrations of a single agent, and much attention has been devoted to that prediction [1–6].
The toxicity or efficacy of an agent also might be affected by multiple factors, including the presence of bivalent cations [7], pH [8], or dissolved organic carbon [9]. In such cases, it is desirable to predict LC100p* over the ranges of those auxiliary covariates, but the development of methods has lagged behind need. Conventionally, an intuitively reasonable two‐stage approach has been used wherein multiple LC100p* values—each from a specific level of an ancillary covariate—are predicted using conventional bivariate models and subsequently treated as observed data in second‐stage regression models that include the auxiliary covariates [9–11]. That conventional two‐stage approach lacks parsimony and coherence, however, because estimation of LC100p* and prediction over the auxiliary covariates are separated. Furthermore, the uncertainty of the LC100p* estimates may be misrepresented if the designs of the studies entail variance components that cannot be accommodated using that conventional two‐stage approach. We propose a simple and coherent class of models to overcome those limitations.
The present study was motivated by the need to provide improved estimates of the effects of 2',5'‐dichloro‐4'‐nitrosalicylanilide (niclosamide) on the toxicity of 3‐trifluoromethyl‐4‐nitrophenol (TFM) to larval sea lamprey (Petromyzon marinus) over a wide range of ambient pH. Adult sea lamprey are parasitic on other fishes, and repeated attacks can be lethal. The Laurentian Great Lakes ecosystem was radically altered by high mortality of large‐bodied native fishes following the invasion of sea lamprey after the opening of the Welland Canal, and TFM has been used to control sea lamprey in the Great Lakes of North America since approximately 1959 [12].
The uptake of TFM decreases with increasing pH, because the ionized form of the TFM molecule, which becomes increasingly dominant as ambient pH increases beyond its dissociation constant (pKa) of 6.07, transfers less readily across the gill membrane than the unionized form [13]. This sensitivity to pH is critically important, because the pH of spawning tributaries varies spatially and temporally. Additions of small amounts of niclosamide are used to boost toxicity, especially in high‐pH streams. Therefore, successful treatment with these agents requires prediction of toxicant formulation and concentration, based on pH, for each application to a spawning tributary.
The treatment of sea lamprey spawning areas is expensive and can have untoward effects. Although TFM and niclosamide are relatively selective for larval sea lamprey, juvenile lake sturgeon (Acipenser fulvescens) also are vulnerable [14]. Therefore, current stream‐treatment protocols require application of the minimum formulation needed to kill 99.9% of the larval sea lamprey (LC99.9) while sparing juvenile lake sturgeon. Accuracy and precision are so important that streamside bioassays have been used to estimate toxicant requirements. The standard practice, however, is to predict treatment requirements based on the results of laboratory toxicity tests.
Conventionally, laboratory estimates of LC100p* have been treated as observed data and regressed on niclosamide concentration and either pH or alkalinity to produce tables of pH/alkalinity and niclosamide‐specific treatment concentrations [10]. Treatment managers measure pH or alkalinity in situ and then use the standard tables to identify suitable toxicant formulations and concentrations.
We propose a parsimonious class of models for prediction of LC100p* and associated measures of uncertainty from complex studies that include multiple auxiliary variables and variance components. The model class is broad and, therefore, requires presentation using a rather high degree of mathematical abstraction. However, we provide SAS® (SAS Institute, Cary, NC, USA) code (see Appendix [http://dx.doi.org/10.1897/06–630.S1]) to demonstrate some examples.
We demonstrate models using data from a study regarding the toxicity of TFM and niclosamide combinations on larval sea lamprey at multiple values of pH. Estimates of LC100p* may vary among models, and we demonstrate inclusion of that model uncertainty using model averaging. Measures of uncertainty are particularly important for that application, because historically, stream‐treatment managers have made small, ad hoc adjustments to formulations based on intuition and experience. Prediction intervals provide data‐based limits on the range of those ad hoc adjustments.
MATERIALS AND METHODS
Review of bivariate quantal‐response models for estimation of LC100p*
A typical quantal‐response bioassay test consists of some number I of experimental units (EUs), each consisting of ni ⩾ 1 organisms, that are exposed to an increasing gradient of some agent. Let Xi represent the concentration/dose of the agent to which EU i was exposed, such that Xi ⩾ 0 for all i, and let Yi represent the number of test organisms out of each ni that exhibited some quantal response (e.g., death) during the test. Control EUs have Xi = 0. Under the condition that the Yi represent the sums of independent homogeneous Bernoulli random variables, each follows a binomial distribution having parameter pi, which represents the proportion of organisms that exhibit the quantal response in the population that produced EU i. Therefore, LC100p* is traditionally predicted from binomial models of the form

where T−1(·) is the inverse of some tolerance function T(·), and β0 and β1 are parameters describing the relation between pi and hΛ(Xi). We demonstrate the commonly used cumulative standard Gaussian, logistic, and extreme‐value tolerance functions (Table 1). The function hΛ(·) is a device that expands the range of shapes that T(·) can assume, and is equivalent to transformation of exposure X. Prior data transformation is unnecessary, and instead, we incorporate the Box‐Cox power function

to mimic the effect of a broad class of transformations on T(·). Note that hΛ(X) = X for Λ = 1. The single component of variance is Var(Yi) = nipi(1 — pi). Conventional predictions of LC100p*, denoted X̂CONV(p*), are obtained by estimating β0 and β1, then setting pi = p* and solving Equation 1 for X to predict the concentration/dose that is expected to produce the quantal‐response rate of p* and given by

For the Gaussian tolerance function and Λ = 1, for example, the conventional estimates of LC50 are then given by

These are essentially the Litchfield‐Wilcoxon type estimates produced by specialized software such as TOXSTAT® [15] and ToxCalc<35> [16].
Commonly used tolerance functions T(w) = p and their inverses, T−1(p) = w, used in quantal response bioassaysa
| Tolerance function/inverse | T(w) | T−1(p) |
![]() | ||
| Tolerance function/inverse | T(w) | T−1(p) |
![]() | ||
a Here, p and w represent the probability of some quantal response and exposure, respectively. These functions can be generalized by defining w = β0 + β1hΛ(x), where hΛ(x) is any monotonic transformation function in x.
Commonly used tolerance functions T(w) = p and their inverses, T−1(p) = w, used in quantal response bioassaysa
| Tolerance function/inverse | T(w) | T−1(p) |
![]() | ||
| Tolerance function/inverse | T(w) | T−1(p) |
![]() | ||
a Here, p and w represent the probability of some quantal response and exposure, respectively. These functions can be generalized by defining w = β0 + β1hΛ(x), where hΛ(x) is any monotonic transformation function in x.
Conventional two‐stage prediction of LC100p* from auxiliary covariates
We consider the more general case where the efficacy or toxicity of an agent may be affected by one or more auxiliary covariates that are not intrinsically toxic or are confounded with the primary agent by design. Typically, the data are generated from multiple clusters, which are aggregations of EUs that might reasonably be correlated. For example, tests—separated in space or time—might be treated as clusters. The auxiliary covariates may vary among clusters. Let j index the J clusters. Conventional ad hoc prediction of LC100p* from the auxiliary covariates is done in two stages. The first stage requires the fitting of J independent conventional quantal‐response prevalence models, given by

to produce J estimates of LC100p*, denoted by X̂CONVj(p*). Note that those first‐stage models require estimation of a total of 2J parameters. For the second stage, the J estimates of LC100p* are treated as observed data in generalized linear models of the form

where g2(·) is some variance function [17] defining any heteroscedasticity in Var[X̂CONVj(p*)] = σ2g2(·) and Aij(θ0 + z'ijθ) is a function of parameter θ0, a vector of M ancillary covariates zij = [Zij, …, ZMj]' and parameter‐vector θ, where z' denotes the transpose of z. For example, if g2(·) = 1 and A(·) is the identity function, then Equation 5b reduces to the ordinary linear regression model:

where Var[X̂j(p*)] = Var(εj) = σ2. The conventional two‐stage predictor of LC100p*, denoted X̂2STG(p*), is obtained from the fitting of Equation 5b as

Note that the X̂CONVj(p*) are predicted with error from Equation 5a; therefore, Equation 5b is an errors‐in‐response model. The parameters of such models are not universally identifiable without additional information, but the necessary conditions are well known [18] and satisfied by ordinary linear regression. The second‐stage model requires the estimation of the M + 1 structural parameters in A(·), σ2, plus any parameters in g2(·). Therefore, the conventional two‐stage prediction of LC100p* from ancillary covariates requires the estimation of a total of at least 2J + M + 2 parameters.
Formulas for the marginal expectation of LC100p*, denoted X̂MARG (p*), obtained from solutions to Equation 9 for h1(Xij) = Xij and h0(Xij) = log(Xij + 1)a
| Tolerance function | h1(Xij) = Xij | h0(Xij) = log(Xij + 1) |
![]() | ||
| Tolerance function | h1(Xij) = Xij | h0(Xij) = log(Xij + 1) |
![]() | ||
a For a single random effect, uj = uj, v'ij∑vij = σ2.
Formulas for the marginal expectation of LC100p*, denoted X̂MARG (p*), obtained from solutions to Equation 9 for h1(Xij) = Xij and h0(Xij) = log(Xij + 1)a
| Tolerance function | h1(Xij) = Xij | h0(Xij) = log(Xij + 1) |
![]() | ||
| Tolerance function | h1(Xij) = Xij | h0(Xij) = log(Xij + 1) |
![]() | ||
a For a single random effect, uj = uj, v'ij∑vij = σ2.
Careful scrutiny also reveals that this superficially intuitive two‐stage model does not seem to represent any coherent and natural underlying data‐generating process. The first stage mimics various binomial data‐generating processes that are explicitly independent of any auxiliary covariates. In contrast, the second stage represents data generation by auxiliary covariates, thereby contradicting the first‐stage processes. The two‐stage model might yield satisfactory results in some cases, but it apparently lacks a logical theoretical basis.
New models for prediction of LC100p* from ancillary covariates
We propose a general and more parsimonious class of normal‐binomial models given by

where i indexes EUs, j indexes clusters, uj is a q × 1 vector of random effects for the jth cluster with design vector vij; and ∑ denotes the variance‐covariance matrix of the uj. The uj are estimated from the data. Here, θ0 is excluded from the argument of Aij(·), where it would be confounded with β0. These models contain multiple components of variance; the conditional variance within clusters is Var(Yij|uj) = nijpij (1 — pij) and the among‐cluster variances and covariances are contained in ∑. Furthermore, they require estimation of only M + 2 + rank(∑) parameters, which generally will be far fewer than the conventional two‐stage approach.
The model given by Equation 7 is conditional on the uj and produces cluster‐specific predictions P̂ij. The resulting conditional predictor of LC100p* is given by

Note that if ûj = 0 for all j, then the argument of h−1Λ(·) in Equation 8 simplifies to the customary estimate of LC100p* (Eqn. 3) multiplied by Aij(z'ijθ). The conditional model is the fundamental representation of the data [19], but the resulting cluster‐specific predictions are not useful for prediction of stream‐treatment requirements. For that purpose, the marginal (integrated over all of the cluster‐specific random effects) expectation is needed for prediction of LC100p* outside the specific set of tests and is given by

where F(û) is the multivariate normal cumulative distribution function of û. Algebraic solutions to Equation 9 exist for the most common choices of h1(X) = X and h0(X) = log(X + 1) (Table 2). Otherwise, the marginal expected LC100p* must be obtained using numerical integration.
The model defined by Equation 7 is general and can be adapted to accommodate complex study designs. For example, variance might be partitioned across three or more hierarchical levels (e.g., individuals within tanks, among tanks within tests, and among tests) [20] that requires only modification of the specification of the random‐effects formulation v'ijuj and covariance matrix ∑. Other tolerance functions (e.g., cumulative Weibull) also might be incorporated but are not considered further here.
Alternative formulations
Our model includes random effects on the scale of β0. Random effects may enter at the scale of other parameters (i.e., β1). We did not consider those alternatives further for lack of need in our data. The parameters β0 and β1 tend to be highly correlated in tolerance functions; therefore, there may be little need or ability to model random effects for both parameters.
Note, from Equation 7, the multiplicative effect of the auxiliary variables zij through the product Xij × [Aij(z'ijθ)]−1. Therefore, the auxiliary variables do not exert any identifiable effect on pij when Xij = 0. That formulation is sensible for auxiliary variables that affect the potency of the primary agent X but that otherwise are inert within the range of the data or are confounded with X by design. In contrast, an additive conditional inverse tolerance function, defined by T−1 (pij) = β0 + v'ijuj + β1Xij + A(z'ijθ), permits the auxiliary variables to exert effects independently of Xij so that pij may vary systematically even when Xij 0. An additive or hybrid formulation may be an appropriate alternative for studies of multiple toxins. For example, suppose that the concentrations of two toxins X1ij and X2ij, each affected by nontoxic covariates zij, are crossed in a factorial design. A plausible hybrid formulation for quantal‐response function is

which yields two nonunique marginal LC100p* estimates given by

This formulation is easily extended to the case where X1 and X2 are affected by different covariates. We did not consider such additive or hybrid formulations for our application, because TFM and niclosamide concentrations were confounded by the design and pH is nontoxic within the range of the data.
Identification of a form for the effect of the ancillary covariates
An appropriate form for Aij(z'ijθ) may be identified from existing knowledge or the relation between the conventional LC50 estimates and the Zmij in the data. Note that from Equation 8, hΛ[X̂j(p* | zij, uj)] is proportional to Aij(z'ijθ). The choice p* = 0.5 is convenient for preliminary identification of Aij(z'ijθ), but other choices also are viable. The X̂CONVj(0.5), computed independently from each test using Equation 4, can be viewed as näive preliminary estimates of the X̂j(0.5 | zij,uj) in Equation 8 to identify a plausible form for Aij(z'ijθ). For example where h1(X) = X and plots of the log[X̂CONVj(0.5)] versus each of the M ancillary covariates Zmij reveal linear scatters, then

Careful exploratory analyses will be needed to identify an appropriate form for Aij(z'ijθ) in each study.
Parameter estimation
The model given by Equation 7 is an example of a generalized nonlinear mixed‐effects model and can be fitted using maximum pseudolikelihood estimation [21], Bayesian Markov‐Chain Monte Carlo sampling [22], or maximum likelihood estimation [23] using commonly available software. Maximum likelihood estimates can be obtained using SAS NLMIXED software or J.K. Lindsey's repeated package for R [24] (opensource software: http://cran.r‐project.org/ and http://popgen.unimaas.nl/∼jlindsey/rcode.html). We demonstrate model fitting using the SAS NLMIXED package, which deploys the method of Pinheiro and Bates using Gaussian quadrature to integrate the likelihood over the random effects. Standard errors and Wald‐type frequentist prediction intervals on the marginal LC100p* estimates Equation 9 are approximated by NLMIXED using the delta method [25] and prediction standard errors [26].
Acute toxicity tests on sea lamprey
The motivating example for the present study was a set of 38 tests designed to assess the effects of pH and niclosamide augmentation on the toxicity of TFM. The test agents were isopropanol solution of the sodium salt formulation of TFM, with an average concentration of 38% active ingredient (Lamrecid®; Farbwerke Hoechst AG, Frankfurt, Germany), and the aminoethanol salt formulation of niclosamide, with an average niclosamide concentration of 59% (Bayluscide®; ProServe, Menphis, TN, USA). Sea lamprey ammocetes (80–120 mm) were collected from the Ford River (MI, USA) and held for a minimum of 14 d before testing to eliminate any effect of delayed handling mortality from the toxicity trials.
All toxicity testing followed standard protocols [27,28]. Thirty‐eight tests were performed as 19 paired runs using a single, continuous‐flow, serially diluting delivery system similar to that described by Garton [29]. The delivery system consisted of two parallel but independently controllable series, each consisting of 10 glass exposure tanks. Tanks were the EUs and, in each series, were independently randomized to treatments. Each test consisted of gradients of TFM concentration sufficient to produce mortalities from 0 to 100% at each pH (7, 8, and 9) (Table 3). For each run, one series was randomly selected for augmentation with niclosamide, and the other served as a niclosamide‐free test. The niclosamide augmentations ranged from 0.125 to 2.000% of the TFM mass concentration. Therefore, TFM concentration and niclosamide addition were multiplicatively confounded in this design. Ten healthy sea lamprey ammocetes were randomized to each tank 24 h before the start of each test. The 19 matched pairs of tests were performed in a random temporal sequence. Each test consisted of 12 h of exposure, followed by 12 h of postexposure observation for delayed mortality. Dead sea lamprey were removed hourly from each tank.
Well water was used for all exposures. Water temperature (mean, 12.5°C; range, 11.9–13.4°C), pH, and dissolved O2 concentration (mean, 9.1 mg/L; range, 8.4–9.6 mg/L) were measured hourly in the tanks during the exposures. Total alkalinity (mean, 108 mg/L as CaCO3; range, 82–129 mg/L of CaCO3) and total hardness (mean, 150 mg/L as CaCO3; range, 145–159 mg/L as CaCO3) were measured every 4 h in the control tanks [30]. The desired pH of the test water was obtained by metering dilute acid (0.5 M HCl) or base (0.5 M NaOH) into the diluter headbox using a Masterflex® peristaltic pump (Cole‐Parmer, Vernon Hills, IL, USA). Water samples were collected hourly, and TFM concentrations were measured using a Beckman DU‐640 spectrophotometer (Beckman Coulter, Fullerton, CA, USA). Niclosamide concentrations were measured every 2 h by high‐performance liquid chromatography using a Millennium Series 2010 LC Module I Plus (Waters, Milford, MA, USA).
Realized 3‐trifluoromethyl‐4‐nitrophenol (TFM) concentrations to which sea lamprey (Petromyzon marinus) ammocetes were exposeda
| TFM (mg/L) | |||
| Experimental unit | Gradient 1 (pH 7) | Gradient 2 (pH 8) | Gradient 3 (pH 9) |
| 1 | 0.00 | 0.00 | 0.00 |
| 2 | 0.09 | 0.19 | 0.26 |
| 3 | 0.37 | 0.26 | 0.48 |
| 4 | 0.41 | 0.54 | 0.75 |
| 5 | 0.80 | 0.82 | 1.11 |
| 6 | 1.06 | 1.02 | 2.25 |
| 7 | 1.35 | 1.70 | |
| 8 | 1.88 | 2.15 | |
| TFM (mg/L) | |||
| Experimental unit | Gradient 1 (pH 7) | Gradient 2 (pH 8) | Gradient 3 (pH 9) |
| 1 | 0.00 | 0.00 | 0.00 |
| 2 | 0.09 | 0.19 | 0.26 |
| 3 | 0.37 | 0.26 | 0.48 |
| 4 | 0.41 | 0.54 | 0.75 |
| 5 | 0.80 | 0.82 | 1.11 |
| 6 | 1.06 | 1.02 | 2.25 |
| 7 | 1.35 | 1.70 | |
| 8 | 1.88 | 2.15 | |
a Tests were defined as combinations of pH and niclosamide treatment. Each test required a series of six or eight tanks comprising pH‐specific TFM gradients. At each pH, paired tests were conducted in which niclosamide was added to each tank of one series, and the other series served as a niclosamide‐free treatment. The target niclosamide additions were 0.125, 0.250, 0.500, 0.750, 1.000, 1.500, and 2.000% of the mass of the TFM concentration; however, the 0.125 and 1.500% niclosamide additions were omitted at pH 9 for lack of ammocetes. Therefore, 19 combinations of pH and niclosamide augmentation plus 19 niclosamide‐free tests were used. Actual niclosamide additions varied somewhat from the nominal targets and were used in all analyses. The numbering of experimental units (tanks) is arbitrary; treatments were randomized to tanks in each test.
Realized 3‐trifluoromethyl‐4‐nitrophenol (TFM) concentrations to which sea lamprey (Petromyzon marinus) ammocetes were exposeda
| TFM (mg/L) | |||
| Experimental unit | Gradient 1 (pH 7) | Gradient 2 (pH 8) | Gradient 3 (pH 9) |
| 1 | 0.00 | 0.00 | 0.00 |
| 2 | 0.09 | 0.19 | 0.26 |
| 3 | 0.37 | 0.26 | 0.48 |
| 4 | 0.41 | 0.54 | 0.75 |
| 5 | 0.80 | 0.82 | 1.11 |
| 6 | 1.06 | 1.02 | 2.25 |
| 7 | 1.35 | 1.70 | |
| 8 | 1.88 | 2.15 | |
| TFM (mg/L) | |||
| Experimental unit | Gradient 1 (pH 7) | Gradient 2 (pH 8) | Gradient 3 (pH 9) |
| 1 | 0.00 | 0.00 | 0.00 |
| 2 | 0.09 | 0.19 | 0.26 |
| 3 | 0.37 | 0.26 | 0.48 |
| 4 | 0.41 | 0.54 | 0.75 |
| 5 | 0.80 | 0.82 | 1.11 |
| 6 | 1.06 | 1.02 | 2.25 |
| 7 | 1.35 | 1.70 | |
| 8 | 1.88 | 2.15 | |
a Tests were defined as combinations of pH and niclosamide treatment. Each test required a series of six or eight tanks comprising pH‐specific TFM gradients. At each pH, paired tests were conducted in which niclosamide was added to each tank of one series, and the other series served as a niclosamide‐free treatment. The target niclosamide additions were 0.125, 0.250, 0.500, 0.750, 1.000, 1.500, and 2.000% of the mass of the TFM concentration; however, the 0.125 and 1.500% niclosamide additions were omitted at pH 9 for lack of ammocetes. Therefore, 19 combinations of pH and niclosamide augmentation plus 19 niclosamide‐free tests were used. Actual niclosamide additions varied somewhat from the nominal targets and were used in all analyses. The numbering of experimental units (tanks) is arbitrary; treatments were randomized to tanks in each test.
Application of the new models to the sea lamprey toxicity tests
Each test was sufficient to independently compute X̂CONVj(p*) for each particular combination of pH and niclosamide augmentation. Preliminary analyses of the 38 X̂CONVj(0.5) indicated that the LC50 of TFM varied approximately exponentially with a quadratic polynomial in niclosamide augmentation Z1 and pH Z2 so that

was a reasonable form. Combining Equations 7 and 10, appropriate models that include a single random effect (q = 1) are given by

where σ2 = ∑ denotes among‐cluster variation in β0. From Equation 9, the formulas for X̂MARG(p*)for our choices of T(·) and hΛ(Xij) are given by substitution of the right‐hand side of Equation 10 for A(z'ijθ) in Table 2. For computational convenience, we centered pH at 8 so that Z2 = pH — 8. In preliminary analyses, we fitted the nonlinear mixed‐effects model using tests, runs (matched pairs of tests), and run × tank combinations as clusters. Based on Akaike's information criterion (AIC; smaller values are better) [31], the choice of tests for cluster definition produced, by far, the best models, and we adopted that choice for subsequent analyses. We fitted logistic, probit, and cumulative extreme‐value model variations for h1(Xij) = Xij and h0(Xij) = log(Xij + 1). Example SAS NLMIXED code for fitting the models given by Equation 11 and marginal prediction of LC100p* is given in the online‐only Appendix (http://dx.doi.org/10.1897/06–630.S1).
In practice, uncertainty about model choice cannot be eliminated. We accommodated model uncertainty by frequentist AIC‐weighted averaging [32,33] of the resulting six marginal predictors X̂MARG(p*)and their 95% prediction limits.
Model comparison
For comparison we mimicked the conventional two‐stage practice used in the sea lamprey control program. We first obtained the conventional estimates of LC50 and LC99.9 from Equation 5a. For the second stage, we fitted the generalized linear model given by

to obtain X̂2STG(p*). Each of the 38 X̂CONVj(0.5) required estimation of two parameters, and the secondary model based on Equation 10 required estimation of six additional parameters (θ0, …, θ4, and σ2). The resulting 39 models can be viewed as a single, 82‐parameter composite model. We obtained the AIC of the 82 parameter model as the sum of the AIC scores of the 38 fittings of Equation 1 plus the AIC from the nonlinear regression of LC50 estimates on pH and niclosamide augmentation. We compared the performance of that 82 parameter composite model with our seven‐parameter, nonlinear mixed‐effects probit model using AIC; differences larger than 10 may be interpreted as strong evidence [33] of the superiority of our nonlinear mixed‐effects model.
We compared the predictive performance of X̂MARG(p*)and X̂2STG(p*) using Monte Carlo simulation. We defined a fully determined hypothetical population based on the predictor variables and parameter estimates from the sea lamprey study. Therefore, the true population LC100p* values for each cluster, denoted Xj(p*), were fixed and known in the simulation. We used Equation 11 with Λ = 1 and the Gaussian tolerance function as our mortality‐generating model. For each of σ2 = 0 and σ2 = 1.24, we sampled 1,000 replicate sets of mortality counts Yij, i = 1, …, 10, and j = 1, …, 38, using independent normal and binomial random‐number generators for uij and Yij, respectively. The simulation in which σ2 was fixed at zero enables comparison without risk of misspecification of amongcluster random variation. We predicted LC50, LC99.9, and their Wald‐type prediction intervals for each cluster within each replicate using X̂MARG(p*) and X̂2STG(p*). We estimated the predictive accuracy of X̂MARG(p*) and X̂2STG(p*) using BiasE = (1/J) ∑Jj=1 [X̂Ej(p*) — Xj(p*)], where E ε {MARG, 2STG}, and we estimated their precision using RMSE of prediction (RMSEE =
for each Monte Carlo replicate. We estimated coverage probabilities of the 95% Wald‐type prediction intervals as Pr(coverage) = (1/1,000J) ∑1,000r=1 ∑Jj=1 I[Xjr(p*) ε {SEjr}] where I(·) ε {0, 1} is an indicator function for its argument and {SEjr} is the estimated Wald‐type prediction interval for X̂Ejr(p*). Perfect prediction intervals have coverage probabilities that achieve their nominal values.
Parameter estimates and Akaike's information criterion (AIC) scores for the six nonlinear mixed‐effects models of effects of 3‐trifluoromethyl‐4‐nitrophenol (TFM) concentration, niclosamide fraction, and pH on mortality of juvenile sea lampreys (Petromyzon marinus)a
| hΛ(X) = [TFM] | hΛ(X) = log([TFM] + 1) | |||||
| Parameter | Logistic | Gaussian | CEV | Logistic | Gaussian | CEV |
| β0 | ‐18.53 (1.12) | ‐10.13 (0.56) | ‐12.75 (0.71) | ‐29.29 (1.76) | ‐15.66 (0.83) | ‐19.27 (1.08) |
| β1 | 9.55 (0.62) | 5.22 (0.31) | 6.29 (0.38) | 27.25 (1.68) | 14.61 (0.81) | 17.47 (1.03) |
| θ1 | −0.85 (0.09) | −0.84 (0.09) | −0.83 (0.09) | −0.56 (0.06) | −0.55 (0.06) | −0.55 (0.06) |
| θ2 | 1.30 (0.02) | 1.30 (0.02) | 1.30 (0.02) | 0.79 (0.01) | 0.79 (0.02) | 0.78 (0.02) |
| θ3 | 0.22 (0.04) | 0.22 (0.04) | 0.22 (0.04) | 0.16 (0.03) | 0.15 (0.03) | 0.15 (0.03) |
| θ4 | 0.18 (0.04) | 0.18 (0.04) | 0.18 (0.04) | −0.03 (0.02) | −0.03 (0.02) | −0.02 (0.02) |
| σ2 | 4.12 (1.16) | 1.24 (0.34) | 2.01 (0.55) | 4.20 (1.21) | 1.25 (0.36) | 1.97 (0.56) |
| AIC | 332.0 | 332.6 | 362.4 | 347.9 | 354.4 | 374.2 |
| hΛ(X) = [TFM] | hΛ(X) = log([TFM] + 1) | |||||
| Parameter | Logistic | Gaussian | CEV | Logistic | Gaussian | CEV |
| β0 | ‐18.53 (1.12) | ‐10.13 (0.56) | ‐12.75 (0.71) | ‐29.29 (1.76) | ‐15.66 (0.83) | ‐19.27 (1.08) |
| β1 | 9.55 (0.62) | 5.22 (0.31) | 6.29 (0.38) | 27.25 (1.68) | 14.61 (0.81) | 17.47 (1.03) |
| θ1 | −0.85 (0.09) | −0.84 (0.09) | −0.83 (0.09) | −0.56 (0.06) | −0.55 (0.06) | −0.55 (0.06) |
| θ2 | 1.30 (0.02) | 1.30 (0.02) | 1.30 (0.02) | 0.79 (0.01) | 0.79 (0.02) | 0.78 (0.02) |
| θ3 | 0.22 (0.04) | 0.22 (0.04) | 0.22 (0.04) | 0.16 (0.03) | 0.15 (0.03) | 0.15 (0.03) |
| θ4 | 0.18 (0.04) | 0.18 (0.04) | 0.18 (0.04) | −0.03 (0.02) | −0.03 (0.02) | −0.02 (0.02) |
| σ2 | 4.12 (1.16) | 1.24 (0.34) | 2.01 (0.55) | 4.20 (1.21) | 1.25 (0.36) | 1.97 (0.56) |
| AIC | 332.0 | 332.6 | 362.4 | 347.9 | 354.4 | 374.2 |
a The models are the combinations of two functions of TFM concentration and the logistic, Gaussian, and cumulative extreme‐value (CEV) tolerance functions. Values in parentheses represent the standard error.
Parameter estimates and Akaike's information criterion (AIC) scores for the six nonlinear mixed‐effects models of effects of 3‐trifluoromethyl‐4‐nitrophenol (TFM) concentration, niclosamide fraction, and pH on mortality of juvenile sea lampreys (Petromyzon marinus)a
| hΛ(X) = [TFM] | hΛ(X) = log([TFM] + 1) | |||||
| Parameter | Logistic | Gaussian | CEV | Logistic | Gaussian | CEV |
| β0 | ‐18.53 (1.12) | ‐10.13 (0.56) | ‐12.75 (0.71) | ‐29.29 (1.76) | ‐15.66 (0.83) | ‐19.27 (1.08) |
| β1 | 9.55 (0.62) | 5.22 (0.31) | 6.29 (0.38) | 27.25 (1.68) | 14.61 (0.81) | 17.47 (1.03) |
| θ1 | −0.85 (0.09) | −0.84 (0.09) | −0.83 (0.09) | −0.56 (0.06) | −0.55 (0.06) | −0.55 (0.06) |
| θ2 | 1.30 (0.02) | 1.30 (0.02) | 1.30 (0.02) | 0.79 (0.01) | 0.79 (0.02) | 0.78 (0.02) |
| θ3 | 0.22 (0.04) | 0.22 (0.04) | 0.22 (0.04) | 0.16 (0.03) | 0.15 (0.03) | 0.15 (0.03) |
| θ4 | 0.18 (0.04) | 0.18 (0.04) | 0.18 (0.04) | −0.03 (0.02) | −0.03 (0.02) | −0.02 (0.02) |
| σ2 | 4.12 (1.16) | 1.24 (0.34) | 2.01 (0.55) | 4.20 (1.21) | 1.25 (0.36) | 1.97 (0.56) |
| AIC | 332.0 | 332.6 | 362.4 | 347.9 | 354.4 | 374.2 |
| hΛ(X) = [TFM] | hΛ(X) = log([TFM] + 1) | |||||
| Parameter | Logistic | Gaussian | CEV | Logistic | Gaussian | CEV |
| β0 | ‐18.53 (1.12) | ‐10.13 (0.56) | ‐12.75 (0.71) | ‐29.29 (1.76) | ‐15.66 (0.83) | ‐19.27 (1.08) |
| β1 | 9.55 (0.62) | 5.22 (0.31) | 6.29 (0.38) | 27.25 (1.68) | 14.61 (0.81) | 17.47 (1.03) |
| θ1 | −0.85 (0.09) | −0.84 (0.09) | −0.83 (0.09) | −0.56 (0.06) | −0.55 (0.06) | −0.55 (0.06) |
| θ2 | 1.30 (0.02) | 1.30 (0.02) | 1.30 (0.02) | 0.79 (0.01) | 0.79 (0.02) | 0.78 (0.02) |
| θ3 | 0.22 (0.04) | 0.22 (0.04) | 0.22 (0.04) | 0.16 (0.03) | 0.15 (0.03) | 0.15 (0.03) |
| θ4 | 0.18 (0.04) | 0.18 (0.04) | 0.18 (0.04) | −0.03 (0.02) | −0.03 (0.02) | −0.02 (0.02) |
| σ2 | 4.12 (1.16) | 1.24 (0.34) | 2.01 (0.55) | 4.20 (1.21) | 1.25 (0.36) | 1.97 (0.56) |
| AIC | 332.0 | 332.6 | 362.4 | 347.9 | 354.4 | 374.2 |
a The models are the combinations of two functions of TFM concentration and the logistic, Gaussian, and cumulative extreme‐value (CEV) tolerance functions. Values in parentheses represent the standard error.
RESULTS AND DISCUSSION
Partial kills of larval sea lamprey were observed in fewer than 20% of the EUs. The remaining EUs were nearly equally split between 0 and 100% mortality rates. The large proportions of data at the survival extremes aided precision and the ability to estimate LC100p* for extreme values of p*. These data, however, also challenge estimation of the variance of X̂CONVj(p*). For example, Fieller's theorem, which is the commonly used basis for estimation of confidence intervals for X̂CONVj(p*), lacked real‐valued solutions in 16 of the 38 tests and, therefore, was not useful for these data. The delta‐method approximation also failed to produce confidence intervals for X̂CONVj(p*) from 8 of the 38 tests, because positive‐definite estimates of covariance matrices could not be obtained.
Predictions of LC100p* of TFM for sea lamprey
Our nonlinear mixed‐effects models based on TFM concentration fit better than models based on log(TFM), and the logistic and Gaussian tolerance functions fit better than the cumulative extreme‐value function (Table 4). The AIC‐weighted marginal predictor X̂MARG(p*) was dominated by the logistic and Gaussian tolerance functions, because those were vastly superior for our data based on AIC. All seven parameters were important and estimated with reasonable precision. The among‐cluster variances, σ2, ranged from 1.2 to 4.2, and their confidence intervals never included zero, confirming the importance of modeling that source of variation. No evidence of any systematic departure from the data was found
Although pH in the range of 7 to 9 is not toxic to larval sea lamprey, pH exerts a strong influence on the toxicity of TFM. Niclosamide augmentation, within the range used in the present study, has relatively little effect on the toxicity of TFM to larval sea lamprey at pH 7, but such augmentation substantially enhances toxicity at pH values greater than 8 (Fig. 1). Generally, little difference was found between LC50 and LC99.9 and, hence, little margin for error in the application of those formulations. These results underscore the importance of precise in situ measurement of pH before each treatment of sea lamprey spawning tributaries to identify the most efficient and effective combination of TFM and niclosamide. Estimates of precision are critical to both regulatory requirements [8] and identification of the plausible range of outcomes from any treatment application. The widths of the pointwise 95% intervals on our marginal predictions of LC50 and LC99.9 were small relative to those predictions (Fig. 1). Our model provides precise predictions of LC50 and LC99.9 at any pH from 7 through 9 and any addition of niclosamide from 0 through 2% of the mass of TFM.
Comparison of the new and conventional predictors
The two‐stage predictor based on the Gaussian tolerance function X̂2STG(0.5) required estimation of 82 parameters and had an AIC score of 373.0. Based on the resulting AIC difference of 40.4, X̂MARG(p*) for the Gaussian tolerance function (AIC = 332.6) (Table 4) outperformed X̂2STG(p*) by a wide margin. The superior AIC of the generalized nonlinear mixed model was entirely the result of its parsimony. The maximized log‐likelihood of the two‐stage model was −104.5 and exceeded the value of −186.5 from the generalized nonlinear mixed model. It does not matter, however, whether an AIC difference between two models is caused by the number of parameters or by the values of the maximized log‐likelihoods, because AIC is an omnibus measure that approximates the Kullback‐Liebler distance between an approximating model and the unknown truth it represents. We should expect better predictions from those models that have smaller AIC values.
The marginal predictor based on the generalized nonlinear mixed probit model, X̂MARG(p*), clearly outperformed the conventional two‐stage predictor X̂2STG(0.5) in the Monte Carlo simulations. The 95% prediction intervals for X̂2STG(p*) were too narrow (Table 5). In contrast, the 95% intervals for X̂MARG(p*) achieved the nominal coverage for σ2 = 0 but were somewhat too narrow for σ2 = 1.24. The median RMSEs of prediction for X̂MARG(p*) were 43 and 83% smaller than those for X̂2STG(p*) for p* = 0.5 and p* = 0.999, respectively, when σ2 = 0 (Fig. 2). The marginal predictor X̂MARG(p*) was unbiased, whereas X̂2STG(0.999) underestimated the LC99.9 of the TFM‐niclosamide formulations by approximately 0.26 mg/L. The RMSEs of prediction forX̂2STG(p*) were more variable and, sometimes, unacceptably large at σ2 = 1.24 (Fig. 3). Again, the median RMSEs of prediction were 29 and 45% smaller for p* = 0.5 and p* = 0.999, respectively, for X̂MARG(p*) than for X̂2STG(p*). The biases of both X̂2STG(p*) and X̂MARG(p*) tended to be negligibly small, but X̂2STG(0.999) usually underestimated LC99.9. Furthermore, the small RMSE and negligible bias of X̂MARG(0.999) in the Monte Carlo simulation for σ2 = 1.24 clearly indicate that LC99.9 can be predicted with good accuracy and precision from trials similar to the sea lamprey study.
![Predictions of concentrations lethal to 50 and 99.9% of exposed organisms (LC50 and LC99.9, respectively) of 3‐trifluoromethyl‐4‐nitrophenol (TFM) for larval sea lamprey (Petromyzon marinus) based on Akaike's information criterion (AIC)‐weighted averages of marginal predictors X̂MARG(p*) and the widths of their pointwise 95% prediction intervals (PIs; 95% PI width). The model averaging spanned the six combinations of the logistic, Gaussian, and cumulative extreme‐value tolerance functions each based on [TFM] and log([TFM] + 1).](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/etc/26/9/10.1897_06-630R.1/3/m_5620260925_ftp_mfig001.gif?Expires=1743020247&Signature=yz3~1NL77X5xKB3sjUaconuHtfp8ZN0jNS6OZseFtnrblcXityi6py6k3-NeHLmo-zf8tr3sx5kZ0GAcs2M40ALIkvbafkGLiRVSISbonA-g9639Na4dRZxQPkCkNHPSLcZyhMy~H-JigkNDEB1bGfr1tyG9vT5LZLVF~Qyj0CIbdscaTMx6aCL3ibapOMM~Da80k5hcN-MV7Sb0Mim5nYbshy1taq0tt0fF5uc7r9LkRYl~xCvQxP9S-l-aDk8b-XEbJ0tMqk-ibNvAqknxOHxMaLbblKdN17ZA7U4Oh8KGH-eppfy9BnYLK-3bc6U~R2jXc9gih9YM7XkCPyDgfQ__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Predictions of concentrations lethal to 50 and 99.9% of exposed organisms (LC50 and LC99.9, respectively) of 3‐trifluoromethyl‐4‐nitrophenol (TFM) for larval sea lamprey (Petromyzon marinus) based on Akaike's information criterion (AIC)‐weighted averages of marginal predictors X̂MARG(p*) and the widths of their pointwise 95% prediction intervals (PIs; 95% PI width). The model averaging spanned the six combinations of the logistic, Gaussian, and cumulative extreme‐value tolerance functions each based on [TFM] and log([TFM] + 1).
Coverage probabilities for 95% prediction intervals on the concentrations expected to kill 50 and 99.9% of the organisms (LC50 and LC99.9, respectively) in arbitrary populations based on X̂2STG(p*) and X̂MARG(p*)a
| Predictor | |||
| Among‐cluster variance | Predicted exposure | X̂2STG(p*) | X̂MARG(p*) |
| σ2 = 0 | LC50 (p* = 0.5) | 0.88 | 0.95 |
| LC99.9 (p* = 0.999) | 0.74 | 0.94 | |
| σ2 = 1.24 | LC50 (p* = 0.5) | 0.88 | 0.90 |
| LC99.9 (p* = 0.999) | 0.84 | 0.90 | |
| Predictor | |||
| Among‐cluster variance | Predicted exposure | X̂2STG(p*) | X̂MARG(p*) |
| σ2 = 0 | LC50 (p* = 0.5) | 0.88 | 0.95 |
| LC99.9 (p* = 0.999) | 0.74 | 0.94 | |
| σ2 = 1.24 | LC50 (p* = 0.5) | 0.88 | 0.90 |
| LC99.9 (p* = 0.999) | 0.84 | 0.90 | |
a Coverage probabilities were estimated from Monte Carlo simulations consisting of 1,000 random replicates of the sea lamprey (Petromyzon marinus) trials for each choice of σ2. The correct coverage probability for a 95% prediction interval is 0.95. Nominal 95% prediction intervals having coverage probability less than 0.95 err by being too narrow.
Coverage probabilities for 95% prediction intervals on the concentrations expected to kill 50 and 99.9% of the organisms (LC50 and LC99.9, respectively) in arbitrary populations based on X̂2STG(p*) and X̂MARG(p*)a
| Predictor | |||
| Among‐cluster variance | Predicted exposure | X̂2STG(p*) | X̂MARG(p*) |
| σ2 = 0 | LC50 (p* = 0.5) | 0.88 | 0.95 |
| LC99.9 (p* = 0.999) | 0.74 | 0.94 | |
| σ2 = 1.24 | LC50 (p* = 0.5) | 0.88 | 0.90 |
| LC99.9 (p* = 0.999) | 0.84 | 0.90 | |
| Predictor | |||
| Among‐cluster variance | Predicted exposure | X̂2STG(p*) | X̂MARG(p*) |
| σ2 = 0 | LC50 (p* = 0.5) | 0.88 | 0.95 |
| LC99.9 (p* = 0.999) | 0.74 | 0.94 | |
| σ2 = 1.24 | LC50 (p* = 0.5) | 0.88 | 0.90 |
| LC99.9 (p* = 0.999) | 0.84 | 0.90 | |
a Coverage probabilities were estimated from Monte Carlo simulations consisting of 1,000 random replicates of the sea lamprey (Petromyzon marinus) trials for each choice of σ2. The correct coverage probability for a 95% prediction interval is 0.95. Nominal 95% prediction intervals having coverage probability less than 0.95 err by being too narrow.
Additional considerations
We relied on the asymptotic properties of the delta‐method approximation of 95% prediction intervals. Based on our Monte Carlo simulation, that approximate method performed extremely well for X̂MARG(p*) when σ2 = 0, but it was unlikely to achieve the nominal coverage for the sea lamprey data for which σ = 1.24. The delta‐method approximation also performed reasonably well for estimation of confidence intervals for the median lethal dose in simulation studies for which J ⩾ 30 [34] and may be adequate for studies that include relatively large numbers of EUs. Clearly, however, room exists for improvement, and smaller studies may require other approaches [35,36]. A further refinement or evaluation of prediction intervals was beyond the scope of the present study.
Bias also is a critical issue. We examined estimation bias, assuming correct model specification, in our Monte Carlo simulations, and X̂MARG(p*) performed well. Bias also can result from model misspecification, particularly the form for A(z'θ) in Equations 5b and 7. We explored many parametric alternatives for A(z'θ) that did not perform as well as our final choice. Those alternatives included variations of the simple linear form Aij (z'θ) = θ1Z1ij + θ2Z2ij + θ3Z1ijZ2ij, exponential functions of those forms, and exponentials of a broad set of fractional polynomials [37]. The P̂ij produced by our choice for A(z'θ) showed no evidence of systematic bias in our data. The choice of A(z'θ), however, merits careful investigation in each study, and the value of careful exploratory analyses cannot be understated. Room exists here for additional development. For example, incorporation of nonparametric smoothers [38] for estimation of A(z) might more automatically minimize the risk of misspecification bias.
![Performance based on precision (root‐mean‐squared error [RMSE]) and accuracy (Bias) of the conventional two‐stage (X̂2STG) and the marginal mean (X̂MARG) predictors of concentrations lethal to 50 and 99.9% of exposed organisms (LC50 and LC99.9, respectively). Performance was estimated from 1,000 Monte Carlo replications of the 38 clusters of the sea lamprey (Petromyzon marinus) trials for which the among‐cluster variance was fixed at σ2 = 0. Boxes enclose the 25th and 75th percentiles, within which the medians are indicated by a horizontal line. The bars at the ends of the tails mark the extremes.](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/etc/26/9/10.1897_06-630R.1/3/m_5620260925_ftp_mfig002.gif?Expires=1743020247&Signature=ebyRIw1t7LjK2IKDgT8MWrT1nFY6xwnsNDw8gwY9RJPx2SxPAET8hfvmeMWuy2FxndaG6Pz8XoRjwpQG-1NcRnZSM9Kqd4Kd4OYuJC6SqXf3Q862ShI1wejNAPBI3QihvEN62cETZ92Uce9H0Cjl6fff9I2JM8djKNWDfDOvflQt2tHIvCKiOpxPSS6iju9RPLoa895CaPtUUMo4JfOxJExjttv70bIiYycr5lAeKHUTFS9~WFvRzbVvaUQGT6t7t1TcapjgroAr~yF2JWTSO3YzNq9UxLSY-9pz26r8-Jken3d3utx6skN3xXCI8sONDmS~oB~2xonUDy9J0G27vA__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Performance based on precision (root‐mean‐squared error [RMSE]) and accuracy (Bias) of the conventional two‐stage (X̂2STG) and the marginal mean (X̂MARG) predictors of concentrations lethal to 50 and 99.9% of exposed organisms (LC50 and LC99.9, respectively). Performance was estimated from 1,000 Monte Carlo replications of the 38 clusters of the sea lamprey (Petromyzon marinus) trials for which the among‐cluster variance was fixed at σ2 = 0. Boxes enclose the 25th and 75th percentiles, within which the medians are indicated by a horizontal line. The bars at the ends of the tails mark the extremes.
CONCLUSION
We formulated a class of nonlinear mixed‐effects statistical models for the estimation of LC100p* from complex studies. Marginal predictions of LC100p*, which support inference over the range of the predictor variables, are easily obtained using only commonly available commercial or open‐source statistical software. We demonstrated six models from that class with the inclusion of two auxiliary predictor variables and two variance components for prediction of effective lampricide formulations from pH. The models outperformed the conventional practice of groupwise estimation of LC100p* followed by secondary modeling of effects of covariates on those estimates by producing more accurate and precise AIC‐weighted averaged predictions of LC50 and LC99.9 in our simulations.
The model class is flexible and is easily extended to accommodate additional features. For example, in other studies, variance might be partitioned across three or more hierarchical levels (e.g., individuals within tanks, among tanks within tests, and among tests). Such additional components of variance require only modification of the specification of the random‐effects formulation. Also, the choice of a function that adequately represents the effect of the covariates is critical to minimization of bias in estimates of LC100p*. That choice should be carefully evaluated for each study.
![Performance based on precision (root‐mean‐squared error [RMSE]) and accuracy (Bias) of the conventional two‐stage (X̂2STG) and the marginal mean (X̂MARG) predictors of (concentrations lethal to 50 and 99.9% of organisms (LC50 and LC99.9, respectively). Performance was estimated from 1,000 Monte Carlo replications of the 38 clusters of the sea lamprey (Petromyzon marinus) trials for which the among‐cluster variance was fixed at σ2 = 1.24. Boxes enclose the 25th and 75th percentiles, within which the medians are indicated by a horizontal line. The bars at the ends of the tails mark the extremes. Note that RMSE is scaled logarithmically for visual clarity.](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/etc/26/9/10.1897_06-630R.1/3/m_5620260925_ftp_mfig003.gif?Expires=1743020247&Signature=UNywoyt5wP0Lg0xHpxrzR2VhFH7Co6yTDlO-VFPZr0IJ2tsi2IYAu4xj221zMV3HRCGGU2Kk77dz7BXfAJU0eQbBNTqHfpLR60LHAGuEIRG60Kd4wMsgUT1wjKOU35BOCkmtgjuMymdrSaD5~e61eAlhQoK1LYnma-HZcZ3EvM1pJrRV2fUWyUaycG5jmNeankEYT2hXcjQBkPSceSBHkXE65KPswuWdtAle2vgZoFsZy~Hw2ls1tu-e6TtcL8rSznVyCdoFovKOUBioPppnRs5NVax2BES1UtAdKHvJ5-tW1SA1SwhEmzgoQIgiet65c5~wfxk3dOUc~eyqz3NtNg__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Performance based on precision (root‐mean‐squared error [RMSE]) and accuracy (Bias) of the conventional two‐stage (X̂2STG) and the marginal mean (X̂MARG) predictors of (concentrations lethal to 50 and 99.9% of organisms (LC50 and LC99.9, respectively). Performance was estimated from 1,000 Monte Carlo replications of the 38 clusters of the sea lamprey (Petromyzon marinus) trials for which the among‐cluster variance was fixed at σ2 = 1.24. Boxes enclose the 25th and 75th percentiles, within which the medians are indicated by a horizontal line. The bars at the ends of the tails mark the extremes. Note that RMSE is scaled logarithmically for visual clarity.
SUPPORTING INFORMATION
Appendix S1. Example SAS® NLMIXED (Ver 9.1) code for estimation in Equation 11.
Found at DOI: 10.1897/06–630.S1 (23 KB PDF).
Acknowledgements
This work was supported by the Great Lakes Fishery Commission, the U.S. Geological Survey Fisheries and Endangered Aquatic Resources Program in collaboration with the U.S. Fish and Wildlife Service, Hammond Bay Biological Station, and Fisheries and Oceans Canada. We thank Jean V. Adams, Brian R. Gray, and the anonymous reviewers.

