Abstract

Stand structures from a combined density manipulation and even- to uneven-aged conversion experiment on the Bartlett Experimental Forest (New Hampshire, USA) were examined 25 years after initial treatment for rotated sigmoidal diameter distributions. A comparison was made on these stands between two probability density functions for fitting these residual structures: the Burr Type III and a finite mixture of Weibulls. All stands exhibited some form of uneven-aged structure (either reverse J or rotated sigmoid) after 25 years. The Burr distribution fit the final distributions as well as the more complicated finite mixture model in all cases.

Introduction

As we learn more about the structure of uneven-aged forests, we find that often the diameter distribution structure encountered may not fit the widely accepted, quintessential reverse J-shaped model. An example of such a departure is the so-called rotated sigmoid form. Rotated sigmoid-shaped tree diameter distributions have a slight to pronounced plateau or even a mild hump in the mid-diameter range of the distribution, often associated with small sawtimber size classes. These structures, when plotted with the logarithm of number of trees vs diameter, show a more exaggerated plateau or bump in the mid-diameters. In contrast, reverse J-shaped distributions plot as a straight line on the log scale and have a smooth, constant negative exponential decrease in number of trees from small to large diameters. Such distributions are often characterized by de Liocourt's q quotient (de Liocourt, 1898).

A consequence of the departure from the theoretically reverse J condition as found in stands characterized by rotated sigmoid structures is that traditional diameter distribution models are generally not flexible enough to model the plateau or hump in the mid-diameter classes (e.g. Zhang et al., 2001). Traditional models such as the negative exponential (Meyer, 1952; Leak, 1964) do well with reverse J-shaped stands following a given q. Weibull distributions (Bailey and Dell, 1973) contain the negative exponential as a special case, but offer a wider range of reverse J and unimodal shapes. Both of these models are relatively easy to work with and have been used extensively in forestry in the past several decades. Other distribution models, such as Johnson's SB have been put forth (Hafley and Schreuder, 1977), but mostly as general models, not specifically aimed at uneven-aged stands, and their use is somewhat more complicated. Recently, Zhang et al. (2001) have shown that finite mixtures of Weibull distributions may be useful in modelling rotated sigmoid distributions. Finite mixture distributions can take on a wide variety of shapes, including bimodal structures (Liu et al., 2001), but they can also be more demanding to work with, at least in the estimation phase.

While rotated sigmoid distributions seem to turn up with increasing frequency in uneven-aged management, the cause for their genesis still appears to be open to research. Apparently open also, is the question of whether or not they are characteristic of an equilibrium, or steady-state, condition. For example, Goff and West (1975) argued that the rotated sigmoid form is a characteristic equilibrium condition in old-growth stands. These authors supported their hypothesis by examining 49 small old-growth stands, which they found to be characteristic of one of several rotated sigmoid forms. This evidence also seems to be supported by various modelling exercises. For example, growth models for northern hardwoods that have been set in the context of a non-linear optimization have produced rotated sigmoid structures when they have not been constrained otherwise (e.g. Adams and Ek, 1974; Gove and Fairweather, 1992). Similarly, Hansen and Nyland (1987) used simulations to arrive at a diameter distribution of pure sugar maple (Acer saccharum Marsh.) that was of the rotated sigmoid form. Both the optimization and simulation methods resulted in target stands that were sustainable in these modelling studies.

Other authors report that stands that appear as rotated sigmoid have been caused by disturbance of one form or another. Lorimer and Frelich (1984), for example, contend that the existence of a plateau or hump in such stands is due to past disturbance and provide simulations that support their contention that the plateau will disappear in time. Leak (1996), who examined several managed and old-growth stands, also notes that the existence of a rotated sigmoid structure may be due to past disturbance; however, he does not conjecture as to the equilibrium status of such stands. In a 20-year simulation study with several combinations of growth and mortality schedules acting on different initial stand structures, Leak (2002) found that different combinations would lead to, or maintain, rotated sigmoidal structures over time, while others would maintain or create a balanced negative exponential distribution. These simulations were designed to address the origins and maintenance of rotated sigmoid stands over the period of a cutting cycle, rather than the long-run steady state.

In Europe, the wisdom of strong management adherence to reverse J conditions is also being questioned, while rotated sigmoid structures are evidently beginning to be considered (Kerr and O'Hara, 2000; Westphal et al., 2006). An example of this is the plenter management system, which often prescribes rotated sigmoid structures (O'Hara and Gersonde, 2004). Plentering has been around for over a century, and is described as a fully irregular forest, characterized by stems of all sizes and age classes (Schütz, 1997a). Anthropogenic in origin, plenter stands begin with a reverse J structure as the initial aim, which is then adjusted based on a simple growth model biased towards large-tree retention, with the overall goal of sustainability and stand volume maximization (Schütz, 1997b). Schütz (1997a, b) contends that there must be continuous surveillance to establish and realize sustainability of the target structure. Evidently, however, once established, it is largely self-regulating; therefore, the rotated sigmoid structures defined by plentering appear to be disturbance-oriented initially, with apparent longer-term steady-state properties (Schütz, 1997a).

Whatever the cause of rotated sigmoid stand structure—and there may be many—it is clear that these are important distributional forms in uneven-aged management and silviculture. Equally important then, is the ability to model such structures by relatively straightforward methods. In this paper, we apply a particular form of the Burr distribution, known as the Type III Burr for use in fitting rotated sigmoid diameter distributions. It is compared against a more flexible, but more complicated, finite mixture model. Both models are applied to 25-year remeasurements from a single long-term density management case study located in the White Mountains of central New Hampshire, to help assess the usefulness of the Burr Type III distribution in modelling rotated sigmoid stand structures. We look at the untransformed rather than the log scale as this is what is used for management and growth simulation models. Although distributions that appear reverse J, but with steep initial slope in the smaller classes, and nearly level numbers in the larger classes also plot as rotated sigmoid on the log scale, we restrict attention here to the stand structures found in this study, all of which appear rotated sigmoid on the untransformed scale.

Study area and methods

The Bartlett density study

The Bartlett density study was established in 1964 in a second-growth northern hardwood stand on the Bartlett Experimental Forest in Bartlett, New Hampshire, USA. The stand originated in the late 1800s from clearcutting. At study inception, it was a typical even-aged northern hardwood stand composed primarily of American beech (Fagus grandifolia Ehrh.), red maple (Acer rubrum L.) and paper birch (Betula papyrifera Marsh), with minor components of sugar maple (A. saccharum Marsh.), white ash (Fraxinus americana L.) and yellow birch (Betula alleghaniensis Britton); scattered stems and localized clusters of eastern white pine (Pinus strobus, L.), red spruce (Picea rubens Sarg.) and eastern hemlock (Tsuga canadensis (L.) Carr.) also occur. The soils in the study area were predominantly well-drained, loamy sands derived from granite; over time, beech with an admixture of hemlock has become the predominant climax species on these soils (Leak, 1982). The results from this study apply particularly to stand/site conditions similar to these. The original purpose for the study was to twofold: (1) to determine treatment combinations that would efficiently convert even-aged northern hardwood stands to an uneven-aged structure and (2) to determine the level of stand density that produced the best residual growth.

Prior to treatment, 48, 1/3-acre plots, (0.13 ha) plots were established, with 50 ft (∼15 m) wide buffers on all sides. The plots and adjoining buffers were treated conformably. Twelve treatment levels were applied in a randomized design resulting in four replications per treatment. The treatment levels correspond to four residual basal area density levels of 40, 60, 80 and 100 ft2 acre−1 (9.2, 13.8, 18.4, 23 m2 ha−1, respectively) and three levels of residual stand structure at 30, 45 and 60 per cent sawtimber. The residual per cent sawtimber levels were in terms of basal area of trees larger than 10.5 in (26.7 cm) diameter at breast height (d.b.h.). Unfortunately, no pre-treatment inventory was undertaken on the study area, hence we have no information on the pre-treatment stand structure. However, post-treatment (1964) measurements and several subsequent remeasurements of all trees greater than 4.5 in (11.4 cm) d.b.h. have been conducted. Initial post-treatment stand structures for the 12 treatments are shown in Figure 1. Note that some of the stands show a clear rotated sigmoidal structure for the merchantable stems, while others are unimodal, or even approaching bimodal. Note also that the maximum d.b.h. in the stand post-treatment was 19 in (48 cm). The stand was treated a second time in 1990, but not consistently with the first treatment. Therefore, we concentrate on the stand structures after the first 25 years of growth (the 1989 remeasurement), just prior to retreatment, and attempt to address the success of the conversion strategy to uneven-aged structure.

Figure 1.

Residual 1964 post-treatment stand structures by treatment (basal area density:per cent sawtimber) on the Bartlett density study. (Note: 1 in equals 2.54 cm.)

The Burr distribution

A flexible family of probability distributions, which can be derived from a single differential equation, was developed by Burr (1942). Two members of the family, the Burr Types III and XII have been introduced to forestry by Lindsay et al. (1996). These distributions are important because they are inherently more flexible than the Weibull, which is often used in forestry applications. Both the Burr Types III and XII cover a much larger area of the skewness–kurtosis plane than the Weibull (Rodriguez, 1977; Tadikamalla, 1980; Fry, 1993; Lindsay et al., 1996), with the Type III being the most flexible of the three. In this paper, we therefore restrict attention to the Type III Burr.

To derive several members of this family of distributions, Burr (1942) considered the simple differential equation on the cumulative distribution function (CDF) and a shaping function g(x), with basic form
graphic
where 0 ≤ F ≤ 1 is the CDF. When this expression is integrated, the solution becomes
graphic
Letting g(x) = cx−1 and generalizing the inverse exponent above to –k produces the CDF for the Type III form of the distribution presented by Burr (1942). Burr (1942) suggests that the CDF can be further modified by adding scale and location parameters for greater flexibility. Doing so yields the form of the Burr Type III CDF given by Lindsay et al. (1996) as
graphic
with shape parameters c, k > 0, location parameter a < x and scale parameter b such that θ = (a,b,c,k)′; the CDF is zero otherwise. The corresponding probability density function (PDF) is
graphic

This particular version of the Burr distribution is notable, not only in the general flexibility of form mentioned above, but specifically because it is able to fit certain rotated sigmoid distributional forms. Figure 2 presents a set of Type III PDFs with scale parameters 5 ≤ c ≤ 9 and 0.09 ≤ k ≤ 0.13, illustrating a number of possible rotated sigmoid and reverse J shapes over a scaled region of the random variable x. Given this range of k, distributions with c < 5 tend more to reverse J shaped, while those with c > 9 tend to be more unimodal.

Figure 2.

Examples of possible rotated sigmoid structures for the Burr Type III distribution with shape parameters 5 ≤ c ≤ 9 and 0.09 ≤ k ≤ 0.13, over the scaled random variable x on the abscissa.

Parameter estimation was accomplished via maximum likelihood (ML). The log-likelihood for the Burr Type III is particularly simple and is given as
graphic
where n is the sample size. This expression can be maximized directly for the parameter estimatesforumla In addition, the location parameter, a, can be held fixed and the expression l(x;θ) is then maximized for the remaining three parameters.

The Burr Type III distribution has been used in other fields and goes by various synonyms. For example, in economics it is known as the Dagum distribution (Dagum, 1977), and McDonald (1984) noted that it was related to the generalized beta of the second kind. Mielke (1973) and Mielke and Johnson (1974) have used this in meteorology and water resources applications and shown it to be a special case of their Kappa distribution. (Mielke and Johnson, 1973; Tadikamalla, 1980).

Finite mixtures of Weibull distributions

In general, finite mixtures are composed of g individual PDFs, not necessarily all of the same form or family, which are individually ‘mixed’, or weighted by proportions 0 ≤ πi ≤ 1 such that forumla and . Here, we use Weibull densities as the components of the mixture and take g = 2 as in Zhang et al. (2001). The three-parameter Weibull PDF is given as
graphic
where a < x is the location parameter, with scale and shape parameters b > 0 and c > 0, respectively, and ξ = (a,b,c)′. Note that these parameters are distinct from those in the Burr distribution presented earlier and we assume in what follows, that the context will allow differentiation between the different models. The corresponding CDF is simply
graphic
In general, the g-component mixture density is defined as
graphic
where fi(x;ξi) is the ith component density, i = 1, …, g. Note that each component density is parameterized in terms of an m-dimensional vector of parameters ξi such that Ξ = (ξ1,ξ2, …,ξg). Therefore, collecting all parameters, Ψ = (π,Ξ). The log-likelihood follows naturally as (McLachlan and Peel, 2000, p. 47)
graphic
where random vector x = (x1, …, xn) denotes a sample of diameters. Conceptually, parameter estimation for FMMs is straightforward, but tedious, as it is complicated by multimodality of the likelihood surface (McLachlan and Peel, 2000, p. 54). We maximize the log-likelihood indirectly by driving the sums of squares of the gradient equations to zero using the following objective
graphic
(1)
where in the second term, the derivatives with respect to ξi,k are the individual parameter derivatives with respect to the kth parameter, summed over all g mixture components. As with the Burr Type III, the location parameter, a, for each Weibull component density may be fixed rather than estimated, in which case, the respective gradient components do not contribute to the second term above. One constraint was applied to this minimization problem to ensure that the means for each component were ordered in ascending order; that is, forumla This constraint is merely a convenience to assure that the component PDFs appear to increase in X and does not affect the resulting solutions.
For the two-component mixture used in this study, π2 = 1 – π1. Therefore, our mixture density is simply
graphic
In general, estimating all g proportions makes for an overdetermined system; therefore, only π1 is estimated in (1) with this two-component mixture. Hence, in the following, we simply drop the subscript and refer to π as the known proportion, or forumla as its MLE. The full vector of estimated parameters under the two-component Weibull mixture is forumla Therefore, if both location parameters are to be estimated, this FMM requires seven total parameters to be estimated whereas the Burr only requires four.

Results

Figure 3 presents the histograms and fitted Burr and FMM PDFs for the Bartlett density study in 1989; the histograms may be compared directly with Figure 1. The first point to notice is with regard to the stand structural evolution over the 25-year growth period from 1964 to 1989. As previously mentioned, the initial stand diameter distributions after treatment ranged from unimodal rotated sigmoid to bimodal, to a few that do not fall readily into any apparent smooth form. However, over the 25-year growth period, growth and mortality have smoothed the stand structures such that the terminal stand conditions fall into one of two reasonably well-defined structures; i.e. either rotated sigmoid or approximate reverse J shaped with the former predominating. In addition, the maximum tree d.b.h., has increased from 19 in to 26.3 in (66.8 cm) over the 25-year period.

Figure 3.

Stand structures with fitted Burr (solid) and FMM (dashed, component PDFs light dot dashes) by treatment (basal area density:per cent sawtimber) after 25 years of growth on the Bartlett density study. (Note: 1 in equals 2.54 cm.)

The well-defined stand structures at the end of the growth period facilitated the estimation of the proposed PDF models in Figure 3. Perhaps the most striking result of the model fits is that the Burr and FMM curves are so similar for every treatment level. Indeed, in a few of the treatments, such as the 100-ft2 level, the curves appear to be almost coincident. For the other stands, the trend seems to be that the Burr decreases more rapidly in the smaller diameter classes, and does not reach as well-defined a ‘plateau’ in the larger classes as the FMMs. The only exception to this is the 80:45 treatment, where the appearance of a small spike in the number of trees in the 9-inch class is enough to cause the FMM to actually develop a mode in this region.

In general, the higher basal area treatments tend to all have developed well-defined rotated sigmoidal diameter distributions 25 years after treatment (Figure 3). However, the low-density (40 ft2) treatments have a somewhat different characteristic. Note that in the 40:30 treatment, the Burr tends to fit a slight ‘hump’ in the mid-diameter range, but in the FMM this is less pronounced. This slight difference between the two model fits may point to a stand in transition to a reverse J-shaped form. The other two residual sawtimber treatments in this low-density set of treatments show a clearly defined transition to approximate reverse J in both of the model fits. Note that the stand is not in the quintessential reverse J condition since the FMM is still composed of two components: if the stand were exactly negative exponential, then the best fit would be a single Weibull density with unity shape parameter. Instead, both the Burr and the FMM PDFs tend to have a slightly smaller negative slope in the mid-diameter classes than a reverse J.

The best fits for both the Burr (Table 1) and FMMs (Table 2) were with fixed location parameters. In the case of the Burr distribution, parameter estimates converged with the location parameter free in all cases, but the Akaike Information Criteria (AIC) (Burnham and Anderson, 1998) was minimized in the limit as this parameter is shifted towards the minimum diameter value. Thus, a value close to the minimum was fixed as shown in the table. The minimum diameter value itself cannot be used due to numerical considerations in the likelihood equations in both the Burr and FMMs. It is well-known that a simple three-parameter Weibull distribution, with location, shape and scale parameters is more challenging to fit than the two-parameter version without the location parameter (Murthy et al., 2004, p. 81). Therefore, it should not be surprising that, when desiring to estimate two free Weibull location parameters in a FMM context, the challenge becomes somewhat Herculean. Indeed, estimates for only a few of the stands converged after numerous attempts with differing starting values when estimating the location parameters as well. It should be noted that in those cases where the free location fits did converge, the corresponding fixed location fits proved better, as in the case with the Burr. Thus, the location parameters for the FMMs were also fixed at the same value as the Burr (Table 2). Even with this constraint, the resulting FMM likelihoods can be multimodal, as will be discussed subsequently.

Table 1:

ML parameter estimates for the Burr Type III distribution by treatment in 1989 (location parameters fixed)

Treatmentaforumlaforumlaforumla
40:304.4950.1086.6668.801
40:454.4950.1783.6998.088
40:604.4950.1823.8287.895
60:304.4950.0848.4229.703
60:454.4950.05810.80211.291
60:604.4950.0738.83811.492
80:304.4950.1176.9069.634
80:454.4950.0779.20610.422
80:604.4950.0729.50512.206
100:304.4950.1386.1888.887
100:454.4950.1296.0779.993
100:604.4950.0938.25213.015
Treatmentaforumlaforumlaforumla
40:304.4950.1086.6668.801
40:454.4950.1783.6998.088
40:604.4950.1823.8287.895
60:304.4950.0848.4229.703
60:454.4950.05810.80211.291
60:604.4950.0738.83811.492
80:304.4950.1176.9069.634
80:454.4950.0779.20610.422
80:604.4950.0729.50512.206
100:304.4950.1386.1888.887
100:454.4950.1296.0779.993
100:604.4950.0938.25213.015
Table 1:

ML parameter estimates for the Burr Type III distribution by treatment in 1989 (location parameters fixed)

Treatmentaforumlaforumlaforumla
40:304.4950.1086.6668.801
40:454.4950.1783.6998.088
40:604.4950.1823.8287.895
60:304.4950.0848.4229.703
60:454.4950.05810.80211.291
60:604.4950.0738.83811.492
80:304.4950.1176.9069.634
80:454.4950.0779.20610.422
80:604.4950.0729.50512.206
100:304.4950.1386.1888.887
100:454.4950.1296.0779.993
100:604.4950.0938.25213.015
Treatmentaforumlaforumlaforumla
40:304.4950.1086.6668.801
40:454.4950.1783.6998.088
40:604.4950.1823.8287.895
60:304.4950.0848.4229.703
60:454.4950.05810.80211.291
60:604.4950.0738.83811.492
80:304.4950.1176.9069.634
80:454.4950.0779.20610.422
80:604.4950.0729.50512.206
100:304.4950.1386.1888.887
100:454.4950.1296.0779.993
100:604.4950.0938.25213.015
Table 2:

ML parameter estimates for the FMM distribution by treatment in 1989 (location parameters fixed)

Treatmentforumlaa1forumla1forumla1a2forumla2forumla2
40:300.5714.4950.9932.2344.4952.2466.609
40:450.1934.4950.6320.9634.4951.1754.569
40:600.1664.4950.6951.2074.4951.1664.339
60:300.4824.4950.9561.7794.4952.5747.100
60:450.6324.4951.0182.2094.4952.9468.484
60:600.6794.4950.9762.6614.4953.2559.265
80:300.5614.4950.9303.0824.4952.8137.156
80:450.2654.4950.7011.1854.4951.9046.280
80:600.6024.4951.0252.5414.4953.1829.615
100:300.3724.4950.9992.0884.4952.0506.421
100:450.4754.4950.9962.4764.4952.0997.425
100:600.4784.4950.9892.9264.4952.4369.452
Treatmentforumlaa1forumla1forumla1a2forumla2forumla2
40:300.5714.4950.9932.2344.4952.2466.609
40:450.1934.4950.6320.9634.4951.1754.569
40:600.1664.4950.6951.2074.4951.1664.339
60:300.4824.4950.9561.7794.4952.5747.100
60:450.6324.4951.0182.2094.4952.9468.484
60:600.6794.4950.9762.6614.4953.2559.265
80:300.5614.4950.9303.0824.4952.8137.156
80:450.2654.4950.7011.1854.4951.9046.280
80:600.6024.4951.0252.5414.4953.1829.615
100:300.3724.4950.9992.0884.4952.0506.421
100:450.4754.4950.9962.4764.4952.0997.425
100:600.4784.4950.9892.9264.4952.4369.452
Table 2:

ML parameter estimates for the FMM distribution by treatment in 1989 (location parameters fixed)

Treatmentforumlaa1forumla1forumla1a2forumla2forumla2
40:300.5714.4950.9932.2344.4952.2466.609
40:450.1934.4950.6320.9634.4951.1754.569
40:600.1664.4950.6951.2074.4951.1664.339
60:300.4824.4950.9561.7794.4952.5747.100
60:450.6324.4951.0182.2094.4952.9468.484
60:600.6794.4950.9762.6614.4953.2559.265
80:300.5614.4950.9303.0824.4952.8137.156
80:450.2654.4950.7011.1854.4951.9046.280
80:600.6024.4951.0252.5414.4953.1829.615
100:300.3724.4950.9992.0884.4952.0506.421
100:450.4754.4950.9962.4764.4952.0997.425
100:600.4784.4950.9892.9264.4952.4369.452
Treatmentforumlaa1forumla1forumla1a2forumla2forumla2
40:300.5714.4950.9932.2344.4952.2466.609
40:450.1934.4950.6320.9634.4951.1754.569
40:600.1664.4950.6951.2074.4951.1664.339
60:300.4824.4950.9561.7794.4952.5747.100
60:450.6324.4951.0182.2094.4952.9468.484
60:600.6794.4950.9762.6614.4953.2559.265
80:300.5614.4950.9303.0824.4952.8137.156
80:450.2654.4950.7011.1854.4951.9046.280
80:600.6024.4951.0252.5414.4953.1829.615
100:300.3724.4950.9992.0884.4952.0506.421
100:450.4754.4950.9962.4764.4952.0997.425
100:600.4784.4950.9892.9264.4952.4369.452
Many indexes of fit have been used for judging the adequateness of PDF models. For example, Zhang et al. (2001) used both root mean square error and likelihood ratio χ2 statistics. Information–theoretic approaches, however, provide another, perhaps better, alternative as detailed in Burnham and Anderson (1998). These authors suggest several such criteria and provide a very cogent discussion of their strengths. In general, these criteria are applicable to both nested and non-nested models, which is often not the case with other statistics (Burnham and Anderson, 1998, p. 63). They also have an appealing interpretation of providing a method for selecting the best model from a pool of candidates based on the ‘strength of evidence’ from the data supporting a given model. We use AIC since the ratio of n/K is large for each treatment (Burnham and Anderson, 1998, p. 76); the model with the minimum AIC value is the best of the two candidate models. AIC takes a very simple form that is easy to calculate from the optimization results (Burnham and Anderson, 1998, p. 60).
graphic
where K is the number of estimated model parameters in each case. Because we have fixed the location parameters, these do not count in the determination of AIC for either model. Therefore, K = 3 for the Burr Type III, while K = 5 for the Weibull FMM. Notice that the number of parameters acts as a type of penalty on the likelihood, so that models with fewer parameters benefit. Table 3 shows the AIC values for both models and for each of the treatments. This table also indicates the ‘best’ model based on choosing the model with lowest AIC value. It is interesting to note, that the simpler Burr model tends to be preferred in all but two cases. However, the AIC values for the two models are quite close in many of the treatments—in fact all—suggesting that either model could be used. This conclusion is validated by the fitted curves in Figure 3, where, for most practical purposes, the results are quite similar.
Table 3:

AIC and RMSE (in trees per acre) values for the Burr and FMM fits by treatment in 1989

AICRMSE
TreatmentFMMBurrBest fitFMMBurr
40:301399.31394.2Burr2.63.3
40:451349.51345.7Burr1.82.2
40:601410.41411.3FMM2.73.1
60:301439.61434.8Burr2.02.6
60:451571.81577.2FMM2.03.0
60:601384.41380.4Burr1.82.6
80:301525.21517.5Burr2.32.5
80:451540.91532.1Burr2.32.1
80:601352.21351.0Burr1.82.5
100:301905.81901.0Burr3.03.2
100:451678.91673.5Burr2.63.0
100:601382.21374.4Burr1.21.4
AICRMSE
TreatmentFMMBurrBest fitFMMBurr
40:301399.31394.2Burr2.63.3
40:451349.51345.7Burr1.82.2
40:601410.41411.3FMM2.73.1
60:301439.61434.8Burr2.02.6
60:451571.81577.2FMM2.03.0
60:601384.41380.4Burr1.82.6
80:301525.21517.5Burr2.32.5
80:451540.91532.1Burr2.32.1
80:601352.21351.0Burr1.82.5
100:301905.81901.0Burr3.03.2
100:451678.91673.5Burr2.63.0
100:601382.21374.4Burr1.21.4
Table 3:

AIC and RMSE (in trees per acre) values for the Burr and FMM fits by treatment in 1989

AICRMSE
TreatmentFMMBurrBest fitFMMBurr
40:301399.31394.2Burr2.63.3
40:451349.51345.7Burr1.82.2
40:601410.41411.3FMM2.73.1
60:301439.61434.8Burr2.02.6
60:451571.81577.2FMM2.03.0
60:601384.41380.4Burr1.82.6
80:301525.21517.5Burr2.32.5
80:451540.91532.1Burr2.32.1
80:601352.21351.0Burr1.82.5
100:301905.81901.0Burr3.03.2
100:451678.91673.5Burr2.63.0
100:601382.21374.4Burr1.21.4
AICRMSE
TreatmentFMMBurrBest fitFMMBurr
40:301399.31394.2Burr2.63.3
40:451349.51345.7Burr1.82.2
40:601410.41411.3FMM2.73.1
60:301439.61434.8Burr2.02.6
60:451571.81577.2FMM2.03.0
60:601384.41380.4Burr1.82.6
80:301525.21517.5Burr2.32.5
80:451540.91532.1Burr2.32.1
80:601352.21351.0Burr1.82.5
100:301905.81901.0Burr3.03.2
100:451678.91673.5Burr2.63.0
100:601382.21374.4Burr1.21.4

It is also instructive to look at the so-called residuals from each of the model fits. These are defined as simply the predicted minus the actual number of trees for each model and treatment, by diameter class and are shown in Figure 4. These plots essentially echo what can be gleaned from the differences between the histograms and the fitted curves in Figure 3. They do, however, help to highlight the fact that the major differences between the model predictions and the actual data are in the smaller diameter classes. They also show that, with few exceptions, there are only minor differences between the two models in all treatments, representing only a few trees per acre at worst in any given diameter class.

Figure 4.

Residuals from fitted Burr (solid) and FMM (dashed) by treatment (basal area density:per cent sawtimber). (Note: 1 in equals 2.54 cm.)

For completeness, we also present the root mean square error (RMSE) statistic used by Zhang et al. (2001). It is simply the square root of the mean-squared residuals and so, may be thought of as a summary index of the residual graphs, a standard deviation of sorts. The RMSE values are also presented in Table 3. In almost all cases, the RMSEs are slightly lower for the FMM. However, it should be noted that RMSE is in terms of trees per acre; therefore, the overall error is quite small. Indeed, if we average the RMSE values over all treatments, we find a difference of only 0.46 trees per acre (∼1 tree ha−1) between the two models. Again, these results support the conclusion that there is very little difference—none in practical terms—between the two models.

Discussion

Residual stand structure

The treatments applied in this study have resulted in two fairly distinct stand structures over time: those evidently approaching reverse J, and rotated sigmoid. The quasi-reverse J-shaped structures are characteristic of the low-density treatments, although the 40:30 treatment with low residual sawtimber component still has retained some of its sigmoidal form. As previously mentioned, all of the initial stand structures were somewhat similar post-treatment in 1964 (Figure 1). Therefore, the question arises as to why the low-density treatments have tended to the reverse J structure? Unfortunately, while density may play a role, it is undoubtedly not the sole driver, but may contribute subtly to changes in species composition in certain diameter classes whose growth rates could push the stand in one structural direction or the other. Trends from the remeasurements over time do, however, show that the 40- and 60-ft2 treatments have had a very large amount of ingrowth into the smallest diameter class, inflating the distribution in these lower diameter classes, while the higher density treatments show little change in these classes over time. Indeed, Leak and Gove (2007) have reported that the ingrowth in the 40- and 60-ft2 classes are two to three times that of the 80- and 100-ft2 classes in terms of basal area. However, no measurements of sapling or seedling size classes were taken until after the second treatment in 1990, so we have no early quantitative information on the structure and dynamics of these smaller trees.

Another interesting question is whether the rotated sigmoid structure may only be transitional in northern hardwoods, and that given enough time, they will eventually move to a more reverse J form regardless of initial density. Leak (1996) looked at a number of uneven-aged compartment studies on the Bartlett Experimental Forest, plus one old-growth stand in New Hampshire's White Mountains known as the Bowl. The Bowl showed a stand distribution that could only be characterized as reverse J, while the other stands had either reverse J or rotated sigmoidal structures. Leak (1996) concluded, in accordance with Schmelz and Lindsey (1965), that the rotated sigmoid form was caused by disturbance and that something close to reverse J was the steady-state condition for old-growth northern hardwoods. Unfortunately, we may not be able to answer this question with any degree of confidence based on remeasurements from the current study, since the retreatment in 1990 left only one replicate for long-term tracking of the original treatments. We can, however, address whether the initial treatments were successful in structural conversion from even- to uneven-aged stands. Whether transitional, or steady-state, both the reverse J and rotated sigmoid diameter distributions are now generally regarded as being indicative of the uneven-aged condition and all of our stands in the current study fit into one of these two structures. This, coupled with the fact that there are at least three extant age classes on the study (Leak and Solomon, 1975; Leak, 2004), allows us to conclude that at least one of the initial goals of the study was indeed successful after 25 years.

Stand structural modelling

The 25-year residual stand structures encountered in this study offer a good range of at least quasi steady-state structures encountered in uneven-aged stands. The simplicity and flexibility of the Burr Type III distribution commends it as useful model for reverse J and rotated sigmoidal stand structures, as well as many unimodal structures often found in even-aged stands. However, the Burr Type III does not possess the ability to contort, for example, to multiple modes like the FMMs do, making it less than suitable for some of the initial structures found, for example, in Figure 1. The ease with which the Burr Type III is able to be fit using simple optimization methods directly on the log-likelihood is also a very important feature of this distribution.

FMMs allow added flexibility in general when compared with simple probability models like the Burr. For one, as we have seen here (e.g. the 80:45 treatment), and has been shown more lucidly in other studies (Liu et al., 2001; Zhang et al., 2001), the ability to model bimodality in the diameter distribution is often useful, especially in mixed-species stands. In addition, Zasada and Cieszewski (2005) indicate that the component distributions might be used to discriminate other biologically relevant information, such as tree height classes. This may indeed be possible, as these authors demonstrate, depending on the component PDFs used and the type of information desired. It should be noted, however, that Zasada and Cieszewski (2005) were unable to use the Weibull distribution as a component model for their unimodal stands because the estimation method used, which attempts to estimate the means and standard deviations of the component PDFs rather than estimate the parameters directly (Macdonald and Pitcher, 1979), can fail when there is strong overlap in the component densities.

It would be interesting to think that the weight parameter played an intuitive role in the final form of the mixture distribution. For example, one would think that π would weight more heavily towards the reverse J-shaped component in the 40:45 and 40:60 treatments since that seems to be the predominant structure, but exactly the opposite is the case: the J-shaped component density (reverse J PDFs are those whose shape parameter is in the range 0 < c ≤ 1 for Weibulls) receives little weight in these cases (Table 2). This may be in part due to the multimodality of the likelihood—perhaps we did not find the overall optimum—but is more likely due to the fact that the model does not necessarily translate as well to biological interpretation as one would like to hope. In other words, it is fitting the data as best it can, with no constraints or preconceptions with respect to foreknowledge of inherent reverse J conditions.

The added flexibility of the FMM approach does also bring with it some associated concerns to be aware of that may make estimation more difficult than with simple non-mixture PDF models such as the Burr. First, as we have already mentioned, with distribution models like the Weibull, where there is an inherent difficulty fitting one of the parameters in the single-PDF setting, these results will be magnified in the mixture setting. With the three-parameter Weibull model, the concern is with the location parameters of the mixture. The inability of a time-tested non-linear solver such as GRG2 (Lasdon et al., 1978) to converge in all but a few treatments when the location parameters were free speaks to the difficulty of estimation in such situations. In those treatments where it did converge, only one of the components was selected (π ≡ 0 or 1). As mentioned earlier, Zasada and Cieszewski (2005) had similar problems fitting component Weibulls with the Macdonald and Pitcher (1979) approach.

A second potential concern with the mixture approach, which has also been mentioned previously, is the evident multimodality of the log-likelihood surface. Based on the experience of fitting the Weibull mixtures to the Bartlett density study data, it is important to try multiple starting values when endeavouring to find the optimum, as the likelihood surface appears to be multimodal, at least in some cases. In fact, it might not be a bad idea to embed the optimization within another directed search algorithm to try to find the global maximum; however, in such a scenario, it should be kept in mind that the embedded mixture optimization may not always converge from every starting point, and non-convergent results must be rejected. Finally, it is often the case that a likelihood solution mode is coincident with the boundary case of π = 0 or 1 and such solutions are easily distinguished. This is illustrated in Figure 5a, along with three other convergent solutions. These four solutions illustrate that there are indeed local optima in the likelihood surface for these data. Figure 5b shows the final solution with minimum AIC as reported in Table 2. Note that it is relatively easy to choose the best fitting FMM by eye based on these results. However, consider the case where the ML algorithm was run in a batch mode for fitting FMMs to many stands, say in a large modelling effort. This example illustrates the need for testing AIC values at multiple starting points in such a setting. If the likelihood surface were not suitably explored, one could end up with one of the less desirable fits shown in this figure. Again, to our knowledge, the Burr Type III likelihood does not suffer from such multimodality concerns.

Figure 5.

Four ML solutions for the FMM (heavy dashes, component densities light dot dash) in the 80:45 treatment illustrating the multimodality of the likelihood with AIC values (a) 1594.8, (b) 1540.9, (c) 1587.4 and (d) 1593.4. (Note: 1 in equals 2.54 cm.)

In all FMMs, the first component was some form of reverse J (or very close, e.g. the 60:45 and 80:60 treatments have forumla1>1, but only slightly) and the second possessed a clear mode as indicated by the respective shape parameters forumla1 and forumla2 (Table 2). These components are clearly visible in the component densities shown in Figure 3. It is interesting to note that correlation among the parameters of both distributions is high not only within the given distribution but also between the FMMs and Burrs in some cases (Table 4). For example, all three of the Burr parameters are fairly strongly correlated with both the scale and shape parameters of the second Weibull component (forumla2 and forumla2). These two Weibull parameters are also correlated to a lesser extent with the corresponding parameters for the first component in the mixture. In addition, the second component's shape parameter, forumla2, is very highly correlated with the FMM weight, forumla. It is difficult to draw any conclusions with regard to causality from these correlations. However, it might be inferred that the reason for the high correlations between the Burr and the second FMM component parameters goes back to Figure 2, where we note that the Burr is rotated sigmoidal in this range of parameters (compare with Table 1), and that the second Weibull component defines the associated plateau in the fitted CDF. Because the plateau in the rotated sigmoid fits is a strong definer of the overall distribution shape, it is also plausible that this might explain the correlation between the estimated FMM weight and forumla2. Regardless of the possible explanations, the relatively high correlations may be useful in subsequent modelling contexts where parameters are predicted from a combination of both stand variables and associated parameter estimates from previously fitted stands.

Table 4:

Correlations between parameter estimates over all treatments

FMMBurr
forumlaforumla1forumla1forumla2forumla2forumlaforumlaforumla
forumla1.00.860.820.940.84–0.750.690.62
forumla11.00.820.790.78–0.590.560.55
forumla11.00.770.79–0.470.420.62
forumla21.00.89–0.840.810.73
forumla21.0–0.810.790.93
forumla1.0–0.980.79
forumla1.00.80
forumla1.0
FMMBurr
forumlaforumla1forumla1forumla2forumla2forumlaforumlaforumla
forumla1.00.860.820.940.84–0.750.690.62
forumla11.00.820.790.78–0.590.560.55
forumla11.00.770.79–0.470.420.62
forumla21.00.89–0.840.810.73
forumla21.0–0.810.790.93
forumla1.0–0.980.79
forumla1.00.80
forumla1.0
Table 4:

Correlations between parameter estimates over all treatments

FMMBurr
forumlaforumla1forumla1forumla2forumla2forumlaforumlaforumla
forumla1.00.860.820.940.84–0.750.690.62
forumla11.00.820.790.78–0.590.560.55
forumla11.00.770.79–0.470.420.62
forumla21.00.89–0.840.810.73
forumla21.0–0.810.790.93
forumla1.0–0.980.79
forumla1.00.80
forumla1.0
FMMBurr
forumlaforumla1forumla1forumla2forumla2forumlaforumlaforumla
forumla1.00.860.820.940.84–0.750.690.62
forumla11.00.820.790.78–0.590.560.55
forumla11.00.770.79–0.470.420.62
forumla21.00.89–0.840.810.73
forumla21.0–0.810.790.93
forumla1.0–0.980.79
forumla1.00.80
forumla1.0

Conclusions

The diameter distributions for the northern hardwood stands in this study were equally well fit by both the Burr Type III and a finite mixture of Weibull distributions. We have shown that while the FMM distribution is a more flexible distribution for modelling overall, it is significantly more difficult to find correct parameter estimates for it than for the Burr. Therefore, while either method could be used for fitting reverse J-shaped to rotated sigmoidal stand structures similar to those found in this study, the Burr Type III distribution can be recommended in such stands based on parsimony and ease of fit for the modeller. If, however, a pronounced hump or other mode occurs in the mid range of the diameter distribution, finite mixtures will probably be a better choice.

The authors would like to thank Dr Kevin O’Hara for his helpful review of the manuscript. We would also like to thank Dr Dale S. Solomon (deceased) for his foresight and untiring work on establishing and running this study for almost 40 years.

Conflict of Interest Statement

None declared.

References

Adams
DM
Ek
AR
Optimizing the management of uneven-aged forest stands
Can. J. For. Res.
1974
, vol. 
4
 (pg. 
274
-
286
)
Bailey
RL
Dell
TR
Quantifying diameter distributions with the Weibull function
For. Sci.
1973
, vol. 
19
 (pg. 
97
-
104
)
Burnham
KP
Anderson
DR
Model Selection and Inference: a Practical Information-Theoretic Approach
1998
Springer
Burr
IW
Cumulative frequency functions
Ann. Math. Stat.
1942
, vol. 
13
 (pg. 
215
-
232
)
Dagum
C
A new model of personal income distribution: specifications and estimation
Econ. Apligue
1977
, vol. 
30
 (pg. 
413
-
436
)
de Liocourt
F
De l’amenagement des sapinières
1898
 
Technical Report, Bulletin trimestriel, Société forestière de Franche-Comtè et Belfort (English translation: http://www.snr.missouri.edu/forestry/larsen.html)
Fry
TRL
Univariate and multivariate Burr distributions: a survey
Pak. J. Stat.
1993
, vol. 
9
 (pg. 
1
-
24
)
Goff
FG
West
D
Canopy-understory interaction effects on forest population structure
For. Sci.
1975
, vol. 
21
 (pg. 
98
-
108
)
Gove
JH
Fairweather
SE
Optimizing the management of uneven-aged stands: a stochastic approach
For. Sci.
1992
, vol. 
38
 (pg. 
623
-
640
)
Hafley
WL
Schreuder
HT
Statistical distributions for fitting diameter and height data in even-aged stands
Can. J. For. Res.
1977
, vol. 
7
 (pg. 
481
-
487
)
Hansen
GD
Nyland
RD
Effects of diameter distribution on the growth of simulated uneven-aged sugar maple stands
Can. J. For. Res.
1987
, vol. 
17
 (pg. 
1
-
8
)
Kerr
G
O’Hara
KL
Uneven-aged silviculture: common myths explored
Q. J. For.
2000
, vol. 
94
 (pg. 
145
-
150
)
Lasdon
LS
Warren
AD
Jain
A
Ratner
M
Design and testing of a generalized reduced gradient code for nonlinear programming
ACM Trans. Math. Softw.
1978
, vol. 
4
 (pg. 
34
-
50
)
Leak
WB
The J-shaped probability distribution
For. Sci.
1964
, vol. 
11
 (pg. 
405
-
409
)
Leak
WB
Habitat mapping and interpretation in New England. Research Paper NE-496.
1982
pg. 
28
 
Leak
WB
Long-term structural change in uneven-aged northern hardwoods
For. Sci.
1996
, vol. 
42
 (pg. 
160
-
165
)
Leak
WB
Origin of sigmoid diameter distributions. Research Paper NE-718.
2002
Newtown Square, PA
USDA Forest Service, Northeastern Research Station
Leak
WB
Diameter growth in even- and uneven-aged northern hardwoods in New Hampshire under partial cutting
North. J. Appl. For.
2004
, vol. 
21
 (pg. 
160
-
163
)
Leak
WB
Gove
JH
Growth of northern hardwoods in New England—a 25-year update
North. J. Appl. For.
2007
 
(in press)
Leak
WB
Solomon
DS
Influence of residual stand density on regeneration of northern hardwoods. Research Paper NE-310.
1975
Upper Darby, PA
USDA Forest Service, Northeastern Research Station
Lindsay
SR
Wood
GR
Woollons
RC
Modelling the diameter distribution of forest stands using the Burr distribution
J. Appl. Stat.
1996
, vol. 
23
 (pg. 
609
-
619
)
Liu
C
Zhang
L
Davis
CJ
Solomon
DS
Gove
JH
A finite mixture model for characterizing the diameter distributions of mixed-species forest stands
For. Sci.
2001
, vol. 
48
 (pg. 
653
-
661
)
Lorimer
CG
Frelich
LE
A simulation of equilibrium diameter distributions of sugar maple (Acer saccharum)
Bull. Torrey Bot. Club
1984
, vol. 
111
 (pg. 
193
-
199
)
Macdonald
PDM
Pitcher
TJ
Age-groups from size-frequency data: a versatile and efficient method of analyzing distribution mixtures
J. Fish. Res. Board Can
1979
, vol. 
36
 (pg. 
987
-
1001
)
McDonald
JB
Some generalized functions for the size distribution of income
Econometrica
1984
, vol. 
52
 (pg. 
647
-
664
)
McLachlan
G
Peel
D
Finite Mixture Models
2000
Wiley
Meyer
HA
Structure, growth, and drain in balanced uneven-aged forests
J. For.
1952
, vol. 
50
 (pg. 
85
-
92
)
Mielke
PW
Another family of distributions for describing and analyzing precipitation data
J. Appl. Meteorol.
1973
, vol. 
12
 (pg. 
275
-
280
)
Mielke
PW
Johnson
ES
Three-parameter kappa distribution maximum likelihood estimates and likelihood ratio tests
Mon. Weather Rev.
1973
, vol. 
101
 (pg. 
701
-
707
)
Mielke
PW
Johnson
ES
Some generalized beta distributions of the second kind having desirable application in hydrology and meteorology
Water Resour. Res.
1974
, vol. 
10
 (pg. 
223
-
226
)
Murthy
DNP
Xie
M
Jiang
R
Weibull Models
2004
Wiley
O’Hara
KT
Gersonde
RF
Stocking control concepts in uneven-aged silviculture
Forestry
2004
, vol. 
77
 (pg. 
131
-
143
)
Rodriguez
RN
A guide to the Burr type XII distributions
Biometrika
1977
, vol. 
64
 (pg. 
129
-
134
)
Schmelz
DV
Lindsey
AA
Size-class structure of old-growth forests in Indiana
For. Sci.
1965
, vol. 
11
 (pg. 
258
-
264
)
Schütz
J-P
Emmingham
WH
The Swiss experience: more than one hundred years of experience with a single-tree selection management system in mountainous mixed forests of spruce, fir and beech from an empirically developed utilization in small-scale private forest to an elaborate and original concept of silviculture
Proceedings of the IUFRO Interdisciplinary Uneven-Aged Management Symposium
1997
(pg. 
21
-
34
)
Schütz
J-P
Emmingham
WH
Conditions of equilibrium in fully irregular, uneven-aged forests: the state-of-the-art in European plenter forests
Proceedings of the IUFRO Interdisciplinary Uneven-Aged Management Symposium
1997
(pg. 
455
-
467
)
Tadikamalla
PR
A look at the Burr and related distributions
Int. Stat. Rev.
1980
, vol. 
48
 (pg. 
337
-
344
)
Westphal
C
Tremer
N
von Oheimb
G
Hansen
J
von Gadow
K
Härdtle
W
Is the reverse J-shaped diameter distribution universally applicable in European virgin beech forests?
For. Ecol. Manage.
2006
, vol. 
223
 (pg. 
75
-
83
)
Zasada
M
Cieszewski
CJ
A finite mixture distribution approach for characterizing tree diameter distributions by natural social class in pure even-aged Scots pine stands in Poland
For. Ecol. Manage.
2005
, vol. 
204
 (pg. 
145
-
158
)
Zhang
L
Gove
JH
Liu
C
Leak
WB
A finite mixture of two Weibull distributions for modeling the diameter distributions of rotated-sigmoid, uneven-aged stands
Can. J. For. Res.
2001
, vol. 
31
 (pg. 
1654
-
1659
)