© 2005 International Council for the Exploration of the Sea
Censored catch data in fisheries stock assessment
a 9 Grove Street, PO Box 1012, Dartmouth, N.S., Canada, B2Y 3Z7
b Département EMH, IFREMER Rue de l'Ile d'Yeu, BP 21105, 44311 Nantes, France
*Correspondence to T. R. Hammond: tel: +1 902 426 3100 287; fax: +1 902 426 9654. e-mail: Tim.Hammond{at}drdc-rddc.gc.ca.
Landings statistics can be lower than true catches because many fish are discarded or landed illegally. Since many discards do not survive, treating landings as true catches can lead to biased stock assessments. This paper proposes treating catch as censored by bounding it below by the landings, L, and above by cL (for scalar c > 1). We demonstrate the approach with a simulation study, using a Schaefer surplus production model. Parameters were estimated in a Bayesian framework with BUGS software using two sets of priors. Both the traditional true-catch method and a survey-and-effort method (which was landings free) performed worse on average than the censored approach, as measured by the Bayes risk associated with estimates of maximum sustainable yield (MSY) and of an index of depletion (X). Recursive partitioning (regression trees) was used to associate simulation parameters to best-performing methods, showing that higher commercial fish catchability favoured the censored method at estimating X. In conclusion, censored methods provide a means of dealing with discarding and misreporting that can outperform some traditional alternatives.
Keywords: censored data, discarding, misreporting, stock assessment
Received 10 September 2004; accepted 27 April 2005.
| Introduction |
|---|
|
|
|---|
While most fisheries stock assessments rely on reported landings, assessment methods differ in their treatment of these data. Methods derived from virtual population analysis (VPA), like extended survivor analysis (Shepherd, 1999), a standard ICES tool, assume that catches are known without error. For many stocks, however, this assumption does not hold, primarily because of discarding and misreporting. Observer studies suggest that the number of fish discarded is sometimes greater than the number landed for Northeast Atlantic megrim, whiting, and Norway lobster (Rochet et al., 2002). While the importance of misreported (and unreported) landings is more difficult to assess, anecdotal reports suggest these can be substantial. Thus, some models admit random errors in landings reports by assuming various error distributions centred on the observed landings (e.g. Kimura, 1990; Patterson, 1998). Still other models (e.g. Cook, 1997) have dispensed with the landings data altogether, preferring to base results on survey data alone. This paper proposes a new approach: treating catch as censored data.
We do not mean the word "censored" in the more familiar sense, as applied to material deemed inappropriate, but in the sense it is used in survival analysis. In the clinical trials typical of this field, a patient might still be alive at the end of the study period and thus his actual post-treatment survival time remains unknown (though, of course, it must be greater than the time from treatment to study's end). In such a case, we say the patient's survival time is censored. Thus, in calling the catch censored, we mean that we have observed bounds for the catch, but not the value itself.
In a previous comparison of catch-modelling approaches using simulated data, Patterson (1998) found that, in terms of mean square errors, a random catch error model provided better results than ADAPT (a VPA-derived model) for a range of underreporting levels and survey index variances. It also outperformed a survey-only method. This paper also uses simulation to compare approaches, but deals with surplus production models rather than age-structured ones.
The most comprehensive study to date comparing the performance of various stock-assessment models using simulated data was published by the National Research Council (1998). One of the scenarios tested was a 30% underreporting of catches for which the surplus production model performed comparably well, in terms of the relative error of estimated exploitable biomass at the end of the study period (year 30). Curiously, the relative error was lower when only commercial catch data were used (6%) compared with both commercial and survey data (24%). TAC estimates were completely wrong (366179%), however, and other assessment methods performed much better.
Most other studies of the performance of surplus production models allowing for errors in commercial catches treat those errors as normal or lognormal. A notable exception is Koonce and Shuter (1987), who assumed fixed under-reporting of 20% but did not compare scenarios with and without under-reporting.
Fisheries science has tried to deal with the discarding problem by placing observers on fishing vessels. By assuming that the fishermen behave no differently when they are not being observed, the observed discards are then extrapolated to fishing trips taken without observers and even to trips taken on vessels that never had any observers at all. The discards are said to be "raised to the fleet" by means of such extrapolation. It is common practice to add the raised discards to the landings and to treat the sum as the true catch (or more recently as the "observed" catch) for the fishery. Moreover, discards in years without observer data are estimated using discards-to-landings ratios observed in different years (e.g. ICES, 2001).
This paper does not evaluate the merits of the above procedures. Instead, it focuses on a situation where the stock-assessment working group has in hand a statistic, L(t), which can safely be considered a lower bound for the true catch in year t. We will call L(t) "the landings" because we tend to imagine a situation where, despite considerable discarding, there is a total lack of observer data, but L(t) could also be interpreted in other ways. It might be, for example, that there are so many anecdotal reports of illegal (black) landings that even the sum of observed landings and raised discards (now L(t)) is a lower bound for true catch. Alternatively, the working group might suspect that observer data are not representative of the whole fleet, so it is not prepared to raise the discards. In that case, L(t) might simply represent the sum of observed discards (unraised) and landings. Thus, there are several possible scenarios for the situation we consider.
In addition to the lower bound, L(t), we also assumed that the working group would be able to place a reasonable upper bound on the catches, which has important repercussions for the usefulness of the approach. It is convenient if this upper bound is expressed as a multiple of L(t). If the working group was prepared to accept the premise that fishermen were not discarding more fish than they were landing, for example, then a reasonable upper bound would be 2L(t). This is the value used throughout this paper.
Surplus production models are commonly used for stock assessments in situations where landings and effort data are available, but detailed age (or length) structure data are not (Hilborn and Walters, 1992). For this class of models, at least, we expected that treating landings as equal to catches, when catches are actually under-reported, should adversely affect stock biomass estimates. Thus, we used simulated data to evaluate the performance of three assessment methods: survey-and-effort, true-catch, and censored-catch. The survey-and-effort method tried to estimate catches with fishing effort data (assumed to be known without error), but relied primarily on surveys to estimate abundance. The true-catch method supplemented the survey index by treating recorded landings as true catch, and the last method also used the survey index but treated catch as censored data. All estimation methods were implemented using Bayesian techniques.
Bayesian analysis has become frequent in fisheries stock assessment (Walters and Ludwig, 1994; McAllister and Ianelli, 1997; Punt and Hilborn, 1997; Hilborn and Liermann, 1998; Meyer and Millar, 1999b) because of its clear, probabilistic portrayal of uncertainty and its natural extension to decision analysis (Punt and Hilborn, 1997). Bayesian probability, however, may be subjective in nature, and usually does not have the common interpretation that involves the frequency of an event. Various methods for obtaining posterior parameter distributions exist, one of which is Markov Chain Monte Carlo (MCMC). MCMC can be implemented with relatively little programming effort using freely available software called BUGS (http://www.mrc-bsu.cam.ac.uk/bugs/). Meyer and Millar (1999a) provide an application of BUGS to a surplus production model, and we used both their model and code as a template. The BUGS environment allowed us to change a standard Schaefer model into a censored data version by modifying only a few lines of code.
While BUGS facilitates the implementation of censored-data models, and the Bayesian approach may aid in interpreting their results, there is nothing inherently Bayesian about censored data. In fact, many models in survival analysis do not follow this paradigm. Nor are the possible applications of this idea in fisheries restricted to surplus production models. A key point of this paper is that any statistical assessment model can be modified to treat catch as censored data. Whether a given model should be so modified, however, should depend on the degree to which landings underestimate the true impact of fishing.
| Methods |
|---|
|
|
|---|
Censored data
For a given random variable Y, with density f and distribution function F, a standard data set might consist of two realizations, y1 = 3 and y2 = 5, of Y. A set of censored data, on the other hand, might have observations of the form y1 > 3 and y2 < 5. Whereas the likelihood (l) for the standard data set would be computed using the density function (l = f(3)f(5)), in this simple example the censored likelihood would be computed from the distribution function (l = (1 F(3))F(5)). Maximum likelihood (or Bayesian) estimation of the parameters of f (and of F) could proceed much as before, though the likelihood function would be less informative. Of course, if there were other random variables correlated with Y, then the situation becomes more complicated, as the censored data for Y would be informative about them. In the frequentist context, special algorithms are used to handle censored data problems. Even the most common of these, the Estimation Maximization (EM) algorithm (Tanner, 1996; McLachlan and Krishnan, 1997), is very rarely applied in fisheries. In the Bayesian context, however, the same algorithms that are typically used in fisheries stock assessment, namely MCMC (Meyer and Millar, 1999b) or SIR (McAllister and Ianelli, 1997), can readily be modified to handle censored data, as demonstrated here for MCMC. This paper uses censored observations of the form L(t) < C(t) < 2L(t), where C(t) is catch in year t and L(t) represents reported landings (but see above).
Experimental design
The simulation experiment in this paper was designed to compare the performance of three different stock-assessment approaches, all based on a discrete variant of the Schaefer surplus production model (Meyer and Millar, 1999a):
|
| (1) |
We took a decision theoretic approach for the comparison of estimation approaches, grading performance in terms of a number called the Bayes risk (e.g. Casella and Berger, 1990). In fact, the paper's main objective is to determine which approach has the lowest such risk, but this simple comparison is complicated by the fact that the Bayes risk can be computed for any model parameter. Moreover, the computation can be made with a variety of estimation loss functions. To constrain the computations, therefore, we chose just two parameters that we considered particularly important to stock assessment: maximum sustainable yield (MSY = rK/4) and depletion (X), the latter defined as the final-year biomass divided by the biomass that yields MSY (namely K/2, which implies that X = 2B'(10), year 10 being the final year). We also chose two loss functions: squared-error loss (
SE) and absolute-error loss (
AE). These decisions gave us four Bayes risk measures to consider for each estimation method.
Loss functions are designed to score estimation performance by comparing an estimation result (
p(x), for a particular parameter p given data vector x) with the "true" value of that parameter (which, for the sake of generality, we take to be a scalar valued function of the "true" parameter vector
denoted p(
)). In other words, loss is just a function of
and x that estimators should try to minimize. As we consider Bayesian estimation, we take
p(x) to be the marginal posterior density for p given the data x denoted here by
p(
|x) (for dummy variable
). In this notation, our two estimation loss functions are defined as follows:
|
| (2) |
|
| (3) |
The loss functions are difficult to evaluate analytically, but they are very well approximated by the results of MCMC. To approximate these equations using n sample values p1, p2, ..., pn from the marginal posterior distribution
p(
|x) given by MCMC, we used the following:
|
| (4) |
|
| (5) |
Now the basic idea of the Bayes risk is to obtain an average value of the estimation loss by integrating over
(the parameter space of possible values for
) and over X (the sample space for random vectors x). For squared-error loss, the Bayes risk formula (after Casella and Berger, 1990) is as follows:
|
| (6) |
) is the likelihood function and
(
) is the prior. Naturally,
and x.
In practice, Equation (6) presents us with a very daunting integration problem. We certainly do not recommend that analytical solutions be attempted. Instead, we propose to approximate by Monte Carlo integration, just as in Equations (4) and (5). If we first notice that the term f(x|
)
(
) in Equation (6) is just the joint distribution of x and
, then it will come as no surprise that, given m vectors (x1,
1), (x2,
2), ..., (xm,
m) drawn randomly from that joint distribution, the Bayes risk can be approximated as follows:
|
| (7) |
Our approximation to the Bayes risk affected the design of our simulation experiment, and as a result it differs considerably from the usual frequentist approach. A frequentist simulation study would normally select a set of Schaefer model parameters and simulate many data sets with them. Such a study would consider several scenarios with different model parameters. In this study, on the other hand, the two scenarios were defined by different prior distributions. For each simulation within a scenario, we first drew a set of model parameters (
) from the priors and then constructed a data set (x) consisting of survey, landings, or effort data by drawing randomly from f(x|
). In other words, both parameters and data varied over the simulations. This simulation scheme provides the joint vectors (x1,
1), (x2,
2), ..., (xm,
m) in just the form needed to approximate the Bayes risk by Equation (7). We did m = 40 simulations for each scenario.
It is natural to wonder at this point whether the 40 simulations we did are sufficient to provide a valid assessment of the approach we propose. We contend that we do not need to evaluate the Bayes risk to any particular degree of precision that might be chosen a priori. Instead, we need only to determine, with a high degree of probability, which assessment method had the lowest risk. This implies that the number of simulations (m) required is closely linked to the actual performance difference between methods: the more they differ in performance, the fewer the simulations required. So, to determine whether the observed difference between the top two performing methods was "significant", we ran a paired, Wilcoxon rank sum test (two-sided, level
= 0.0001) comparing the 40 calculations of
. We repeated the test with
,
, and
. If all these tests are significant at that low level, then 40 is enough.
The Bayes risk is effective at evaluating the overall performance of the various methods, but hides the effects of different parameter values. Therefore, we used regression trees to investigate the link between simulation model parameters and which method was optimal for them. Regression tree is an exploratory tool suitable for modelling categorical response variables (here, which model performed best in terms of
) using categorical or continuous explanatory variables (Chambers and Hastie, 1992; Clark and Pregibon, 1992). Individuals (here, simulated data sets) are recursively split into two groups using the explanatory variables (here, model parameters plus average ratio of catch to landings). At each node the split leading to the greatest difference in the response variable is selected, and these splits are represented as branches of a tree. The leaf nodes of this tree are clusters of similar individuals. By looking at the values of the variables characterizing each branch, a description of the final cluster is obtained. It is possible to set a minimum group size for clusters (we used 8) in order to avoid clusters with only few individuals.
Simulation
We started each simulation by generating a new 10-year time-series of fishing effort, E(t). These time-series were constructed by simulating three numbers from a uniform distribution on the interval from 0.2 to 1.2. These three numbers represented the effort in years 1, 6, and 10. The other effort values were obtained by linear interpolation. In other words, the effort time-series was not affected by any trend in the size of the fished population. We justify this simplification by the fact that, in reality, the effect of biomass trends on effort can be quite weak, given the many other factors at play (fuel prices, other stocks, other jobs, government subsidies, etc.).
The next step was to draw a set of Schaefer assessment model parameters from the prior distributions. We used two scenarios representing different levels of prior information about the stock. These two scenarios are described in Table 1, which also gives a description of each model parameter. The principal difference between the scenarios is that, in the second, both the survey data and the priors were considerably less informative. These priors were chosen so as to simulate population trajectories that we considered realistic.
|
The simulations created fish stocks that were already depleted below the carrying capacity, K, in year 1. This was done by drawing the initial proportion of K, denoted P0, from a Beta distribution:
|
| (8) |
2:
|
| (9) |
,
|
| (10) |
|
| (11) |
|
| (12) |
We generated a mean landings rate (pL) for our simulated fishery:
|
| (13) |
|
| (14) |
The survey catch rates, denoted I(t) in year t, were generated next. We assumed a lognormal distribution for the survey index:
|
| (15) |
Estimation
The three estimation models, survey-and-effort, true-catch, and censored-catch, closely resemble the simulation model in that they share all its equations except Equations (10), (11), (13), and (14). Equation (10) defines parameter
, which was used to create a non-linear relationship between commercial catch rates and biomass in the simulated data sets. This parameter was not used at all in estimation. For the true-catch method, which we consider traditional because it is like the one in Meyer and Millar (1999a), catches were assumed equal to landings, C(t) = L(t), so there was no need for Equations (11), (13), and (14). For the survey-and-effort and censored-catch methods, catches were predicted from product qcE(t)B(t), where qc is the catchability coefficient for commercial fishing and E(t) is the corresponding effort in year t. This is equivalent to assuming that catch rates are proportional to biomass. Thus Equations (13) and (14) were eliminated, and Equation (11) was replaced with the following (wherein the
has been deleted):
|
| (16) |
The likelihood function for the data involves a factor for the survey index:
|
| (17) |
|
| (18) |
|
| (19) |
Fortunately, within the BUGS programming environment, the full joint likelihood need not be specified, as the program can deduce it from the conditional distributions and from the conditional independence relationships implied by their specification. The Appendix provides the BUGS code that we used to implement the true-catch method. It also indicates the modifications required to adapt this code to the survey-and-effort and censored-catch models.
For each simulation we ran two MCMC chains for one million iterations, sampling every 40th iteration, after a burn-in of 500 000 iterations. These two chains were started from different, over-dispersed starting locations, as specified in the Appendix. Running two chains allowed us to compute the relative difference between the marginal posterior quantiles for each parameter. This provides an intuitive measure of MCMC-induced error, more reliable at examining many chains than statistical convergence tests. All results in this paper were based on the combined samples in both chains.
| Results |
|---|
|
|
|---|
In order to assess the MCMC-induced error, we took our two MCMC chains and computed the difference between estimates of the first, second, and third quartiles of every parameter, and divided that difference by the mean parameter value (this mean was taken over both chains). The maximum absolute value of this relative difference, over all parameters and over the three quartiles, provides an index of the relative difference between the chains. We looked at the average of that index over all 40 simulations. For the censored method (in scenario 1) the average was 0.89%; for the survey-and-effort method it was 1.87%, and for the true catch it was 1.09%. The worst case for this relative difference index (over the simulations) was 2.37% for censored, 4.72% for survey-and-effort, and 2.93% for true catch. This analysis revealed that late-year catches mixed most slowly in the MCMC chains. The MCMC-induced error was small compared with the relative difference between results across estimation methods (using the same MCMC start values where possible), which averaged over 50% (over the parameters and quantiles in scenario 1). Thus, MCMC performance was deemed acceptable for the purpose of comparing methods on simulated data.
The Bayes risk for depletion (X) under squared-error loss
was smallest for the censored-catch method in scenario 1, at 0.077. For the survey-and-effort method this statistic was 0.098 and for the true-catch method it was 0.108. For
the results were 2279 for the censored method, 3098 for the survey-and-effort, and 2728 for the true catch in the same scenario. With absolute-error loss in scenario 1, the results were 0.162 for the censored, 0.178 for the survey-and-effort, and 0.214 for the true-catch methods for
. For
the results were 20.25 for the censored method, 25.75 for the survey-and-effort, and 23.35 for the true catch.
Figure 1 gives a graphical representation of these estimation results. The first row of this figure provides marginal posterior results from estimating the depletion index X, while the second row gives results for MSY. These marginal distributions have been shifted left by the true parameter value (i.e. the one generated in simulation), so the results should be clustered around zero, if all is going well. All methods are a bit over-optimistic about the stock, but the true-catch method is particularly so.
|
Results were much the same in scenario 2:
|
All the Wilcoxon rank sum tests comparing values of
= 0.000001, showing that the m = 40 simulations used in each scenario were sufficient to discern that the censored method's superior performance is unlikely to have occurred by chance.
Thus, the overall performance of the censored approach was superior, but it was decidedly not optimal in several situations. With recursive partitioning we attempted to characterize the situations in which each method performed best. For this purpose, squared-error loss was chosen as the optimality criterion. For each simulation (40 for each scenario) the dependent variable was coded by the method with the minimum
(censored, survey-and-effort, or true-catch). The resulting trees are shown in Figures 36, for MSY and X in scenarios 1 and 2, respectively. The dominating method with the smallest
is marked at the end of each branch. For example, branch 1 in Figure 3 consists of 20 simulations for which the censored method generally performed best. Some misclassification can happen; hence, the percentage of simulations classified correctly is also given in the figures. Looking at these figures for patterns that are consistent between scenarios, we find that higher fish catchability (qc) values and lower precision for the survey index (
2 in scenario 1) favoured the censored method at estimating the depletion index X compared with the other two methods. Higher ratios of catch to landings (ratio), i.e. less discarding, favoured it at estimating MSY in scenario 2, but not in scenario 1. Note that the survey-and-effort method was relatively effective at estimating depletion, but poor at estimating MSY.
|
|
|
|
| Discussion |
|---|
|
|
|---|
This paper introduces the idea of treating catch as censored data in stock-assessment models. Though censored data have been used for decades in survival analysis, the concept seems to be new to fisheries. In this paper, it was shown that censored-data models can lead to improved performance (on an average) relative to the traditional true-catch and survey-and-effort methods on simulated data sets, both at estimating depletion and at estimating MSY. This was true regardless of whether performance was measured by squared-error or absolute-error loss, and it did not appear to be sensitive to the prior distributions. The censored method was not, however, always the best performer.
Although we explored which parameter values were associated with good performance of the censored method, few patterns emerged that were consistent across scenarios. The only such pattern that we uncovered suggested that the censored method was favoured by high catchability values when estimating depletion. We had expected that the relative performance of the censored and true-catch methods would depend on the average ratio of catch to landings, with higher ratios favouring the censored approach. This turned out to be the case in estimating MSY in scenario 2, but not in scenario 1. Without clear indications of parameter values that give the censored method special difficulty, we decided that its superior average performance might be expected in general.
Fisheries science has increasingly relied on scientific surveys in stock assessment because of the difficulties with commercial fishery data. Some recommend dispensing with landings data altogether in certain situations (e.g. Cook, 1997), preferring landings-free methods. We showed here that our landings-free method, the survey-and-effort one, was relatively effective at estimating depletion, but not at estimating MSY. Nonetheless, our results suggest that landings-free methods throw away too much information as the survey-and-effort method was bettered by the censored approach even at estimating depletion.
The censored-catch idea can only prove useful, however, if real situations arise where catches are surely greater than landings. When catches are greater than landings, it is most often because fishermen discard many of the fish they catch at sea. Fish that are thrown back into the ocean after being hauled on board ship by commercial fishing gear are usually injured, if not dead. In many fisheries worldwide, it is not unusual for more fish to be discarded than landed (e.g. Alverson et al., 1994; Stratoudakis et al., 1999; Rochet et al., 2002). Though fishermen may regret the waste involved, market forces and fisheries regulations interact to compel them to discard. Indeed, the use of landings quotas in mixed fisheries creates an incentive to discard fish because it is difficult to avoid catching a mixture of species (Pascoe, 1997). A vessel that has quota for cod and other stocks, for example, will not usually stop fishing when she fills her cod quota, but will keep going until she fills all quotas, discarding as many cod as necessary in the process. To ignore discarding in stock assessment, therefore, is usually to underestimate the true impact of fishing.
Fishing gears also kill many fish that they fail to capture. Though fisheries science has a long tradition of using trawl mesh-size regulations to allow unwanted small fish to slip through nets, video footage of gear performance suggests that many of the fish escaping capture in this way are still injured or killed (e.g. Suuronen et al., 1996). Many gear types also lose fish in the process of hauling them on board, drift nets being a prominent example. Ghost fishing, by which we mean lost fishing gear that continues to kill fish (Kaiser et al., 1996), provides yet another mechanism by which landings may underestimate fishing impact.
Despite compelling reasons for thinking that landings underestimate fishing impact, there are mechanisms that can lead to the very opposite. The herring fishery on the west coast of Scotland (Area VIa) provides an interesting example. According to the stock-assessment working group reports for that area, fishermen have been reporting fish that they caught in the North Sea (Area IVa) in Area VIa (to some degree also in Areas IIIa and IIa), in order to take advantage of the less restrictive quota in VIa (ICES, 2002). This might mean that landings in VIa are greater than catches.
Such examples being exceptional, a lower bound for the catch could likely be found in most fisheries; however, the censored method also calls for an upper bound. This is a weak point because it is hard to imagine how such a bound could be observed directly; it would have to be set somewhat arbitrarily. Nonetheless, we feel that a stock-assessment working group could set a level beyond which the catch would be incredible. We showed in this paper that, even if this upper bound is set too low (at 2 times landings rather than the 2.5 possible in simulated data), censored-model performance can still be better than that of traditional approaches. Future use of censored-data models may well depend on how this arbitrary aspect is viewed by working groups.
| Appendix 1 |
|---|
|
|
|---|
In this section, we provide the BUGS code used to estimate parameters of the true-catch model specified in Methods (for scenario 1). To map this code onto the equations of that section, it is important to know that "isigma2" corresponds to 1/
2, "itau2" to 1/
2, "P[t]" to "B'(t)", "itheta2" to 1/
2, "q" to qs, and "qcomm" to qc. Note that lines starting with "#" are comments. BUGS also uses an unconventional parameterization for its normal and lognormal distributions: the second parameter is the reciprocal of the variance, also known as the precision of the distribution.
The "I" symbols following distribution declarations are used to signal the presence of censored data in BUGS. For example, "Y
dnorm(0,1)I(2,)" would signify that Y has a standard normal distribution, and that there has been a censored observation "Y > 2". Just by way of clarification, it would not mean that Y has a truncated normal distribution, which is subtly different. Changing that suffix to read "I(,3)" would imply an observation of "Y < 3", while "I(2,3)" would mean an observation of "2 < Y < 3". These I symbols are the key to implementing censored-data models in BUGS.
model;
{
- #priors
- r
dlnorm(1.4,4)
- isigma2
dgamma(100.05)
- K
dlnorm(6.551,4)
- r
- # _ A _
- # itheta2
dgamma(70,1)
- # _ A' _
- # itheta2
- itau2
dgamma(44,2)
- qcomm
dbeta(5.5,44.5)
- q
dbeta(4,8)
- Pinit
dbeta(20,6)
- qcomm
- #Dynamics
- Pmed[1] <Pinit
- Pmed[1] <Pinit
- for(t in 1: N) {
- P[t]
dnorm(Pmed[t],isigma2)I(0.001,)
- # _ B _
- C[t] <Landings[t]
- # _ B' _
- C[t] <Landings[t]
- Pmed[t + 1] < (r + 1) * P[t] (r * P[t]) * P[t] C[t]/K
- Imed[t] < log((K * P[t]) * q)
- I[t]
dlnorm(Imed[t],itau2)
- I[t]
- }
- P[t]
- Pcur
dnorm(Pmed[N+1],isigma2)I(0.001,)
- X<-Pcur/.5
- MSY< r * K/4
- MSY< r * K/4
}
BUGS needs data and inputs to start an MCMC chain. A typical data file would look as follows:
- list(N = 10, Landings = c(5.94,6.36,7.65,7.47,8.86,11.95,8.17,5.52,6.35,3.96),
- I = c(51.17,61.25,55.67,60.07,58.31,70.99,53.8,47.54,39.73,49.09))
- I = c(51.17,61.25,55.67,60.07,58.31,70.99,53.8,47.54,39.73,49.09))
The values for "Landings" were taken from a simulated data set, as were the values in the "I" vector, which represents the survey index values. N is the number of years of data.
A typical input file might look as follows:
- list(q = 0.332, qcomm = 0.13, itau2 = 25.2, isigma2 = 2508, Pcur = 0.5, Pinit = 0.537, K = 193, r = 0.265,
- P = c(0.54,0.55,0.55,0.54,0.53,0.53,0.48,0.47,0.45,0.5))
- P = c(0.54,0.55,0.55,0.54,0.53,0.53,0.48,0.47,0.45,0.5))
An input file can be created by multiplying the true parameters by either 0.7 or 1.3. In this paper, we made two input files for every set of simulation data by using one of these two multipliers on every parameter, while making sure to constrain the parameter values in the feasible range.
The true-catch model can be changed into the censored-catch model in four steps:
- Remove the comment symbol "#" before "itheta2" between points "_ A _" and "_A'_"
- Replace the code between "_ B _" and "_B'_" with the following:
- Cmed[t] < log((E[t] * qcomm) * P[t] * K)
- L2[t]<= 2* Landings[t]
- C[t]
dlnorm(Cmed[t],itheta2)I(Landings[t],L2[t])
- L2[t]<= 2* Landings[t]
- Add the effort data (from the same data set) to the data file:
- list(N = 10, E = c(0.42,0.49,0.55,0.62,0.68,0.75,0.66,0.57,0.48,0.39),
- Landings = c(5.94,6.36,7.65,7.47,8.86,11.95,8.17,5.52,6.35,3.96),
- I = c(51.17,61.25,55.67,60.07,58.31,70.99,53.8,47.54,39.73,49.09))
- Landings = c(5.94,6.36,7.65,7.47,8.86,11.95,8.17,5.52,6.35,3.96),
- Add initial values for catch vector C for the input file:
- list(q = 0.332, qcomm = 0.13, itau2 = 25.2, isigma2 = 2717,
- itheta2 = 71, Pcur = 0.5, Pinit = 0.54, K = 193.4, r = 0.265, P = c(0.54,0.55,0.55,0.54,0.53,0.53,0.48,0.47,0.45,0.5),
- C = c(7.9,8.4,9.7,9.5,10.9,13.9,10.2,7.5,8.3,6))
- itheta2 = 71, Pcur = 0.5, Pinit = 0.54, K = 193.4, r = 0.265, P = c(0.54,0.55,0.55,0.54,0.53,0.53,0.48,0.47,0.45,0.5),
- Replace the code between "_ B _" and "_B'_" with the following:
The true-catch model can also be changed into the survey-and-effort model in four steps:
- Remove the comment symbol "#" before "itheta2" between points "_ A _" and "_A'_"
- Replace the code between "_ B _" and "_B'_" with the following:
- Cmed[t] < log((E[t] * qcomm) * P[t] * K)
- C[t]
dlnorm(Cmed[t],itheta2)
- C[t]
- Add the effort data (from the same data set) to the data file and remove the landings:
- list(N = 10, E = c(0.42,0.49,0.55,0.62,0.68,0.75,0.66,0.57,0.48,0.39),
- I = c(51.17,61.25,55.67,60.07,58.31,70.99,53.8,47.54,39.73,49.09))
- I = c(51.17,61.25,55.67,60.07,58.31,70.99,53.8,47.54,39.73,49.09))
- Add initial values for catch vector C for the input file
- list(q = 0.332, qcomm = 0.13, itau2 = 25.2, isigma2 = 2717,
- itheta2 = 71, Pcur = 0.5, Pinit = 0.54, K = 193.4, r = 0.265, P = c(0.54,0.55,0.55,0.54,0.53,0.53,0.48,0.47,0.45,0.5),
- C = c(7.9,8.4,9.7,9.5,10.9,13.9,10.2,7.5,8.3,6))
- itheta2 = 71, Pcur = 0.5, Pinit = 0.54, K = 193.4, r = 0.265, P = c(0.54,0.55,0.55,0.54,0.53,0.53,0.48,0.47,0.45,0.5),
- Replace the code between "_ B _" and "_B'_" with the following:
| Acknowledgements |
|---|
This work was partly funded by the European Commission under Framework V, project contract QLRT-1999-01609. It was also partly funded by Defence R&D Atlantic.
| References |
|---|
|
|
|---|
-
Alverson D. L., Freeberg M. H., Murawski S. A., Pope J. G. (1994) A global assessment of bycatch and discards. FAO Fisheries Technical Paper No. 339: 233 pp.
Anon. (2001) Report of the working group on the assessment of Southern Shelf demersal stocks. ICES CM 2001/ACFM:5. 735 pp.
Casella G. and Berger R. (1990) Statistical Inference(Wadsworth and Brooks, Belmont) 650 pp.
In Chambers J. and Hastie T. (Eds.). Statistical Models (1992) (Wadsworth and Brooks, Pacific Grove, CA) 608 pp.
Clark L. and Pregibon D. (1992) Tree-based models. In Chambers J.M. and Hastie T.J. (Eds.). Statistical Models(Wadsworth and Brooks, Pacific Grove, CA) pp. 377417 608 pp.
Cook R.M. (1997) Stock trends in six North Sea stocks as revealed by an analysis of research vessel surveys. ICES Journal of Marine Science 54:924933.
Hilborn R. and Liermann M. (1998) Standing on the shoulders of giants: learning from experience in fisheries. Reviews in Fish Biology and Fisheries 8:273283.[CrossRef][Web of Science]
Hilborn R. and Walters C.J. (1992) Quantitative Fisheries Stock Assessment: Choice, Dynamics and Uncertainty(Chapman and Hall, New York).
ICES. (2002) Report of the herring assessment working group for the area South of 62° N. ICES CM 2002/ACFM:12. 418 pp.
Kaiser M.J., Bullimore B., Newman P., Lock K., Gilbert S. (1996) Catches in "ghost fishing" set nets. Marine Ecology Progress Series 145:1116.[Web of Science]
Kimura D.K. (1990) Approaches to age-structured separable sequential population analysis. Canadian Journal of Fisheries and Aquatic Sciences 47:23642374.
Koonce J.F. and Shuter B.J. (1987) Influence of various sources of error and community interactions on quota management of fish stocks. Canadian Journal of Fisheries and Aquatic Sciences 44:6167.
McAllister M.K. and Ianelli J.N. (1997) Bayesian stock assessment using catch-age data and the sampling importance resampling algorithm. Canadian Journal of Fisheries and Aquatic Sciences 54:284300.
McLachlan G.J. and Krishnan T. (1997) The EM Algorithm and Extensions(John Wiley and Sons, New York) 275 pp.
Meyer R. and Millar R. (1999) BUGS in Bayesian stock assessments. Canadian Journal of Fisheries and Aquatic Sciences 56:10781086.
Meyer R. and Millar R.B. (1999) Bayesian stock assessment using a state space implementation of the delay difference model. Canadian Journal of Fisheries and Aquatic Sciences 56:3752.
NRC (National Research Council). (1998) Improving Fish Stock Assessments(National Academy Press, Washington, DC) 177 pp.
Pascoe S. (1997) Bycatch management and the economics of discarding. FAO Fisheries Technical Paper, 370. FAO, Rome.
Patterson K. (1998) Assessing fish stocks when catches are misreported: model, simulation tests, and application to cod haddock, and whiting in the ICES area. ICES Journal of Marine Science 55:878891.
Punt A.E. and Hilborn R. (1997) Fisheries stock assessment and decision analysis: the Bayesian approach. Reviews in Fish Biology and Fisheries 7:3563.[CrossRef][Web of Science]
Rochet M.J., Péronnet I., Trenkel V.M. (2002) An analysis of discards from the French trawler fleet in the Celtic Sea. ICES Journal of Marine Science 59:538552.
Shepherd J.G. (1999) Extended survivors analysis: an improved method for the analysis of catch-at-age data and abundance indices. ICES Journal of Marine Science 56:584591.
Stratoudakis Y., Fryer R.J., Cook R.M., Pierce G.J. (1999) Fish discarded from Scottish demersal vessels: estimators of total discards and annual estimates of targeted gadoids. ICES Journal of Marine Science 56:592605.
Suuronen P., Perez-Comas J., Lethonen E., Tschernijj V. (1996) Size-related mortality of herring (Clupea harengus L.) escaping through a rigid sorting grid and trawl codend meshes. ICES Journal of Marine Science 53:691700.
Tanner M.A. (1996) Tools for Statistical Inference(Springer-Verlag, New York) 207 pp.
Walters C. and Ludwig D. (1994) Calculation of Bayes posterior probability distributions for key population parameters. Canadian Journal of Fisheries and Aquatic Sciences 51:713722.
This article has been cited by other articles:
![]() |
J. A. Bogaards, S. B. M. Kraak, and A. D. Rijnsdorp Bayesian survey-based assessment of North Sea plaice (Pleuronectes platessa): extracting integrated signals from multiple surveys ICES J. Mar. Sci., May 1, 2009; 66(4): 665 - 679. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







