© 2004 International Council for the Exploration of the Sea
An application of two techniques for the analysis of short, multivariate non-stationary time-series of Mauritanian trawl survey data
a Centro de Ciências do Mar (CCMAR), Universidade do Algarve 8000-117 Faro, Portugal
b Institut Mauritanien de Recherches Océanographiques et des Pêches (IMROP) B.P. 22, Nouadhibou, Mauritania
c Instituto Nacional de Investigação Agrária e das Pescas (IPIMAR) Av. Brasília, 1449-006 Lisboa, Portugal
*Correspondence to K. Erzini: tel: +351 289 800100; fax: +351 289 818353. e-mail: kerzini{at}ualg.pt.
Min/max autocorrelation factor analysis (MAFA) and dynamic factor analysis (DFA) are complementary techniques for analysing short (>1525 y), non-stationary, multivariate data sets. We illustrate the two techniques using catch rate (cpue) time-series (19822001) for 17 species caught during trawl surveys off Mauritania, with the NAO index, an upwelling index, sea surface temperature, and an index of fishing effort as explanatory variables. Both techniques gave coherent results, the most important common trend being a decrease in cpue during the latter half of the time-series, and the next important being an increase during the first half. A DFA model with SST and UPW as explanatory variables and two common trends gave good fits to most of the cpue time-series.
Keywords: dynamic factor analysis, indicators, Mauritania, metrics, min/max autocorrelation factor analysis, multispecies, time-series
Received 1 April 2004; accepted 10 November 2004.
| Introduction |
|---|
|
|
|---|
The main use of ecosystem indicators is to summarize trends in ecosystem structure or functioning that can be related to fishing and environmental variability. A variety of metrics based on size and trophic structure, relative abundance, and diversity have been developed (Bianchi et al., 2000; Pauly et al., 2000; Rice, 2000; Rochet and Trenkel, 2003; Trenkel and Rochet, 2003), many of which are based on time-series derived from regular monitoring surveys.
Rice (2000) discusses the merits and limitations of different classes of metrics as indicators of ecosystem state. Aggregate, community-level indicators have clear merits, because they integrate over the individual components, but their disadvantage is that effects on individual species cannot be traced. This may be a crucial shortcoming, because the primary response of a species to exploitation (as well as to environmental change) is largely determined by its life history characteristics (Rochet, 1998). Also, the effects of fishing and environmental variables cannot easily be evaluated using aggregate indicators (Rice, 2000). While techniques such as canonical correspondence analysis, redundancy analysis, and canonical correlation may be used to analyse multivariate data with explanatory variables, their utility for analysing trends and changes over time is limited (Bull et al., 2001; Zuur et al., 2003b).
Trend, seasonality, and explanatory variables can be handled by alternative techniques such as vector auto-regressions, auto-regressive integrated moving average (ARIMA) models, multivariate ARIMA, and dynamic regression models (Stergiou, 1991; Rothschild et al., 1996; Stergiou and Christou, 1996; Stergiou et al., 1997; Lloret, 2003). However, such techniques require long, stationary, and complete time-series, and are not efficient for handling common trends (Stergiou and Christou, 1996). Solow (1994), Zuur et al. (2003a, b) and Zuur and Pierce (2004) propose two statistical techniques that are particularly suitable for relatively short (>1525 y), non-stationary multivariate time-series: min/max autocorrelation factor analysis (MAFA) and dynamic factor analysis (DFA). They can be used to extract and identify common trends from multiple time-series, to estimate index functions, to evaluate interactions between response variables, and to determine the effects of explanatory variables. Moreover, the effects of introducing lags in the explanatory variables can also be evaluated (Zuur et al., 2003b; Zuur and Pierce, 2004).
Fisheries applications of DFA have been limited to multiple time-series of catch per unit effort (cpue) for Nephrops norvegicus (Zuur et al., 2003b) and Loligo forbesi (Zuur and Pierce, 2004), with sea surface temperature (SST) and NAO index as explanatory variables, and an analysis of landings from southern Portugal with NAO index, river run-off, and an upwelling index as explanatory variables (Erzini, 1995). Here, we apply the two techniques to multispecies cpue time-series (19822001) of 15 fish and two cephalopod species taken in trawl surveys off Mauritania, with indices of NAO, upwelling, SST, and fishing effort as explanatory variables. The aim is to identify important underlying patterns in the data and to evaluate their relative importance, as well as to investigate the effects of introducing lags in the explanatory variables for individual species.
| Material and methods |
|---|
|
|
|---|
Response and explanatory variables
Mauritanian trawl survey data were available for the years 19822001 (IMROP, 2002). Species were selected on the basis of their overall abundance and importance to local demersal fisheries. Pelagic species were excluded because their catches by bottom trawls are likely not representative. The 15 fish species (Table 1) selected, along with octopus (Octupus vulgaris) and cuttlefish (Sepia officinalis), accounted for 43% of survey catches over the entire period.
|
Trawl surveys were stratified by season (warm, JuneOctober; cold, NovemberMay), zone (north, >19.25°N; south, <19.25°N), and depth (stratum 1, 020 m; 2, 2050 m; 3, 50100 m; 4, 100227 m). Deeper stations were eliminated because they were sampled less frequently and species composition was very different. Generalized Linear Models (GLM; McCullagh and Nelder, 1992) were used to remove the effects of these factors and thus to estimate standardized annual abundance. GLM analysis was carried out on log-transformed mean survey catch (kg per 30 min) by species, accounting for the factors year, season, zone, and depth. Zero values were eliminated, because the objective was to determine relative change over time, not absolute abundance.
Four explanatory variables were used: mean SST during the cold season (NovemberMay), based on data available on a 1° x 1° grid from COADS (Comprehensive Ocean-Atmosphere Data Set; http://www.cdc.noaa.gov/coads/), an upwelling index based on wind data at latitude 20.5°N (Sedik, 1975), the annual mean NAO index based on the difference of normalized sea level pressures between Ponta Delgada, Azores, and Stykkisholmur/Reykjavík, Iceland (http://www.cgd.ucar.edu/~jhurrell/nao.html; Hurrell, 1995), and an index of fishing effort for the cephalopod trawl fleet in Mauritanian waters (IMROP, 2002). To facilitate the interpretation of factor loadings and canonical correlations, the time-series of cpue and explanatory variables were standardized by dividing annual values by the overall mean.
Min/max autocorrelation factor analysis (MAFA)
MAFA (Solow, 1994) is a type of principal component analysis (PCA) in which the axes represent a measure of autocorrelation and give an indication of the association between variable Yt and Yt+k, where k is a time-lag (k = 1, 2,...). Unlike PCA, in which the first axis explains most of the variance, the first MAFA axis has the highest autocorrelation, and because trends are associated with high autocorrelation, it represents the main trend in the data. MAFA can be used to extract trends from multiple time-series, to estimate index functions, and for smoothing (Solow, 1994).
The loadings determine the relationship of individual response variables to particular MAFA axes. Cross-correlations between MAFA axes and response variables, also known as canonical correlations, are a measure of the relationship between Yt and Xtk, and allow identification of significant relationships between trends and explanatory variables. We used the Brodgar (http://www.brodgar.com) software package to carry out MAFA of the time-series of response variables and explanatory variables.
Dynamic factor analysis (DFA)
DFA is a dimension reduction technique that can be used to identify underlying common patterns in a multivariate time-series, evaluate interactions between response variables, and determine effects of explanatory variables (Zuur et al., 2003a). Time-series are modelled as a function of a linear combination of common trends, a constant level parameter, one or more explanatory variables, and noise (Zuur and Pierce, 2004).
Using the Brodgar software package, we fitted a series of DFA models, ranging from the simplest (with one common trend plus noise) to the most complex (with four common trends, two explanatory variables, plus noise). Models were fitted with both a diagonal covariance matrix and a symmetric positive-definite covariance matrix. Models with explanatory variables were fitted at lag = 0, 1, and 2, and Akaike's information criterion (AIC) was used as a measure of goodness-of-fit and to compare models (Zuur et al., 2003b). Factor loadings were used to make inferences regarding the importance of particular trends, representing underlying common patterns over time, both to specific response variables, and to different groups of response variables.
| Results |
|---|
|
|
|---|
The cpue time-series showed considerable variability, and trends cannot easily be identified (Figure 1a). This also largely applies to the explanatory variables, although fishing effort clearly peaked in 1987 and 1996, and (as expected) SST and UPW were negatively correlated.
|
Although autocorrelations were high (0.97 and 0.92, for first and second MAFA axis, respectively), neither was significant (p > 0.59). The scores for the first axis showed a steady increase from 1982 to 1992, followed by a sharp decline, whereas the second axis displayed a sharp increase to 1988, followed by a slight decrease and levelling off thereafter (Figure 2a). The canonical correlations illustrating the relationships between species and the first axis (Figure 3) indicate positive correlations for 15 of the 17 species, but only four of these were significant (p < 0.05; Brotula barbata, Citharus linguatula, Epinephelus aeneus, and Umbrina canariensis), corresponding to a main trend of decreasing cpue in recent years. The significant (p < 0.05) negative correlation for Mustelus mustelus indicates an opposite trend. In addition, four species were significantly (p < 0.05) and positively correlated with the second axis (Brachydeuterus auritus, C. linguatula, Pomadasys incisus, and Pseudopeneus prayensis). None of the correlations between the first two MAFA axes and the explanatory variables was significant.
|
|
The best DFA fits were obtained for non-lagged explanatory variables and a symmetric, non-diagonal matrix (Table 2). Of the models with no explanatory variables, the best fit was for three common trends (AIC = 740). When explanatory variables were included, AIC values were smaller, the model with two common trends and SST and UPW being the best (AIC = 573). The main common trend showed an increase to a maximum in 1993, followed by a decline, whereas the second common trend depicted a general decrease from 1982 to 2001 (Figure 2b). The two main common trends were similar to those for the first two MAFA axes (Figure 2a), especially for the second half of the period.
|
The estimated t-values for the regressions for individual species and SST and UPW, respectively, were relatively large for Diplodus bellottii (SST) and Brotula barbata (UPW), indicating strong relationships (Table 3). Relatively large diagonal elements of the error covariance matrix (R > 0.55) were obtained for M. mustelus, O. vulgaris, S. officinalis, Sparus caeruleostictus, and U. canariensis, indicating that these cpue series did not fit well. However, the relatively small covariance values (range: 0.19 to +0.42; with only 9 of 136 >0.3) indicate that the common pattern generally fitted well.
|
The biplot of factor loadings (Figure 4) shows that, with one exception (M. mustelus), all species are positively correlated with at least one of the two principal trends, often with two. Observed and fitted cpue series for the seven species best correlated with one of the two main common trends (B. auritus, B. barbata, C. linguatula, E. aeneus, Plectorhyncus mediterraneus, P. prayensis, and U. canariensis) and the one negatively correlated with both (M. mustelus) are given in Figure 5. For all 17 species, the observed cpue and fitted cpue were significantly positively correlated (
= 0.05), indicating that the model adequately described the trend, despite considerable interannual variability in cpue. With the exception of M. mustelus showing an increase in later years, all other species showed a marked decline in cpue during the second half of the period.
|
|
| Discussion |
|---|
|
|
|---|
The two techniques applied, MAFA and DFA, apparently provided coherent results, indicating that the two most important trends in the response variables are (i) a decline in cpue during the second half of the time-series, and (ii) a general decrease in cpue over time. What drove these trends is less clear, but the available evidence suggests that the environment (SST, and to a lesser extent UPW; the two factors are obviously related, in that a strong UPW should lead to lower SST) has been more important than fishing. The importance of the pelagic environment is not surprising, given the life history characteristics of the species studied (Table 1). Both octopus and cuttlefish live for about one year (Sobrino et al., 2002), and approximately half the fish species are relatively small, short-lived, and fast-growing, and recruit to the fishery in their first year of life. Moreover, most species spend part of their early life off the bottom, and thus are subjected to the pelagic environment. In the northwestern Mediterranean, catch and cpue of most fish species is significantly positively correlated with river flow and a wind-mixing index for lags <1 year (Lloret et al., 2001).
Notwithstanding, the index of trawl effort used may not be the best to characterize exploitation in the area. The cephalopod fishery is only one of several trawl fisheries operating in Mauritanian waters, and may not be relevant for some of the species caught in a survey covering a large area and depth range.
In conclusion, we have demonstrated two useful techniques for evaluating the state of multispecies fisheries, and the environmental and fisheries factors affecting them. The techniques have a number of advantages compared with other methods used to evaluate the effects of fishing on ecosystems (Rice, 2000), especially in terms of their ability to handle trends, explanatory variables, missing values, and interactions between trends, response, and explanatory variables.
| Acknowledgements |
|---|
We thank Alain Zuur for statistical advice.
| References |
|---|
|
|
|---|
-
Bianchi G., Gislason H., Graham K., Hill L., Jin X., Koranteng K., Manickchand-Heileman S., Payá I., Sainsbury K., Sanchez F., Zwanenburg K. (2000) Impact of fishing on size composition and diversity of demersal fish communities. ICES Journal of Marine Science 57:558571.
Bull B., Livingston M.E., Hurst R., Bagley N. (2001) Upper-slope fish communities on the Chatham Rise, New Zealand, 199299. New Zealand Journal of Marine and Freshwater Research 35:795815.[Web of Science]
Erzini K. (2005) Trends in NE Atlantic landings (southern Portugal: identifying the relative importance of fisheries and environmental variables. Fisheries Oceanography 14: (in press).
Hurrell J.W. (1995) Decadal trends in the North Atlantic Oscillation: regional temperatures and precipitation. Science 269:676679.
IMROP. (2002) Report of the 5th Stock Assessment Working Group for Mauritania, December 2002(IMROP Institut Mauritanien de Recherches Océanographiques et des Pêches, Nouadhibou, Mauritania) (draft version).
Lloret J. (2003) In CIESM J. (Ed.). Use of Box-Jenkins models to analyse time series of fisheries data in relation to environmental factors in the northwestern Mediterranean. Mediterranean Biological Time Series 3742 CIESM Workshop Monographs, 22.
Lloret J., Lleonart J., Solé I., Fromentin J-M. (2001) Fluctuations of landings and environmental conditions in the North-Western Mediterranean sea. Fisheries Oceanography 10:3350.[CrossRef][Web of Science]
McCullagh P. and Nelder J.A. (1992) Generalized Linear Models. Monographs on Statistics and Probability(Chapman and Hall, London) 37: 511 pp.
Pauly D., Christensen V., Walters C. (2000) Ecopath, Ecosim, and Ecospace as tools for evaluating ecosystem impact of fisheries. ICES Journal of Marine Science 57:697706.
Rice J.C. (2000) Evaluating fishery impacts using metrics of community structure. ICES Journal of Marine Science 57:682688.
Rochet M-J. (1998) Short-term effects of fishing on life history traits of fishes. ICES Journal of Marine Science 55:371391.
Rochet M-J. and Trenkel V.M. (2003) Which community indicators can measure the impacts of fishing? A review and proposals. Canadian Journal of Fisheries and Aquatic Sciences 60:8699.
Rothschild B.J., Smith S.G., Li H. (1996) The application of time series analysis to fisheries population assessment and modeling. In Gallucci V.F., Saila S.B., Gustafson D.J., Rothschild B.J. (Eds.). Stock Assessment: Quantitative Methods and Applications for Small-Scale Fisheries(CRC Press, Boca Raton, Florida) pp. 354402.
Sedik K.A. (1975) La circulation verticale des eaux dans la partie nord-est de l'Atlantique tropicale en septembre 1971mai 1972. Océanographie industriel des zones de l'océan Atlantique, Travaux ATLANTNIRO 61:319.
Sobrino I., Silva L., Bellido J.M., Ramos F. (2002) Rainfall, river discharges and sea temperature as factors affecting abundance of two coastal benthic cephalopod species in the Gulf of Cádiz (SW Spain). Bulletin of Marine Science 71:851865.[Web of Science]
Solow A.R. (1994) Detecting changes in the composition of a multispecies community. Biometrics 50:556565.[Medline]
Stergiou K.I. (1991) Short-term fisheries forecasting: comparison of smoothing, ARIMA and regression techniques. Journal of Applied Ichthyology 7:193204.
Stergiou K.I. and Christou E.D. (1996) Modelling and forecasting annual fisheries catches: comparison of regression, univariate and multivariate time series methods. Fisheries Research 25:105138.[CrossRef][Web of Science]
Stergiou K.I., Christou E., Petrakis G. (1997) Modelling and forecasting monthly fisheries catches: comparison of regression, univariate and multivariate time series methods. Fisheries Research 29:5595.[CrossRef][Web of Science]
Trenkel V.M. and Rochet M-J. (2003) Performance of indicators derived from abundance estimates for detecting the impact of fishing on a fish community. Canadian Journal of Fisheries and Aquatic Sciences 60:6785.
Zuur A.F., Fryer R.J., Jolliffe I.T., Dekker R., Beukema J.J. (2003) Estimating common trends in multivariate time series using dynamic factor analysis. Environmetrics 15:665685.
Zuur A.F. and Pierce G.J. (2004) Common trends in Northeast Atlantic squid time series. Journal of Sea Research 52:5772.[CrossRef]
Zuur A.F., Tuck I.D., Bailey N. (2003) Dynamic factor analysis to estimate common trends in fisheries time series. Canadian Journal of Fisheries and Aquaculture Sciences 60:542552.[CrossRef]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




