| Methods Home |
Sampling Procedures for Volunteer Programs Version: 06.19.2006 |
||||||||||||||||||||||||||||||||||||||||||||||||
| Introduction
Location What Waterbodies to Sample Depth Length of Sampling Season Sampling Frequency Temporal Sampling Strategies Statistical Accuracy and Volunteer Reality
Comments?
Suggestions? Please write us at
|
Portions of the following has been excerpted with permission from copyrighted material. Material in this section should not be reproduced without specific permission of the North American Lake Management Society (www.NALMS.org). The material should be cited as:
Carlson, R.E. and J.
Simpson. 1996. A Coordinator’s Guide to Volunteer Lake Monitoring
Methods. North American Lake Management Society. 96 pp.
|
||||||||||||||||||||||||||||||||||||||||||||||||
|
When many limnologists think about sampling design and sampling statistics, it sometimes seems that thoughts about the sampling design are perfunctory and statistics sometimes only become a concern only after the project is completed and the researcher is trying to make some sense of the data. Despite numerous warnings from statisticians and limnologists with some statistical background, statistical considerations seem to be the last concern of projects rather than the first. In volunteer monitoring projects, statistics may even more be shoved to the side as the control of lake, site, date, or even technique is removed from the coordinator's grasp. The coordinator is faced with the dilemma of obtaining meaningful data despite the variability inherent in volunteer monitoring. High quality data will greatly facilitate all subsequent predictive and diagnostic efforts. This chapter cannot be a complete review of sampling statistics, nor is another such review necessary. A number of manuals as well as statistics books exist that can give the coordinator information on sampling design. Manuals such as Gaugush (1986, 1987), Reckhow (1979), Reckhow and Chapra (1983) are specifically deal with the sampling of lakes and reservoirs. Bias: What Will Compromise Your Data? Statistics books put a great deal of emphasis on the concepts of accuracy, precision, and bias. Accuracy is the closeness of a measured or computed value to its true value, while precision is the closeness of repeated measurements of the same quantity (Sokol and Rolf (1981). Standard Methods (APHA, 1989) describes bias is a systematic error introduced into the estimate by either the method (site selection, sampling, storage, or analysis) or the use of the method (how the volunteer or the laboratory interprets the method). The goal of any sampling program should be to have low bias, high precision, and high accuracy. Precision is maximized by careful instructions to the volunteers, a standardized sample collection and storage procedure, minimization of the need for the volunteer to handle samples, and, of course, careful analytical laboratory procedures. Quality assurance (QA) is the set of operating principles that should be set up at the beginning of the program to guide both the volunteer and the laboratory in this pursuit of precision. Once a quality assurance plan is formulated, then quality control (QC) procedures can be formulated and used to assure that the volunteer and the laboratory are maintaining the necessary degree of precision. After the coordinator has done all that is possible to train the volunteers in sample collection, handling, and storage, and is assured that the laboratory can do the analyses carefully, then the problem will lie with the reduction of bias. Bias is perhaps the most invidious of the possible errors in a sampling/analysis program because it is the unseen, the unplanned, the unintentional error that sneaks into the program when everything seems to be so well-planned. Bias is that systematic error caused by a laboratory carefully, precisely, and with appropriate quality control procedures, using the inappropriate analytical procedure for total phosphorus or chlorophyll. Bias is the volunteer carefully, precisely, and with several replications, measuring Secchi depth off the end of their dock or in a bed of weeds. Bias is having a colorblind volunteer estimating the water color. Bias is depending on the data from volunteer-chosen lakes, rather than randomly-chosen lakes to be used as an indicator of a state's lake water quality. Bias is the error that can invalidate the data of the most well-planned monitoring program. Bias can be dealt with in two very different ways. The first technique attempts to eliminate the bias. The second, acknowledges that bias exists and shapes the expectation and interpretation of the data accordingly. Both techniques can be employed in volunteer monitoring programs. The best, and most difficult, method to deal with bias is to try to eliminate it. A great deal is known about sampling and analytical bias, and an informed coordinator can minimize the bias. Much of the discussion below deals with identifying forms of bias describing methods that can eliminate it. However, sometimes bias cannot be easily removed. In some instances, the roadblock to the elimination of bias is the cost of doing so. Characterization of a lake's water quality may require samples taken at numerous locations in the lake, but the cost of doing so would be too large. In this case, the coordinator must deal with the problem of too few samples at too few locations. In other instances, it is known that laboratory procedures produce data that is highly dependent on the method used. Accurate data may cost too much or may be impossible to attain under any circumstance. How does the coordinator deal with analytical bias? Probably one the first and most helpful techniques for maximizing the collection of accurate and useful data is to begin with a careful and thoughtful consideration of the goals of the project. Goals certainly help to define the expectations of the project, but, more importantly, also guide the coordinator in choosing appropriate sampling strategies. In some cases, the goals might suggest that a volunteer program is not appropriate. An example of goal-setting is the State of Vermont's lake monitoring program (Smeltzer, et al., 1989). Their stated goals were to: 1. Provide a perspective on the range of eutrophication conditions found in the state's lakes 2. Describe water quality conditions in individual lakes 3. Provide data useful in developing regional empirical eutrophication models 4. Detect trends of deteriorating water quality caused by nutrient pollution. Once goals are formulated, then the methods to best realize these goals can be devised. The possible biases accompanying each method can be listed and either eliminated, minimized, or managed. In the case of Vermont's programs, one bias was that lakes selected for the program were biased toward the large and more heavily used lakes within the state (Smeltzer, et al., 1989). The bias was recognized, could not be eliminated, but instead was included in the description of the sampled lakes. The program described the water quality in "Vermont's major recreational lakes," rather than "Vermont's lakes. Whether the initial selection bias was intentional or not, the program was able to provide useful information on the lakes thought to be of greatest importance to the State rather than on a more unbiased, yet perhaps less useful, estimate of the overall lake water quality in the state. There are three commonly-used sampling strategies: random, stratified random, and sequential (Fig. 1). A random sampling procedure requires that the investigator generate a sampling scheme using a random selection of the sample. The details of these techniques are described in many statistics books, and will not be re-described here. In limnology, a truly random sample would require a random selection of site, time, date, and/or depth (Table 1). A random sampling procedure is the simplest to analyze, but it does not account for known gradients within lakes, depths, or geographical regions, which may have differing mean values and variances. If gradients do occur, or regions of the state or lake might be best examined separately, then it is useful to use a stratified random sampling procedure. In this procedure, the pool of all possible samples (lakes, locations, or dates) are split into smaller, more homogeneous entities. A state may be divided into ecoregions, a lake divided according to areas such as main lake or embayment, depth into epilimnion and hypolimnion, and the year into seasons. Within each of these divisions, samples are then taken randomly.
A third sampling procedure is sequential sampling. In this case a starting point is chosen randomly, and samples are then taken a set intervals (distance, depth, or time) from that point (Cochran, 1977). Sequential sampling may or may not produce more accurate estimates than do random sampling, but in limnological sampling it is often the easiest. However, samples drawn in sequence may be autocorrelated; the next sample in the series may be predicted to some extent by the preceding sample. Autocorrelation poses a problem because samples are not be drawn independently and randomly from a pool of possible variable, and therefore, the statistical number of samples being taken is less than the actual number of samples (Cochran, 1977; Reckhow and Chapra, 1983). Autocorrelation may affect random and stratified sampling procedures as well. A third drawback of sequential sampling is that, as illustrated if Fig. 1A, if there are periodic fluctuations in the variable of interest, it is possible with sequential sampling to always sample the same part of the curve (Cochran, 1977). For example, chlorophyll may rise and fall regularly during a 24 hour period because the algae migrate towards the surface at night and descend during the day. If surface samples were taken every day at noon, the chlorophyll concentrations in the lake would be underestimated. If they were taken at midnight, the estimates would be exaggerated. If a single yearly sample were taken in December or June or August, different estimates of annual chlorophyll might be found. As illustrated in Fig. 1B, samples can be taken more frequently, so that over time, they sample all portions of the periodic function. In Fig. 1C, a random sampling design is also illustrated. In this case, an excellent estimate of the mean might be obtained over the course of several periods, but certain portions of the period may be missed. If this curve represented an annual chlorophyll curve, in some years there would be no samples taken at the highest months, thus under-representing chlorophyll in those years. An alternative to either the sequential or completely random design would be to use a stratified design. This design would allow for each period to be partitioned into relevant smaller periods, yet still provide the statistical rigor required. Table 1. Possible sampling strategies for lakes and reservoirs. The "Typical Sampling" scheme is not necessarily recommended.
Bias can creep into a monitoring program when lakes are chosen for the program. If the purpose of the program is to provide information on, for example, the trophic state of the state's lakes, then the lakes of the state must be sampled without bias; every lake must have the equal potential to be included in the program. The coordinator might wish to stratify the sampling within geographic, political, or ecological regions within the state, but, within each division, the lakes still must be sampled randomly. Rarely do volunteer monitoring programs have the luxury of choosing volunteers on randomly selected lakes. More often, the lakes are selected based on the availability of volunteers. It is quite possible that the lakes will not be chosen randomly, and, quite probably, will produce data that will not be unbiased estimates of the state's lakes. For example, volunteers may be more available for recreational lakes, for lakes where they have cabins, or for lakes that are easily accessible. Private lakes, lakes in state parks, small, shallow, or weedy lakes may be under-represented. A solution to this dilemma is to reexamine the goals in light of the realities of the volunteers and their lakes. The coordinator could either randomly match chosen lakes with volunteers and fill in the non-volunteer lakes with your agency staff or could change the goals of the program. Perhaps the expectations can be modified so that the recreational lakes of the state are adequately sampled. As a last resort, accept the fact that you are estimating the trophic state of a potentially biased set of volunteer-sampled lakes. The important consideration may not be that the data are biased, but how the data are interpreted and used once they are gathered. The location of the sampling point in a monitoring program is heavily dependent on the goals of the program. In the past, many limnologists would sample natural lakes at a single site over the deepest part of the lake. There is a good reason for using the point over the deepest part of the lake; it allows the sampling of every possible depth. However, using this point as the single site to characterize a lake assumes that there are no horizontal gradients in the lake; each depth at that single sampling site represents the conditions at that depth throughout the lake. As emphasized by Peters (1990), the use of this sampling site also ignores the characterization and influence of the littoral zone, which can constitute a significant percentage of the total lake area and volume. Peters also suggests that marked horizontal differences should occur over horizontal distances of only 1500 meters (0.9 miles) in a 15 meter deep lake. It is difficult to imagine the immensity of the influence of this single point, deep water sampling procedure on our perspective of lakes. It not only rejects the possibility of spatial gradients in the lake, but ignores the littoral zone entirely, both as a component and its functional role in the lake system. The characterization of reservoirs adds an entirely new dimension to the problem of selecting a sampling site. Many reservoirs are long and narrow, with a majority of the water flowing into the reservoir at a single point. Others are highly dendritic, with many bays and inlets, each often with their own contributing stream. Spatial differences in water quality in reservoirs are the rule, not the exception, and assumptions about homogeneity at a given depth stratum simply cannot be made. To attempt the characterization the entire reservoir by a single sample would be almost meaningless. Two different approaches have been suggested to characterize heterogeneous bodies of water. The first, recommended by Thornton, et al. (1982) and Gaugush (1987), suggests that sampling stations should be more numerous where concentration gradients are the highest, such as near stream and river inflows. Before initiation of a sampling program, the lake or reservoir would be sampled intensively throughout. Replicate surface samples are taken at each site, and a statistical multiple range test used to identify significant differences between station means. If the means between two different stations are not significant, then only a single sampling station would probably be sufficient. Where station differences are significant, separate stations would be established. In contrast, Walker (1987) suggests that sometimes these areas of greatest change, such as near inputs and in narrow embayments, do not constitute a large proportion of the reservoir's volume. More emphasis should be placed in sampling the areas of a reservoir where most of the volume can be found. Walker's approach would concentrate on sampling the bulk of the reservoir's volume. In a sense these differ rather than intensively sampling areas where greatest change occur. These two different methods of sampling reflect the need to characterize the entire surface area of the reservoir (or lake) versus the need to represent the condition of the bulk (not entirety) of the water body. A third approach does not require such extensive sampling effort. If the goal of the program is to monitor changes in water quality over seasons and years, then complete characterization of the lake or reservoir may not necessary. In this case, sampling at a single site might be appropriate. The important factor in single site monitoring is that the site be clearly marked, so that anyone, not only the initial volunteer, can find the site again. The sites should documented using either a satellite-based GPS device to establish the exact latitude and longitude or triangulation with permanent shoreline markers, such as houses, water towers, etc. Several sites should be monitored on large or dendritic lakes in order to keep track of local changes. Sampling at Depth Limnologists probably use one of several methods when sampling with depth: they may sample only at the surface or they may take a series of samples at pre-set intervals from the surface to the bottom of the lake (0 m, 1 m, 2 m, etc.). A "surface" sample can be taken just below the surface (0.5 meters or 1 foot) to avoid surface scums. Another approach is to take a single integrated sample with a tube or hose sampler from the surface to some pre-determined depth (euphotic depth or thermocline). All of these techniques have advantages and drawbacks, either because of limnological, logistical, or statistical considerations. The advantage of a single sample taken at the surface is that it requires only one analysis, an important consideration to a program on a budget. It also does not require any special sampling apparatus; the volunteer can just hold a sampling container under the surface of the water. Because it is simple and inexpensive, it is an attractive sampling technique for volunteer programs. Taking a single sample at the surface assumes that the lake, or at least the epilimnion, is completely mixed, and that the surface sample is an adequate estimator of the lake or epilimnetic concentration. In many cases, the assumption of a completely mixed body of water is incorrect. Many lakes have vertical gradients of temperature, oxygen, nutrients, and algae which will be missed by the surface sample. Most certainly, deeper lakes establish a thermal gradient in the summer and the lake will have at least two thermal regions, and, possibly, at least two chemical regions as well. Even in shallow, usually well-mixed lakes, chemical gradients will develop during calm periods. Gradients can even occur in the upper waters of lakes. Metalimnetic populations of algae will develop in many stratified lakes (Hanna and Peters, 1991; Knowlton and Jones, 1989). Blue-green algae also migrate deeper into the epilimnion during the day, and a surface sample would miss them. On the other hand, on calm days, blue-green algae may float to the surface, forming surface scums. If sampled on these days, the amount of phosphorus and chlorophyll would be highly exaggerated. Sampling at intervals through the entire water column is the standard alternative to the single surface sample. If sampled at sufficiently close intervals, the technique detects gradients within the water column. The technique is also used when constructing a nutrient budget to estimate the total content of a nutrient, such as phosphorus, in the water column. As it is commonly done, however, interval sampling violates one of the requirements of sequential sampling: it is usually started at the same depth each time. Sequential sampling require that the starting depth be randomly selected. For example, sampling might be chosen randomly to begin a 0 meters on one day and 0.4 meters on the next visit. If a randomly chosen starting point is not selected each time, there is the possible introduction of bias into the estimate. For example, the predetermined sampling points might continue to miss a thin, metalimnetic algal population or a near-sediment phosphorus or oxygen gradient. Interval sampling also requires a larger number of analyses than the single sample. The coordinator should consider whether the information gained , or the error minimized, by interval sampling can be justified by the added expense. The
hose or tube sampler collects and integrates a column of water into a single
sample. The volunteer lowers the weighted end of a plastic hose (even garden
hose has been used) to a given depth. The other end of the hose, which is in
the boat, is then corked. The weighted end pulled to the surface by an
attached line. If the top end of the hose is well-sealed, all or most of the
water will remain in the hose as it is raised. Once the weighted end is in the
boat, it is placed inside a sample container, the other end uncorked, and the
entire volume of the hose emptied into the container. A rigid tube can be used
instead of a hose, but a longer tube becomes cumbersome to lift into the boat,
and it is difficult, especially for a single person, to pour the sample into a
bottle. The hose sample has several disadvantages, especially for volunteer monitoring. Someone, probably the volunteer, is going to have to decide the maximum depth to which the sampler will be lowered. The hose could be lowered to the bottom of the lake, to the bottom of the epilimnion, to the bottom of the euphotic zone (1% light), or to a fixed depth. It makes little sense that these depths should be fixed from lake to lake nor necessarily constant from visit to visit. The volunteer would have to make a decision about the depth prior to lowering the sampler.
Table 2. Models that could be used to estimate thermocline depth (THER) based on maximum effective width (MEL), area (A) and shoreline length (L).
Where C(zn-zm) is the averaged concentration between depths zn and zm and V(zn-zm) is the volume of water in the same interval (Fig. 2). The bulk of the water is at the surface, therefore, the concentration of a substance at the surface usually contributes most to the average concentration in the lake. The volume of water in the metalimnion and hypolimnion is much less, and concentrations in this region should contribute less to the average concentration.
When a tube sampler is used, the concentrations at each depth are weighted equally rather than in proportion to the contribution of each volume to the total volume of the lake. This equal sampling increases the influence of the deeper concentrations on the calculated average concentration. If concentrations at these lower depths are much higher than at the surface, their contribution to the lake's average concentration is higher as well. Large metalimnetic chlorophyll peaks or hypolimnetic phosphorus concentrations would have an exaggerated impact on the estimate of the average concentration. There is no guarantee that a hose sampler lowered to the depth of the epilimnion or thermocline increases the accuracy of an estimate of epilimnetic or lake concentration. Increased accuracy will depend on the morphometry of the lake and the nature of the vertical distribution of the variable concentrations. If a program does not have the funds to sample each depth independently, yet considers that the hose sampler inaccurately measures the average concentration, an alternative would be to have the volunteer produce a composite sample. This would entail having the volunteer pour a volume of sample proportional to the volume of water at each depth into a sample container. The resulting concentration in the sample container should be very close to the average lake concentration. However, the volume of each stratum would have to be known, and the volunteer would have to carefully measure volumes appropriate for each depth. This may be more than can be expected from the average volunteer. Several studies have evaluated the effect of vertical sampling strategies on estimates of lake concentrations. Hanna and Peters (1991) compared several strategies, including weighted-composite samples to the trophogenic depth (estimated as 2X Secchi depth), euphotic depth (estimated as1% light), and maximum depth, hose samples to the epilimnetic and trophogenic depths, and subsurface samples. They found that subsurface samples had the lowest phosphorus concentrations, epilimnetic hose samples somewhat higher, samples of the trophogenic and euphotic still more, and water column composites (total water column) the highest. In their lakes the maximum difference of the means was only slightly more than 20%. Chlorophyll concentrations were also smallest in the subsurface samples, higher in the epilimnetic hoses, higher yet in the volume-weighted samples, and highest in the hose samples of the trophogenic zone. Although there were differences between techniques, the inter-lake differences were even greater, meaning that any of the techniques could distinguish between lakes. However, they thought that subsurface samples yielded low values and were subject to greater daily variation and therefore should be avoided in inter-lake surveys. They recommended the use of the hose sample over integrated composite samples because the hose sample requires less effort. They recommended using 2X Secchi depth because such samples represent lake productivity better in principle, although they could not demonstrate this advantage in practice.Knowlton and Jones (1989) compared subsurface (10 cm) samples with samples continuously drawn with a pump from the surface to the oxycline, or, when no oxycline was present, from the surface through the entire mixed layer. Differences between surface and composite samples in chlorophyll and total phosphorus were usually small, but differences increased when phytoplankton formed subsurface layers. Chlorophyll concentrations tended to increase with depth, seldom being maximal at the surface. However, they considered the error obtained from differences between the two sampling techniques to be much less than temporal variation and the sampling technique would add negligible error in inter-lake surveys. The difference between this study and that of Hanna and Peters (1991) may be the stability of the algal layers. Knowlton and Jones found subsurface chlorophyll layers to be the major cause of differences between the techniques, but these layers were transient, common only on calm days. Metalimnetic maxima were also transient, occurring infrequently in the lakes. More importantly, the concentrations in the peaks were less than 20% higher than surrounding layers. It may be that metalimnetic peaks accounted for some of the differences in found by Hanna and Peters (1991). In their lakes the peaks may have been more prevalent and more stable. This would explain why hose samples actually had higher chlorophyll concentrations than did weighted composite samples. The final word on the best method to sample with depth has yet to be written. Surface samples are the easiest to measure and require the least equipment. However, they may be more variable and under-represent the concentrations in the lake. Hose samplers are marginally more expensive and may sample more of the water column, thus reducing variability caused by vertical heterogeneity. However, if high concentrations are found deeper in the sampled water column, the hose sampler may over-represent the epilimnetic or lake concentration. Also, there is no agreement as to the appropriate depth the hose should be lowered. A volume-weighted composite sample may be best, but it requires the knowledge of the lake volume at each stratum and a great deal more effort on the part of the volunteer. Deciding on when, how long, and how frequently it is necessary for the volunteers to take samples is as fraught with statistical decisions was the case of deciding on sampling location and depth. There seems to be no statistically acceptable method that coincides well with the realities of using volunteers to collect samples. Because of this, some compromises may have to be made. These are discussed below. Length of Sampling Season The length of time that the volunteers are asked to sample is dependent on the goals of the program. If for example, the goal is to obtain annual mean concentrations or to examine the annual fluctuations in the target variables, then the calendar year is the appropriate duration of the sampling season. If ice-free or summer mean concentrations are what is desired, then the sampling period becomes obvious. If a whole-lake springtime phosphorus concentration is needed for a nutrient budget or to calibrate a loading model, then sampling should include the period of spring turnover. If that spring turnover total phosphorus concentration were used in a model to predict summer chlorophyll (Dillon and Rigler, 1973), then calibrating that model would require summer chlorophyll concentrations as well. It may be a little more difficult to select a sampling season if the goals are to determine concentrations or parameters that may not be dependent on a certain season, i.e. maximum concentrations. Peters (1990) points out that the typical limnological preoccupation with the summer season may miss spring and fall peak chlorophyll concentrations, which, according to Marshall and Peters (1989) may be higher than peak concentrations found in the summer (Fig. 2), especially in eutrophic lakes. Marshall and Peters (1989) point out that most models predicting mean chlorophyll concentrations ignore the temporal distribution of chlorophyll. The result is that when such models are used, the resulting predictions may be irrelevant to the program goals. Does the program need to know the annual or seasonal mean of chlorophyll or phosphorus, or is the peak concentration of more interest? Will a single number (the mean) be all that is used of the volunteer data, or are seasonal fluctuations of importance as well? Does it matter to the goals of the program that the maximum comes in April or is the peak concentration in July or August of more interest? There is no single recommendation that can be made regarding sampling duration. The duration depends on the goals of the program. The better these goals are defined, the easier it will be for the coordinator to choose an sampling period. Once this is done, and the ideal period chosen, probably the reality of volunteer cooperation and capability should be considered. An annual mean necessitates that someone must visit the lake throughout the calendar year. Will volunteers be willing to do so? In the northern states, being on the lake in the winter or early spring or late fall can be cold and dangerous; some volunteers may not launch their boats or go to their cabins until May or June. The coordinator may have to modify the program expectations to incorporate the schedules of the majority of the volunteers. Sampling Frequency The frequency of sampling is another important consideration in the design of the volunteer program. If the coordinator has the volunteers sample too infrequently, then there is a good chance that the variation will be so large that it will be impossible to statistically identify differences between lakes or between sampling years. If the volunteers sample too frequently, their enthusiasm may wane and the added statistical reliability may not be justified. Oversampling will also waste money that could have been used to fund more lake monitoring. When sampling frequency translates into dollars and volunteer time, it becomes necessary to balance the need for statistical confidence against the monetary and human costs incurred in obtaining that level of confidence. Most volunteer programs probably have as one of their goals the estimation of the annual or seasonal statistic of central tendency (mean or median) for each of the monitored lakes. Probably this statistic will be used to compare lakes and to monitor year-to-year changes in each lake. In order to do either, a sufficient number of samples must be taken so that a reliable estimate of the true central tendency can be obtained. The term "reliable" in this case is relative; it depends on the level of uncertainty that the coordinator will accept. The less uncertainty (the more precision) desired, the more samples will have to be taken. There are two methods that have been used to estimate the number of samples needed in a monitoring program; both are discussed in Marshall et al. (1988). The first, also discussed in Gaugush, (1987), Reckhow and Chapra, (1983), and Thornton et al., (1982), uses the equation
Where n = number of samples The term d can also be written as r, where r represents the desired precision as expressed as a fraction and , the mean of the sample (Gaugush, 1987). For example, if the mean of the chlorophyll concentration was estimated at 20 ug/L, and the desired level of precision is + 10 percent, d would have the value 0.2 (0.1 x 20). The equation then looks like
To use this equation, the coordinator first decides what level of precision is desired, and have some estimate of the mean and variance of the target variable. The desired precision in most experiments is commonly 5%, but to obtain this level of precision is very difficult in limnological monitoring programs without a very large number of samples. More often, values of 10 or 20% are found to be more possible (Gaugush, 1987; Marshall, et al., 1988). There are several commonly used methods used for estimating the mean and variance of a particular variable (Gaugush, 1987). First, the coordinator can conduct a pilot study, gathering sufficient data to estimate mean and variance. Second, the results of previous studies on the lake or on similar lakes can be used. The third method, according to Gaugush (1987), would be to make an educated guess. A fourth method has been reported by Marshall, et al., (1988), for chlorophyll and by France and Peters (1992) for total phosphorus. This method relies on the reported relationship between the log variance of an annual or seasonal sample and the log mean value of the same sample (Walker, 1985). Marshall, et al., (1988) report the relationship between variance (s2) and mean () for chlorophyll to be log(s2) = =0.53 + 2.10 log for annual data, and log(s2) = = -1.03 + 2.50 log for seasonal data. France and Peters (1992) report the relationship for total phosphorus to be log(s2) = = -0.563 + 1.65 log for seasonal data (April - October). Marshall, et al., (1988) used these values in Equation (2) above to obtain estimates of the variance based solely on the mean value. Knowledge of the mean is still necessary, but again it can be estimated from similar lakes or previous studies, or from information on Secchi depth which can be related to total phosphorus or chlorophyll. The most difficult part of estimating n is the determination of t, the value from the Student's t distribution for the degrees of freedom (n-1). This value depends on a knowledge of n, the value you are seeking. Therefore an iterative approach, where an initial guess at the value of n is used to obtain t, and then this calculated value on n is used to estimate t, and so on until n converges on the best estimate of n (Thornton, et al., 1982; Gaugush, 1987). Marshall, et al., (1988) estimates that 10 observations per year would be needed to estimate the seasonal mean of chlorophyll with a coefficient of variation (r) of 20 percent. Between 30 and 40 observations would be needed to obtain a coefficient of variation of 10 percent. Total phosphorus has a lower variance around the mean than does chlorophyll, and fewer samples would be needed (Knowlton, et al., 1984; France and Peters, 1992). However, if a relationship between chlorophyll and phosphorus is desired, then the larger number of samples might be desirable for each. Since the variance apparently increases as the mean of the sample increases, oligotrophic lakes would not need as many samples taken as would a eutrophic lake (Marshall, et al., 1988). Temporal Sampling Strategies Sampling the lake over the year could be done by randomly pre-selecting sampling dates for the volunteers. Having the volunteers sample the entire calendar year may have some virtues, but the needs of most programs, at least in the northern states, might consider using the ice-free period as more important and logistically possible. Within the summer or ice-free period, there is rarely a random assortment of possible concentrations and conditions. Daylength, temperature, and, in some lakes, algal chlorophyll increases to a maximum and then decreases. Inputs of phosphorus may he highest during spring runoff and these may translate into higher lake concentrations in the spring. On the other hand, internal loading in some lakes may cause a continual increase in nutrient concentrations throughout the summer. Because of these non-random fluctuations in limnologically-interesting variables, a simple random selection of sampling dates during the summer may not adequately sample these systematic fluctuations. It may be more appropriate to stratify the sampling dates or to use a sequential sampling technique. Stratification could be done within months (June, July, etc.), within limnological periods (Ice out to stratification, Stratification to turnover, turnover to ice-over) or hydrologic periods (high flows in the spring, summer, low flow inputs in the summer, autumnal high flows). In any of these cases, the coordinator would supply the volunteer with a list of randomly-selected sampling dates within each period, and the volunteer would be expected to sample on that date, rain or shine. If a sequential sampling technique were used, the coordinator would first decide on the period of interest and on an appropriate time segment (7 days, 14 days, 30 days, etc) with which to divide that period. An initial sampling day would be chosen randomly from the first time segment. Sampling would be initiated on that randomly-chosen date and at equal time segments thereafter. For example, a coordinator might choose to sample every 15 days from May 1 to September 30. To do so, a date would be chosen at random between May 1 and May 15 (a 15 day interval). If May 6 were chosen, sampling would commence on May 6 and then at 15 day intervals (May 21, June 5, etc.) throughout the summer. New starting dates would be chosen for each lake or volunteer. A sequential sampling program is the easiest to organize because the volunteer would know when they sample, but since the initial sampling times are randomly pre-determined, it might mean that the volunteer would have to sample every Thursday, even though they go to the lake only on weekends. Although a random or a sequential design could be designed for a volunteer program, the reality may be that it will be difficult to get volunteers to agree to go out to sample on the assigned days. On some lakes it may be dangerous to go out on days when the boat traffic is highest. Certainly no volunteer should go out if there is a threat of lightening or high winds. Rainstorms may discourage others. Volunteers may only go to their cabins on weekends, and sampling would therefore be limited to certain dates. Any of these instances would violate one or more assumptions of the sampling regime, and therefore increase the possibility of biased estimates. It is the question of bias that should be of concern. The goal of the program should be to obtain the least unbiased estimates of the condition of the lake. A sampling date chosen by the volunteers could introduce bias. For example, if a lake is only sampled on a weekend, boat traffic may temporarily increase turbidity, and therefore lower Secchi depths would be recorded. On the other hand, increased numbers of visitors at the lake during the weekend might increase nutrient inputs from septic tanks, which, in turn, might translate into higher algal densities during the next week. In the first instance sampling, for example, every Wednesday would have a larger Secchi depth than sampling on Sunday; in the other, it might be smaller. Consider the validity of comparing the chlorophyll concentrations in two lakes, one sampled on Sunday, the other on Wednesday. Without evidence that there are no regular weekly fluctuations, the comparison could be invalid. In some lakes, sampling during or after a rainstorm may produce Secchi depths and nutrient concentrations altered by inputs of silt and clay from runoff. Three days later, the lake may be clear or it may have higher algal populations. Statistically valid sampling techniques assume that each of the events has a equal chance of being sampled, and each event contributes equally to the final estimate of central tendency. A range of temporal sampling strategies seem possible. At one extreme, the coordinator provides a rigid sampling scheme that is statistically correct, either using a random, a stratified random, or a sequential sampling design. Volunteers would be required to adhere to the sampling regime in order to be a part of the program. Although it might sound rigid, it would provide the needed statistical rigor. The other extreme would be to ignore the whole business of sampling strategy, hoping that there is sufficient randomness and minimum bias in the volunteer sampling that the numbers will still approximate the population mean. For example, volunteers could be urged to sample twice a month, suggesting that the two dates not be close together. No more guidance would given as to possible sampling dates, relying on the judgment of the volunteer. Between these two extremes is the possibility that the coordinator could establish a sampling "concept," in which volunteers attempt to sample either within certain time periods. For example, volunteers could be asked to sample within the 2 week period, July 1 to 15, or plus or minus a given number of days (June 1 + 1 week). The central day or the time period would be chosen at random by the coordinator. The actual sampling date would not be randomly selected, but selected at the convenience of the volunteer. This scheme would provide some random structure to the sampling regime, but allow some flexibility to the volunteer. It should be obvious that the first extreme may be too rigid for a volunteer program and the other too haphazard. The third still may contain the weekly biases mentioned above. If, however, an effort was made to determine if there were or were not weekly sampling biases, the procedure might have more validity. It could be suggested to the volunteers that they measure Secchi depth several times in a single week. From these multiple measurements, some estimate of within-week variability might be obtained. If there was a strong weekly pattern, then some accommodation may have to be made. If not, then there would be some confidence that the sampling "concept" may also be statistically valid. In the case of sampling during or after rainstorms, it may be that it would be easier to filter the data after collection rather than to urge the volunteer to endanger themselves adhering to a rigid schedule. Volunteers could be urged to write the weather conditions for the past 3 days on their note card. When the data are analyzed, the coordinator could test as to whether prior precipitation had an effect on the Secchi depth or any other variable. Then, if necessary, the coordinator could eliminate all data in which precipitation occurred within a 24, 48, or 72 hour period prior to sampling. Reconciling Statistical Accuracy With Volunteer Reality This chapter has attempted to deal with the problems of obtaining statistically reliable data from a volunteer program. In a real sense, the problems discussed in the chapter are not limited to volunteer programs, but are commonly encountered in professional limnological investigations. Solutions vary, but too often the solution is to ignore rather than to deal with the problems. Using volunteers instead of trained professionals only adds to the problem because it is more difficult to control when, where, or how samples are taken. The coordinator must rely on non-professional eyes to avoid non-normal situations and on non-professional integrity that shortcuts in technique, location, or sampling dates are not used. The volunteer is being asked to shoulder a considerable burden, and may not fully realize the importance of that burden. The solutions offered in this chapter often center around two themes: that statistical validity is important, and that the realities of volunteer sampling must be recognized. If the coordinator and the volunteer both realize that it is important to sample on certain days, at certain sites, and careful, repeatable, defined procedures, then a first step has been made towards statistical validity of the data. The second step is made when the coordinator recognizes, and deals with, the known or potential biases included in the sampling procedures used by the volunteers. The second theme reduces to one of acknowledging the potential biases remaining in procedures. In the same sense that one would not make pronouncements about the annual mean concentration of phosphorus based on an intensive summer sampling procedure, one should recognize that non-randomly selected lakes do not necessarily allow unrestricted statements about the condition of the state's lakes. Samples taken at the surface may or may not represent the average content of the lake, and samples taken at a single point do not necessarily represent the entire extent of variation within the lake. One can, however, make statements regarding the average summer concentration of phosphorus at the surface at a location near the center of the lake. If that station is well-marked and could be returned to repeatedly, then weekly, monthly, or yearly fluctuations of the surface water phosphorus at that site could be measured and compared. 1. The coordinator should clearly and completely define the goals of the monitoring program. These goals should be used to guide decisions as to sampling times, locations, etc. 2. It may be that availability of volunteers, rather than the coordinator's needs and desires, will decide the lakes that are sampled and the locations and times that samples are taken. The coordinator should carefully consider the biases introduced by these non-random elements in the sampling program. Efforts should be made to eliminate biases, filter the data, or assure oneself that the amount of bias introduced is not significant, relative to the needs of the program. 3. At least one permanent sampling site should be chosen for each lake. This site should be over the deepest portion of the reservoir or lake, so that vertical samples can be taken of the entire water column. This site should be well-marked, so that it could be used in the future by other volunteers or professionals. This primary site may not be used to characterize the entire lake basin, but can serve as a point of comparison from year to year. 4. Volunteers should be urged to sample at more than one site. In dendritic or flow-through reservoirs or lakes, samples should be taken at several sites. These sites should also be well-marked. 5. No recommendation is made as to how to best sample a water column. Surface samples are inexpensive but may have high variability and underestimate epilimnetic and lake concentrations. Hose samples may integrate the waters of the lake, but may cause overestimation of lake concentrations. A volume-weighted composite sample or individual samples at several depths may be the best method, but may involve too much labor and laboratory expense for volunteer programs. 6. The seasonal period of sampling may depend on the goals of the project and the latitude of the sample lakes. For many purposes, seasonal or ice-free sampling seasons may be acceptable. Shorter sampling periods may reduce variability, but also reduce information on seasonal variation. 7. It is recommended that some random structure be given to deciding on sampling days, while still giving the volunteers some degree of freedom for selecting convenient days. Final sampling schemes can be tested for randomness. Within-week sampling bias should be considered prior to analysis of the data and the data filtered if necessary. American Public Health Association. 1989. Standard Methods for the Examination of Water and Wastewater. 17th Edition. APHA. Cochran, W.G. 1977. Sampling Techniques. John Wiley and Sons. Davies-Colley, R.J. and W.N. Vant. 1988. Estimation of optical properties of water from Secchi disk depths. Wat. Res. Bull. 24: 1329-1335. Dillon, P.J. and F.H. Rigler. 1973. The phosphorus-chlorophyll relationship in lakes. Limnol. Oceanogr. 17: 767-773. France, R.L. and R.H. Peters. 1992. Temporal variance function for total phosphorus concentration. Can. J. Fish. Aquat. Sci. 49: 975-977. Gaugush, R.F. 1986. Statistical methods for reservoir water quality investigations. Instruction Report E-86-2, U.S. Army Engineer Waterways Experiment Station, Vicksburg, Miss. Gaugush, R.F. 1987. Sampling design for reservoir water quality investigations. Instruction Report E-87-1, U.S. Army Engineer Waterways Experiment Station, Vicksburg, Miss. Hanna, M. Evaluation of models predicting mixing depth. Can J. Fish. Aquat. Sci. 47: 940-947. Hanna, M. and R.H. Peters. 1991. Effect of sampling protocol on estimates of phosphorus and chlorophyll concentrations in lakes of low to moderate trophic status. Can. J. Fish. Aquat. Sci. 48: 1979-1986. Knowlton, M.F. and J.R. Jones. 1989. Comparison of surface and depth-integrated composite samples for estimating algal biomass and phosphorus values and notes on the vertical distribution of algae and photosynthetic bacteria in midwestern lakes. Arch. Hydrobiol./Suppl 83. 2: 175-196. Hakanson, L. 1981. A manual of lake morphometry. Springer-Verlag. Hutchinson, G.E. 1957. A Treatise on Limnology. Vol. I. John Wiley and Sons, Inc. New York. Lind, O.T. 1979. Handbook of Common Methods in Limnology. St. Louis: Mosby. Marshall, C.T. and R.H. Peters. 1989. General patterns in the seasonal development of chlorophyll a for temperate lakes. Limnol. Oceanogr. 34: 856-867. Marshall, C.T., a. Morin, and R.H. Peters. 1988. Estimates of mean chlorophyll-a concentration: precision, accuracy, and sampling design. Water Res. Bull. 24: 1027-1034. Osgood, R.A. 1988. Lake mixis and internal phosphorus dynamics. Arch. Hydrobiol. 113: 629-638 Peters, R.H. 1990. Pathologies in limnology. In: de Bernardi, R., G. Giussani, and L. Barbanti (Eds). Scientific Perspectives in Theoretical and Applied Limnology. Mem. Ist. Ital. Idrobiol. 47: 181-217. Reckhow, K.H. 1979. Quantitative techniques for the assessment of lake quality. U.S. E.P.A. Office of Water Planning and Standards. EPA-440/5-79-015. Reckhow, K.H. and S.C. Chapra. 1983. Engineering Approaches for Lake Management. Volume 1: Data Analysis and Empirical Modeling. Butterworth Publishers (Ann Arbor Science). Thornton, K.W., R.H. Kennedy, A.D. Magoun, and G.E. Saul. 1982. Reservoir water quality design. Water Res. Bull. 18: 471-480. Smeltzer, E., V. Garrison, and W.W. Walker, Jr. 1989. Eleven years of lake eutrophication monitoring in Vermont: a critical evaluation. Enhancing State's Lake Management Programs. 1989: Walker, W.W. Jr. 1985. Statistical bases for mean chlorophyll-a criteria. J. Lake Reserv. Manag. 4: 57-62. Walker, W.W. Jr. 1987. Empirical methods for prediction eutrophication in impoundments. Report 4. Phase III: Applications Manual. Technical Report E-81-9. US Army Engineer Waterways Experiment Station, Vicksburg, Mississippi.53-62.
|
|||||||||||||||||||||||||||||||||||||||||||||||||