David B. Herbst, Erik L. Silldorff
Journal of the North American Benthological Society 25 (2), 513-530, (1 June 2006) https://doi.org/10.1899/0887-3593(2006)25[513:COTPOD]2.0.CO;2
KEYWORDS: bioassessment, impairment detection, methods comparison, metric precision, multimetric IBI, RIVPACS, Sierra Nevada, Stream macroinvertebrates
Regional bioassessment programs of states, various federal agencies, and other governmental and private groups often use different methods to collect and analyze stream invertebrate samples. This lack of uniformity has created concern and confusion over the comparability of disparate sources of data, but few studies have attempted to evaluate differences in performance between methods or to reconcile the results produced from different methods. We conducted concurrent sampling at 40 sites in the eastern Sierra Nevada of California using 3 bioassessment methods to obtain directly comparable data sets. The riffle-based methods (University of California Sierra Nevada Aquatic Research Laboratory [UC-SNARL, Lahontan Water Board], California Stream Bioassessment Protocol, and US Forest Service Region 5) differed at each stage from field sample collection to laboratory processing and data analysis. We used a performance-based methods system to compare precision, uniformity, discrimination, accuracy, and correlations among multimetric Index of Biotic Integrity (IBI) scores and multivariate River Invertebrate Prediction and Classification System (RIVPACS)-type observed/expected (O/E) ratios. Reference and test sites were identified using local and upstream-watershed disturbance criteria, and invertebrate community measures and models were then developed to discriminate between reference and test sites. The more-intensive UC-SNARL method showed slightly, but consistently, greater sensitivity for discriminating impairment than the other 2 methods. The UC-SNARL method produced greater differences between reference- and test-site means relative to lower reference-site standard deviations than the other 2 methods. However, assessment scores were highly correlated among methods and distinguished reference from test sites with similar accuracy among methods despite the slight differences in performance. Our results show that differing bioassessment methods can yield very similar, effective discrimination of impaired biological condition even though they have multiple differences in field and laboratory protocols (mesh size, replication, area sampled, taxonomic resolution, total counts). Moreover, this conclusion did not depend on the approach taken to data analysis because both multimetric IBIs and multivariate RIVPACS-type O/Es were in close agreement. Methodological uniformity is important when coordinating monitoring programs, but our results suggest that data from multiple sources could potentially be used interchangeably and for cross-validation of assessments of stream biological integrity.