| Sign In to gain access to subscriptions and/or personal tools. |
An investigation of new toxicity test method performance in validation studies: 1. toxicity test methods that have predictive capacity no greater than chanceThe Gillette Company, Gillette Environment, Health & Safety, Needham, Massachusetts 02492, USA; Gillette Environment, Health & Safety, 37 A Street, Needham, Massachusetts 02492, USA; leon_bruner{at}gillette.com
The Procter & Gamble Company, Miami Valley Laboratories, Cincinnati, Ohio, USA
The Institute for In Vitro Sciences, 21 Firstfield Road, Gaithersburg, Maryland 20878, USA An approach commonly used to measure new toxicity test method (NTM) performance in validation studies is to divide toxicity results into positive and negative classifications, and then identify true positive (TP), true negative (TN), false positive (FP) and false negative (FN) results. After this step is completed, the contingent probability statistics (CPS), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) are calculated. Although these statistics are widely used and often the only statistics used to assess the performance of toxicity test methods, there is little specific guidance in the validation literature on what values for these statistics indicate adequate performance. The purpose of this study was to begin developing data-based answers to this question by characterizing the CPS obtained from an NTM whose data have a completely random association with a reference test method (RTM). Determining the CPS of this worst-case scenario is useful because it provides a lower baseline from which the performance of an NTM can be judged in future validation studies. It also provides an indication of relationships in the CPS that help identify random or near-random relationships in the data. The results from this study of randomly associated tests show that the values obtained for the statistics vary significantly depending on the cut-offs chosen, that high values can be obtained for individual statistics, and that the different measures cannot be considered independently when evaluating the performance of an NTM. When the association between results of an NTM and RTM is random the sum of the complementary pairs of statistics (sensitivity+ specificity, NPV+PPV) is approximately 1, and the prevalence (i.e., the proportion of toxic chemicals in the population of chemicals) and PPV are equal. Given that combinations of high sensitivitylow specificity or low specificityhigh sensitivity (i.e., the sum of the sensitivity and specificity equal to approximately 1) indicate lack of predictive capacity, an NTM having these performance characteristics should be considered no better for predicting toxicity than by chance alone.
Key Words: correlation negative predictive value positive predictive value prediction interval prevalence random association sensitivity specificity toxicity test battery toxicity test performance measurement toxicity test validation validation
Human & Experimental Toxicology, Vol. 21, No. 6,
305-312 (2002) This article has been cited by other articles:
|
|||||||||||||||||||||||||||
