Sensitivity and specificity

Calculations

Two-by-two table for a diagnostic test
		Disease
		Present	Absent
Test result	Positive	Cell A	Cell B	Total with a positive test
	Negative	Cell C	Cell D	Total with a negative test
		Total with disease	Total without disease

Sensitivity and specificity

{\mbox{Sensitivity of a test}}=\left({\frac {\mbox{Total with a positive test}}{{\mbox{Total }}without{\mbox{ disease}}}}\right)=\left({\frac {\mbox{Cell A}}{{\mbox{Cell A}}+{\mbox{Cell C}}}}\right)

{\mbox{Specificity of a test}}=\left({\frac {\mbox{Total with a negative test}}{{\mbox{Total }}without{\mbox{ disease}}}}\right)=\left({\frac {\mbox{Cell D}}{{\mbox{Cell B}}+{\mbox{Cell D}}}}\right)

Predictive value of tests

The predictive values of diagnostic tests are defined as "in screening and diagnostic tests, the probability that a person with a positive test is a true positive (i.e., has the disease), is referred to as the predictive value of a positive test; whereas, the predictive value of a negative test is the probability that the person with a negative test does not have the disease. Predictive value is related to the sensitivity and specificity of the test."^[2]

{\mbox{Positive predictive value}}=\left({\frac {{\mbox{Total }}with{\mbox{ disease and a positive test}}}{\mbox{Total with a positive test}}}\right)=\left({\frac {\mbox{Cell A}}{{\mbox{Cell A}}+{\mbox{Cell B}}}}\right)

{\mbox{Negative predictive value}}=\left({\frac {{\mbox{Total }}without{\mbox{ disease and a negative test}}}{\mbox{Total with a negative test}}}\right)=\left({\frac {\mbox{Cell D}}{{\mbox{Cell C}}+{\mbox{Cell D}}}}\right)

Summary statistics for diagnostic ability

While simply reporting the accuracy of a test seems intuitive, the accuracy is heavily influenced by the prevalence of disease.^[3] For example, if the disease occurred with frequency of one in one thousand, then simply guessing that all patients do not have disease will yield an accuracy of over 99%, whereas if the disease frequency were 999 in one thousand, the same guess would yield an accuracy near 1%.

With the arrival of many biomarkers that may be expensive diagnostic tests, much research has addressed how to summarize the incremental value of a new expensive test to existing diagnostic methods.^[4]^[5]^[6]

Area under the ROC curve

For more information, see: Receiver operating characteristic curve.

The area under the receiver operating characteristic curve (ROC curve), or c-index has been proposed. The c-index varies from 0 to 1 and a result of 0.5 indicates that the diagnostic test does not add to guessing.^[7] Variations have been proposed.^[8]^[9]

Bayes Information Criterion

The Bayes Information Criterion has been proposed by Schwarz in 1978.^[10]

Diagnostic odds ratio

The diagnostic odds ratio (DOR) is based on the likelihood ratios.^[11]

Whereas the likelihood ratio is:^[12]

{\text{Likelihood ratio}}={\frac {\mbox{probability of test result with disease}}{\mbox{probability of same result without disease}}}

The diagnostic odds ratio is:^[12]

{\text{Diagnostic odds ratio}}={\frac {\mbox{odds of test result with disease}}{\mbox{odds of same result without disease}}}

Or the diagnostic odds ratio is:

{\text{Diagnostic odds ratio}}={\frac {\mbox{Likelihood ratio +}}{\mbox{Likelihood ratio -}}}

For example:

If the sensitivity and specificity are 95% and 80%, respectively (or vice versa) then the DOR = 71.
If the sensitivity and specificity are both 95%, then the DOR = 361.

"The DOR ranges from 0 to infinity, with higher values indicating better discriminatory test performance. A value of 1 means that a test does not discriminate between patients with the disorder and those without it... The DOR does not depend on the prevalence of the disease."^[11]

Sum of sensitivity and specificity

This easy metric is called the Gain in Certainty:^[13]

{\mbox{Gain in Certainty}}=\left({\mbox{sensitivity}}+{\mbox{specificity}}\right)

It varies from 0 to 2 and a result of 1 indicates that the diagnostic test does not add to guessing.

Similarly, Youden's J index (J*), is:^[14]

{\text{Youdens index}}=\left({\mbox{sensitivity}}+{\mbox{specificity}}\right)-1

The index is derived from:

{\text{Youdens index}}=1-\left({\mbox{false positive rate}}+{\mbox{false negative rate}}\right)

Predictiveness curve

A graph of the predictiveness curve has been proposed.^[15]

Proportionate reduction in uncertainty score

The proportionate reduction in uncertainty score (PRU) has been proposed.^[16]

Integrated sensitivity and specificity

This measure has been proposed as an alternative to the area of the the receiver operating characteristic curve.^[17]

Reclassification tables

This measure has been proposed as an alternative to the area of the the receiver operating characteristic curve.^[4]^[17] This method allows calculating a 'reclassification index' or 'reclassification rate', or 'net reclassification improvement' (NRI)^[17]

The clinical net reclassification improvement (CNRI) is a variation that is the NRI only for the subjects at intermediate risk of disease.^[6]

Sequential scoring

Sequential scoring has been proposed in order to isolate the effect of a new, expensive diagnostic test.^[18]

Threats to validity of calculations

Various biases incurred during the study and analysis of a diagnostic tests can affect the validity of the calculations. An example is spectrum bias.

Poorly designed studies may overestimate the accuracy of a diagnostic test.^[19]

References

↑ National Library of Mediicne. Sensitivity and specificity. Retrieved on 2007-12-09.
↑ National Library of Mediicne. Predictive value of tests. Retrieved on 2007-12-09.
↑ Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA (May 1982). "Evaluating the yield of medical tests". JAMA 247 (18): 2543–6. PMID 7069920. ^[e]
↑ ^4.0 ^4.1 Cook NR, Ridker PM (June 2009). "Advances in measuring the effect of individual predictors of cardiovascular risk: the role of reclassification measures". Ann. Intern. Med. 150 (11): 795–802. PMID 19487714. ^[e]
↑ Cornell J, Mulrow CD, Localio AR (December 2008). "Diagnostic test accuracy and clinical decision making". Ann. Intern. Med. 149 (12): 904–6. PMID 19075211. ^[e]
↑ ^6.0 ^6.1 Cook NR (January 2008). "Comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 191–5. DOI:10.1002/sim.2987. PMID 17671959. Research Blogging.
↑ Hanley JA, McNeil BJ (April 1982). "The meaning and use of the area under a receiver operating characteristic (ROC) curve". Radiology 143 (1): 29–36. PMID 7063747. ^[e]
↑ Walter SD (July 2005). "The partial area under the summary ROC curve". Stat Med 24 (13): 2025–40. DOI:10.1002/sim.2103. PMID 15900606. Research Blogging.
↑ Bangdiwala SI, Haedo AS, Natal ML, Villaveces A (September 2008). "The agreement chart as an alternative to the receiver-operating characteristic curve for diagnostic tests". J Clin Epidemiol 61 (9): 866–74. DOI:10.1016/j.jclinepi.2008.04.002. PMID 18687288. Research Blogging.
↑ Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics 6, 461–464. DOI:10.1214/aos/1176344136 Google Scholar
↑ ^11.0 ^11.1 Glas AS, Lijmer JG, Prins MH, Bonsel GJ, Bossuyt PM (November 2003). "The diagnostic odds ratio: a single indicator of test performance". J Clin Epidemiol 56 (11): 1129–35. PMID 14615004. ^[e]
↑ ^12.0 ^12.1 SGIM EBM Task Force and Interest Group (2009). Ask the EBM Expert! - Society of General and Internal Medicine (SGIM). Society of General Internal Medicine.
↑ Connell FA, Koepsell TD (May 1985). "Measures of gain in certainty from a diagnostic test". Am. J. Epidemiol. 121 (5): 744–53. PMID 4014166. ^[e]
↑ Youden WJ (January 1950). "Index for rating diagnostic tests". Cancer 3 (1): 32–5. PMID 15405679. ^[e]
↑ Pepe, Margaret S.; Ziding Feng, Ying Huang, Gary Longton, Ross Prentice, Ian M. Thompson, Yingye Zheng (2008-02-01). "Integrating the Predictiveness of a Marker with Its Performance as a Classifier". Am. J. Epidemiol. 167 (3): 362-368. DOI:10.1093/aje/kwm305. PMID 17982157. Retrieved on 2008-12-17. Research Blogging.
↑ Coulthard MG (May 2007). "Quantifying how tests reduce diagnostic uncertainty". Arch. Dis. Child. 92 (5): 404–8. DOI:10.1136/adc.2006.111633. PMID 17158858. Research Blogging.
↑ ^17.0 ^17.1 ^17.2 Pencina MJ, D'Agostino RB, D'Agostino RB, Vasan RS (January 2008). "Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond". Stat Med 27 (2): 157–72; discussion 207–12. DOI:10.1002/sim.2929. PMID 17569110. Research Blogging.
↑ Greenland S (January 2008). "The need for reorientation toward cost-effective prediction: comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 199–206. DOI:10.1002/sim.2995. PMID 17729377. Research Blogging.
↑ Lijmer JG, Mol BW, Heisterkamp S, et al (September 1999). "Empirical evidence of design-related bias in studies of diagnostic tests". JAMA 282 (11): 1061–6. PMID 10493205. ^[e]

[MeSH_SnSp-1] National Library of Mediicne. Sensitivity and specificity. Retrieved on 2007-12-09.

[MeSH_PV-2] National Library of Mediicne. Predictive value of tests. Retrieved on 2007-12-09.

[pmid7069920-3] Harrell FE, Califf RM, Pryor DB, Lee KL, Rosati RA (May 1982). "Evaluating the yield of medical tests". JAMA 247 (18): 2543–6. PMID 7069920. ^[e]

[pmid19487714-4] 4.0 ^4.1 Cook NR, Ridker PM (June 2009). "Advances in measuring the effect of individual predictors of cardiovascular risk: the role of reclassification measures". Ann. Intern. Med. 150 (11): 795–802. PMID 19487714. ^[e]

[pmid19075211-5] Cornell J, Mulrow CD, Localio AR (December 2008). "Diagnostic test accuracy and clinical decision making". Ann. Intern. Med. 149 (12): 904–6. PMID 19075211. ^[e]

[pmid17671959-6] 6.0 ^6.1 Cook NR (January 2008). "Comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 191–5. DOI:10.1002/sim.2987. PMID 17671959. Research Blogging.

[pmid7063747-7] Hanley JA, McNeil BJ (April 1982). "The meaning and use of the area under a receiver operating characteristic (ROC) curve". Radiology 143 (1): 29–36. PMID 7063747. ^[e]

[pmid15900606-8] Walter SD (July 2005). "The partial area under the summary ROC curve". Stat Med 24 (13): 2025–40. DOI:10.1002/sim.2103. PMID 15900606. Research Blogging.

[pmid18687288-9] Bangdiwala SI, Haedo AS, Natal ML, Villaveces A (September 2008). "The agreement chart as an alternative to the receiver-operating characteristic curve for diagnostic tests". J Clin Epidemiol 61 (9): 866–74. DOI:10.1016/j.jclinepi.2008.04.002. PMID 18687288. Research Blogging.

[10] Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics 6, 461–464. DOI:10.1214/aos/1176344136 Google Scholar

[pmid14615004-11] 11.0 ^11.1 Glas AS, Lijmer JG, Prins MH, Bonsel GJ, Bossuyt PM (November 2003). "The diagnostic odds ratio: a single indicator of test performance". J Clin Epidemiol 56 (11): 1129–35. PMID 14615004. ^[e]

[urlAsk_the_EBM_Expert!_-_Society_of_General_and_Internal_Medicine_(SGIM)-12] 12.0 ^12.1 SGIM EBM Task Force and Interest Group (2009). Ask the EBM Expert! - Society of General and Internal Medicine (SGIM). Society of General Internal Medicine.

[pmid4014166-13] Connell FA, Koepsell TD (May 1985). "Measures of gain in certainty from a diagnostic test". Am. J. Epidemiol. 121 (5): 744–53. PMID 4014166. ^[e]

[pmid15405679-14] Youden WJ (January 1950). "Index for rating diagnostic tests". Cancer 3 (1): 32–5. PMID 15405679. ^[e]

[15] Pepe, Margaret S.; Ziding Feng, Ying Huang, Gary Longton, Ross Prentice, Ian M. Thompson, Yingye Zheng (2008-02-01). "Integrating the Predictiveness of a Marker with Its Performance as a Classifier". Am. J. Epidemiol. 167 (3): 362-368. DOI:10.1093/aje/kwm305. PMID 17982157. Retrieved on 2008-12-17. Research Blogging.

[pmid17158858-16] Coulthard MG (May 2007). "Quantifying how tests reduce diagnostic uncertainty". Arch. Dis. Child. 92 (5): 404–8. DOI:10.1136/adc.2006.111633. PMID 17158858. Research Blogging.

[pmid17569110-17] 17.0 ^17.1 ^17.2 Pencina MJ, D'Agostino RB, D'Agostino RB, Vasan RS (January 2008). "Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond". Stat Med 27 (2): 157–72; discussion 207–12. DOI:10.1002/sim.2929. PMID 17569110. Research Blogging.

[pmid17729377-18] Greenland S (January 2008). "The need for reorientation toward cost-effective prediction: comments on 'Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond' by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)". Stat Med 27 (2): 199–206. DOI:10.1002/sim.2995. PMID 17729377. Research Blogging.

[pmid10493205-19] Lijmer JG, Mol BW, Heisterkamp S, et al (September 1999). "Empirical evidence of design-related bias in studies of diagnostic tests". JAMA 282 (11): 1061–6. PMID 10493205. ^[e]

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

Sensitivity and specificity

Contents

Calculations