National Cancer Institute Home at the National Institutes of Health | www.cancer.gov

Publication Abstract

Authors: Eide MJ, Tuthill JM, Krajenta RJ, Jacobsen GR, Levine M, Johnson CC

Title: Validation of claims data algorithms to identify nonmelanoma skin cancer.

Journal: J Invest Dermatol 132(8):2005-9

Date: 2012 Aug

Abstract: Health maintenance organization (HMO) administrative databases have been used as sampling frames for ascertaining nonmelanoma skin cancer (NMSC). However, because of the lack of tumor registry information on these cancers, these ascertainment methods have not been previously validated. NMSC cases arising from patients served by a staff model medical group and diagnosed between 1 January 2007 and 31 December 2008 were identified from claims data using three ascertainment strategies. These claims data cases were then compared with NMSC identified using natural language processing (NLP) of electronic pathology reports (EPRs), and sensitivity, specificity, positive and negative predictive values were calculated. Comparison of claims data-ascertained cases with the NLP demonstrated sensitivities ranging from 48 to 65% and specificities from 85 to 98%, with ICD-9-CM ascertainment demonstrating the highest case sensitivity, although the lowest specificity. HMO health plan claims data had a higher specificity than all-payer claims data. A comparison of EPR and clinic log registry cases showed a sensitivity of 98% and a specificity of 99%. Validation of administrative data to ascertain NMSC demonstrates respectable sensitivity and specificity, although NLP ascertainment was superior. There is a substantial difference in cases identified by NLP compared with claims data, suggesting that formal surveillance efforts should be considered.

Last Modified: 03 Sep 2013