National Cancer Institute Home at the National Institutes of Health |
Please wait while this form is being loaded....
The Applied Research Program Web site is no longer maintained. ARP's former staff have moved to the new Healthcare Delivery Research Program, the Behavioral Research Program, or the Epidemiology & Genomics Research Program, and the content from this Web site is being moved to one of those sites as appropriate. Please update your links and bookmarks!

Publication Abstract

Authors: Eide MJ, Tuthill JM, Krajenta RJ, Jacobsen GR, Levine M, Johnson CC

Title: Validation of claims data algorithms to identify nonmelanoma skin cancer.

Journal: J Invest Dermatol 132(8):2005-9

Date: 2012 Aug

Abstract: Health maintenance organization (HMO) administrative databases have been used as sampling frames for ascertaining nonmelanoma skin cancer (NMSC). However, because of the lack of tumor registry information on these cancers, these ascertainment methods have not been previously validated. NMSC cases arising from patients served by a staff model medical group and diagnosed between 1 January 2007 and 31 December 2008 were identified from claims data using three ascertainment strategies. These claims data cases were then compared with NMSC identified using natural language processing (NLP) of electronic pathology reports (EPRs), and sensitivity, specificity, positive and negative predictive values were calculated. Comparison of claims data-ascertained cases with the NLP demonstrated sensitivities ranging from 48 to 65% and specificities from 85 to 98%, with ICD-9-CM ascertainment demonstrating the highest case sensitivity, although the lowest specificity. HMO health plan claims data had a higher specificity than all-payer claims data. A comparison of EPR and clinic log registry cases showed a sensitivity of 98% and a specificity of 99%. Validation of administrative data to ascertain NMSC demonstrates respectable sensitivity and specificity, although NLP ascertainment was superior. There is a substantial difference in cases identified by NLP compared with claims data, suggesting that formal surveillance efforts should be considered.

Last Modified: 03 Sep 2013