Callaghan F, Jackson MT, Demner-Fushman D, Abhyankar S, McDonald C. Analysis of data that has been extracted from free-text using natural language processing: a likelihood model for misclassification with an application to medical informatics. International Conference on Advances in Interdisciplinary Statistics and Combinatorics (AISC2012), Greensboro, NC, October 2012