Mao S, Kim J, Thoma G.A Dynamic Feature Generation System for Automated Metadata Extraction in Preservation of Digital Materials Proc. International Workshop on Document Image Analysis for Libraries (DIAL2004). 2004 Jan;: 225-32.
Mao S, Kanungo T.Empirical Performance Evaluation Methodology and its Application to Page Segmentation Algorithms IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001 Mar;23(3): 242-256.
Mao S, Kim J, Le DX, Thoma GR.Generating Robust Features for Style-Independent Labeling of Bibliographic Fields in Medical Journal Articles Proc. 7th World Multiconference on Systemics, Cybernetics and Informatics.2003 July;III:53-6.
Misra D, Thoma GR.Use of descriptive metadata as a knowledgebase for analyzing data in large textual collections. Proc. IS&T Archiving 2013. Washington D.C. Proc. IS&T Archiving 2013. Washington D.C. pg 193-199.
Misra D, Hall RH, Payne SM, Thoma GR.Digital preservation and knowledge discovery based on documents from an international health science program. Proc. 12th ACM/IEEE-CS JCDL, pg 23-26 (2012). doi: 10.1145/2232817.2232823.
Mrabet Y, Kilicoglu H, Demner-Fushman D.TextFlow: A Text Similarity Measure based on Continuous Sequences. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017, Vancouver, Canada, July 30 - August 4, Volume
Rae A, Kim J, Le DX, Thoma GR.Main Content Detection in HTML Journal Articles. DocEng ’18: ACM Symposium on Document Engineering 2018, August 28–31, 2018, Halifax, NS, Canada. ACM, New York, NY, USA, 4 pages. https://doi.org/10.1145/3209280.3229115
Simpson M, Ford G, Antani S, Demner-Fushman D, Thoma GR.A Lightweight Statistics Package for Interactive Publications Poster at 20th NIH Research Festival (TECH-15), September 2007, National Institutes of Health
Szolovits P, Aberdeen J, Meystre S, Kayaalp M.Panel on: State of the Art of Clinical Narrative Report De-Identification and Its Future [Poster]. Proceedings of the Annual American Medical Informatics Association Fall Symposium: 240–242.
Thoma GR, Antani S, Ford GL, Chung M, Vasudevan K.Interactive Publications Research: A Report to the Board of Scientific Counselors September 2005 Technical Report to the LHNCBC Board of Scientific Counselors.
Thoma GR, Ford G, Le DX, Li Z.Text Verification in an Automated System for the Extraction of Bibliographic Data Proc. 5th International Workshop on Document Analysis Systems, Springer-Verlag: Berlin. 2002 Aug;: 423-32.
Xue Z, Rahman M, Antani SK, Long LR, Demner-Fushman D, Thoma GR.Modality Classification for Searching Figures in Biomedical Literature. Proceedings of the IEEE 29th International Symposium on Computer-Based Medical Systems, pp. 152-157, 2016. doi:10.1109/CBMS.2016.29.
Zhang X, Zou J, Le DX, Thoma GR.Investigator Name Recognition From Medical Journal Articles: A Comparative Study of SVM and Structural SVM International Workshop on Document Analysis Systems. June 2010:121-8
Zhang X, Zou J, Le DX, Thoma GR.A Stacked Sequential Learning Method For Investigator Name Recognition From Web-based Medical Articles 17th Document Recognition and Retrieval Conference (SPIE-DR&R). San Jose, CA. January 2010;7534:753404-7
Zhang X, Zou J, Le DX, Thoma GR.A Semi-supervised Learning Method to Classify Grant Support Zone in Web-based Medical Articles Proc SPIE Electronic Imaging Science and Technology, Document Recognition and Retrieval. January 2009;7247:7247 OW(1-8)
Zou J, Antani SK, Thoma GR.Localizing and Recognizing Labels for Multi-Panel Figures in Biomedical Journals. Proceedings of International Conference on Document Analysis and Recognition, November 13, 2017
Zou J, Le DX, Thoma GR.Extracting a Sparsely-Located Named Entity from Online HTML Medical Articles Using Support Vector Machine Proc SPIE-IS/T Electronic Imaging. San Jose, CA. January 2008;6815:6815OP(1-10)