Misra D, Thoma GR.Use of descriptive metadata as a knowledgebase for analyzing data in large textual collections. Proc. IS&T Archiving 2013. Washington D.C. Proc. IS&T Archiving 2013. Washington D.C. pg 193-199.
Misra D, Hall RH, Payne SM, Thoma GR.Digital preservation and knowledge discovery based on documents from an international health science program. Proc. 12th ACM/IEEE-CS JCDL, pg 23-26 (2012). doi: 10.1145/2232817.2232823.
Mrabet Y, Kilicoglu H, Demner-Fushman D.TextFlow: A Text Similarity Measure based on Continuous Sequences. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017, Vancouver, Canada, July 30 - August 4, Volume
Rae A, Kim J, Le DX, Thoma GR.Main Content Detection in HTML Journal Articles. DocEng ’18: ACM Symposium on Document Engineering 2018, August 28–31, 2018, Halifax, NS, Canada. ACM, New York, NY, USA, 4 pages. https://doi.org/10.1145/3209280.3229115
Xue Z, Rahman M, Antani SK, Long LR, Demner-Fushman D, Thoma GR.Modality Classification for Searching Figures in Biomedical Literature. Proceedings of the IEEE 29th International Symposium on Computer-Based Medical Systems, pp. 152-157, 2016. doi:10.1109/CBMS.2016.29.
Zhang X, Zou J, Le DX, Thoma GR.Investigator Name Recognition From Medical Journal Articles: A Comparative Study of SVM and Structural SVM International Workshop on Document Analysis Systems. June 2010:121-8
Zhang X, Zou J, Le DX, Thoma GR.A Stacked Sequential Learning Method For Investigator Name Recognition From Web-based Medical Articles 17th Document Recognition and Retrieval Conference (SPIE-DR&R). San Jose, CA. January 2010;7534:753404-7
Zhang X, Zou J, Le DX, Thoma GR.A Semi-supervised Learning Method to Classify Grant Support Zone in Web-based Medical Articles Proc SPIE Electronic Imaging Science and Technology, Document Recognition and Retrieval. January 2009;7247:7247 OW(1-8)
Zou J, Antani SK, Thoma GR.Localizing and Recognizing Labels for Multi-Panel Figures in Biomedical Journals. Proceedings of International Conference on Document Analysis and Recognition, November 13, 2017
Zou J, Le DX, Thoma GR.Extracting a Sparsely-Located Named Entity from Online HTML Medical Articles Using Support Vector Machine Proc SPIE-IS/T Electronic Imaging. San Jose, CA. January 2008;6815:6815OP(1-10)