You are here

  • Mao S, Kim J, Thoma G. A Dynamic Feature Generation System for Automated Metadata Extraction in Preservation of Digital Materials Proc. International Workshop on Document Image Analysis for Libraries (DIAL2004). 2004 Jan;: 225-32.
  • Chachra S, Ben Abacha A, Shooshan SE, Rodriguez L, Demner-Fushman D. A Hybrid Approach to Generation of Missing Abstracts in Biomedical Literature. Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers: 1093-1100.
  • Simpson M, Ford G, Antani S, Demner-Fushman D, Thoma GR. A Lightweight Statistics Package for Interactive Publications Poster at 20th NIH Research Festival (TECH-15), September 2007, National Institutes of Health
  • Zhang X, Zou J, Le DX, Thoma GR. A Semi-supervised Learning Method to Classify Grant Support Zone in Web-based Medical Articles Proc SPIE Electronic Imaging Science and Technology, Document Recognition and Retrieval. January 2009;7247:7247 OW(1-8)
  • Zhang X, Zou J, Le DX, Thoma GR. A Stacked Sequential Learning Method For Investigator Name Recognition From Web-based Medical Articles 17th Document Recognition and Retrieval Conference (SPIE-DR&R). San Jose, CA. January 2010;7534:753404-7
  • Kayaalp M, Browne AC, Dodd ZA, Sagan P, McDonald CJ. An Easy-to-Use Clinical Text De-identification Tool for Clinical Scientists: NLM Scrubber [Poster]. Proceedings of the Annual American Medical Informatics Association Fall Symposium: 1522.
  • Demner-Fushman D, Lin J. Answering Clinical Questions with Knowledge-based and Statistical Techniques Computational Linguistics. 2007 Jan;33(1):63-103
  • Lasko TA, Hauser SE. Approximate String Matching Algorithms for Limited-Vocabulary OCR Output Correction Proc. SPIE, Document Recognition and Retrieval VIII. 2001 Jan;4307:232-40.
  • Misra D, Mao S, Rees J, Thoma GR. Archiving a Historic Medico-legal Collection: Automation and Workflow Customization Proc IS&T Archiving 2007. Arlington, Virginia, May 2007; 157-61
  • Kim I, Thoma GR. Automated Classification of Author’s Sentiments in Citation Using Machine Learning Techniques: A Preliminary Study. Proc. the 2015 IEEE Conf. Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2015), Niagara Falls, Canada, Aug. 12-15, 2015.
  • Kim I, Le DX, Thoma GR. Automated Cleanup Processing for Extracting Bibliographic Data from Biomedical Online Journals In: Callaos N, Lesso W, editors. SCI 2005. Proc. 9th World Multiconference on Systemics, Cybernetics and Informatics; 2005 Jul 10-13; Vol. 4; Orlando (FL): International Institute of Informatics and Systemics; c2005. 401-5
  • Thoma GR, Ford G. Automated Data Entry System: Performance Issues Proc. SPIE: Document Recognition and Retrieval IX. 2002 Jan;4670: 181-90.
  • Le DX, Thoma GR. Automated Document Labeling for Web-Based Online Medical Journals Proc. 7th World Multiconference on Systemics, Cybernetics and Informatics. 2003 July;II: 411-15.
  • Kim I, Le DX, Thoma GR. Automated identification of biomedical article type using support vector machines. Proc. 18th SPIE Document Recognition and Retrieval, 7874:787403 (1-9), San Francisco, January 2011.
  • Kim I, Thoma GR. Automated Identification of Potential Conflict-of-Interest in Biomedical Articles Using Hybrid Deep Neural Network. Proc. 14th Int’l Conf. Machine Learning and Data Mining (MLDM 2018), LNAI 10934, pp. 99-112, Newark, NJ, July 2018.
  • Kim I, Thoma GR. Automated Identification of Potential Conflict-of-Interest in Biomedical Articles Using Hybrid Deep Neural Network. Proc. 14th Int’l Conf. Machine Learning and Data Mining (MLDM 2018), LNAI 10934, pp. 99-112, Newark, NJ, July 2018.
  • Kim J, Le DX, Thoma GR. Automated Labeling Algorithms for Biomedical Document Images Proc. 7th World Multiconference on Systemics, Cybernetics and Informatics. 2003 July;V: 352-57.
  • Kim J, Le DX, Thoma GR. Automated Labeling in Document Images Proc. SPIE, Document Recognition and Retrieval VIII. 2001 Jan;4307:111-22.
  • Kim J, Le DX, Thoma GR. Automated Labeling of Bibliographic Data Extracted from Biomedical Online Journals Proc. SPIE Electronic Imaging. 2003 Jan;5010: 47-56.
  • Kim J, Le DX, Thoma GR. Automated Labeling Of Biomedical Online Journal Articles In: Callaos N, Lesso W, editors. SCI 2005. Proc 9th World Multiconference on Systemics, Cybernetics and Informatics; 2005 Jul 10-13; Vol. 4; Orlando (FL): International Institute of Informatics and Systemics; c2005. 406-11
  • Le DX, Tran LQ, Chow J, Kim J, Hauser SE, Moon CW, Thoma GR. Automated Medical Citation Records Creation for Web-Based Online Journals Proc. 14th IEEE Symposium on Computer-Based Medical Systems: IEEE Computer Society. 2001.
  • Thoma GR, Mao S, Misra D. Automated Metadata Extraction to Preserve the Digital Contents of Biomedical Collections Proc VIIP 2005. September 2005. Benidorm, Spain; 214-19
  • Kim I, Le DX, Thoma GR. Automated method for extracting "citation sentences" from online biomedical articles using SVM-based text summarization technique. Proc. the 2014 IEEE Int'l Conf. on Systems, Man, and Cybernetics (SMC 2014), pp. 2006-2011, San Diego, October, 2014
  • Hauser SE, Le DX, Thoma GR. Automated Zone Correction in Bitmapped Document Images SPIE: Document Recognition and Retrieval VII. 2000 Jan;3976: 248-58.
  • Kim J, Le DX, Thoma GR. Automatic Extraction of Bibliographic Information from Biomedical Online Journal Articles Using a String Matching Algorithm Proc IEEE CBMS, June 2006, Salt Lake City, Utah; 905-10
  • Mao S, Kanungo T. Automatic Training of Page Segmentation Algorithms: An Optimization Approach International Conference on Pattern Recognition. 2000 Sept.;:531-534.
  • Le DX, Thoma GR. Automatically Creating Biomedical Bibliographic Records from Printed Volumes of Old Indexes In: Callaos N, Lesso W, editors. SCI 2005. Proc 9th World Multiconference on Systemics, Cybernetics and Informatics; 2005 Jul 10-13; Vol. 3, Computer Science and Engineering. Orlando (FL): International Institute of Informatics and Systemics; c2005. 267-74
  • Demner-Fushman D, Few B, Hauser SE, Thoma GR. Automatically Identifying Health Outcome Information in MEDLINE Records J Am Med Inform Assoc. 2006 Jan-Feb;13(1):52-60. Epub 2005 Oct 12.
  • Pearson G, Moon CW. Bridging Two Biomedical Journal Databases with XML - A Case Study. Proc. 14th IEEE Symposium on Computer-Based Medical Systems: IEEE Computer Society. 2001 Jul;:309-14.
  • Zou J, Le DX, Thoma GR. Combining DOM Tree and Geometric Layout Analysis for Online Medical Journal Article Segmentation Proc JCDL, June 2006, Chapel Hill, NC; 119-28
  • Kim J, Le DX, Thoma GR. Combining SVM Classifiers to Identify Investigator Name Zones in Biomedical Articles. IS&T/SPIE’s 22nd Annual Symposium on Electronic Imaging. San Francisco, CA, January 2012; 8297.
  • Hauser SE, Schlaifer J, Sabir TF, Demner-Fushman D, Thoma GR. Correcting OCR Text by Association with Historic Datasets Proc. SPIE Electronic Imaging. 2003 Jan;5010: 84-93.
  • Bennett A, Liu J, Van Ryk D, Bliss D, Arthos J, Henderson RM, Subramaniam S. Cryoelectron Tomographic Analysis of an HIV-neutralizing Protein and Its Complex with Native Viral gp120 J Biol Chem. 2007 Sep 21;282(38):27754-9. Epub 2007 Jun 28
  • Thoma GR, Mao S, Misra D, Rees J. Design of a Digital Library for Early 20th Century Medico-legal Documents Proc ECDL 2006. Eds: Gonzalo J et al. Berlin: Springer-Verlag; LNCS 4172: 147-57
  • Mao S, Misra D, Seamans J, Thoma GR. Design Strategies for a Prototype Electronic Preservation System for Biomedical Documents IS&T Archiving 2005 Conference, April 2005; 48-53.
  • Misra D, Hall RH, Payne SM, Thoma GR. Digital preservation and knowledge discovery based on documents from an international health science program. Proc. 12th ACM/IEEE-CS JCDL, pg 23-26 (2012). doi: 10.1145/2232817.2232823.
  • Mao S, Rosenfeld A, Kanungo T. Document Structure Analysis Algorithms: A Literature Survey Proc. SPIE Electronic Imaging. 2003 Jan;5010:197-207.
  • Chen S, Misra D, Thoma GR. Efficient Automatic OCR Word Validation Using Word Partial Format Derivation and Language Model Document Recognition and Retrieval XVII. Proceedings of the SPIE. San Jose, CA. January 2010;7534:75340O-75340O-8
  • Mao S, Kanungo T. Empirical Performance Evaluation Methodology and its Application to Page Segmentation Algorithms IEEE Transactions on Pattern Analysis and Machine Intelligence. 2001 Mar;23(3): 242-256.
  • Mao S, Kanungo T. Empirical Performance Evaluation of Page Segmentation Algorithms SPIE conference on Document Recognition and Retrieval. 2000 Jan.;:303-314.
  • Ide NC, Loane RF, Demner-Fushman D. Essie: A Concept-based Search Engine for Structured BiomedicalText J Am Med Inform Assoc. 2007 May-Jun;14(3):253-63. Epub 2007 Feb 28
  • Zou J, Le DX, Thoma GR. Extracting a Sparsely-Located Named Entity from Online HTML Medical Articles Using Support Vector Machine Proc SPIE-IS/T Electronic Imaging. San Jose, CA. January 2008;6815:6815OP(1-10)
  • Mao S, Kim J, Le DX, Thoma GR. Generating Robust Features for Style-Independent Labeling of Bibliographic Fields in Medical Journal Articles Proc. 7th World Multiconference on Systemics, Cybernetics and Informatics.2003 July;III:53-6.
  • Lin J, Karakos D, Demner-Fushman D, Khudanpur S. Generative Content Models for Structural Analysis of Medical Abstracts Proc 2006 BioNLP'06. June 2006, New York City, New York
  • Le DX, Straughan SR, Thoma GR. Greek Alphabet Recognition Technique for Biomedical Documents Proc. 6th World Multiconference on Systemics, Cybernetics and Informatics, eds: Callaos N, et al. 2002 July;III: 86-91.
  • Ford G, Thoma GR. Ground Truth Data for Document Image Analysis Proceedings of 2003 Symposium on Document Image Understanding and Technology. 2003 April 9-11;: 199-205.
  • Sabir TF, Hauser SE, Thoma GR. Historical Author Affiliations Assist Verification of Automatically Generated MEDLINE Citations AMIA Annu Symp Proc. 2006:1082
  • Kim IC, Le DX, Thoma GR. Hybrid approach combining contextual and statistical information for identifying and statistical information for identifying MEDLINE citation terms. Proc. SPIE-IS/T Electronic Imaging. San Jose, CA. January 2008;6815:68150P(1-9)
  • Kim IC, Le DX, Thoma GR. Identification of "comment-on sentences" in online biomedical documents using support vector machines. Proc. SPIE conference on Document Recognition and Retrieval, 6500:65000O (1-8), San Jose, January 2007.
  • Kim J, Le DX, Thoma GR. Identification of Investigator Name Zones Using SVM Classifiers and Heuristic Rules. 12th international Conference on Document Analysis and Recognition (ICDAR). Washington D.C., August 2013.

Pages