You are here

Exploring Adjectival Modification in Biomedical Discourse Across Two Genres

Printer-friendly versionPrinter-friendly version
Bodenreider O, Pakhomov S
Proc. of the ACL, 2003 Workshop, Natural Language Processing in Biomedicine. 2003;:105-112.
Abstract: 

To explore the phenomenon of adjectival modification in biomedical discourse across two genres: the biomedical literature and patient records. Methods: Adjectival modifiers are removed from phrases extracted from two corpora (three million noun phrases extracted from MEDLINE, on the one hand, and clinical notes from the Mayo Clinic, on the other). The original phrases, the adjectives extracted, and the resulting demodified phrases are compared across the two corpora after normalization. Quantitative comparisons (frequency of occurrence) are performed on the whole domain. Qualitative comparisons are performed on the two subdomains (disorders and procedures). Results: Although the average number of adjectives per phrase is equivalent in the two corpora (1.4), there are more adjective types in MAYO than in MEDLINE for disorders and procedures. For disorder phrases, the 38% of adjective types common to the two corpora account for 85% of the occurrences. The predominance of adjectives in one corpus is analyzed. Discussion: Potential applications of this approach are discussed, namely terminology acquisition, information retrieval, and genre characterization.

Bodenreider O, Pakhomov S. Exploring Adjectival Modification in Biomedical Discourse Across Two Genres Proc. of the ACL, 2003 Workshop, Natural Language Processing in Biomedicine. 2003;:105-112.