Consumer Data (From Dina)
I. Introduction
The page describes consumer data that are used in baseline dictionary. There are four files in this data set:
II. Algorithm
The above 4 files are generated from UMLS (2013AB?) by the following steps:
III. Analysis
File Name | Semantic Types | Terms | Not UMLS (No CUI) |
---|---|---|---|
umls_anatomy_merged.txt | 9 | 295,932 | 0 |
umls_interventions_merged.txt | 65 | 528,668 | expo: 5,457 |
umls_population_merged.txt | 4 | 5,898 | 0 |
umls_problem_merged.txt | 68 | 644,839 | prob: 1,643, (from Gopher Terms) |
Total Terms | 147 | 1,475,204 | all.txt.1 |
Total Unique Terms | 97 | 1,469,339 | all.txt.1.uSort |
Total Tokens | N/A | 299,669 | medDic.data |
IV. Others
ST abb | Source File (term no) |
---|---|
alga |
|
invt | |
rich |
|
V. Other Resources
Other resources are used to merge to the above 4 files:
The above two files are used as source for the interventions and problem list:
liver is small|C0577047|small liver|fndg
spleen is enlarged|C0038002|Splenomegaly|fndg
File Name | Semantic Types | Terms | Not UMLS (No CUI) |
---|---|---|---|
interventions.txt (PICO) | 76 | 30,492 | expo: 6,344 |
umls_problem_list.txt (UMLS) | 71 | 254,420 | prob: 1,792 |