UMLS-Core: Normalized term to CUI Mapping
- Descriptions:
- A simple mapping from normalized term to CUI (concept)
- File: MRXNS_ENG.RRF (fields 2 and 3)
- Examples - Test Cases:
| Input | Output | Notes
|
|---|
| disease's | C0012634 | Genitive
|
| disease(s) | C0012634 | parenthetical plural forms
|
| :disease.&* | C0012634 | Punctuation
|
| of disease | C0012634 | Stop Words
|
| DisEASE | C0012634 | cases
|
| diseases | C0012634 | inflectional variants
|
| :Disease(S).&* | C0012634 | Combination
|
| Colours | C0009393 | spelling variants
|
| Cortical Cystic Diseases | C0431713 | Multi words
|
- Algorithm:
- normalized input term by Lvg Norm
- Mapping
- It is one to many mapping, such as:
- tree trumpet|C1460803
- tree trumpet|C1492035
- Use Java Hashtable
| | key | Values
|
|---|
| Type | String | Vector<String>
|
|---|
| Description | Normalized term | Mapped CUIs
|
|---|
- Removed duplicated key|value (same normTerm|CUI with different LUI and SUI) in MRXNS_ENG.RRF.2011AB
- MRXNS_ENG.RRF.2011AB has 9,537,805 lines. It uses lots of memory. The alternative is to use database
- This is a one to many mapping