Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov
Computer-Aided Revision
A set of computer-aided program is developed to validate and revise the reconciled Brat annotation data. They are described follows:
${C_SPELL}/PostProcess
${C_SPELL}/PostProcess/bin
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/brat
${C_SPELL}/PostProcess/bin/PostBratNewTest
2
1
2
Tag | Check Items |
---|---|
ToSplit |
|
ToSplitOnPunct |
|
ToMerge |
|
Misspelling |
|
Informal |
|
RealWord |
|
OutOfVocabulary |
|
WordExists |
|
Punctuation |
|
Garbage |
|
Unknown |
|
From our experience, there are two types of errors that commonly seen in spelling annotation.
3
Check Brat Tags spans - the purpose of this check is to ensure generate gold standard correctly for the cases of contain, multi-tag and overlap for both non-word and real-word