Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov
Non-word Spelling (1-To-1)
I. Introduction
This page describes the processes for non-word spelling (1-to-1) detection and correction.
II. Processes
NonWordDetector.java
OneToOneCandidates.java
CS_CAN_NW_1TO1_WORD_MAX_LENGTH
)
RankNonWordByMode.java
,
CS_RANKER_NW_S1_RANK_RANGE_FAC
)
CS_NW_1TO1_CONTEXT_RADIUS
)
CS_RANKER_NW_S1_MIN_OSCORE
)
OneToONeCorrector.java
III. Development Test
Id | Source | Original Word | Corrected Word |
---|---|---|---|
TP-1 | 10023 | knoledge | knowledge |
TP-2 | 10040 | truely | truly |
TP-3 | 10475 | diagnost | diagnosed |
TP-4 | 6 | diagnosised | diagnosed |
... | ... | ... | ... |
Id | Source | Original Word | Corrected Word | Correct Word |
---|---|---|---|---|
FP-1 | 10058 | B | be | B |
FP-2 | 10084 | i.e. | ice. | i.e. |
FP-3 | 11144 | clancy | chancy | clumsy |
FP-4 | 11588 | baging | bagging | begging |
... | ... | ... | ... | ... |
Id | Source | Original Word | Corrected Word | Correct Word |
---|---|---|---|---|
FN-1 | 10285 | hitiala | hitiala | hiatal |
FN-2 | 10714 | havy | have | heavy |
FN-3 | 10 | ewings | ewings | ewing's |
FN-4 | 11144 | traumatologo | traumatologo | traumatologist |
FN-5 | 11186 | segmens | segment | segments |