Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov
Ensemble Source Code Analysis
The ensemble spelling correction (by Halil) is used as the baseline for this project. The reviewed status and suggesting plan for the original source code are described as follows:
Original Java Code | Notes | Module | Status | Plan | |
---|---|---|---|---|---|
1 | SpellingPreProcessor.java |
| PreProcessor |
| Rewrite or refactoring |
2 | DictionaryBasedSpellChecker.java |
| Dictionary |
| Rewrite |
3 | SpellingCandidateGenerator.java |
| Candidate |
| Rewrite or refactoring |
4 | CorpusFrequencyCounts.java |
| Ranking |
| Rewrite |
5 | Word2Vector.java |
| Ranking |
| TBD |
6 | SpellCorrectionEvaluator.java |
| Evaluator |
| Rewrite |
7 | diff_match_patch.java |
| Evaluator |
| Rewrite |
8 | SpellCorrection.java | Interface, might not need it | System |
| Remove or redesign |
9 | LinearWeightedEnsembleSpellCorrection.java |
| System |
| Rewrite |
10 | JazzySpellCorrection.java | Use ASpell (Jazzy) to correct text | System |
| Remove |
11 | ESpellCorrection.java | Use ESpell to correct text | System |
| Remove |
where,
Plan Summary:
Performance | Maintenance | Thread Safe | Distributable | Configurable | |
---|---|---|---|---|---|
OO Design | X | X | |||
Coding standard | X | X | |||
Limited comment | X | ||||
static global variable | X | ||||
Algorithm (exhausted) | X | ||||
Package Dependency | X | X | X |