LMW Candidate Post-Processes - Results
The performance results of previous tagged LMW candidate files are in the output file of ${MULTIWORDS}/data/Candidates/DataLog/${YEAR}/${YYYY}_${MM}_${DD}/prevCand.lmw.rpt. The results shown below is a snapshot on the completion of the latest candidate list and is based on the above file report, results might be slightly different over the time due to the updates on Lexicon (when valid words become invalid words and vise versa).
Year | Acronym Expansions | Abbreviation Expansions | ||||
---|---|---|---|---|---|---|
Total | Valid | Invalid | Total | Valid | Invalid | |
2015 | 908 | 881 (97.03%) | 27 (2.97%) | 62 | 40 (64.52%) | 22 (35.48%) |
2016 | 59 | 59 (100.00%) | 0 (0.00%) | 183 | 180 (98.36%) | 3 (1.64%) |
2017 | 39 | 39 (100.00%) | 0 (0.00%) | 22 | 19 (86.36%) | 3 (13.64%) |
2018 | 17 | 16 (94.12%) | 1 (5.88) | 28 | 26 (92.86%) | 2 (7.14%) |
2019 | 151 | 142 (94.04%) | 9 (5.96%) | 13 | 12 (92.31%) | 1 (7.69%) |
Year | Total | Valid | Invalid | |||
2020 | 148 | 112 (75.68%) | 36 (24.32%) | |||
2021 | 158 | 129 (81.65%) | 29 (18.35%) | |||
2022 | 94 | 53 (56.38%) | 41 (43.62%) | |||
2023 | 2 | 2 (100.00%) | 0 (0.00%) | |||
2024 | 2 | 1 (50.00%) | 1 (50.00%) | |||
2025 | 7 | 3 (42.86%) | 4 (57.14%) | |||
Accu. | Total: 1816 | Valid: 1640 (90.31%) | Invalid: 176 (9.69%) |
Year | Total | Valid | Invalid | Notes |
---|---|---|---|---|
2015 | 4994 | 3681(73.71%) | 1313 (26.29%) | |
2016 | 360 | 200 (55.56%) | 160 (44.44%) | |
2017 | 1855 | 1317 (71.00%) | 538 (29.00%) |
|
2018 | 808 | 604 (74.75%) | 204 (25.25%) |
|
2019 | 1081 | 663 (61.33%) | 418 (38.67%) |
|
2020 | 1061 | 787 (74.18%) | 274 (25.82%) |
|
Accu. | 9816 | 7060 (71.92%) | 2756 (28.08%) |
Year | Total | Valid | Invalid | Notes |
---|---|---|---|---|
2018 | 557 | 39 (7.00%) | 518 (93.00%) | 7.00% became valid |
2019 | 2533 | 236 (9.32%) | 2297 (90.68%) | 9.32% became valid |
2020 | 2771 | 58 (2.09%) | 2713 (97.91%) | 2.09% became valid consistent: small percentage. |
Year | Total | Valid | Invalid | Notes |
---|---|---|---|---|
2016 | 6370 | 5725 (89.87%) | 645 (10.13%) |
|
2017 | 1945 | 1764 (90.69%) | 181 (9.31%) |
|
2018 | 819 | 703 (85.84%) | 116 (14.16%) |
|
2019 | 2918 | 2588 (88.69%) | 330 (11.31%) |
|
2020 | 2846 | 2489 (87.46%) | 357 (12.54%) |
|
Accu. | 14898 | 13269 (89.07%) | 1629 (10.93%) |
Year | Total | Valid | Invalid | Notes |
---|---|---|---|---|
2017 | 1034 | 393 (38.01%) | 641 (61.99%) | 38.01% become valid Main reason is some candidates were not tagged |
2018 | 953 | 133 (13.96%) | 820 (86.04%) | 13.96% become valid Clean up |
2019 | 984 | 50 (5.08%) | 934 (94.92%) | 5.08% become valid consistent: small percentage |
2020 | 1291 | 24 (1.86%) | 1267 (98.14%) | 1.86% become valid consistent: small percentage |
Year | Word Count | Total | Valid | Invalid | Accu. P |
---|---|---|---|---|---|
2015 | 1000000 | 3368 | 2397 (71.17%) | 971 (28.83%) | 71.17% |
100000 | 2218 | 1520 (68.53%) | 698 (31.47%) | 70.12% | |
10000 | 895 | 605 (67.60%) | 290 (32.40%) | 69.77% | |
1000 | 588 | 249 (42.35%) | 339 (57.65%) | 67.49% | |
100 | 538 | 119 (22.12%) | 419 (77.88%) | 64.33% | |
Accu. | Accu. | 7602 | 4890 (64.33%) | 2712 (35.67%) | 64.33% |
Models | Total | Valid | Invalid | Notes |
---|---|---|---|---|
zeroD, CUI | 322 | 322 (100.00%) | 0 (0.00%) | WordNetCand.ZD.cui.2021 |
zeroD, no CUI | 626 | 601 (96.01%) | 25 (3.99%) | WordNetCand.ZD.noCui.2021 |
aPairs | 1912 | 1413 (73.90%) | 499 (26.10%) | WordNetCand.AP.2021 |
suffixD | 3654 | 3428 (93.81%) | 226 (6.19%) | WordNetCand.SD.2021 |
Accu. | 6508 | 5758 (88.48%) | 750 (11.52%) |
Date | Total | Valid | Invalid | Notes - completed candList |
---|---|---|---|---|
2018-11-15 | 21955 | 16096 (73.31%) | 5859 (26.69%) | 2.MNSMatcherParAcr, 2017 |
2019-01-03 | 22763 | 16687 (73.31%) | 6076 (26.69%) | 2.MNSMatcherParAcr, 2018 |
2019-07-19 | 24856 | 18915 (76.10%) | 5941 (23.90%) | 1.LexiconAbbAcrExpansion, 2020 |
2019-08-02 | 25675 | 19608 (76.37%) | 6067 (23.63%) | 3.DMNSMatcherCuiEndWord, 2018 |
2019-10-16 | 26756 | 20429 (76.35%) | 6327 (23.65%) | 2.MNSMatcherParAcr, 2019 |
2020-06-12 | 29674 | 23041 (77.65%) | 6633 (22.35%) | 3.DMNSMatcherCuiEndWord, 2019 |
2020-07-17 | 29832 | 23192 (77.74%) | 6640 (22.26%) | 1.LexiconAbbAcrExpansion, 2021 |
2020-08-18 | 30892 | 23999 (77.69%) | 6893 (22.32%) | 2.MNSMatcherParAcr, 2020 |
2021-03-01 | 33737 | 26512 (78.58%) | 7225 (21.42%) | 3.DMNSMatcherCuiEndWord, 2020 |
2021-07-13 | 33831 | 26571 (78.54%) | 7260 (21.46%) | 1.LexiconAbbAcrExpansion, 2022 |
2022-01-10 | 34128 | 26868 (78.73%) | 7260 (21.27%) | 8.WordNetCand.ZD.cui.2021 |
2022-01-10 | 34754 | 27466 (79.03%) | 7288 (20.97%) | 8.WordNetCand.ZD.noCui.2021 |
2022-07-06 | 34756 | 27471 (79.04%) | 7285 (20.96%) | 1.LexiconAbbAcrExpansion, 2023 |
2022-09-27 | 36649 | 28865 (78.76%) | 7784 (21.24%) | 8.WordNetCand.AP.2021 |
2024-07-10 | 40307 | 32298 (80.13%) | 8009 (19.87%) |
8.WordNetCand.SD.2021
1.LexiconAbbAcrExpansion, 2024 & 2025 |