Wikipedia:Typo Team/moss: Difference between revisions
Sun Creator (talk | contribs) →Notification of new dumps: Remove self. Other priorities for now. |
|||
Line 1,688: | Line 1,688: | ||
* (add your username to this list) |
* (add your username to this list) |
||
* [[User: Jake The Great 908|<span style="color:blue;text-shadow:2px 2px 3px rgba(17,189,172,1)">Jake The Great!]][[User talk:Jake The Great 908|📞talk!]]</span> 01:40, 18 December 2019 (UTC) |
* [[User: Jake The Great 908|<span style="color:blue;text-shadow:2px 2px 3px rgba(17,189,172,1)">Jake The Great!]][[User talk:Jake The Great 908|📞talk!]]</span> 01:40, 18 December 2019 (UTC) |
||
* [[User:Sun Creator|Sun Creator]]<sup>([[User talk:Sun Creator|talk]])</sup> 22:21, 19 November 2019 (UTC) |
|||
*[[User:Puddleglum2.0|Puddleglum2.0]] ([[User talk:Puddleglum2.0|talk]]) 20:31, 13 October 2019 (UTC) |
*[[User:Puddleglum2.0|Puddleglum2.0]] ([[User talk:Puddleglum2.0|talk]]) 20:31, 13 October 2019 (UTC) |
||
*[[User:Schazjmd|Schazjmd]] ([[User talk:Schazjmd|talk]]) 18:25, 21 December 2018 (UTC) |
*[[User:Schazjmd|Schazjmd]] ([[User talk:Schazjmd|talk]]) 18:25, 21 December 2018 (UTC) |
Revision as of 08:00, 3 August 2021
This page has a backlog that requires the attention of willing editors. Please remove this notice when the backlog is cleared. |
The moss project seeks to find and remove the furry green typos that have been growing on Wikipedia articles. It uses software written by User:Beland to automatically find misspellings, mistakes in English grammar, violations of the Wikipedia:Manual of Style, and confusing or broken wiki markup.
QUICK LINK TO THE BEST PAGE FOR NEW PARTICIPANTS
About misspellings
How the lists are made
The moss spell checker is run against a recent set of database dumps, which are generated on the 1st and 20th of every month (but take a few days to process). All the articles in the English Wikipedia are examined. The following are ignored:
- Text inside references, templates, tables, quotation marks, sections like "External links" and "Works", and some other weird places.
- Capitalized words (which are presumed to be correctly-spelled proper nouns)
- Words that appear in titles in the English Wiktionary (which has definitions of all words in all languages, excluding proper nouns and systematic words like chemical names and large numbers)
- Words that appear in titles in the English Wikipedia (which explains some things that don't appear in the dictionary)
- Words that appear in titles in the Wikispecies (which has many technical words that don't appear in the dictionary or encyclopedia)
Many mistakes are not (yet) caught:
- Improper addition of 's (possessives are not added to Wiktionary, so these are excluded systematically)
- Incorrect capitalization
- Incorrect multi-word phrases
- Wrong word used in context
- Non-English language words not tagged with {{lang}} or where an English misspelling happens to be the same as a word in another language. (These are counted as correct spellings if they are in the English Wiktionary, which lists words in all languages – only the definitions are restricted to English.)
- Other situations listed in #False negatives below
2020 statistics
- See also: Older statistics
In the year from March 2019 to March 2020, moss volunteers fixed over 94,000 typos! The most impressive progress is in the T1 category (single-letter misspellings), where we eliminated about half from the English Wikipedia. During this period we also started fixing missing spaces (focusing on those around punctuation) and those have dropped by about one-fifth. As we make progress, clear misspellings are increasingly mixed in with unclear cases; I'll be doing some more work on separation algorithms to keep the typo reports useful, so you'll probably see some more changes to typo classifications. Thanks to everyone who has been helping out! -- Beland (talk) 16:54, 28 April 2020 (UTC)
Reporting symbol | Explanation | Change from 2019-03-01 to 2020-02-20 | Instances, 2020-04-01 dump (9f6d726) | Instances, 2020-04-20 dump (5ff589d) | Instances, 2020-05-01 dump (1a96ded) | Instances, 2020-05-20 dump (e511f74) | Instances, 2020-06-01 dump (509f79a) | Instances, 2020-06-20 dump (825ceb4) | Instances, 2020-07-01 dump (db9db23) | Instances, 2020-07-20 dump (caa619f) | Instances, 2020-08-01 dump (cf76e8c) | Instances, 2020-08-20 dump (f104e58) | Instances, 2020-09-01 dump (4654d88) | Instances, 2020-09-20 dump (a26ccca) | Instances, 2020-10-01 dump (686f5db) | Instances, 2020-10-20 dump (4f90810) | Instances, 2020-11-01 dump (ac54580) | Instances, 2020-11-20 dump (6dbd61d) | Instances, 2020-12-01 dump (917bcc8) | Instances, 2020-12-20 dump (0b3409d) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
TS | Missing or extra whitespace or dash (or new compound) | -39368 (-21%) | 145297 | 144673 | 331658** | 330624 | 328249 | 325399 | 324179 | 322282 | 321801 | 318621 | 317183 | 315825 | 314747 | 312110 | 310537 | 309386 | 308280 | 308977 |
T1 | Edit distance 1 from common English word | -36192 (-48%) | 41090 | 41081 | 39967 | 39452 | 38783 | 38379 | 38436 | 38271 | 37803 | 36783 | 35976 | 34036 | 33539 | 33764 | 32347 | 33097 | 33559 | 33427 |
T2 | Edit distance 2 from common English word | -7560 (-10%) | 64526 | 63263 | 60690 | 60321 | 59589 | 58603 | 58649 | 58521 | 58200 | 58085 | 57845 | 57329 | 57152 | 57487 | 57387 | 57511 | 57386 | 57348 |
T3 | Edit distance 3 from common English word | -5276 (-7%) | 74396 | 73255 | 70516 | 70039 | 68887 | 68192 | 68149 | 68020 | 67769 | 67788 | 67482 | 67226 | 67025 | 67101 | 67002 | 67213 | 67298 | 67399 |
R | Regular word (A-Z only) not near a common English word | -3525 (-3%) | 97726 | 96916 | 94793 | 93855 | 93252 | 91537 | 91489 | 91746 | 91521 | 91729 | 91513 | 91613 | 91339 | 91813 | 92329 | 93246 | 93377 | 93493 |
I | Definitely not English (International) due to accents or mixed with punctuation (other than hyphen) | -22196 (-24%) | 72151 | 69118 | 65842 | 64827 | 63630 | 61844 | 61888 | 61782 | 61899 | 62113 | 61916 | 62003 | 62049 | 62274 | 62287 | 62390 | 62234 | 62471 |
W | Not in English Wiktionary, in non-English Wiktionary | -6764 (-8%) | 75913 | 74351 | 86935 | 85604 | 83173 | 81894 | 81946 | 82173 | 81943 | 82170 | 81912 | 81968 | 81792 | 81256 | 81052 | 81224 | 81131 | 81192 |
L | Probable Romanization (transLiteration) | +81 (+2%) | 4435 | 4486 | 4266 | 4199 | 4120 | 4122 | 4104 | 4113 | 4137 | 4140 | 4151 | 4164 | 4165 | 4207 | 4203 | 4234 | 4240 | 4260 |
ME | Probable coMpound, English (with and without dash) | +976 (+2%) | 52269 | 48761 | 47187 | 47153 | 46830 | 46856 | 46967 | 47163 | 47052 | 47170 | 47009 | 47070 | 47066 | 47045 | 47023 | 47193 | 47142 | 47302 |
MI | Probable coMpound, non-English (International) in English Wiktionary (both A-Z and non-ASCII characters, with and without dash) | -18475 (-9%) | 177646 | 176929 | 171484 | 169592 | 166216 | 164828 | 165140 | 165351 | 165605 | 166016 | 166208 | 166499 | 166572 | 167349 | 167961 | 169044 | 168953 | 169409 |
MW | Probable coMpound, found in non-English Wiktionary | -5544 (-11%) | 46113 | 45103 | 43501 | 42931 | 40436 | 41383 | 41325 | 41440 | 41173 | 41234 | 40990 | 40956 | 40795 | 40353 | 40272 | 40454 | 40411 | 40338 |
ML | Probable coMpound, transLiteration | -124 (-3%) | 3909 | 3874 | 3707 | 3663 | 3672 | 3575 | 3589 | 3593 | 3628 | 3639 | 3658 | 3717 | 3724 | 3779 | 3769 | 3825 | 3830 | 3822 |
C | Chemistry words | -176 (-9%) | 1782 | 7564 | 7530 | 7644 | 7640 | 7655 | 7658 | 7659 | 7660 | 7662 | 7654 | 7644 | 7659 | 7661 | 7665 | 7659 | 7674 | 7700 |
N | A-Z plus numbers and hyphens | -1391 (-5%) | 25209 | 23813 | 22650 | 22511 | 22290 | 22020 | 22052 | 22053 | 21971 | 22009 | 21960 | 21923 | 21879 | 21856 | 21885 | 21898 | 21893 | 21943 |
Z | Decimal fraction missing leading Zero | - | 47* | 0* | 11405** | 11418 | 11414 | 11398 | 11402 | 11421 | 11455 | 11530 | 11546 | 11578 | 11598 | 11669 | 11683 | 11703 | 11728 | 11762 |
P | Patterns (e.g. rhyme schemes) | -20 (-43%) | 27 | 28 | 7 | 9 | 7 | 7 | 3 | 2 | 2 | 4 | 5 | 4 | 5 | 5 | 4 | 5 | 5 | 5 |
H | HTML/XML/SGML tag | -539 (-15%) | 3010 | 2886 | 2938 | 2903 | 2904 | 2848 | 2693 | 2697 | 2680 | 2747 | 2757 | 2729 | 2565 | 2569 | 2542 | 2538 | 2540 | 2572 |
HB | Known bad HTML tag, like <font> | -1080 (-7%) | 14465 | 14121 | 12903 | 13928 | 12919 | 14733 | 14022 | 11428 | 11670 | 11198 | 10191 | 8860 | 8756 | 8842 | 9725 | 11088 | 10164 | 10556 |
HL | Bad HTML-like linking, like <http://...> | -98 (-19%) | 414 | 418 | 377 | 394 | 394 | 421 | 408 | 425 | 420 | 413 | 373 | 359 | 356 | 329 | 324 | 315 | 318 | 328 |
U | URL | -94 (-7%, from 2019-03-20) | 1179 | 1152 | 1118 | 1134 | 1117 | 1122 | 1129 | 1124 | 1120 | 1124 | 1124 | 1103 | 1101 | 1099 | 1091 | 1096 | 1050 | 1055 |
BC | Bad characters | -12678 (-6%, from 2019-09-01) | 192230 | 190482 | 186651 | 186517 | 185572 | 178698 | 175325 | 166116 | 159095 | 124158 | 112959 | 112755 | 112695 | 112633 | 112479 | 110608 | 110025 | 109808 |
BW | Bad words | -6542 (-5%, from 2019-09-20) | 113682 | 106327 | 381288** | 380259 | 378710 | 374982 | 375107 | 375206 | 375431 | 375306 | 374622 | 374740 | 374560 | 375010 | 375008 | 375557 | 374989 | 375663 |
Total | -39115 (-3%, from 2019-09-20) | 1207516 instances | 1188601 instances | 1647413** instances | 1638977 instances | 1619804 instances | 1600496 instances | 1595660 instances | 1582586 instances | 1574035 instances | 1535639 instances | 1519034 instances | 1514101 instances | 1511139 instances | 1510211 instances | 1508575 instances | 1511284 instances | 1508227 instances | 1510830 instances | |
Parse failure | Mismatched punctuation | -5145 (-3%) | 154084 articles + 40705 MOS:STRAIGHT violations | 153033 articles + 40838 MOS:STRAIGHT violations | 214365 articles + 37697 MOS:STRAIGHT violations | 214463 articles + 37667 MOS:STRAIGHT violations | 214101 articles + 37607 MOS:STRAIGHT violations | 214465 articles + 37767 MOS:STRAIGHT violations | 214732 articles + 37849 MOS:STRAIGHT violations | 215081 articles + 37993 MOS:STRAIGHT violations | 215447 articles + 38067 MOS:STRAIGHT violations | 215915 articles + 38169 MOS:STRAIGHT violations | 216227 articles + 38210 MOS:STRAIGHT violations | 216472 articles + 38205 MOS:STRAIGHT violations | 216738 articles + 38213 MOS:STRAIGHT violations | 216991 articles + 38246 MOS:STRAIGHT violations | 217192 articles + 38338 MOS:STRAIGHT violations | 217660 articles + 38498 MOS:STRAIGHT violations | 217861 articles + 38625 MOS:STRAIGHT violations | 218207 articles + 38789 MOS:STRAIGHT violations |
- red = Probably need to fix
- yellow = Unsorted
- blue = Probably OK (but may need to verify)
- bold = actively working on fixing
* Identification of Z was broken
** Affected by major bug fix for counting inter-word typos (e.g. involving punctuation)
2021 statistics
Dump (moss version) | Parse failures (articles + articles with MOS:STRAIGHT violations) | TOTAL (instances) | BC | BW | C | H | HB | HL | I | L | ME | MI | ML | MW | N | P | R | T1 | T2 | T3 | TS | U | W | Z |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2021-01-01 (b4af24a) | 218317 + 38841 | 1505808 | 108661 | 375875 | 7705 | 2550 | 10726 | 311 | 62583 | 4262 | 47274 | 169504 | 3841 | 40131 | 21954 | 4 | 93373 | 32968 | 56903 | 66819 | 306445 | 1054 | 81112 | 11753 |
2021-01-20 (a249b2d) | 218455 + 38930 | 1506940 | 108030 | 376079 | 7679 | 2616 | 11036 | 298 | 62746 | 4298 | 47044 | 170234 | 3885 | 39960 | 21959 | 4 | 93467 | 33598 | 56688 | 66688 | 306776 | 1042 | 81049 | 11764 |
2021-02-01 (8279235) | 218833 + 38960 | 1506004 | 107000 | 375979 | 7677 | 2595 | 11729 | 298 | 62829 | 4305 | 47053 | 171005 | 3888 | 39771 | 21971 | 2 | 93726 | 33237 | 56822 | 66707 | 305573 | 1035 | 81079 | 11723 |
2021-02-20 (2f00c51) | 218991 + 39035 | 1504064 | 106534 | 375909 | 7682 | 2602 | 11697 | 275 | 62942 | 4342 | 47036 | 171313 | 3897 | 39732 | 22009 | 3 | 93959 | 32705 | 56529 | 66617 | 304463 | 1020 | 81041 | 11757 |
2021-03-01 (248159a) | 219198 + 39155 | 1494162 | 106421 | 376305 | 7669 | 2624 | 9291 | 281 | 62978 | 4328 | 46830 | 169666 | 3876 | 39189 | 21936 | 4 | 92221 | 32762 | 56197 | 66069 | 302377 | 1020 | 80338 | 11780 |
2021-03-20 (57aaae7) | 219556 + 39371 | 1492923 | 106284 | 375853 | 7695 | 2610 | 9965 | 278 | 63055 | 4331 | 47064 | 170453 | 3880 | 39172 | 21998 | 2 | 92721 | 32523 | 56052 | 66087 | 299751 | 1002 | 80305 | 11842 |
2021-04-01 (d47c725) | 219692 + 39478 | 1484879 | 105670 | 375757 | 7697 | 2620 | 8857 | 205 | 62842 | 4309 | 46966 | 170369 | 3884 | 38886 | 21964 | 0 | 92575 | 32160 | 55810 | 65706 | 296009 | 995 | 79736 | 11862 |
2021-04-20 (d169566) | 220014 + 39634 | 1476477 | 104505 | 374548 | 7686 | 2648 | 8863 | 199 | 62668 | 4327 | 47036 | 170547 | 3878 | 38644 | 21973 | 4 | 92336 | 30560 | 55284 | 65191 | 293170 | 985 | 79487 | 11938 |
2021-05-01 (7719363) | 219292 + 39601 | 1445819 | 103253 | 367236 | 7661 | 2387 | 7682 | 178 | 59749 | 3966 | 44397 | 165787 | 3774 | 38591 | 21697 | 4 | 91448 | 30666 | 56556 | 65257 | 283967 | 980 | 78634 | 11949 |
2021-05-20 (c6359fc) | 219284 + 39761 | 1444570 | 102794 | 368258 | 7678 | 2271 | 7878 | 176 | 59913 | 3978 | 44514 | 166538 | 3804 | 38629 | 21725 | 4 | 91887 | 29205 | 56341 | 65171 | 282093 | 983 | 78651 | 12079 |
2021-06-01 (076f14c) | 219111 + 39759 | 1441769 | 102409 | 368046 | 7689 | 2275 | 7827 | 166 | 59876 | 3943 | 44658 | 166622 | 3818 | 38567 | 21755 | 5 | 92077 | 28507 | 56157 | 64919 | 280645 | 975 | 78682 | 12151 |
2021-06-20 (ffbc72f) | 219625 + 39935 | 1435330 | 101926 | 367522 | 7694 | 2276 | 7108 | 162 | 59650 | 3964 | 44692 | 167038 | 3819 | 38298 | 21687 | 8 | 92365 | 28020 | 55983 | 64688 | 276538 | 955 | 78621 | 12316 |
2021-07-01 (cb3d5e8) | 219791 + 39990 | 1433415 | 101916 | 367581 | 7704 | 2263 | 6921 | 169 | 59663 | 3960 | 44770 | 167508 | 3837 | 38299 | 21674 | 8 | 92600 | 27369 | 55755 | 64301 | 275024 | 946 | 78720 | 12427 |
2021-07-20 (5c3b9e9) | 220086 + 40132 | 1429627 | 101518 | 367954 | 7688 | 2136 | 6702 | 137 | 59995 | 3955 | 44805 | 167818 | 3824 | 38179 | 21646 | 7 | 92660 | 26469 | 55565 | 64171 | 272147 | 950 | 78624 | 12677 |
Instructions for editors
Just like a regular spell checker, sometimes a word that's highlighted is really a misspelling and should be changed, but sometimes it is a correct spelling that needs to be added to the spell checker's dictionary (which in this case is the English Wiktionary and Wikispecies). For the below lists, here's how you can help:
- For spelling mistakes: Click on the links to the individual Wikipedia articles, and edit them to correct the misspelling. Make sure this is actually a misspelling, and not a technical term that needs to be better explained, or an alternate spelling (possibly from a different regional variety of English).
- For non-English words (including words from Old English and Middle English, since they are pronounced differently): Edit the article and use the {{lang}} or {{transl}} templates to mark all non-English passages. Template contents are ignored, so they will not show up in the next report. If you can define the word, it would still be helpful to add the non-English word to the English Wiktionary or the same-language Wiktionary if you speak that language. As of the March 20, 2019 dump, only words not found in any Wiktionary are reported by moss as misspellings. (The "home" Wiktionary for Old and Middle English words is the modern English one.)
- If you don't know which language is being used, you can tag it with {{which lang}}. If you add a "reason=" parameter, that will change the pop-up tooltip text readers will see when they hover over "what language is this?". If you have a guess as to which language it might be, or any other question or comment, you can leave that here to help future editors. If you use this tag, you can delete the article from the moss listing; the article will be added to Category:Articles with unidentified words instead, and ignored by future runs of moss until the mystery is solved.
- For languages that don't have a code (often happens with historical languages), use "mis" and add an HTML comment indicating the language. For example: {{lang|mis|sharbe do kin ratz}}<!-- Old Runish -->
- For incorrect spellings in direct quotes:
- These shouldn't be picked up by the spell checker, as text in double quotes "" is ignored. The article probably has incorrect punctuation.
- Regardless of punctuation problems, you can add {{sic}} around the word or phrase. See Wikipedia:Manual of Style#Quotations for guidance.
- For correct spellings that belong in the dictionary: Click on the word to add it to the English Wiktionary. Remember the word might not be English (though the definition must be) and be sure to check capitalization!
- For correct spellings already in the dictionary: Delete from the list or strike through; these have been added in the meantime since the database dump by other editors. They do not automatically turn red as internal Wikipedia links do.
- For correct spellings not appropriate for Wiktionary:
- For complicated chemical names:
- If there is an article about this chemical, it's best to make a redirect. You may want to tag it {{R from systematic name}} or {{R from technical name}} if appropriate.
- If there is no Wikipedia article, you can either {{chem name}}; for example:
- For complicated chemical names:
- {{chem name|poly(1-phenylethene)}}
- This should not be used for chemical formulas like H
2O, for which {{chem}} and {{chem2}} are appropriate.
- For DNA sequences, add {{DNA sequence}} around it.
- For species, add the whole name to Wikispecies:Wikispecies:Requested articles#From_Wikipedia and it will be suppressed from future runs.
- For proper nouns and (including non-English titles) that aren't capitalized, put inside a {{proper name}} tag.
- Use <code></code> or similar tags for computer programs; see Wikipedia:WikiProject_Computer_science/Manual_of_style#Code_samples.
- For terms that are only relevant to one Wikipedia article (and for which the article makes clear the definition) consider creating a redirect to the article. As long as the "typo" word is in the title (as a whole word), it won't show up as a mistake in future spell checks.
- {{IPA}} or {{respell}} can be used for word pronunciations. See Wikipedia:Manual of Style/Pronunciation for details.
- For bird calls: Treat these as foreign-language words or words-as-words and put them in italics, following MOS:ITALICS. Put the call inside {{not a typo}} so it won't show up on moss spell check reports. (It doesn't matter if the double apostrophes that make the italics go inside or outside the template.)
- Anything else, add {{not a typo}} around it (for example, nonsense series of letters used as examples in puzzles).
- Correct or incorrect, when finished delete or
strike outthe entry for the word from the lists on this page (or subpages), so work won't be duplicated. It is preferred to delete the entry for sections that rotate through specific letters, and strikethrough for sections where the whole thing gets updated (to prevent duplicating work done while the dumps were being processed, which can take more than a week). - If an article or section has generally bad grammar, and you don't have time to fix the whole thing, just add {{copyedit}} at the top of the article or {{copyedit|section}} at the top of the affected section. If it's just a sentence or two, {{copy edit inline}} or {{incomprehensible inline}} can go at the end of the problem passage.
- If you see errors being reported from footnotes or bibliographies, check to make sure the section is titled with a standard name following MOS:APPENDIX conventions. Standard end-matter sections like "References" and "Further reading" and "Works" are ignored.
- If it helps to leave a message on the article's talk page asking if the word is correct or incorrect, you can use Template:Typo help like this when editing the bottom of the talk page (leave the section header blank; it will automatically be added):
- {{subst:typo help|PUT WORD HERE}} -- ~~~~
- If you are uncertain whether a word is spelt correctly or not, you can add {{typo help inline}} immediately after it. If you add a "reason=" parameter, that will change the pop-up tooltip text readers will see when they hover over "check spelling". You can add a specific question or comment that may help identification. If you use this tag, you can delete the article from the moss listing; the article will be added to Category:Articles with unidentified words instead, and ignored by future runs of moss until the mystery is solved.
Don't worry if you miss something; it will reappear in a future report if there are still mistakes.
Suggested edit summaries
If you want to help publicize this project, you can copy-and-paste these into your edit summary, if appropriate.
For Wikipedia edits:
- Fix misspelling found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag non-English text found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag correct text as {{not a typo}} for automated spell checkers (including [[Wikipedia:Typo Team/moss]])
- Fix mismatched quote marks found by [[Wikipedia:Typo Team/moss]] – you can help!
For Wiktionary edits:
- Add word identified by [[w:Wikipedia:Typo Team/moss]] – you can help!
Wiktionary cheat sheet
Need to add a word to Wiktionary? The Wiktionary cheat sheet has copy-and-paste templates that make it easy for the types of words commonly encountered here, even if you've never done it before.
Misspellings - lists of things to fix
Likely misspellings by article (main listing)
The most efficient list to work on if all you want to do is fix misspellings. These listings try to list all the typos from a given article, so they can be fixed all at once. It also tries to only show typos that legitimately need fixing. It's not perfect, so a few words found need to be added to Wiktionary or tagged as not English, not a typo, etc. Only a few letters are updated on each run, to avoid stale listings as the whole list takes far longer than two weeks to work through. (This also avoids duplicating recent work when listings are refreshed.)
See subpages due to length:
- Wikipedia:Typo Team/moss/before A - Completed 2020-04-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/A - Completed 2020-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/B - Completed 2020-06-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/C - Completed 2020-07-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/D - Completed 2020-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/E - Completed 2020-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/F - Completed 2020-09-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/G - Completed 2020-09-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/H - Completed 2020-10-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/I - Completed 2020-10-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/J - Completed 2021-01-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/K - Completed 2021-02-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/L - Completed 2021-03-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/M - Completed 2021-04-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/N - Completed 2021-05-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/O - Completed 2021-05-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/P - Completed 2021-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/Q - Completed 2021-06-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/R - Typos from 2021-06-20 dump ready for fixing
- Wikipedia:Typo Team/moss/S - Typos from 2021-07-20 dump ready for fixing
- Wikipedia:Typo Team/moss/T - Completed 2019-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/U - Completed 2019-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/V - Completed 2019-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/W - Completed 2019-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/X - Completed 2019-03-20 dump, currently empty
- Wikipedia:Typo Team/moss/Y - Completed 2019-05-01 dump, currently empty
- Wikipedia:Typo Team/moss/Z - Completed 2019-03-20 dump, currently empty
- Wikipedia:Typo Team/moss/after Z - Completed 2019-03-20 dump, currently empty
Notes:
- For more cases that require investigation, see Category:Articles with unidentified words.
- Due to length and an increased number of false positives, typo reports for dumps 2020-05-20 and later don't include T2+, T3+, and TS+BRACKET+.
Likely misspellings by frequency (n-z)
The best list to work on if you want to eliminate all instances of a specific typo. Only typos that are very close to known words are shown. The algorithm is not perfect, so some of these may still be words that need to be added to Wiktionary. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Legitimate misspellings are candidates for Wikipedia:Lists of common misspellings. If there is an obvious correction, adding that to Wikipedia:Lists of common misspellings/For machines will help editors who use automated tools to fix cases faster.
- 37 - wikt:rayonı - Administrative and municipal divisions of Sevastopol, Administrative divisions of Crimea, Autonomous Republic of Crimea, Blagoveshchensky District, Bashkortostan, Buinsky District ... find all
- 33 - wikt:officialy - ABS-CBN Broadcasting Center, Arab separatism in Khuzestan, Aramean-Syriac flag, Birendranagar, Bryan Danesi ... find all
- 18 - wikt:sibyan - Burmese cuisine, Burmese curry, Kuttab, Shrimp curry ... find all
- 13 - wikt:suchs - Altitudinal migration, Balneário Camboriú, Eduardo Varela Pezzano, Italy, John Bluett ... find all
- 10 - wikt:pronouced - Battlefield Palette, Dang Le Nguyen Vu, SMS Beowulf, SMS Frithjof, SMS Hagen ... find all
- 10 - wikt:pnish - Eiji Moriyama, Mizuki Sano, Yuichi Tsuchiya ... find all
- 10 - wikt:performaing - List of Billboard Hot 100 top-ten singles in 2005, List of Billboard Hot 100 top-ten singles in 2006, List of Billboard Hot 100 top-ten singles in 2007, List of Billboard Hot 100 top-ten singles in 2008, List of Billboard Hot 100 top-ten singles in 2010 ... find all
- 10 - wikt:particulary - Augusto Barros, Dragon Ball Z: Idainaru Dragon Ball Densetsu, Franz Stassen, Harehills, John Thomas Micklethwaite ... find all
- 9 - wikt:zecs - Sainte-Anne River (Les Chenaux), Zec de Cap-Chat, Zec de la Bessonne, Zec de la Maison-de-Pierre, Zec de la Rivière-Cap-Chat ... find all
- 9 - wikt:rulling - 2000 Lithuanian parliamentary election, Baharia, Bangladesh, Broad Front (Uruguay), Duchy of Wiślica, Mangkunegara II ... find all
- 9 - wikt:reph - Assamese rô, Gujarati script, Malayalam script, Marthandavarma (novel), Mojibake ... find all
- 9 - wikt:protogonist - Inferno (Brown novel), Mission: Impossible (film), Mission: Impossible 2, Mission: Impossible III, Mission: Impossible – Fallout ... find all
- 8 - wikt:whichs - Alpár Jegenyés, Charles Brigham, Duisburg Hauptbahnhof, Early-May 1933 tornado outbreak sequence, Ecuadorian women's football championship ... find all
- 8 - wikt:suceeded - Françoise d'Eaubonne et l'écoféminisme, James St Clair, 18th Baron of Roslin, Jürgen Bogs, List of mayors of Ipswich Borough, Suffolk, Rolando Mosca Moschini ... find all
- 8 - wikt:preighters - Cargo airline, Emirates (airline), Impact of the COVID-19 pandemic on aviation, Preighter ... find all
- 8 - wikt:predominantely - Cutibacterium acnes, East Tennessee, Hart Sheik, Heis (town), Parasemionotiformes ... find all
- 8 - wikt:officals - 1970 Formula One season, 2019 Swiss ePrix, 845, Brigade Speciale Beveiligingsopdrachten, Kearns High School ... find all
- 8 - wikt:occour - Adenylosuccinate lyase deficiency, Encephalocraniocutaneous lipomatosis, Hyaluronidase deficiency, Hydrops-ectopic calcification-moth-eaten skeletal dysplasia, Langer–Giedion syndrome ... find all
- 7 - wikt:wıth - Alberto Salazar, Arbab Ghulam Rahim, Aydın Province, Cuisine of Corsica, Jim McGinlay ... find all
- 7 - wikt:wheep - American avocet, Golden-olive woodpecker, Squirrel cuckoo ... find all
- 7 - wikt:ucool - Dota 2, Valve Corporation ... find all
- 7 - wikt:themselwes - Banafsha bint Abdullah al-Rumiyyah, Black Death in the Middle East, Countess Charlotte Brabantina of Nassau, Countess Elisabeth of Nassau, Madeleine-Françoise Calais ... find all
- 7 - wikt:reguardless - The Church of Jesus Christ of Latter-day Saints in Angola, The Church of Jesus Christ of Latter-day Saints in Botswana, The Church of Jesus Christ of Latter-day Saints in Cambodia, The Church of Jesus Christ of Latter-day Saints in Indonesia, The Church of Jesus Christ of Latter-day Saints in Ireland ... find all
- 7 - wikt:ramaining - Daporijo, Koloriang, Nacho, Arunachal Pradesh, Seppa, Shi Yomi district ... find all
- 7 - wikt:promient - Aamir Rashadi Madni, Francis Bedford (photographer), James Austen, Muhammadzai (Hashtnagar), St. Petersburg green benches ... find all
- 7 - wikt:predominanty - Chatra (community development block), Gidhour block, Itkhori block, Landeskirche, Shaligram Ramnarayanpur (community development block) ... find all
- 7 - wikt:potrayed - Alamgir II, Cary Elwes, Chandralekha (TV series), Guanyin, Gurumayum Bonny ... find all
- 7 - wikt:potrayal - Chinglen Thiyam, Leishangthem Tonthoingambi Devi, List of awards and nominations received by Christian Bale, Maachis, Shougrakpam Hemanta ... find all
- 6 - wikt:villege - Itamati, Kotij, Mirza Huseyn Afandi Qayibov, Sarang Kheda Dam, Sukhdev Singh Dhindsa ... find all
- 6 - wikt:tournment - Chris Ridgeway, Goalkeeper (association football), Jorginho (footballer, born December 1991), NKP Salve Challenger Trophy 2009-10, Northeastern University ... find all
- 6 - wikt:tifas - Tifa (drum), Tifa totobuang ... find all
- 6 - wikt:temporaily - 2019 in rail transport, Gerry House, House of Heroes, Tom Clancy's Rainbow Six Extraction, Wilhelm Batz ... find all
- 6 - wikt:sthe - 2017–18 Colchester United F.C. season, Artemis Fowl and the Lost Colony, Kelly Lai Chen, Lejwana, Botswana, San Leandro Unified School District ... find all
- 6 - wikt:stereids - Physiological Plant Anatomy, Plagiomnium venustum, Pogonatum urnigerum ... find all
- 6 - wikt:servied - 44th Infantry Brigade (United Kingdom), Adendro, Antoine I de Gramont, Bizerte, Charles Luke (politician) ... find all
- 6 - wikt:secretely - 1592 papal conclave, Dario Fo, List of Power Rangers Megaforce characters, The Flame of Iridar, The Thirteenth Night ... find all
- 6 - wikt:remaked - & (Loona EP), Battle-Girl, Darling (2010 film), Junglee (1961 film), Udai Singh of Marwar ... find all
- 6 - wikt:reauthorised - 2020–2021 Ukrainian constitutional crisis, Albanian Armed Forces, Chlorpropham, Condor Legion Tank Badge, Spanish Cross ... find all
- 6 - wikt:proprieter - Addison Motor Company, Bhiwapur, Continental Iron Works, Rowena Granice Steele, The Trip to Biarritz ... find all
- 6 - wikt:prinicpal - Itkhori, La clemenza di Tito, North West Wales, SISTINE, Tulare Western High School ... find all
- 6 - wikt:pratically - Battle of Benevento, Classe préparatoire aux grandes écoles, Education in France, Heym SR 30, Stanley Rossiter Benedict ... find all
- 6 - wikt:prasing - Another Code: R – A Journey into Lost Memories, Ghost Pilots, Mandela (2021 film), Mors Praematura, Razgovor ... find all
- 6 - wikt:posthmously - Fyodor Shebanov, Here We Go, Phạm Phú Quốc, Tosia Altman, Vladimir Bobrov (pilot) ... find all
- 6 - wikt:poored - Bea Alonzo, Trinity College, Toronto ... find all
- 6 - wikt:oppostion - 100th Anniversary of the Chinese Communist Party, Fausto Gullo, Haleem Adil Sheikh, Kpandai (Ghana parliament constituency), Talbiseh ... find all
- 6 - wikt:neqe - Central Alaskan Yup'ik language, Teip, Toʼabaita language ... find all
Likely new English compounds by frequency (n-z)
The best list to work on if you want to add variations of known words to Wiktionary, mostly compound words. The algorithm is not perfect, so some of these might be common mistakes that need to corrected. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
- 50 - wikt:timeschedule - 1975 Formula One season, 2007–08 Biathlon World Cup – World Cup 2, 2008–09 Biathlon World Cup – World Cup 1, 2008–09 Biathlon World Cup – World Cup 2, 2008–09 Biathlon World Cup – World Cup 3 ... find all
- 44 - wikt:resbaker - Superstar Duets, Tawag ng Tanghalan, Tawag ng Tanghalan (season 2), Tawag ng Tanghalan (season 3), Tawag ng Tanghalan (season 3, quarter I) ... find all
- 42 - wikt:shipname - Admiral Gorshkov, Adonis (disambiguation), Burdekin (disambiguation), Esquimalt (disambiguation), Gamston ... find all
- 37 - wikt:shrikethrushes - List of bird genera, List of birds of Asia, List of birds of Australia, List of birds of Bangladesh, List of birds of Brunei ... find all
- 31 - wikt:traplining - Amethyst-throated mountaingem, Black inca, Bombus hortorum, Bombus impatiens, Eufriesea surinamensis ... find all
- 28 - wikt:subassociations - 1927 Yugoslav Football Championship, 1928 Yugoslav Football Championship, 1929 Yugoslav Football Championship, Banja Luka City Stadium, Belgrade Football Subassociation ... find all
- 28 - wikt:railtracks - 2019–2020 United Kingdom floods, Beeches Light Railway, Caloocan station, Carrer d'Aragó, Barcelona, Centquatre-Paris ... find all
- 28 - wikt:pseudotentacles - Lebrunia coralligens, Lebrunia neglecta, Mithraculus cinctimanus, Phyllodiscus, Pseudobiceros bedfordi ... find all
- 26 - wikt:treecats - A Beautiful Friendship (novel), A Rising Thunder, Ashes of Victory, Fire Season, Honorverse ... find all
- 26 - wikt:treecat - A Beautiful Friendship (novel), Beginnings (Honorverse), Fire Season, Honorverse ... find all
- 26 - wikt:tangse - Chinese shamanism, Manchu shamanism, Shamanism in the Qing dynasty ... find all
- 26 - wikt:subname - BEXCO station, Black Pearl (disambiguation), Deokcheon station, Dongjak station, Gaebong station ... find all
- 26 - wikt:sidedraft - 2019 CircuitCity.com 250, Alfa Romeo 2000, BMC A-series engine, CVCC, Carburetor ... find all
- 26 - wikt:pulingaws - Puyuma Pulingaw ... find all
- 26 - wikt:pronggill - Acanthophlebia, Atalophlebia albiterminata, Atalophlebia aurata, Atalophlebia australasica, Atalophlebia australis ... find all
- 25 - wikt:underfolding - AK-74, Halcón M-1943, K31, Lehnar submachine gun, Nicaraguan Armed Forces ... find all
- 25 - wikt:railline - Akrata railway station, Alcântara (Lisbon), Bairabi–Sairang line, Bajkul Milani Mahavidyalaya, Bhadrak railway station ... find all
- 24 - wikt:trainstation - Athens Airport–Patras railway, Attnang-Puchheim, Bandar Putra Kulai, Buthidaung, Friedberg, Styria ... find all
- 24 - wikt:thedove - Cascade Christian High School, KCMX (AM), KDOV (FM), KGEC-LD, KQSL ... find all
- 24 - wikt:tanods - Adiangao, Ang Probinsyano (season 6), Ang Probinsyano (season 7), Barangay, Barangay hall ... find all
- 24 - wikt:sportsclub - Al Kharaitiyat, Eintracht Duisburg 1848, FC Ingolstadt 04, Gammelin, Haimhausen ... find all
- 24 - wikt:roadbridge - Bache railway station, Barton railway station, Camelon railway station, Carterton railway station (England), Cheltenham Racecourse railway station ... find all
- 24 - wikt:polysilazanes - Gurpreet Singh (professor), Inorganic polymer, Polysilazane, Silicon carbide ... find all
- 23 - wikt:stylonurines - Adelophthalmidae, Alkenopterus, Eurypterid, Eurypterina, Kokomopteroidea ... find all
- 23 - wikt:showcourts - 2011 Australian Open, 2011 French Open, 2012 French Open, 2013 French Open, 2013 US Open (tennis) ... find all
- 23 - wikt:shoplots - Bandar Sri Permaisuri, Bandar Tun Razak, Bongawan, Bukit Badak Komuter station, Bukit Jambul ... find all
- 23 - wikt:provicar - Apostolic Vicariate of Northern Germany, Coptic Catholic Patriarchate of Alexandria, Jean Basset (died 1707), Lazar Lazarević, Marko Perić ... find all
- 22 - wikt:workunit - 3G Bridge, BOINC client–server technology, LHC@home, QMC@Home, Rosetta@home ... find all
- 22 - wikt:treemail - Survivor Philippines (season 1), Survivor Philippines: Celebrity Doubles Showdown, Survivor Philippines: Celebrity Showdown, Survivor Philippines: Palau ... find all
- 22 - wikt:screenplayed - 180° Rule (film), Chhota Sa Ghar, Cuatro balazos, Cy Chermak, Farnoosh Samadi ... find all
- 22 - wikt:pruinosed - Agriocnemis pygmaea, Bradinopyga konkanensis, Diplacodes trivialis, Disparoneura apicalis, Elattoneura tetrica ... find all
- 22 - wikt:primarch - Aurelian (disambiguation), Chaos (Warhammer), Lion (name), Mass Effect 3, Morlock ... find all
- 22 - wikt:pitwall - 1981 Belgian Grand Prix, 1982 Formula One World Championship, 1990 Portuguese Grand Prix, 1995 Australian Grand Prix, 1995 Hungarian Grand Prix ... find all
- 22 - wikt:parahaoma - Ab-Zohr, Frashokereti, Soma (drink) ... find all
- 22 - wikt:overram - A Glimpse of Hell (book), Fred Moosally, USS Iowa turret explosion ... find all
- 21 - wikt:umhos - Barnes Brook, Burgess Brook, Clarks Creek (Lackawanna River tributary), Cranberry Run, Crooked Run (Catawissa Creek tributary) ... find all
- 21 - wikt:statline - 1995–96 Chicago Bulls season, 2015–16 Phoenix Suns season, Brandon Dickson, Chris Ross (basketball), Dez Wells ... find all
- 21 - wikt:runchase - 2017 Indian Premier League Final, Billy Stelling, Fifth Test, 1948 Ashes series, Fourth Test, 1948 Ashes series, Ian Johnson with the Australian cricket team in England in 1948 ... find all
- 21 - wikt:regazetted - Bemboka, Bundala National Park, Burtville, Western Australia, City of Ipswich, Elgin Vale Sawmill ... find all
- 21 - wikt:rahaya - Addi Azmera, Addi Walka, Amanit, Arebay, Aregen ... find all
- 21 - wikt:perhumid - ABC Islands bear, Acid sulfate soil, Appalachian temperate rainforest, Chalcorana eschatia, Climate change in Alaska ... find all
- 20 - wikt:watchpost - Aspö, Avandha Fort, Cardiff town walls, Castle Dracula, Dartford Library ... find all
- 20 - wikt:toneholes - C.G. Conn, Charles Nicholson (flautist), Contraforte, Feliks Ravdonikas, Flabiol ... find all
- 20 - wikt:supermodifieds - 1996 Indy Racing League, Bentley Warren, Doug Heveron, Habe Haberling, Knoxville Raceway ... find all
- 20 - wikt:subsequents - 2017–2018 Spanish constitutional crisis, Adriatic sturgeon, Burning of the Midnight Lamp, Carlos Nilson, Casi Justicia Social ... find all
- 20 - wikt:stakeswins - Alycidon, Biscay (horse), Blue Peter (British horse), Chatham (horse), Chester (horse) ... find all
- 20 - wikt:skilifts - Argentière, Beerfelden, Bukovel, Culture of Lebanon, Engstligenalp ... find all
- 20 - wikt:shiploaders - Bulk material handling, Finnpusku, Hunter Valley Coal Chain, Lambert's Point, Lanqiao ... find all
- 20 - wikt:sanjakbeys - Administrative divisions of the Ottoman Empire, Albanian nobility, Beylerbey, Eqrem Vlora, Feridun Ahmed Bey ... find all
- 20 - wikt:reintensification - 1909 Atlantic hurricane season, 1984 Pacific typhoon season, 1993–94 South Pacific cyclone season, Cyclone Ann, Cyclone Bonita ... find all
- 20 - wikt:puthis - Abdul Karim Sahitya Bisharad, Ashraf Hussain, Bengali poetry, Dobhashi, History of printing and publishing in Dhaka ... find all
- 20 - wikt:produsers - Axel Bruns (scholar), Produsage ... find all
- 20 - wikt:polydodecahedron - 120-cell, Cantellated 120-cell, Grand 120-cell, Grand stellated 120-cell, Great 120-cell ... find all
- 20 - wikt:opmask - AVX-512, Advanced Vector Extensions, EVEX prefix ... find all
- 19 - wikt:willowfly - Garden Mountain Cluster, Oemopteryx, Oemopteryx contorta, Oemopteryx glacialis, Strophopteryx ... find all
- 19 - wikt:warsuit - Lex Luthor, Lex Luthor (Smallville), Lex Luthor in other media, List of Earth One characters, Web (character) ... find all
- 19 - wikt:warjacks - Grind (board game), Hordes (game), Warmachine ... find all
- 19 - wikt:waistlock - Facebuster, Lou Thesz, Pin (professional wrestling), Powerbomb, Professional wrestling aerial techniques ... find all
- 19 - wikt:vicecomital - Aicard, Aimery II of Narbonne, Antonio Pessagno, Aubusson, Creuse, Bernard II, Count of Toulouse ... find all
- 19 - wikt:upperbody - Adrastus rachifer, Agriotes lineatus, Albert's lyrebird, Australian pratincole, Bare-shanked screech owl ... find all
- 19 - wikt:undée - Devon heraldry, Hawkridge, Chittlehampton ... find all
- 19 - wikt:translase - Alanine—tRNA ligase, Arginine—tRNA ligase, Asparagine—tRNA ligase, Aspartate—tRNA ligase, Cysteine—tRNA ligase ... find all
- 19 - wikt:threadlets - Callistele calliston, Comitas wynyardensis, Daphnella aulacoessa, Daphnella stiphra, Guraleus flaccidus ... find all
- 19 - wikt:taikodai - Doi taikomatsuri, Niihama, Saijō, Ehime ... find all
- 19 - wikt:swimoff - Australia at the 2013 World Aquatics Championships, Kara Lynn Joyce, Netherlands at the 2013 World Aquatics Championships, Robbie Taylor, Rules of water polo ... find all
- 19 - wikt:subperforate - Calliostoma fragum, Calthalotia mundula, Cantharidus antipodum, Chlorodiloma, Desmoulin's whorl snail ... find all
- 19 - wikt:subchart - Hot Latin Songs, Inalcanzable (song), Latin Airplay, List of Billboard Latin Pop Airplay number ones of 1994 and 1995, List of Billboard Latin Pop Airplay number ones of 1996 ... find all
- 19 - wikt:shatkarmas - Complete Illustrated Book of Yoga, Hatha yoga, Haṭha Ratnāvalī, Kriyā, Nadi (yoga) ... find all
- 19 - wikt:rivername - Fluberg, Fådalen, Fåvang, Gutulia National Park, Iwerne Courtney ... find all
- 19 - wikt:refranchising - Essex Thameside, Gatwick Express, Great North Eastern Railway, Greater Anglia (train operating company), Greater Western franchise ... find all
- 19 - wikt:redpots - 1999 Aggie Bonfire collapse, Aggie Bonfire, Aggie Bonfire leadership, Elephant Walk (Texas A&M) ... find all
- 19 - wikt:qplus - Atomic force microscopy, Franz Josef Giessibl, Non-contact atomic force microscopy ... find all
- 19 - wikt:predough - Croissant ... find all
- 18 - wikt:warbeasts - Hordes (game), List of Star Wars creatures, No Game No Life, Warhammer Age of Sigmar ... find all
- 18 - wikt:wallpaintings - Albertus Pictor, Ancient South Arabian art, Bromma, El Kab, Empúries ... find all
- 18 - wikt:tshang - 10th Dalai Lama, Jaisang Depa, Jonê County, Keutsang Hermitage, Mönlam Legpa Lodrö ... find all
- 18 - wikt:subdiaphanous - Cirsonella densilirata, Guraleus delicatulus, Iolaea eucosmia, Ividella navisa, Menestho felix ... find all
- 18 - wikt:selectboard - Alex Morse, Board of selectmen, Brookline, New Hampshire, Donald Milne, Dummerston, Vermont ... find all
- 18 - wikt:rootword - Chapan, Gohonzon, Ha (Javanese), Ka (Javanese), Kocot ... find all
- 18 - wikt:racedays - 1894–1987 New Zealand alcohol licensing referendums, 2021 Prada Cup, Aintree Central railway station, Anthony DeSpirito, Ascot Racecourse ... find all
- 18 - wikt:opencasting - Barrow Hill railway station, Bolsover Castle railway station, Brecon Forest Tramroad, Brookhouse Colliery, Castleford–Garforth line ... find all
- 17 - wikt:workunits - 3G Bridge, BOINC client–server technology, QMC@Home, Rosetta@home, Verifiable computing ... find all
- 17 - wikt:wheated - Bourbon whiskey, Buffalo Trace Distillery, Chattanooga Whiskey Company, Lincoln County Process, List of whisky brands ... find all
- 17 - wikt:unirank - Al-Beroni University, Altınbaş University, Ateneo de Davao University, Chreso University, Dawat University ... find all
- 17 - wikt:terminii - Bessbrook and Newry Tramway, Bobby Porritt, Business routes of Interstate 10, City of London, Georgia State Route 120 ... find all
- 17 - wikt:superfinalists - 4Fun, Denmark in the Eurovision Song Contest 2012, Denmark in the Eurovision Song Contest 2013, Denmark in the Eurovision Song Contest 2014, Denmark in the Eurovision Song Contest 2016 ... find all
- 17 - wikt:subciphers - 3-subset meet-in-the-middle attack, Biclique attack, Grand Cru (cipher), Hasty Pudding cipher, New Data Seal ... find all
- 17 - wikt:simsubs - Fee-for-carriage, Global Television Network, Simultaneous substitution, Super Bowl commercials ... find all
- 17 - wikt:shoreworkers - United Fishermen and Allied Workers' Union ... find all
- 17 - wikt:santris - Abdullah Gymnastiar, Bahar bin Smith, Burhanuddin Ulakan, Indonesian Muslim Council, Indonesians in Saudi Arabia ... find all
- 17 - wikt:refenestrated - Alfoxton House, All Saints Church, Dodington, Blackdown Hills, Brompton Ralph, Chastleton ... find all
- 17 - wikt:railworkers - Altenkirchener SG, Anarchism in Argentina, Argentine Regional Workers' Federation, BC Express (sternwheeler), Barrmill, North Ayrshire ... find all
- 17 - wikt:pronggilled - Furcatergalia, Habrophlebia, Habrophlebia vibrans, Leptophlebia cupida, Leptophlebia intermedia ... find all
- 17 - wikt:precalciner - Calcium looping, Cement kiln, Tire recycling ... find all
- 17 - wikt:ppcl - Plasma cell leukemia ... find all
- 17 - wikt:polykinetids - Ciliate, Colpodea, Gellertia, Halofolliculina corallasia, Heterotrich ... find all
- 17 - wikt:outloading - Allied logistics in the Southern France campaign, Anniston Munitions Center, Demountable Rack Offload and Pickup System, Railway stations on the Eyre Peninsula, UN offensive into North Korea ... find all
- 17 - wikt:outbent - Lygropia joasharia, Phaedropsis maritzalis, Phaedropsis venadialis, Phostria cleodalis, Pilocrocis bastalis ... find all
- 17 - wikt:oilhouse - Doubling Point Light, Fort Point Light (Maine), Grindel Point Light, Hillsboro Inlet Light, Kabetogama Ranger Station District ... find all
- 17 - wikt:nokoti - Fore people ... find all
- 17 - wikt:nagaiya - Fore people ... find all
- 43 - wikt:subscalar - Aforia goniodes, Aforia trilix, Anatoma alta, Benthomangelia macra, Bolma aureola ... find all → need a malacological definition
- 21 - wikt:peaktime - 1977 in British television, 1982 in British television, 1983 in British radio, Blockbusters (British game show), Bus Éireann Route 101 ... find all -it does get used in sources eg, but dictionaries use "peak time". I went through and changed all uses within WP prose to the dictionary spelling, but it maybe should get added to wikt.
- 19 - wikt:pstg - Language center, Language processing in the brain ... find all - okay so this is an acronym (pSTG) for an area in the brain. I would suggest a redirect of both the term (posterior superior temporal gyrus) and the acronym to either Brodmann area 22 or Superior temporal gyrus. --Xurizuri (talk) 04:43, 2 February 2021 (UTC)
- 19 - wikt:soundsystems - Acetate disc, Boiler Room (music broadcaster), Caribbean music in the United Kingdom, David Rodigan, Hopelessly in Love ... find all
- apparently this is a reggae term, I found a few instances of it w/o a space like this. The wp article is Sound system (Jamaican). --Xurizuri (talk) 05:35, 4 February 2021 (UTC)
Likely new words by frequency, all languages (n-z)
Good candidates for words to add to the English Wiktionary (which provides English definitions for words in all languages), as it seems English Wikipedia readers will frequently encounter them. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Most of the words are not from English. To get them off this list, you can either add an entry to the English Wiktionary (which provides English definitions for words in all languages) or tag all instances of the word on the English Wikipedia with {{lang}}. Wiktionary does not accept Romanizations for some languages, so those cases must be tagged as {{transl}} or {{lang}}.
- 156 - wikt:rishabham - Abhogi, Ahiri, Amritavarshini, Anandabhairavi, Andolika ... find all
- 134 - wikt:стрелковая - 109th Rifle Division (Soviet Union), 10th Guards Motor Rifle Division, 114th Rifle Division (Soviet Union), 118th Rifle Division, 121st Guards Rifle Division ... find all
- 112 - wikt:neivethanam - Adhirangam Ranganathaswamy temple, Adi Jagannatha Perumal Temple, Adikesava Perumal temple, Sriperumpudur, Alwarthirunagari Temple, Amirthakadeswarar Temple, Sakkottai ... find all
- 85 - wikt:sathurthi - Abirameswarar Temple, Adi Kumbeswarar Temple, Kumbakonam, Agastheeswar Temple, Agnipureeswarar Temple, Thirupugalur, Aiyarappar Temple ... find all
- 80 - wikt:panchamam - Abhogi, Ahiri, Amritavarshini, Andolika, Asaveri ... find all
- 79 - wikt:гвардейская - 100th Guards Rifle Division, 10th Guards Motor Rifle Division, 10th Guards Uralsko-Lvovskaya Tank Division, 121st Guards Rifle Division, 126th Guards Rifle Division ... find all
- 74 - wikt:pabhaga - Akhadachandi Temple, Arjunesvara Siva Temple, Astasambhu Siva Temples, Belesvara Siva Temple, Bhringesvara Siva Temple ... find all
- 72 - wikt:отдельная - 10th Guards Uralsko-Lvovskaya Tank Division, 15th Independent Special Forces Brigade, 203rd Rifle Division, 22nd Separate Guards Special Purpose Brigade, 407th Rifle Division ... find all
- 71 - wikt:navaranga - Arakeshvara Temple, Hole Alur, Architecture of Karnataka, Athani, Belagavi, Bherya, Bhoga Nandeeshwara Temple ... find all
- 69 - wikt:ongkan - Amnat Charoen Province, Ang Thong Province, Ban Khlong, Bueng Kan Province, Chachoengsao Province ... find all
- 65 - wikt:naivethanam - Abirameswarar Temple, Adi Kumbeswarar Temple, Kumbakonam, Agastheeswar Temple, Agnipureeswarar Temple, Thirupugalur, Aiyarappar Temple ... find all
- 58 - wikt:īlābād - Boneh-ye Esmail, Khuzestan, Eshqabad, West Azerbaijan, Esmailabad (north), Dorudzan, Esmailabad (north), Gowhar Kuh, Esmailabad (south), Dorudzan ... find all
- 56 - wikt:nawāḥī - Abu Kamal District, Al-Bab District, Al-Haffah District, Al-Hasakah District, Al-Malikiyah District ... find all
- 49 - wikt:zvenya - Zveno (Soviet collective farming) ... find all
- 49 - wikt:rúdo - Black Warrior (wrestler), CMLL 80th Anniversary Show, CMLL Super Viernes (April 2010), CMLL Super Viernes (May 2010), CMLL Torneo Nacional de Parejas Increíbles (2010) ... find all
- 46 - wikt:wijnmeester - Abraham Teniers, Adriaen Collaert, Albert Xavery, Alexander Casteels the Elder, Alexander Casteels the Younger ... find all
- 46 - wikt:radiodonts - Amplectobelua, Amplectobeluidae, Anomalocaris, Diania, Dinocaridida ... find all
- 45 - wikt:thaats - Asavari (thaat), Bhairav (thaat), Bhairavi (thaat), Bhoopeshwari, Bilaval (thaat) ... find all
- 45 - wikt:sołectwos - Adamów, Łuków County, Borzęcin, Lesser Poland Voivodeship, Dmosin, Dmosin Drugi, Dmosin Pierwszy ... find all
- 44 - wikt:īdābād - Aqeh Kheyl, Gorgabad, Ardabil, Kalateh-ye Seyyed Ali, South Khorasan, Mohammadabad-e Saidabad, Nematabad-e Ghar ... find all
- 44 - wikt:ssmcs - Genetics of infertility, Liposarcoma, Marker chromosome, Pallister–Killian syndrome, Small supernumerary marker chromosome ... find all
- 44 - wikt:paasurams - Alwarthirunagari Temple, Aravindalochanar temple, Azhagiyasingar temple, Thiruvali, Devapiran temple, Irattai Thiruppathy ... find all
- 43 - wikt:мотострелковая - 10th Guards Uralsko-Lvovskaya Tank Division, 13th Motor Rifle Division NKVD, 21st Guards Motor Rifle Division (Russia), 295th Motor Rifle Division, 4th Army (Soviet Union) ... find all
- 43 - wikt:nishadam - Abhogi, Asaveri, Atana, Bahudari, Bhupalam ... find all
- 41 - wikt:võmm - Ain Mäeots, Alo Kõrve, Anne Margiste, Carmen Mikiver, Der letzte Bulle ... find all
- 41 - wikt:osiedles - Administrative division of Poznań, Białołęka, Bieńczyce (Kraków), Bieżanów-Prokocim, Bronowice (Kraków) ... find all
- 40 - wikt:προσευχη - Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739, Minuscule 181 ... find all
- 40 - wikt:ŭnbyŏng - Goryeo coinage, Korean currency, Korean mun ... find all
- 40 - wikt:sukhanasi - Arakeshvara Temple, Hole Alur, Brahmeshvara Temple, Kikkeri, Chalukya dynasty, Chennakeshava Temple, Hullekere, Chennakeshava Temple, Turuvekere ... find all
- 40 - wikt:sangatya - Hoysala literature, Kannada literature, Kingdom of Mysore, List of Yakshagana plays in the Kannada language, Medieval Kannada literature ... find all
- 40 - wikt:plăṣi - Alba County, Arad County, Argeș County, Bacău County, Bihor County ... find all
- 39 - wikt:stamhus - Aastrup (manor house), Amaliegade 8, Benzonsdal, Birkendegård, Christen Lindencrone ... find all
- 39 - wikt:propodus - Actaea savignii, Agonistic behaviour, Arthropod leg, Calcinus laevimanus, Cancer pagurus ... find all
- 39 - wikt:nymotypical - Aglais ichnusa, Aulocera magica, Autumn ringlet, Boloria thore, Boloria titania ... find all
- 37 - wikt:tapedar - Akai, Sindh, Alipur, Tando Muhammad Khan, Bhadmi, Bokhi, Chanesri ... find all
- 37 - wikt:sengekanten - A/S Palladium, Bedroom Mazurka, Bedside-films, Cinema of Denmark, Edelgave ... find all
- 37 - wikt:paḻaya - Ba (Indic), Bha (Indic), Ca (Indic), Da (Indic), Dha (Indic) ... find all
- 35 - wikt:srpskog - 1838 Constitution of Serbia, Anta Protić, Bože pravde, Božidar Zečević, Coronation of the Serbian monarch ... find all
- 35 - wikt:smafs - MAFF (gene), MAFG, MAFK, Small Maf ... find all
- 34 - wikt:საკრებულო - Abasha Municipality, Borjomi, Borjomi Municipality, Chkhorotsqu Municipality, Chokhatauri Municipality ... find all
- 34 - wikt:wakagashira - Kaneyoshi Kuwata, Kenichi Yamamoto (yakuza), Kiyoshi Takayama, Kodo-kai, Masaru Takumi ... find all
- 34 - wikt:posadniks - Administrative divisions of the Novgorod Republic, Arkazhsky Monastery, Cathedral of St. Sophia, Novgorod, Chiliarch, Economy of the Pskov Republic ... find all
- 33 - wikt:risalits - Adolf Foehr, Amaliegade 45, Bishop's Ordinariate, Charlottenlund Palace, Döbling Synagogue ... find all
- 33 - wikt:phytomorphic - Acquaviva delle Fonti Cathedral, Angra do Heroísmo City Hall, Campanhã railway station, Casa Fenoglio-Lafleur, Castello Normanno-Svevo (Gioia del Colle) ... find all
- 32 - wikt:āgamas - Anekantavada, Antakrddaasah, Anuttaraupapātikadaśāh, Aupapatika, Bhairava ... find all
- 32 - wikt:zmizelých - Stolpersteine in Hradec Králové Region, Stolpersteine in Hranice na Moravě, Stolpersteine in Karlovy Vary Region, Stolpersteine in Kolín, Stolpersteine in Lomnice u Tišnova ... find all
- 32 - wikt:storozhevoi - Kashin-class destroyer, Ocean escort, Project Hula, Tacoma-class frigate, USS Albuquerque (PF-7) ... find all
- 31 - wikt:изд - Albena Stambolova, Church of St Demetrius, Boboshevo, Church of St Elijah, Boboshevo, Daniel Kluger, Igor Birman ... find all
- 31 - wikt:vidwans - Evoor, Gururajulu Naidu, Hemavati (raga), Hyderabad Brothers, K. Bhaskaran ... find all
- 31 - wikt:uluuha - Abyysky District, Aldansky District, Allaikhovsky District, Amginsky District, Anabarsky District ... find all
- 31 - wikt:shachô - Cure (film), Eijirō Tōno, Frankie Sakai, Hideo Murota, Isao Yamagata ... find all
- 31 - wikt:rashmis - Rashmi (Hindu astrology) ... find all
- 31 - wikt:prōtosebastos - Alexios II Komnenos, Alexios Komnenos (protosebastos), Hrelja, Megas logothetes, Pinkernes ... find all
- 31 - wikt:paaltjasker - Tjasker, Tjaskers in Drenthe, Tjaskers in Friesland, Tjaskers in Germany, Tjaskers in Overijssel ... find all
- 31 - wikt:nnnn - Caversham Heights, Interleaved 2 of 5, List of Special Characters for Passwords, List of Unicode characters, List of XML and HTML character entity references ... find all
- 30 - wikt:đồngs - Bình Phước Province, Bình Định Province, Bắc Giang Province, Bắc Kạn Province, Cao Bằng Province ... find all
- 30 - wikt:stūpas - Bihari culture, Buddhism, Buddhist devotion, Buddhist symbolism, Faith in Buddhism ... find all
- 30 - wikt:sawnwork - Achmester, Bullock-Dew House, Campbell Farm (Edinburg, Virginia), Claymont Hill, Edgar Allan Poe House (Fayetteville, North Carolina) ... find all
- 30 - wikt:planigons - 3-4-3-12 tiling, 3-4-6-12 tiling, 33344-33434 tiling, List of Euclidean uniform tilings, Planigon ... find all
- 30 - wikt:phénakisticope - 3D film, Animation, Anorthoscope, Film, History of animation ... find all
- 29 - wikt:shākiriyya - Al-Mu'tasim, Jund, Shakiriyya ... find all
- 29 - wikt:ryttmästare - Adolf Ludvig Stierneld, Alexis Aminoff, Archibald Douglas (1883–1960), Axel Ståhle, Bror Munck (Swedish general) ... find all
- 29 - wikt:rofchade - TCEC Season 14, TCEC Season 15, TCEC Season 17, TCEC Season 18, TCEC Season 19 ... find all
- 29 - wikt:quæ - Coat of arms of Belfast, Constance Vella, Ecclesiastical letter, Evan Evans (poet), Ficus aurea ... find all
- 29 - wikt:paneláks - Housing estate, Kobylisy Shooting Range, Loučovice, Luděk Sekyra, Panelák ... find all
- 28 - wikt:usuiensis - Buchnera (plant), Deudorix jacksoni, Iolaus aequatorialis, Iolaus australis, Iolaus crawshayi ... find all
- 28 - wikt:urtekræmmer - Admiralgade 23, Brolæggerstræde 2, Frisch House, Gråbrødretorv 11, Gråbrødretorv 14 ... find all
- 28 - wikt:tropeiros - Araranguá, Caetité, Estrada Real, Itapevi–Butantã Metropolitan Corridor, Piraí do Norte ... find all
- 28 - wikt:protagonized - Acompañame A Estar Solo, Alicia Giménez Bartlett, Arquímedes Puccio, Braian Toledo, Carlos Capriles Ayala ... find all
- 28 - wikt:posthernsteini - Misikella, Oncodella, Rhaetian ... find all
- 27 - wikt:ḍākinīs - Daikokuten, Dakini, Kora (pilgrimage), Oddiyana, Suchandra ... find all
- 27 - wikt:дивизија - 11th Air Defense Division, 13th Air Defense Division, 15th Air Defense Division, 21st Aviation Division, 29th Aviation Division (Socialist Yugoslavia) ... find all
- 27 - wikt:νηστεια - Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739, Minuscule 181 ... find all
- 27 - wikt:δij - Archimedes' principle, Borel–de Siebenthal theory, Brownian motion, Buoyancy, Cartesian tensor ... find all
- 27 - wikt:ādatābād - Dashtabad, Narmashir, Saadatabad, Abadeh, Saadatabad, Arsanjan, Saadatabad, Bardsir, Saadatabad, Darab ... find all
- 27 - wikt:sāg - Saag ... find all
- 27 - wikt:skaranikon - Akolouthos, Allagion, Despot (court title), Droungarios of the Fleet, Droungarios of the Watch ... find all
- 27 - wikt:recreotourism - Americaine River, Bouchard River, Bras de Ross (Brébeuf Lake), Cachée River (Mauvaise River tributary), Demers Island ... find all
- 26 - wikt:äkıms - 2019 Kazakh presidential election, El Tıregı, July 2021 Kazakh local elections, Kazakh democracy movement, List of Äkims of Atyrau Region ... find all
- 26 - wikt:vratas - Brihaddharma Purana, Golapurva, Index of Jainism-related articles, Jain monasticism, Jainism ... find all
- 26 - wikt:skiadion - Akolouthos, Allagion, Despot (court title), Droungarios of the Fleet, Droungarios of the Watch ... find all
- 26 - wikt:pulingaws - Puyuma Pulingaw ... find all
- 26 - wikt:owdava - Asaveri, Bhupalam, Dhanyasi, Gambhiranata, Garudadhvani ... find all
- 25 - wikt:ваздухопловна - 1st Air Command, 21st Aviation Division, 29th Aviation Division (Socialist Yugoslavia), 32nd Aviation Division, 37th Aviation Division (Socialist Yugoslavia) ... find all
- 25 - wikt:tabulars - Acherontiscus, Anthodon (reptile), Bulbasaurus, Callistomordax, Doswellia ... find all
- 25 - wikt:shatsruthi - Chalanata, Dhatuvardhani, Divyamani, Gangeyabhushani, Hatakambari ... find all
- 25 - wikt:pallavis - Chingleput Ranganathan, D. K. Pattammal, Dhaneswar Swain, K. Bhaskaran, Kalathur Kannamma ... find all
- 24 - wikt:äkım - 2020 Dungan–Kazakh ethnic clashes, July 2021 Kazakh local elections, Äkim of Almaty, Äkim of Shymkent ... find all
- 24 - wikt:rhynchos - Banded stilt, Brasinorhynchus, Doratorhynchus, Eucalyptus dolichorhyncha, Flammulated flycatcher ... find all
- 24 - wikt:reengined - Airspeed Ambassador, Airspeed Envoy, Armstrong Whitworth Argosy, Bell 430, Burt Rutan ... find all
- 24 - wikt:punctuosus - Agraulos, Condylopyge, Glyptagnostus reticulatus, John William Salter, Penarosa ... find all
- 24 - wikt:polysilazanes - Gurpreet Singh (professor), Inorganic polymer, Polysilazane, Silicon carbide ... find all
- 24 - wikt:phagors - Binary stars in fiction, Helliconia, Phagor ... find all
- 24 - wikt:orogene - Fatra-Tatra Area, The Fifth Season (novel), The Obelisk Gate ... find all
- 23 - wikt:ọgbanje - Ogbanje ... find all
- 23 - wikt:зрдн - Structure of the Bulgarian Air Force ... find all
- 23 - wikt:yonipitha - Aisanyesvara Siva Temple, Bharateswar Temple, Bhrukutesvar Siva Temple, Budha Deula, Byamokesvara Temple ... find all
- 23 - wikt:xvrieslandsia - Tillandsia australis, Tillandsia baliophylla, Tillandsia deppeana, Tillandsia imperialis, Tillandsia lampropoda ... find all
- 23 - wikt:workholding - Adam Koppy, Centerless grinding, Chuck (engineering), Collet, Dip soldering ... find all
- 23 - wikt:vezatin - GPR56, VEZT ... find all
- 23 - wikt:ucdn - Content delivery network interconnection ... find all
- 23 - wikt:tourmai - Aegean Sea (theme), Bucellarian Theme, Byzantine Crete, Byzantine army, Cibyrrhaeot Theme ... find all
- 23 - wikt:talmide - Dor Daim ... find all
- 23 - wikt:suraus - 2014 in Malaysia, Bandar Tun Razak, Bangsar, Islam in West Sumatra, Ismail al-Khalidi al-Minangkabawi ... find all
- 23 - wikt:stylonurines - Adelophthalmidae, Alkenopterus, Eurypterid, Eurypterina, Kokomopteroidea ... find all
- 26 - wikt:vrijburgers -
Afrikaners, Asafo, Boers, Cape Dutch, Dutch people... find all -> only one left for this is South African Wars (1879–1915), but I get too angry at its tone to finish it. The article has Dutch (iso nl) and Afrikaans (iso af) pretty often, but they are hard to tell apart. From memory, there is some Xhosa (iso xh) and Zulu (iso zu), and possibly other Black languages. Quick summary of how to tell: the relevant Black language is generally clear from context. Bantu languages may have proper nouns in camel case (e.g., isiZulu). Indigenous languages may have non-alphabetic symbols (Khoekhoe language is the one I've seen the most, iso naq). The "ij" letter combination is Dutch. Strings of vowels are usually Afrikaans. Anything about Boers is likely Afrikaans. Anything about the VOC is likely Dutch. By the time the Boer Wars started, it will generally be Afrikaans, not Dutch. --Xurizuri (talk) 03:50, 23 May 2021 (UTC) - 24 -
wikt:premiére - Atandwa Kani, Cigano (film), De Lafontaine, Figaro-Polka, Franz Christian Gau ... find all→ Use in English text fixed, but there are a few unconfirmable references with this spelling. Note this does also appear to be a Slovak spelling - not using the è as French does.- A new search comes up with more misspellings in English and French. -- Beland (talk) 02:47, 3 August 2021 (UTC)
Likely misspellings by frequency (a-m)
The best list to work on if you want to eliminate all instances of a specific typo. Only typos that are very close to known words are shown. The algorithm is not perfect, so some of these may still be words that need to be added to Wiktionary. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Legitimate misspellings are candidates for Wikipedia:Lists of common misspellings. If there is an obvious correction, adding that to Wikipedia:Lists of common misspellings/For machines will help editors who use automated tools to fix cases faster.
- 23 - wikt:eunited - Arcitys, Call of Duty Championship 2019, Giants Gaming, Samsora ... find all -> eUnited is a name. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 14 - wikt:deuls - Bardhaman, Bengal temple architecture, Chapari, Gumut, Bankura, Kanki, Purulia ... find all -> it may be a typo of deula or may be non-English. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 11 - wikt:laparascopic - Cholecystectomy, Epiploic appendagitis, Falloposcopy, Median arcuate ligament syndrome, Ovarian cancer ... find all -> typo of laparoscopic. Added to the common misspellings lists, but haven't corrected any. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 10 - wikt:eurotic - Anna Pletnyova, Vintage (band) ... find all -> Eurotic/eUrotic is a name. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 10 - wikt:daees - List of Dai of the Dawoodi Bohra, Sulaymani ... find all -> Daee may be an alternate transliteration of Da'i. Du'at appears to be the Arabic plural (based on Da'i al-Mutlaq), but da'is/daees are an anglicized plural. So the question is whether da'i/daee is now an English loanword which can be pluralised with an s, or if it's Arabic only and cannot. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 8 - wikt:multiplayers - Dummy hand, Gamestudio, Metal Slug (1996 video game), Photon: The Ultimate Game on Planet Earth, PlayAlong ... find all -> sometimes it's a clear typo, sometimes it's a name, sometimes it's used to refer to a group. For the last, I have no idea if it's a valid term. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 8 - wikt:garhs - Garhwal division, Lunahar, Panchagarh District, Rongpa, Sangram Shah ... find all
- 8 - wikt:econoımic - Ayaskent, İzmir, Elbeyli, Erdemli, Gücüş, Kargıcak, Keşlitürkmenli, Silifke ... find all -> typo of economic. Added to the common misspellings lists, but haven't corrected any. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 8 - wikt:developped - Cephalometric analysis, Code-division multiple access, Daniel Peter, Datagram Congestion Control Protocol, Joseph Romain-Desfossés ... find all -> typo of developed. Already on both common misspellings lists. I haven't corrected any. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 7 - wikt:meaing - Caloundra, Caloundra West, Queensland, Chaetodon falcula, Muirlea, Queensland, Orang Asli ... find all -> typo of meaning. Added to the common misspellings lists, but haven't corrected any. --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
- 7 - wikt:lthe - Elizabeth, New Jersey, Goth (novel), Kalmyk Project, NASA Exceptional Scientific Achievement Medal, Plaza Bridge ... find all -> usually a typo of "the". Added to the main common misspellings list, but not machine readable due to the occasional correct usage --Xurizuri (talk) 04:37, 23 May 2021 (UTC)
Likely new words by frequency, all languages (a-m)
(Waiting for new dump; only cases with manual notes are shown below.)
- 6 (down from 53) - wikt:οτι - Lectionary 12, Minuscule 2444, Papyrus 63, Papyrus 91, Uncial 0170, Uncial 0308
- These all appear to be the Greek word 'οτι', which does not appear in wikt without breath marks. That is, see wikt:ότι, which then mentions forms wikt:τι, wikt:ὅτι, wikt:ό,τι.
- It would appear then that the proper action is to mark all these quoted Greek texts with {{lang}}? ::Also, I think I'll ask over at wikt if it would be reasonable for them to have an entry for wikt:οτι. They do have an entry for wikt:oti, which mentions at least wikt:ότι and wikt:ό,τι, but not wikt:ὅτι. (sigh) Oh what a tangled web we wind, when first we endeavor these defined. Shenme (talk) 04:30, 13 October 2019 (UTC)
- Additionally, many (all?) of these appear to be 'biblical' == classical == ancient Greek, which has ISO 639-2 code 'grc'. Modern Greek is ISO 639-1 code 'el', ISO 639-2 code 'gre'. Shenme (talk) 04:49, 18 October 2019 (UTC)
- Ah, but not all. Some found with search are modern Greek, so lang|el, and some 'oti' found having breath marks. Currently searching using "οτι" -insource:"lang|grc" -insource:"lang|el" -insource:"lang|gre" and working on labelling any form of Greek. Shenme (talk) 02:27, 20 October 2019 (UTC)
- 18 (down from 230) - wikt:æftiʀ - Princes Street Gardens Runestone, Runestones of Högby, Sjörup Runestone, Tjängvide image stone, Uppland Rune Inscriptions 101, 143 and 147, Östergötland Runic Inscription 179, Østermarie
- I don't think Old Norse entries with ʀ are allowed (they are either presented in Runic or normalized to r) on Wiktionary; the solution is to language-tag instances on here (generally as Old Norse although glancing at a few, it seems the articles/infoboxes helpfully specify which language it is in each case). -sche (talk) 20:13, 18 November 2018 (UTC)
For Wiktionary
This is a special section; putting a Wiktionary link here will cause a word to be ignored by the spell checker everywhere it appears (on the assumption it will soon be added to Wiktionary.)
Rejected
(These will need {{not a typo}} and maybe an HTML comment.)
- moved out from the list for English words first attested in Chaucer, this is apparently a misspelling of prentishood as Chaucer spelled it (wikt:prenticehood) --Xurizuri (talk) 12:55, 7 January 2021 (UTC)
- wikt:scorkle (from English words first attested in Chaucer list) apparently used to be on wikt then got RfD'd
Vocab pages
- 61 - Boontling - wikt:bahlness, wikt:beelch, wikt:beemsch, wikt:beeljeck, wikt:belhoon, wikt:blooch, wikt:bloocher, wikt:breggo, wikt:borp, wikt:bowgley, wikt:burlapping, wikt:chigrel, wikt:cloddies, wikt:comoshe, wikt:condeal, wikt:crazeek, wikt:deeger, wikt:deejy, wikt:dehigged,wikt:dissies, wikt:donicker, wikt:donagher, wikt:dreek, wikt:dreeked, wikt:dreeking, wikt:dulcey, wikt:eeld, wikt:eesole, wikt:haireem, wikt:heelch, wikt:pockety, wikt:higged, wikt:higgied, wikt:hobneelch, wikt:keishbook, wikt:kimoshe, wikt:kingster, wikt:madging, wikt:modocker, wikt:moldune, wikt:moldunes, wikt:nettied, wikt:nonch, wikt:oshtook, wikt:peeril, wikt:pusseek, wikt:rawncher, wikt:seertail, wikt:sirtle, wikt:sharkin, wikt:shoveltooth, wikt:somersetting, wikt:steedos, wikt:teebow, wikt:tuddies, wikt:tuddish
- (?) - English words first attested in Chaucer - wikt:attourne, wikt:feminie, wikt:gigge, wikt:louke, wikt:emprent, wikt:enbaissing, wikt:ensampler, wikt:entach, wikt:entech, wikt:entalent, wikt:eschaufe, wikt:festivally, wikt:foleye, wikt:forline, wikt:formly, wikt:fortunel, wikt:fortunous, wikt:habitacule, wikt:hustlement, wikt:necess, wikt:overwhelve, wikt:plungy, wikt:portionable, wikt:presentary, wikt:previdence, wikt:purveyable, wikt:rhetorian, wikt:slead, wikt:troublabla, wikt:unbetide, wikt:undoubtous, wikt:unleeful, wikt:unmovablety, wikt:unparegal, wikt:unplite, wikt:unweened, wikt:vengeress, wikt:weeply, wikt:witnessfully, wikt:begeth, wikt:anoyful, wikt:chincher, wikt:chinchery, wikt:counterwait, wikt:customance, wikt:custumance, wikt:humblehede, wikt:laureole, wikt:rackleness, wikt:clotheless, wikt:mistrest, wikt:nigromancian, wikt:sustenant, wikt:disfigurate, wikt:messagery, wikt:communably, wikt:jacounce, wikt:jagounce, wikt:mendience, wikt:miscoveting, wikt:misway, wikt:outwine, wikt:outsling, wikt:papelardy, wikt:ravisable, wikt:recreandise, wikt:ribanding, wikt:rideled, wikt:roinous, wikt:suckeny, wikt:timbester, wikt:villainsly, wikt:wyndre, wikt:minstrelly, wikt:sweynt, wikt:adjoust, wikt:annoyously, wikt:arbitry, wikt:asperness, wikt:celebrable, wikt:coetern, wikt:definish, wikt:delye, wikt:distempre, wikt:whaped, wikt:whaped, wikt:advocary, wikt:amphilbology, wikt:asfast, wikt:avaunter, wikt:betrend, wikt:calculing, wikt:circumscrive, wikt:defeit, wikt:defet, wikt:desespeir, wikt:desesperaunce, wikt:disblame, wikt:enterpart, wikt:estately, wikt:executrice, wikt:forbysen, wikt:forlose, wikt:grufe, wikt:howne, wikt:inhelde, wikt:kankedort, wikt:ounded, wikt:palaceward, wikt:palaceward, wikt:palaestrial, wikt:refreid, wikt:reheting, wikt:resport, wikt:saluing, wikt:scrivenliche, wikt:tempestous, wikt:unbroided, wikt:untroth, wikt:yfled, wikt:agrote, wikt:bedote, wikt:betraising, wikt:browd, wikt:radevore, wikt:renownee, wikt:tidive, wikt:tuteler, wikt:toteler, wikt:almury, wikt:embelif, wikt:solsticion, wikt:forloin, wikt:tickleness, wikt:uncorven, wikt:ungrubbed
- 9 - Longest word in English - wikt:broughammed, wikt:subdermatoglyphic, wikt:gravedinously, wikt:shakalshas, wikt:galahads, wikt:leucocytozoans, wikt:quiaquia
0-9
- 1 - 2008 Dublin Senior Football Championship - wikt:sline: Gaelic Football notation ('sideline') \\ this is on wikt but not for this meaning --Xurizuri (talk) 13:06, 7 January 2021 (UTC)
- 1 - 1830–1831 papal conclave - wikt:unvetoed - not a typo
- 3 - 1842 Wallachian princely election - wikt:sortitioned - past tense verb form of sortition
- 1 - 2000 and Whatever - wikt:auspOp: the name of a web site. Ira Leviton (talk) 16:21, 24 September 2019 (UTC)
- 1 - 2018 in Germany - wikt:indiologist - seems to be a real word
1 - 42 (dominoes) - wikt:renegger: a term used in the game and defined in the article. Ira Leviton (talk) 20:59, 26 September 2019 (UTC)- I think this belongs in the dictionary, as a derivation of wikt:reneg -- Beland (talk) 01:48, 24 March 2020 (UTC)
- I can only find two examples in books (wikt:Citations:renegger); if it were a valid alternative spelling, it would need three in order to have an entry (but some editors might argue it's just a rare misspelling). I've changed the Wikipedia entry to use the usual spelling, reneger, which already has a Wiktionary entry. -sche (talk) 19:33, 18 April 2021 (UTC)
- I think this belongs in the dictionary, as a derivation of wikt:reneg -- Beland (talk) 01:48, 24 March 2020 (UTC)
- 1 - 2015 African Youth Athletics Championships - wikt:octathlete - competitor in an octathlon*
1 - 1607 - wikt:pallisadoed- a real word- Added to Wiktionary, although it seems to be an obsolete spelling of palisaded. (The use in the Wikipedia article is in a quote, though, not in wikivoice, so it's OK.) -sche (talk) 19:41, 18 April 2021 (UTC)
- 1 - 17th Armored Engineer Battalion - wikt:chespaling- "chespaling mat" is a real term for a type of field matting
- 2 - 1980 Quebec referendum - (probably OK: wikt:regroupments) - conscious adaptation of a French word specifically in the context of the politics of Quebec
- 3 - 1854 Broad Street cholera outbreak - wikt:vibriones, wikt:vibriones, wikt:vibriones - a real word
- 1 - 1894 United States House of Representatives elections - wikt:silverist - if this is a real word, it means an American political faction
- 1 - 1938 NSWRFL season - wikt:trygetters - conceivably a real word (Australian)
- 2 - 1957 in jazz - wikt:sazabo - a Turkish musical instrument
- 2 - 1968–69 Mersin İdmanyurdu season - (probably OK: wikt:maçkolik) = maçkolik.com (Turkish sports website)
2 - 1st Aeromedical Evacuation Squadron - wikt:aeromedically, wikt:aeromedically- if "aeromedical" is an adjective, no reason why "aeromedically" can't be an adverb- Added. -sche (talk) 20:02, 18 April 2021 (UTC)
- 1 - 2001 Taiwan legislative election - wikt:reunificationist - must surely be a real word = "supporter of reunification"
- 1 - 2003 Somaliland presidential election - wikt:mistabulation - a real word
- 1 - 2NU - wikt:synclaver - this is a common spelling, so I've left it, but it may nevertheless be a typo for "synclavier"
- 1 - 2014 in Costa Rica - wikt:unjournalistic: this word seems OK. Ira Leviton (talk) 02:05, 30 September 2019 (UTC)
- 2 - 2016 PSOE crisis - wikt:officialists, wikt:officialists: a name given to a faction in this crisis (opposed to the "critics". Ira Leviton (talk) 21:17, 2 October 2019 (UTC)
- 6 - 2008 Murshidabad beheading - wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi = "shalishi court", which is a kangaroo court in India
- 1 - 2008 TC3 - wikt:polymict - a real word
- 2 - 2010–11 Reading F.C. season - wikt:backheeler - never seen it as a noun but no reason why not
- 1 - 2012 Ingleside, San Francisco homicide - wikt:undeportable - a real word
- 1 - 2006 Iranian Assembly of Experts election - wikt:provisionist: seems to be a term in Iranian politics. It comes up on Internet searches, but the citation in the article is in Persian. Ira Leviton (talk) 23:53, 29 September 2019 (UTC)
- 1 - 2006 Oregon Ballot Measures 46 and 47 - wikt:unobligated: OK, in an arcane financial way. Ira Leviton (talk) 23:53, 29 September 2019 (UTC)
- 1 - 20th Lancers (British Indian Army) - wikt:risallahs - word for an Indian cavalry unit - please add to Wikt
- 1 - 24/7 service - wikt:rehumanisation - apparently a word in the service sector
- 1 - 251st Cyberspace Engineering Installation Group - wikt:remissioning - cyberspace jargon
- 1 - 2017–18 Taça da Liga - wikt:repechaged - "repechage" is fine as a noun; "repechaged" is occasionally used, but is not necessarily correct, so leaving this here for a second opinion
- 4 - 3D cell culturing by magnetic levitation - wikt:adipospheres - real scientific term
- 1 - 1973 Soviet economic reform - wikt:derationalisation: seems OK in context and with British spelling. Ira Leviton (talk) 15:17, 29 September 2019 (UTC)
- 4 - 009-1 - wikt:cybernetized - intended for "adapted into a cybernetic form" but probably not a real word
- 1 - 2003 in Afghanistan - wikt:telekiosks: a legit term (plural). Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 1 - 2018 Selangor state election - wikt:agropolitan: apparently a legitimate word. Ira Leviton (talk) 20:03, 25 May 2020 (UTC)
- 1 - 1962 Burmese coup d'état - wikt:intramilitary: seems OK. Ira Leviton (talk) 15:16, 27 May 2020 (UTC)
- the correct spelling is wikt:intra-military, but this isn't in wikt either so I'm leaving this entry here as a reminder. I have fixed it in the article.
- 1 - 1923 Madras Presidency Legislative Council election - wikt:diarchical - might be okay Strickesel (talk)
- its the adjective form for diarchy. Xurizuri (talk) 09:47, 28 December 2020 (UTC)
- 1 - 1 Corinthians 10 - wikt:eulogesas - a theological term, might be correct Strickesel (talk)
- 1 - 116th Operational Maneuvers Regiment - wikt:brelage: a type of belt or ropework. Ira Leviton (talk) 01:26, 25 May 2020 (UTC)
- 2 - 1946 Romanian general election - wikt:guardists: a term applied to members of the Iron Guard in Romania. Ira Leviton (talk) 00:08, 25 May 2020 (UTC)
- 2 - 35 mm movie film - wikt:anamorphically, wikt:anamorphosing: photography terms. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 2 - 86th Scripps National Spelling Bee - wikt:kneydel, wikt:knadel - different spellings of 'knaidel' Strickesel (talk)
- 1 - 2400-series (CTA) - wikt:unrehabbed - unsure if okay Strickesel (talk)
- 11 (number) - wikt:octiamonds - the (probably correct) name for a certain mathematical shape Strickesel (talk)
- 1 - 2019 in paleontology - wikt:cololites - intestinal casts
- 5 - 2016 in archosaur paleontology - wikt:oosp, wikt:oofam: two abbreviations. Ira Leviton (talk) 14:25, 24 May 2020 (UTC)
- 4 - 2017 in paleontology - wikt:eucladid: a legit paleontology term. Ira Leviton (talk) 14:25, 24 May 2020 (UTC)
- 8 - 2018 in mammal paleontology - wikt:monachine, wikt:pongines: wikt:monachine is the word for a member of the genus Monachinae. wikt:pongines (pl. of pongine) are members of Ponginae, i.e. orangutans. Xurizuri (talk) 10:43, 28 December 2020 (UTC)
- 2 - 2019 in amphibian paleontology - wikt:pipimorph: a frog term Ira Leviton (talk) 14:25, 24 May 2020 (UTC)
- 2 - 2019 in insect paleontology - wikt:stenine: an insect term Ira Leviton (talk) 14:25, 24 May 2020 (UTC)
- 5 - 2019 in mammal paleontology - wikt:anancines: anancines (pl.) are of the genus Anancus Xurizuri (talk) 10:43, 28 December 2020 (UTC)
- 1 - 2020 in arthropod paleontology - wikt:seroloid - probably okay Strickesel (talk)
- 1 - 9 (2009 animated film) - wikt:tarantulid - a real word Strickesel (talk)
- 1 - 2018 in echinoderm paleontology - wikt:disparid: a paleontology term. Ira Leviton (talk) 20:03, 25 May 2020 (UTC)
- 3 - 2018 in paleoichthyology - wikt:batomorphs: a scientific term. Ira Leviton (talk) 19:11, 25 May 2020 (UTC)
- 2 - 2019 in arthropod paleontology - wikt:agnostinids: a zoology term. Ira Leviton (talk) 19:11, 25 May 2020 (UTC)
- 1 - 1,2,3,4-Cyclohexanetetrol - wikt:inososes: chemistry term (plural). Ira Leviton (talk) 01:26, 25 May 2020 (UTC)
- 1 - 6-Nonenal - wikt:nonenals - might be correct Strickesel (talk)
- 1 - 1,1'-Bis(diphenylphosphino)ferrocene - wikt:dilithiation - chemistry term Strickesel (talk)
- 1 -
1,2,4,5-Cyclohexanetetrol - wikt:betitol- chemistry term Strickesel (talk) - 1 -
1,3-Cyclohexanedione - wikt:cycloxydim- chemicals Xurizuri (talk) 09:47, 28 December 2020 (UTC) - 5 -
1,3-Dipolar cycloaddition - wikt:oxacycle, wikt:oxacycles, wikt:oxacycle, wikt:oxacycle, wikt:oxacycle- chemistry term Strickesel (talk) - 1 - 1,4-dihydroxy-2-naphthoate polyprenyltransferase - wikt:naphthalenoid - chemistry term Strickesel (talk)
- 2 - 2,2'-Biphenol - wikt:diphosphite: a chemistry term. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 1 - 2,2'-Biphenylene phosphorochloridite - wikt:diphosphite: a chemistry term. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 1 - 2-Norbornyl cation - wikt:ethynium: a chemistry term. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 1 - 2-Phosphoglycolate - wikt:endiolate: a chemistry term. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 2 - 3-Methylcatechol - wikt:calopin: a chemistry term. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 1 - 4,7-Dihydroisoindole - wikt:tosylacetylene: a chemistry term. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 2000 AD crossovers - wikt:megalopolized, wikt:megalopolized: Inflection of the verb form of megalopolis. -- Beland (talk) 04:51, 7 June 2021 (UTC)
A
- 1 - A. carbonaria - wikt:varay: seems like a real word.
- This is part of the common name for the species, so I think that belongs in Wiktionary? If not, we would normally create a redirect from cotton varay to Albizia carbonaria and that would take care of it, but the latter hasn't been created yet. -- Beland (talk) 17:27, 13 October 2018 (UTC)
- 1 - Adana Center for Arts and Culture - wikt:ampire: possible Ottoman architectural style?
- 1 - Adeline's Dream - wikt:soddle - Possibly means 'soddy' a nickname for houses made of sod (earth and grass), potentially also Germanic slang for the same.
- 1 - African-American Vernacular English - wikt:fixina: dialect-specific
- Yes, but possibly too rare to meet Wiktionary Criteria For Inclusion in this specific spelling; we do have wikt:fixing to, wikt:finna and others. -sche (talk) 03:06, 28 November 2018 (UTC)
- 1 - Aina Wifalk - wikt:manuped: name for an invention.
- 1 - Alash Ensemble - wikt:limpi: old musical instrument?
- 1 -
Allelopathy - wikt:allelo-:used as a prefix to explain a word root. (Wiktionary does have prefixes and suffixes.) → lang marked - 1 - Alice Holt Forest - wikt:hangra: an Old English word.
- 1 - Antas de Ulla - wikt:liscos: a Spanish word, possibly regional, for a type of dish of bacon bits.
- Is that also the word in English for this specific preparation? -- Beland (talk) 07:28, 29 March 2019 (UTC)
- Given that has an English inflection, maybe this is actually now an English word? - Beland (talk) 07:28, 29 March 2019 (UTC)
- 2 - Amsterdamseweg - wikt:banpole, wikt:banpole: a type of monument or marker to indicate how far criminals were allowed to approach city. Western Europe, Netherlands. Not sure if it should be one or two words.
- 1 -
Andrew Glover (composer) - wikt:aleotory: evidently a real word, although I can't define it. See https://aleacounterpoint.wordpress.com/2010/06/08/orpheus/ → was corrected to "aleatory" - 1 - ArmSCII - wikt:yiwn - normal Armenian ech (yech) and yiwn (vyun) small letters pair
- 1 - Arto Tunçboyacıyan - wikt:blul: an Armenian musical instrument, described as the same as or similar to a sring
- 1 - Arts and Science Center for Southeast Arkansas - wikt:seriographs: seems like a legitimate word. -> There's a Wikipedia article; Wiktionary needs the word and its plural.
These were checked for misspellings and determined to be OK. They need to be added to Wiktionary or an exclusion list:
- Most of these seem like other language words --Xurizuri (talk) 13:24, 7 January 2021 (UTC)
- Apo (drink) - wikt:wiyu - checked, OK
- Apocalypse (Star Wars novel) - wikt:drochs - checked, OK
- Apodemia mormo langei - wikt:psychicola - checked, OK
- Apolinar's wren - wikt:twii (probably OK: wikt:tchorr) - checked, OK \\ w/o having checked, I'll bet all my savings that these are bird sounds --Xurizuri (talk) 13:24, 7 January 2021 (UTC)
- Aporia hippia - wikt:taupingi - checked, OK
Aposthia - wikt:aposthic, wikt:aposthic, wikt:aposthic, wikt:aposthic- checked, OK- Apotomops rhampha - wikt:rhamphos - checked, OK
- Apotropaic mark - wikt:trepein (probably OK: wikt:apotrepein) - checked, OK
- Appendix Probi - wikt:denasalised, wikt:numqua - checked, OK
- Appendix Vergiliana - wikt:keirein - checked, OK
- Appias ada - wikt:thasia (probably OK: wikt:tindalti) - checked, OK
- Apple Blossom Handicap - wikt:distaffers - checked, OK
- Apple of my eye - wikt:iyshown, wikt:iyshown, wikt:iyshown - checked, OK
- Application Enhancer - wikt:haxies, wikt:haxies, wikt:haxies - checked, OK
- April 2009 Moldovan parliamentary election protests - wikt:episodul - checked, OK
- April Daniels - wikt:andrn - checked, OK
- Aprosphylosoma - wikt:julidan - checked, OK
- Aptamer - wikt:aptabodies (probably OK: wikt:postranslational, wikt:trxA) - checked, OK
- Aptenia - wikt:ptenos - checked, OK
- Apulet - wikt:apulettes - checked, OK
- Aqraba, Nablus - wikt:khirbets - checked, OK
- Aqua Virgo - wikt:vinustas - checked, OK
- Aquatic garter snake - wikt:zaxanthus - checked, OK
- Aquilarhinus - wikt:palimentus - checked, OK
- Aquilino Ribeiro - wikt:encoiradas - checked, OK
- Arab Street - wikt:pukadai, wikt:sadkku - checked, OK
- Arabana people - wikt:wadlu, wikt:wagka (probably OK: wikt:woqka) - checked, OK
- Araeosoma - wikt:dactylous (probably OK: wikt:brunnichi) - checked, OK
- Aralez (mythology) - wikt:aralezes, wikt:aralezes, wikt:aralezes, wikt:aralezes - checked, OK
- Arancini - wikt:bburru - checked, OK
- Arapian - wikt:kavourma, wikt:loutza, wikt:caseri, wikt:chasapaki - checked, OK
- Araripedactylus - wikt:daktylos - checked, OK
- Araucaria biramulata - wikt:biramule - checked, OK
- Arbore people - wikt:kyrnat, wikt:qawots (probably OK: wikt:chirnan, wikt:morqo, wikt:qawot) - checked, OK
- Arboretum La Alfaguara - wikt:manleb (probably OK: wikt:euromericana) - checked, OK
- Arbostola - wikt:heuritica - checked, OK
- Arbutus unedo - wikt:kocimare (probably OK: wikt:komaròs) - checked, OK
- Arbuzov - wikt:arbooz - checked, OK
- Arca (bivalve) - wikt:kauaia (probably OK: wikt:koumaci) - checked, OK
- Arcadian League - wikt:myrioi - checked, OK
- Archaefructus - wikt:eoflora - checked, OK
- Archaeocyon - wikt:leptodus (probably OK: wikt:falkenbachi)- checked, OK
- Archaeocyte - (probably OK: wikt:collencytes) - checked, OK
- Archaeognatha - (probably OK: wikt:koryphē) - checked, OK
- Archaeoindris - (probably OK: wikt:collodiaphyseal) - checked, OK
- Archaeology of Qatar - wikt:rawdas - checked, OK
- wikt:archaios (from Archaeornithomimus, Archaeornithoides, Archaeoistiodactylus, Archaeoindris, Archaeognatha, Archaeocyte) used to be in wikt but was deleted for being a frivolous entry.
B
- 1 -
Balloon light - wikt:tuboid:legitimate word, meaning resembling or like a tube. - 1 - Batog - wikt:batogs: legitimate word.
- 1 - Battered (band) - wikt:burkies: probable legitimate slang use of a word.
- 1 -
Bathford - wikt:drayning: old English spelling of draining.→ it is in a quote so should not reappear. - 1 - Baths of Agrippa - wikt:quadran: a Roman bronze coin worth one quarter of an as.
- 1 - Bathtub boat - wikt:tubbers: somebody who races in bathtubs; a bathtub racer.
- 2 - Baju Kurung - wikt:sampin, wikt:sampin: probably a Malay word.
- I don't know if there's a different English word for this, so it might just be a borrowing. -- Beland (talk) 16:39, 29 May 2019 (UTC)
- 1 - Bruce Kiskaddon - wikt:creakin - many in' word endings: can they be special-cased?
- I think these should be added to Wiktionary as variant spellings? -- Beland (talk) 16:39, 29 May 2019 (UTC)
- 1 - Buhler Group - wikt:gristing - needs to be in wikt (it's what grist mills do)
- 1 - Butts Up - wikt:savies - defined in artile. Slang/sports term. Elfabet (talk)
- 1 - Béton brut - wikt:shetting - defined in article with source, should be added, probably. Elfabet (talk)
- 1 - Brisket - wikt:brusket: Middle English.
- 1 - Bourgueticrinida - wikt:cirrals: part of a sea lily (plural).
- 1 - Bible and Orient Museum - wikt:ethnologica: old-fashioned, but OK.
- 2 - Big-bang firing order - wikt:twingles, wikt:twingles: plural for twingle, a type of engine re-engineered to have cylinders fire simultaneously instead of alternately.
- 1 - Birstall, West Yorkshire - wikt:byrh: Old English.
- 1 - Black Shuck - wikt:skuh: Old English.
- 1 - Blagdon - wikt:bloec: Old English, meaning 'black' or 'bleak'.
- 1 - Blasius Merrem - wikt:carinates: an term for flying birds, with a keeled sternum.
- 1 - Bellwether - wikt:bellewether: Middle English spelling, used as an example in the article.
- 1 - Berberis canadensis - wikt:glaucose: used correctly in the article - it's a word.
- 1 - Bergstedt - wikt:stedt: used as a suffix, as in -stedt.
- it's in wikt, but as a Danish word --Xurizuri (talk) 02:52, 8 January 2021 (UTC)
- 1 - Bertram de Criol - wikt:constabularie: Old English spelling.
- 2 - Band government - wikt:treatied, wikt:untreatied: I'm not sure if this is proper use of the word treaty.
- 1 - Barbiturate - wikt:tooties: slang for barbiturate, as mentioned in the article.
- 1 - Bahaba - wikt:chaptis: a common name taken from a species name.
- 1 - Businessman (film) - wikt:flexies - unsure
- Seems to be a type of posable doll, possibly a genericized trademark. -- Beland (talk) 21:13, 24 June 2019 (UTC)
- it's in wikt, but as a Dutch word --Xurizuri (talk) 02:52, 8 January 2021 (UTC)
- 1 -
Breakscore - wikt:zouching: zouch is a term for a scoreless roll in the game, maybe in other games too. - 3 - Book signing - wikt:ereading, wikt:ereading, wikt:ereading: short for "electronic reading".
- 1 - Barney Berlinger - wikt:septathlon: used in the newspaper article used as a reference.
- 2 - Bathtub racing - wikt:tubbers, wikt:tubbers: a bathtub racer.
- 1 - Battle of Byczyna - wikt:elears: a pretty obscure word. A type of cavalry fighter (and plural).
- 2 - Bay of Sielmönken - wikt:warfts, wikt:warfts: a type of Northern European artificial dwelling mound - see Terp
- 2 - Blood and Thunder (comics) - wikt:squigs, wikt:squigs: a type of character in a game or comic. Short for "squiggly beast".
- 1 - Belmond Las Casitas - wikt:colcas: Spanish or a local Indian word for the mud and stone granaries built into the cliffs or caves, and for which the Colca Valley is named. Plural.
- 1 - Battle-Pieces and Aspects of the War - wikt:outly: this appears to be a typo in a source copied online. I can't locate the original source, nor figure out what the word should be. I don't want to mark it with [sic] because I don't know if there's an error in the original source.
- I assume this is the verb form of outlier, and it's simply archaic. ("So that's where that went!") -- Beland (talk) 19:19, 25 June 2019 (UTC)
- 27 - wikt:adobong - Cabalen, Ipomoea aquatica, Kapamilya, Deal or No Deal, Philippine adobo, Squid as food ... find all
- A Filipino word derived from adobo meaning cooked in a marinade. I redirected, but if someone knows how to add these to Wiktionary, pleaes do it.
C
- 2 - Capriccio (art) - wikt:quadratture: may be mispelled (one 't'). The same word applied to math and other fields has one 't', but I'm not sure about this. I'll leave it to the art experts.
- Shows up on Google Books, probably should be added to Wikitionary, possibly in both English and Italian or Latin. -- Beland (talk) 18:04, 14 September 2019 (UTC)
- 1 - Catherine de' Medici's building projects - wikt:priants: correct, people kneeling in prayer.
- Added. -sche (talk) 02:37, 13 April 2021 (UTC)
- 1 - Chattanooga Choo Choo (film) - wikt:chuggin: in a movie tagline, as "chuggin' "
- 1 - Cheese on toast - wikt:choast: slang for cheese and toast, explained in the article.
- 1 - Cheltenham - wikt:cilta: word root explaining the etymology of the town name.
- it used to have a wikt but got moved to be included in an appendix. not sure what that means or how it affects the bot. --Xurizuri (talk) 03:14, 8 January 2021 (UTC)
- If you look at the page it was moved to, what was moved was a Lojban entry, not a "pre-British" word. But in order to include the pre-British word, more information would be needed on what language it would be... -sche (talk) 02:37, 13 April 2021 (UTC)
- it used to have a wikt but got moved to be included in an appendix. not sure what that means or how it affects the bot. --Xurizuri (talk) 03:14, 8 January 2021 (UTC)
- 1 - Catopta saldaitisi - wikt:stroky: quoted correctly from a citation.
- 1 - Cusec - wikt:cufm - unit of flow rate. stands for 'cubit feet per minute'. Includable, though uncommon.
- 1 -
Cyclist fatality rate in U.S. by year - wikt:trikkes- A company's name for their 3-wheeled, body-powered transport device that is not copyright, so isn't a proper noun, but almost should be. - 1 - Cremlingen - wikt:deestablishment - "Until its deestablishment in 1974"
- Rare, sometimes shows up as de-establishment, but is in Google Books. -- Beland (talk) 19:29, 14 September 2019 (UTC)
- Added. (Plural seems to exist only as a scanno, though, combining de- and -establishments or deestablish- and -ments from different columns of text.) -sche (talk) 02:37, 13 April 2021 (UTC)
- Rare, sometimes shows up as de-establishment, but is in Google Books. -- Beland (talk) 19:29, 14 September 2019 (UTC)
- 1 - Cybermind - wikt:asence - unsure. A made-up word by one author about the presence or lack there-of of people online. Not in common usage. Probably just {notatypo} it and call it day? Elfabet (talk)
- Seems to have been used by multiple people on a discussion list? Should probably be investigated to see if it meets criteria for inclusion. -- Beland (talk) 19:29, 14 September 2019 (UTC)
- 1 - Current (mathematics) - wikt:comass - unsure. Mathematical term? Defined in article? -- seems to be 'co-mass' Jkgree (talk) 15:46, 20 February 2019 (UTC)
- 1 - Crime in Iran - wikt:toumans - "amounts to 10 trillion toumans a year (1 touman equals 10 rials)"
- 1 - Cernach mac Fergusa - wikt:subsept: a subdivision of a tribe. Legitimate use.
- 1 - Chaos (genus) - wikt:uroid: a subcellular portion of an amoeba.
- 1 - Conulariida - wikt:conulate: seems like a legitimate word, although restricted to science.
- 1 -
Coregonus maraena - wikt:whitfish:appears to be spelled correctlybobdog54 (talk) 19:44, 14 December 2018 (UTC) - 4 - Coat of arms of Barcelona - wikt:paletts, wikt:paletts, wikt:fomer, wikt:paletts: diminutive of Pale (heraldry)?
- 1 - Coat of arms of the London Borough of Hammersmith and Fulham - wikt:pomels: a heraldic term.
- used to exist but got deleted with the reasoning "ME not ModE" which I do not understand --Xurizuri (talk) 03:14, 8 January 2021 (UTC)
- The edit summary means "[this word only exists as] Middle English not Modern English", though the entry shouldn't have been deleted, just the language header/codes should have been changed from presenting it as modern English to Middle English. It may, in fact, also exist in modern English in heraldic descriptions, in which case even the modern English entry could be restored, but for now I've at least restored it as Middle English. -sche (talk) 21:53, 12 April 2021 (UTC)
- used to exist but got deleted with the reasoning "ME not ModE" which I do not understand --Xurizuri (talk) 03:14, 8 January 2021 (UTC)
- 1 - Coat of arms of the Prince of Asturias - wikt:bordured: a heraldic term - bordure
- Added. -sche (talk) 02:47, 13 April 2021 (UTC)
- 2 - Coat of arms of the Valencian Community - wikt:paletts, wikt:paletts: diminutive of Pale (heraldry)? - i.e., "pallets" (usual spelling)
- 1 - Ciconiae Nixae - wikt:regionaries: seems legit (plural).
- 1 - Cox Green, Tyne and Wear - wikt:coccs: Old English cocc, crest of a hill.
- I'm not sure if this plural is actually attested in Old English, though; the standard plural is coccas. -sche (talk) 02:47, 13 April 2021 (UTC)
- 1 - Coagulin - wikt:proxins: a type of protein.
- 1 - Craver Farmstead - wikt:skipples - "In the 1790 lease The Millers agreed to a yearly rent of 24 1/2 skipples of winter wheat"
- Merriam-Webster says this an alternate spelling of wikt:schepel, a Dutch unit -- Beland (talk) 20:56, 14 September 2019 (UTC)
- 1 - Champion (apple) - wikt:shampion: alternative spelling of the Champion type of apple.
- 1 - Chest (furniture) - wikt:wakis short for "wagon-kist".
- 2 - Chester Zoo - wikt:mantellas, wikt:mantellas: plural of mantella, a type of frog.
- 2 - Cheuksin - wikt:jesas, wikt:jesas: a type of Korean ritual.
- 1 - Chhurpi - wikt:durkha: a Nepali type of cheese.
- 1 - Chimantaea - wikt:paramoid: correctly used, according to the page, meaning "like paramo."
- 1 - Chorus line - wikt:twirlies: a term used for girls in a chorus line.
- it's in wikt as a plural, but the entry for the singular doesn't include this meaning --Xurizuri (talk) 03:14, 8 January 2021 (UTC)
- 2 - Christopher O'Hare - wikt:cremain: short for cremated remains.
- 1 - Chub (disambiguation) - wikt:chubbing: a legislative discussion among several members to waste time and/or block action. Similar to filibustering.
- 2 - Chung Do Kwan - wikt:guep: a level in Tang Soo Do, a Korean martial art.
- 1 - Church of St. James (Brno) - wikt:flanning: architectural term meaning "the internal splay or bevel of a window-jamb."
- 2 - Coat of arms of Nuuk - wikt:siminar, wikt:siminar: the name of a type of building in Nuuk.
- 3 - Cobbler (food) - wikt:cobeler, wikt:sonker, wikt:sonker. Cobeler explains the etymology of cobbler. Sonker is a local North Carolina variation, a cross between a cobbler and a pie.
- 1 - Cipriani Potter - wikt:valzers: in the title of a piece written for piano.
- 1 - Cornish currency - wikt:dynar: an old Cornish currencey.
- 1 - Cornish jack - wikt:labeos: a type of fish (plural).
- 1 - Creedmoor Branch - wikt:demapped - "finally being torn up and demapped in the early 1970s."
- 1 - Chain conveyor - wikt:multiflexing
- 1 -
Charles Dallas - wikt:mcrt: it's an abbreviation but I don't know for what. (It even has a period.)- The Internet thinks this means "marriage certificate". It may be a common abbreviation in geneology so might be eligible for Wiktionary. But it's obscure enough not to use in Wikipedia articles. -- Beland (talk) 23:11, 11 October 2019 (UTC)
- 1 - Cinema of the United States - wikt:photogenia: nots sure if it's a legit word. It's used to mean "the desire to make everything photogenic for social media impact".
- 2 - Cigu Niru - wikt:nirus: transliteration of a Chinese word (plural) of an army unit.
- It looks like "niru" has been borrowed (at least in this article) into English, since it's getting an English plural, so "niru" and "nirus" should probably be added to Wiktionary. Though I'm unclear on the precise definition of "niru". -- Beland (talk) 00:52, 3 December 2019 (UTC)
- 1 - Cleobury Mortimer - wikt:clifu: an Old English word, meaning a steep place.
- Old English words should definitely be in English Wiktionary, but I also tagged it since Old English pronunciation is different. -- Beland (talk) 01:28, 3 December 2019 (UTC)
F
H
- 3 - HAL (software) - wikt:devd
- 1 -
Himura Kenshin - wikt:mangazine-- Word needs to be added to wikt - 1 - Hindsiclava paraconsors - wikt:costals -- Word needs to be added to wikt, I think \\ it's on wikt but in Catalan. The English for costal does exist, so it would be fairly easy to make the English one for costals
- 1 - Hereford United F.C. - wikt:kitwear: a local Britishism.
- 1 - Han Bwee Kong - wikt:koyans: an old Korean measure for grain.
- 1 - Hand-colouring of photographs - wikt:brittling: seems legitimate, the process of becoming brittle, like an old photograph.
1 - Helena (wife of Inge the Elder) - wikt:attungs: not English. Possibly Swedish.Swedish - a mediaeval land measure - tagged, but should be in Wikt- 1 - Helene J. Kantor - wikt:amortous: unclear. This is not a word, but it's apparently in a source.
- Might be from Greek? "Not fatal"? -- Beland (talk) 05:45, 10 June 2021 (UTC)
- 1 - Helietta - wikt:barettas: variant spelling of a common name for the plant.
- 1 - Henryk Baranowski - wikt:sensuation: unclear. Considering that it involves James Joyce, it might be a correct word, and I left it alone in the hopes that somebody can clarify.
- 3 - Heterolithic bedding - wikt:flasers, wikt:flasers, wikt:flasers: a type of pattern in a rock (plural).
- 4 - Hiligaynon people - wikt:hablon, wikt:hablon, wikt:hablon, wikt:hablon -- Word needs to be added to wikt
- 1 - History of Seattle - wikt:fringies: a local Seattle term.
- 1 - History of Seattle since 1940 - wikt:fringies: ditto.
- 1 - History of rail transport in Luxembourg - wikt:brakers: workers who braked trains before the introduction of air brakes.
1 - Hippodrome of Olympia - wikt:kalpe -- UnsureAnct Greek for a type of horse race in the ancient Olympics - tagged; ought to be in Wikt- 1 - Historic Lac qui Parle County, Minnesota - wikt:deorganized: as in a county being de-incorporated and absorbed into other counties.
- 1 - Hornsea - wikt:morraines: plural of morraine.
- 1 - Horse harness - wikt:britchen: seems like a legit word.
- 1 - Hot Coffee mod - wikt:unpatch: software jargon.
- 1 - Hotel Artemis - wikt:dowdied: quoted from a movie review.
- 3 - Hogback (sculpture) - wikt:tegulation, wikt:tegulation, wikt:tegulation - this is not a typo but a real word
- 1 - Hermann Ebbinghaus - wikt:mnemometers: an obsolete device that measured memory (plural).
- 1 - Hestock - wikt:bablets: architectural term of unknown meaning
- 1 - Horbury Hunt Hall - wikt:simplate: Architectural term of unknown meaning.
J
- 1 -
Joadja, New South Wales - wikt:oilworks: seems legit. - 1 - JotForm - wikt:esigning: short for "electronic signing" like email for "electronic mail".
- 1 -
John Austin Victoreen - wikt:otometry:an old but legitimate word. Not optometry.
O
- Open fracture - wikt:unreamed: a term used in orthopedic operations.
- Osteoradionecrosis - wikt:decoronation: a dental term.
- Okishio's theorem - wikt:temporalist: According to checked on Merriam-Webster, it is a word.
- wikt:beatmap, wikt:beatmaps (see beatmap)
P
- 1 - Pachylemur - wikt:anticliny - the state of having wikt:anticlines – Normal Name (talk) 22:54, 24 June 2021 (UTC)
- 1 - Pacific oyster - wikt:cavortin - the name of a gene – Normal Name (talk) 22:54, 24 June 2021 (UTC)
- 1 - Paelya - wikt:paelyas - plural of paelya. paelya is an alternate name for wikt:paella – Normal Name (talk) 22:54, 24 June 2021 (UTC)
- 2 - Palaeopsychops - wikt:nygmata, wikt:trichorsors - nygmata is the plural of wikt:nygma, neither are in Wiktionary yet. trichosors is plural and synonymous with seta/setae (zoological term) – Normal Name (talk) 02:46, 25 June 2021 (UTC)
- 1 - Palatalization (sound change) - wikt:deaffricated - past tense of wikt:deaffrication – Normal Name (talk) 02:46, 25 June 2021 (UTC)
- 1 - Palazzo Sciarra - wikt:prudity - quality or state of being prudish/a prude – Normal Name (talk) 02:46, 25 June 2021 (UTC)
- 1 - Paleobiota of the Posidonia Shale - wikt:barytized - geological term, in addition to wikt:barytization – Normal Name (talk) 02:46, 25 June 2021 (UTC)
- 2 - Panoramic photography - wikt:panograph, wikt:panoramists - panograph: a single image constructed from overlapping photographs; panoramists: plural of wikt:panoramist, a photographer who uses the technique of panoramic photography (Merriam-Webster[1] also gives an alternate definition as "one who paints panoramas") – Normal Name (talk) 00:49, 26 June 2021 (UTC)
- 1 - Pastisset - wikt:anissette - also known as anis from wikt:anise, "anise-flavored liqueur that is consumed in most Mediterranean countries" – Normal Name (talk) 23:22, 28 June 2021 (UTC)
- 1 - Penders - wikt:sisalation - foil insulation paper used on roofing – Normal Name (talk) 05:50, 29 June 2021 (UTC)
- 1 - Paubha - wikt:paubhas - plural of wikt:paubha, a traditional religious painting made by the Newar people of Nepal – Normal Name (talk) 05:50, 29 June 2021 (UTC)
- 1 - Peering - wikt:depeer - Negation of peer – Normal Name (talk) 20:19, 29 June 2021 (UTC)
- 1 - Petasis reaction - wikt:racemically - 'racemically synthesized', to be formed via racemization – Normal Name (talk) 05:14, 30 June 2021 (UTC)
Q
- 2 - Quantum cloning - wikt:teleclone: seems like a legit term. Ira Leviton (talk) 18:59, 28 May 2020 (UTC) \\ telecloning is in wikt, and has red links to teleclone. -- Xurizuri
- 1 - Quidditch (sport) - wikt:bludgering - verb form of 'bludger' Strickesel (talk)
- 1 - Qutbuddin Mubarak Shah - wikt:wizarat - might be correct Strickesel (talk)
- Quantum dot - wikt:antiflection
- 19 - Qanat - wikt:konait, wikt:ghundat, wikt:kahrezes, wikt:kahrezes, wikt:kahrezes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:kahrizes, wikt:issadar, wikt:issadar, wikt:hangams, wikt:hangams - alternate spellings, alternate names for this and the last two words seem to be correct terms Strickesel (talk)
Y
- 1 - Yacambú National Park - wikt:caramerudo - this is the common name used in Venezuela for Odocoileus virginianus deer (see here) - I'm not sure what to do with it. DferDaisy (talk) 15:55, 4 August 2018 (UTC)
- I think it just needs an entry in Wiktionary, then. -- Beland (talk) 19:34, 20 September 2018 (UTC)
By word
- 90 - wikt:bellcast - Acadia National Park carriage paths, bridges and gatehouses, Administration Building, Missouri State Fruit Experiment Station, Asbury United Methodist Church (Knoxville, Tennessee), Avondale, Parramatta, Benjamin Franklin Prescott House ... find all
- 83 - wikt:buyrate - Backlash (2003), Badd Blood: In Your House, Beach Brawl, Campbell McLaren, Chuck Liddell vs. Tito Ortiz ... find all
- 72 - wikt:interbrand - 2019 WWE Draft, Akam (wrestler), Alexa Bliss, Becky Lynch, Bragging Rights (2009) ... find all
- 52 - wikt:headstroke - Ba (Indic), Bengali language, Bha (Indic), Ca (Indic), Da (Indic) ... find all
- 50 - wikt:cornerboards - Abraham Hall, Babson-Alling House, Bangor Elevator, Benjamin Franklin Prescott House, Benoit Apartments ... find all
- 48 - wikt:cellspot - Apamea anceps, Apamea furva, Apamea oblonga, Apamea ophiogramma, Apamea remissa ... find all
- 44 - wikt:lagums - BIP Brewery, Building of the Patriarchate, Belgrade, Gardoš, Gardoš Tower, House at 10 Cara Dušana Street ... find all
- 39 - wikt:flushboarding - Binks Hess House and Barn, Burt Henry Covered Bridge, Call-Bartlett House, Capt. William McGilvery House, Casey House (Mountain Home, Arkansas) ... find all
- 21 - wikt:headstrokes - Bha (Indic), Ca (Indic), Da (Indic), Dha (Indic), Ga (Indic) ... find all
- 21 - wikt:divebombers - 105th Light Anti-Aircraft Regiment, Royal Artillery, 1st Flintshire Rifle Volunteers, 32nd Light Anti-Aircraft Regiment, Royal Artillery, 61st Light Anti-Aircraft Regiment, Royal Artillery, Battle of Fort Eben-Emael ... find all
- 20 - wikt:jetpipe - Airbreathing jet engine, Bristol Proteus, CFE CFE738, Components of jet engines, De Havilland Sea Venom ... find all
- 20 - wikt:droneless - Bosnia and Herzegovina, Bousine, Great Highland bagpipe, Greek bagpipes, Gudastviri ... find all
- 19 - wikt:drillhead - History of ice drilling, Ice core, Ice drilling ... find all
- 17 - wikt:ingassing - Circumstellar habitable zone, Decompression equipment, Decompression illness, Decompression practice, History of decompression research and development ... find all
- 17 - wikt:handcut - Archie Granot, Architecture of Bermuda, Buffalo Grove Lime Kiln, Covenant First Presbyterian Church, Culture of Mysore ... find all
- 17 - wikt:cycletracks - Baltimore, Bikeway and legislation, Frontage road, Sean Chu ... find all
- 17 - wikt:cyclepaths - A2 road (England), Benelux, Bike lane, De Marne, Doncaster iPort ... find all
- 17 - wikt:corniceline - Albert Palmer House, Belcrest Apartments (Detroit, Michigan), Benjamin Williams House, Christian-Ellis House, Daniel Gould House ... find all
- wikt:degradated - Southern American English
- 9 - wikt:lavwa - Bélé, Chanté mas, Chouval bwa, Music of Dominica, Music of Martinique ... find all →means "voice" in several; Caribbean creoles.
- wikt:vlně - Czeck (wool)
- wikt:vlne - Slovak
- According to User:Palmyrah on Template talk:Which lang#Patna, used in Horton Plains National Park:
- wikt:patna, (Sri Lankan) English, noun: a plain or, more usually, a hillside covered with patna grass
- wikt:patna grass - a particular kind of grass
- Possible etymology: a similar kind of grass grows in Patna, India, and brooms made from it are used all over India
- wikt:patana, Sinhalese, noun: patna
- Possible etymology: English patna
- patna and patana are both in wikt, but not with these meanings
- wikt:subdeletion - Comparative#Comparative subdeletion, [2]
- 28 (down from 45) - wikt:groundcolour - Anaxyrina cyanopa, Asura euprepioides, Carposina maritima, Dioryctria caesirufella, Euchromius aris ... find all
- In books, it seems like this only occurs as a typo for "ground-colour" (e.g. in books that also use that hyphenated form). -sche (talk) 07:32, 29 March 2020 (UTC)
- 32 (down from 39) - wikt:āgamas - Anekantavada, Antakrddaasah, Anuttaraupapātikadaśāh, Aupapatika, Bhairava ... find all - see agama. It is normal to add s to pluralize many Sanscrit words. Johnbod (talk) 12:22, 29 January 2020 (UTC)
- 26 - wikt:zeitlose - Count of St. Germain, Martin Werhand, Martin Werhand Verlag, St. Germain (Theosophy) ... find all
- This is German, and should be language-tagged as such. -sche (talk) 23:15, 29 May 2019 (UTC)
- Tagged, should presumably be added to Wiktionary? -- Beland (talk) 00:43, 31 March 2021 (UTC)
- This is German, and should be language-tagged as such. -sche (talk) 23:15, 29 May 2019 (UTC)
Most common non-English, missing from English Wiktionary
These words are commonly found in English Wikipedia, are present in a non-English Wiktionary, but are missing from English Wiktionary. Word counts are from English Wikipedia. This is a special report from the 2019-08-20 dump.
- 106 - wikt:стрелковая - 109th Rifle Division (Soviet Union), 10th Guards Motor Rifle Division, 114th Rifle Division (Soviet Union), 121st Guards Rifle Division, 137th Rifle Division (Soviet Union) ... find all
- 101 - wikt:īn - Ab Barik-e Sofla, Kermanshah, Ab Garmak-e Sofla, Khuzestan, Abbasabad-e Moin, Abd al-Mu'in ibn Musa'id, Alhashem-e Sofla ... find all
- 80 - wikt:ει - Ancient Greek verbs, Attic Greek, Axiotta, Celtic deities, Cernunnos ... find all
- 67 - wikt:pasangan - Ba (Javanese), Ca (Javanese), Da (Javanese), Dha (Javanese), Ga (Javanese) ... find all
- 52 - wikt:književnosti - August Kovačec, Bogdan Popović, Borisav Stanković, Božidar Petranović, Bratoljub Klaić ... find all
- 48 - wikt:jazyka - 1818 in literature, Aleš Klégr, Andrey Korolev, Andrey Zaliznyak, Bohemian ... find all
- 46 - wikt:изд - Albena Stambolova, Boris Koyalovich, Church of St Demetrius, Boboshevo, Demyanka River, Fedor Kapelyush ... find all
- 42 - wikt:гвардейская - 100th Guards Rifle Division, 104th Guards Airborne Division, 10th Guards Motor Rifle Division, 10th Guards Uralsko-Lvovskaya Tank Division, 121st Guards Rifle Division ... find all
- 40 - wikt:eskadrila - 122nd Hydroplane Liaison Squadron, 461st Light Combat Aviation Squadron, 462nd Light Combat Aviation Squadron, 463rd Light Combat Aviation Squadron, 464th Light Combat Aviation Squadron ... find all
- 38 - wikt:tunjos - Aguazuque, Bacatá, Eastern Hills, Bogotá, El Dorado, Epítome de la conquista del Nuevo Reino de Granada ... find all
- 38 - wikt:serambi - Al-Wustho Mangkunegaran Mosque, Grand Mosque of Bandung, Great Mosque of Banten, Great Mosque of Malang, Great Mosque of Surakarta ... find all
- 36 - wikt:ουκ - Codex Vaticanus 2061, Lectionary 12, Lectionary 225, Lectionary 240, Matthew 27:6 ... find all
- 31 - wikt:ombiasy - Andrianampoinimerina, Antambahoaka, Antemoro people, Bara people, Culture of Madagascar ... find all
- 31 - wikt:kozane - Dō (armour), Japanese armour, Kimura Shigenari, Lamellar armour, Scale armour ... find all
Mineral words
Several pages with lists of minerals are showing up as some of the pages with the most detected typos. Below is a list of words from these pages. I'm pretty sure some of them are misspelled, so they all require verification. I don't see anything in wikt:Wiktionary:CFI that would exclude these names; some but not all of them are IUPAC systematic. We could also add Wikipedia stubs or redirects as needed if Wiktionary doesn't want them. -- Beland (talk) 15:36, 30 May 2019 (UTC)
- @Beland: Wiktionary does want them. We just haven't gotten around to them, as there are tens of thousands of terms along these lines. BD2412 T 01:04, 29 March 2021 (UTC)
- Problem is the extremely rare ones, or terms only ever used on Wikipedia. These are unwanted on Wiktionary. There are an infinite number of possible chemical names, so there are some criteria for inclusion. Graeme Bartlett (talk) 05:18, 9 April 2021 (UTC)
- wikt:aluminodecaoxotrisilicate
- wikt:aluminodecaoxytetrasilicate
- wikt:aluminodisilicate
- wikt:aluminohexaoxodisilicate
- wikt:aluminohexaoxosilicate
- wikt:aluminotetraoxosilicate
- wikt:aluminotrisilicate
- wikt:alumotrisilicate
- wikt:berylloalumotrisilicate
- wikt:chloro-potassichastingsite
- wikt:decaoxodihydroxy
- wikt:decaoxydihydroxy
- wikt:dialuminiosilicate
- wikt:dialuminoctaoxodisilicate
- wikt:dialuminodecaoxodisilicate
- wikt:dialuminodisilicate
- wikt:dialuminohexasilicate
- wikt:dialuminopentaoxosilicate
- wikt:dialuminotrisilicate
- wikt:dialumodisilicate
- wikt:diboro
- wikt:dihydroxoarsenate
- wikt:dihydroxophosphate
- wikt:dihydroxotellurate
- wikt:dioxoarsenate
- wikt:dioxoborate
- wikt:dioxochloride
- wikt:dioxodiarsenate
- wikt:dioxodichloride
- wikt:dioxodifluorine
- wikt:dioxodiphosphate
- wikt:dioxodiselenite
- wikt:dioxofluorine
- wikt:dioxohydroxy
- wikt:dioxophosphate
- wikt:dioxoselenite
- wikt:dioxosilicate
- wikt:dioxosulfate
- wikt:dioxotetrasulfate
- wikt:dioxotriarsenate
- wikt:dioxotriphosphate
- wikt:dioxydecahydroxy
- wikt:dioxydifluorine
- wikt:dioxydihydroxy
- wikt:dioxydodecahydroxy
- wikt:dioxyhydroxy
- wikt:diREE
- wikt:disulfa
- wikt:disulfarsenide
- wikt:ditetraoxosilicate
- wikt:docosahydroxy
- wikt:docosaoxide
- wikt:docosaoxotetrasilicate
- wikt:dodecahydroxy
- wikt:dodecaoxotrisilicate
- wikt:dodecaoxychloride
- wikt:dodecaoxytetrasilicate
- wikt:fluoro-potassichastingsite
- wikt:fluoro-potassicrichterite
- wikt:henicosahydrate
- wikt:heptaicosahydrate
- wikt:heptaicosaoxodisilicate
- wikt:heptaoxodivanadate
- wikt:heptaoxopentaborate
- wikt:heptaoxosilicate
- wikt:heptasilicon
- wikt:heptasulfadiarsenide
- wikt:heptawater
- wikt:hexaaluminotetraicosaoxohexasilicate
- wikt:hexacontaoxide
- wikt:hexahydrogen
- wikt:hexahydroxide
- wikt:hexaicosahydroxy
- wikt:hexaoxodiborate
- wikt:hexaoxodisilicate
- wikt:hexaoxopentaborate
- wikt:hexaoxtellurate
- wikt:hexaoxydihydroxy
- wikt:hexasulfa
- wikt:hexatricontahydrate
- wikt:hydrodioxoarsenate
- wikt:hydroheptaoxide
- wikt:hydrohexaoxodisilicate
- wikt:hydrophosphate
- wikt:hydrotrioxosilicate
- wikt:hydroxoarsenate
- wikt:hydroxophosphate
- wikt:hydroxyarsenate
- wikt:hydroxyhexaoxodisilicate
- wikt:hydroxypentaoxide
- wikt:hydroxytriborate
- wikt:hydroxytridecaoxodisilicate
- wikt:hydroxytrioxosilicate
- wikt:icosahydrate
- wikt:icosalead
- wikt:icosaoxide
- wikt:icosaoxo
- wikt:icosaoxooctasilicate
- wikt:icosaoxopentasilicate
- wikt:nonadecaoxoctasilicate
- wikt:nonaoxodiarsenate
- wikt:nonaoxodiborate
- wikt:nonaoxodivanadate
- wikt:nonaoxohexaborate
- wikt:nonaoxopentaborate
- wikt:nonaoxosilicate
- wikt:nonaoxotetravanadate
- wikt:nonaoxotrisilicate
- wikt:nonaoxyhydroxytetrasilicate
- wikt:octadecaoxide
- wikt:octadecaoxoheptasilicate
- wikt:octadecaoxohexasilicate
- wikt:octadecaoxopentasilicate
- wikt:octaoxodiborodisilicate
- wikt:octaoxoicosahydroxy
- wikt:octaoxopentaborate
- wikt:octaoxotetraborate
- wikt:octaoxotetrasilicate
- wikt:octaoxotrisilicate
- wikt:octaoxotritellurate
- wikt:octaoxydihydroxy
- wikt:octasulfa
- wikt:octasulfadiantimonide
- wikt:octatelluride
- wikt:octatriacontaoxide
- wikt:octauranyl
- wikt:oxoarsenate
- wikt:oxocarbonate
- wikt:oxochromate
- wikt:oxodecachloride
- wikt:oxodiarsenate
- wikt:oxodiborate
- wikt:oxodiphosphate
- wikt:oxodisulfate
- wikt:oxodisulfide
- wikt:oxohydrophosphate
- wikt:oxosulfate
- wikt:oxotrisulfate
- wikt:oxydihydroxy
- wikt:oxydinitride
- wikt:oxyhydroxy
- wikt:oxyphosphate
- wikt:oxytrivanadate
- wikt:pentadecaoxohexasilicate
- wikt:pentaicosahydro
- wikt:pentaicosamanganese
- wikt:pentaoxodiarsenate
- wikt:pentaoxodiborate
- wikt:pentaoxodisilicate
- wikt:pentaoxotellurate
- wikt:pentaoxotetraborate
- wikt:pentaoxotrivanadate
- wikt:pentaoxoundecaborate
- wikt:pentasulfa
- wikt:pentatetracontaoxooctadecasilicate
- wikt:polytypoids
- wikt:potassic-aluminosadanagaite
- wikt:potassic-aluminotaramite
- wikt:potassicarfvedsonite
- wikt:potassic-chloropargasite
- wikt:potassic-ferrisadanagaite
- wikt:Potassic-jeanlouisite
- wikt:potassium-fluorrichterite
- wikt:protoferro-anthophyllite
- wikt:proto-ferro-suenoite
- wikt:stannotrisilicate
- wikt:stewardite
- wikt:sulfantimonide
- wikt:surkhobite
- wikt:tengerite
- wikt:tetrabismuthide
- wikt:tetradecaborate
- wikt:tetradecalead
- wikt:tetradecaoxopentasilicate
- wikt:tetrahydroxoarsenate
- wikt:tetraicosaoxide
- wikt:tetraicosaoxodecasilicate
- wikt:tetraicosaoxotrisilicate
- wikt:tetraluminotetrasilicate
- wikt:tetraoxoarsenate
- wikt:tetraoxoborate
- wikt:tetraoxodichloride
- wikt:tetraoxodiphosphate
- wikt:tetraoxodisulfate
- wikt:tetraoxogermanate
- wikt:tetraoxomolybdate
- wikt:tetraoxoselenate
- wikt:tetraoxosulfate
- wikt:tetraoxotellurate
- wikt:tetraoxotetraphosphate
- wikt:tetraoxovanadate
- wikt:tetraoxozincate
- wikt:tetraoxy
- wikt:tetraoxysilicate
- wikt:tetraoxytetrabismuth
- wikt:tetraoxytriborate
- wikt:tetraselenite
- wikt:tetrastannide
- wikt:tetrasulfa
- wikt:tetrawater
- wikt:triacontaoxide
- wikt:triacontaoxoctasilicate
- wikt:triacontaoxydodecasilicate
- wikt:trialuminotrisilicate
- wikt:triberylohexasilicate
- wikt:triborododecasilicate
- wikt:tridecaoxoditellurate
- wikt:tridecaoxoheptaborate
- wikt:tridecasulfa
- wikt:trihydronium
- wikt:triicosaoxotetrasilicate
- wikt:trilithiododecasilicate
- wikt:trioxoarsenate
- wikt:trioxoborate
- wikt:trioxosilicate
- wikt:trioxotellurate
- wikt:trioxotriborate
- wikt:triREE
- wikt:trisulfa
- wikt:triwater
- wikt:undecaoxoheptasilicate
- wikt:undecaoxohexahydrohexaborate
- wikt:undecaoxotetrasilicate
- wikt:undecaoxotitanotetrasilicate
- wikt:zircono
Incorrect or rare mineral words
- wikt:aluminoctaoxotrisilicate try aluminum trisilicate octaoxide
- wikt:docosatantalum → valid but too rare for wiktionary
- wikt:hexaoxy → needs changing
- wikt:heptatelluride → too rare for wiktionary but appears valid
- wikt:undecalead → too rare for Wiktionary
Needs Wikipedia article instead?
- 2 - Anacron - wikt:cronie, wikt:cronie
- 2 - Carpathian Large Carnivore Project - wikt:cntours, wikt:cntours - redlinked company in Romania needs an article.
- 1 - Club Penguin (franchise) - wikt:puffles: plural of a type of character in an online game.
- 33 hexipentisteriruncicantitruncated - a nest of specialized geometrical form names; what to do? → since this is a part of several compound names, it may need a set index or disambig page. If it has use in books, it could go in Wiktionary, but Wikipedia seems to be the source of these geometric terms.
- the articles it's currently (Jan 2021) used in are: Uniform 9-polytope, A8 polytope, Uniform 8-polytope, B8 polytope, Hexicated 7-cubes, Hexicated 7-simplexes. There's also Hexipentisteriruncicantic 7-cube which redirects to a section in (and is a form of) Hexic 7-cubes which is a "convex uniform 7-polytope". Hexipentisteriruncicantic itself has a few pages that it's in: Uniform 8-polytope which is in the other list, D7 polytope, Uniform 7-polytope. And let me just say, what the hell does any of this mean. So hopefully, any of those help with figuring out what to do. --Xurizuri (talk) 12:07, 21 January 2021 (UTC)
- wikt:boxel and wikt:boxels - used and cursorily defined on 2.5D (visual perception). -- Beland (talk) 00:05, 9 April 2021 (UTC)
Possible typos by length
Longest or shortest in certain categories are shown, sometimes just for fun and sometimes because they form a useful group. Please use strikethrough (or leave a note) for this section rather than removing lines, to avoid repeating work done while the dumps were being processed. Thanks!
Likely chemistry words
(updated from 2021-07-20 dump)
These need to be checked by a chemist and marked as {{not a typo}}.
- 84 - wikt:trans-2-hydroxyisoxypropyl-3-hydroxy-7-isopentene-2,3-dihydrobenzofuran-5-carboxylic - Cāng zhú
- 84 - wikt:ethyl-8-azido-5,6-dihydro-5-methyl-6-oxo-4h-imidazo-1,4-benzodiazepine-3-carboxylate - Ro15-4513
- 79 - wikt:d-1,2,3,9,10,10a-hexahydro-6-methoxy-11-methyl-4h-10,4a-iminoethano-phenanthren - Controlled Drugs and Substances Act
- 73 - wikt:d-1,2,3,9,10,10a-hexahydro-11-methyl-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act
- 72 - wikt:l-11-allyl-1,2,3,9,10,10a-hexahydro-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act
- 70 - wikt:n-2'-hydroxyoctadecanoyl-2-amino-9-methyl-4,8-heptade-cadiene-1,3-diol - Ramaria botrytis
- 69 - wikt:n-methyl-l-alanyl-l-leucyl-n-methyl-trans-dehydrophenyl-alanyl-glycyl - Tetrapeptide
- 65 - wikt:dl-1,2-anhydro-4,5-o-cyclohexylidene-1,2,3/4,5-cyclopentanepentol - 1,2,3,4,5-Cyclopentanepentol
- 63 - wikt:uridine-5'-diphospho-n-acetyl-2-amino-2-deoxy-3-o-lactylglucose - UDP-N-acetylmuramate dehydrogenase
- 59 - wikt:octachloro-3a,4,7,7a-tetrahydro-4,7-methanoindene-1,8-dione - Hexachlorocyclopentadiene
- 59 - wikt:dihydroxy-21-oxa-21-chloromethylpregna-1,4-diene-3,20-dione - List of corticosteroids
- 59 - wikt:cis-5,6-dihydroxy-4-isopropylcyclohexa-1,3-dienecarboxylate - 2,3-dihydroxy-2,3-dihydro-p-cumate dehydrogenase
- 58 - wikt:decahydro-10-methoxy-3,6,9-trimethyl-3,12-epoxy-12h-pyrano - Artemether
- 58 - wikt:anti-7β,8α-dihydroxy-9α,10α-epoxy-7,8,9,10-tetrahydrobenzo - Benzo(j)fluoranthene
- 56 - wikt:hydroxy-17α,21-dimethyl-19-norpregna-4,9-dien-3,20-dione - Trimegestone
- 54 - wikt:s-adenosyl-l-methionine:3-hexaprenyl-4,5-dihydroxylate - Hexaprenyldihydroxybenzoate methyltransferase
- 54 - wikt:cis-11,12-dichloro-9,10-dihydro-9,10-ethano-2-anthroic - Field effect (chemistry)
- 52 - wikt:hexahydro-1,3-dimethyl-4-phenylazepine-4-carboxylate - Controlled Drugs and Substances Act
- 52 - wikt:hexahydro-1,2-dimethyl-4-phenylazepine-4-carboxylate - Controlled Drugs and Substances Act
- 52 - wikt:cis-2-methyl-4-trimethylammoniummethyl-1,3-dioxolane - Dioxolane
Chemical formulas
Chemical formulas should be written with HTML subscripts or {{chem2}}.
Chemical formulas that use Unicode subscripts (which is against MOS:SUBSCRIPT) will be detected automatically by moss_entity_check.py.
Chemical formulas that use <sub>...</sub>
are allowed by MOS:CHEM, but may show up in the main typo listings above. They can be converted to use {{chem2}} to be accepted by the spell checker.
Articles with a large number of chemical formulas triggering the spell checker are listed here (updated from 2020-03-20 dump):
- 851 - Classification of non-silicate minerals - wikt:Nb,Ta, wikt:Fe,Ni, wikt:Fe,Ni, wikt:Ni,Fe, wikt:Fe,Ni, wikt:Ni,Fe, wikt:Au,Ag, wikt:Fe,Ni, wikt:Fe,Ni, wikt:Fe,Ni, wikt:Ir,Os, wikt:Ru,Pt, wikt:Rh,Pt, wikt:Pd,Pt, wikt:Os,Ir, wikt:Ru,Ir, wikt:Ir,Os, wikt:Fe,Os, wikt:Ru,Ir, wikt:Mo,Ru, wikt:Fe,Ir, wikt:Ni,Fe, wikt:Pt,Pd, wikt:Fe,Cu, wikt:Pt,Pd, wikt:Pd,Pt, wikt:Bi,Pb, wikt:Fe,Ni...
- 272 - Classification of silicate minerals - wikt:Al,Fe3, wikt:Al,Fe3, wikt:Na,Ce, wikt:Ca,Ce, wikt:OH,Cl, wikt:OH,H2O, wikt:Fe,Mn, wikt:OH,H2O, wikt:Mn,Sr, wikt:Na,Ca, wikt:Na,H3O, wikt:Ca,Mn...
- Several "List of minerals" articles currently posted at Wikipedia:Typo Team/moss/L
- 90 - Nickel compounds - wikt:hexaaquanickel(II), wikt:Ni(H, wikt:Ni(NH, wikt:Nickel(IV), wikt:Ni(N, wikt:Ni(BF, wikt:Ni(AsF, wikt:Ni(SbF, wikt:Ni(BiF, wikt:Ni(AsF, wikt:Ni(SbF, wikt:Ni(ICl, wikt:Ni(N, wikt:Ni(N, wikt:Ni(NH...
- 80 - Silicate mineral - wikt:Hf,Zr, wikt:Mg,Fe, wikt:Fe,Mg, wikt:Ca,Fe, wikt:Al,Fe, wikt:Ce,La, wikt:FeII,FeIII, wikt:Mg,Fe, wikt:Fe,Mn, wikt:Na,Ca, wikt:Al,Li, wikt:Al,Fe, wikt:Fe,Mg, wikt:Al,Fe, wikt:Si,Al, wikt:Mg,Fe...
- 67 - N-heterocyclic silylene - [[wikt:tBuN]Si]], [[wikt:tBuN]Si]], [[wikt:tBuN]Si]]... wikt:Ru(MeCN), wikt:RuCl(MeCN), wikt:NHSi)OTf...
- 50 - Phosphor - wikt:Ca,Sr, wikt:Cu,Mg, wikt:Cu,Al, wikt:Zn,Cd...
- 33 - Silicon - wikt:Mg,Fe, wikt:Mg,Fe, wikt:SiH(OMe), wikt:Si(OMe)...
- 31 - Pyroxene - wikt:Si,Al, wikt:Mg,Fe, wikt:Mg,Fe, wikt:Mg,Fe...
- 28 - Thorium - wikt:OH,Cl, wikt:Ca,Fe, wikt:ThO(OH, wikt:Cl)H...
I'm refining the below report to be more useful; will probably be updated in a few days. -- Beland (talk) 03:17, 1 April 2021 (UTC)
Chemical formulas that don't use subscripts (which is incorrect notation) are listed below. These are found with:
grep -P ' ((H|He|Li|Be|B|C|N|O|F|Ne|Na|Mg|Al|Si|P|S|Cl|Ar|K|Ca|Sc|Ti|V|Cr|Mn|Fe|Co|Ni|Cu|Zn|Ga|Ge|As|Se|Br|Kr|Rb|Sr|Y|Zr|Nb|Mo|Tc|Ru|Rh|Pd|Ag|Cd|In|Sn|Sb|Te|I|Xe|Cs|Ba|La|Ce|Pr|Nd|Pm|Sm|Eu|Gd|Tb|Dy|Ho|Er|Tm|Yb|Lu|Hf|Ta|W|Re|Os|Ir|Pt|Au|Hg|Tl|Pb|Bi|Po|At|Rn|Fr|Ra|Ac|Th|Pa|U|Np|Pu|Am|Cm|Bk|Cf|Es|Fm|Md|No|Lr|Rf|Db|Sg|Bh|Hs|Mt|Ds|Rg|Cn|Nh|Fl|Mc|Lv|Ts|Og|R)([2-9]|[1-9][0-9]))+$' debug-spellcheck-ignored.txt | head -100
These should be converted to use {{chem2}}. If there is a suitable target, it would also be nice to create a redirect that is tagged with {{R from molecular formula}}. (A redirect is also a good way to clear sequences that aren't actually chemical formulas.)
TODO: Search for incorrect instances of chemical formulas listed in Category:Molecular formulas.
(Updated from 2021-01-01 dump.)
- 78/28 - C6F5
- 76/50 - Nb6 → Chess notation
74/35 - H3K4- 71/44 - Nh5 → Chess notation
- 68/38 - Rf8 → Chess notation
- 52/25 - Nb3 → Chess notation
- 46/29 - Nb5 → Chess notation
- 44/15 - Fe4S4
- 36/20 - Rh8
36/13 - Si8O22- 30/11 - Si2O6
- 27/13 - Y32
- 25/15 - Y30
- 22/12 - Y33
- 21/19 - Si3O9
- 21/17 - Rg5 → chess notation
- 19/10 - Cu6
- 18/7 - Y34
17/7 - H4K16- 17/17 - K52
17/17 - F6F6F6- 17/17 - C5H7O2
- 17/16 - V82
- 16/7 - Si6Al2
- 16/1 - Ti3C2
- 16/16 - No8
- 15/9 - Si6O18
15/7 - H3K79- 15/6 - Y36 → bus routes, page number, database reference, mutation
- 15/15 - Si9O27
- 14/9 - V31
- 13/9 - V51
- 13/8 - Y13
- 13/8 - W61
- 13/6 - Y93
13/4 - Ga2S3- 13/13 - K65
13/12 - C6H1113/11 - H4K20- 12/7 - V69
12/4 - Al2Br612/2 - H3K1412/10 - Al2Cl6- 11/9 - K54
- 11/7 - Mn5
11/5 - B6N2→ Japanese bomber type- 11/1 - V3I2
- 11/10 - Y95
- 10/9 - Si4O11
- 10/7 - Na7
10/7 - H4K12- 10/7 - C8H17
- 10/6 - Th9
- 10/6 - Pr6O11
10/5 - Rb9O210/5 - H4K510/5 - Bi2Se310/4 - In2S3- 10/10 - R88
9/8 - W439/8 - V58- 9/8 - V53
9/8 - O2C5H7- 9/8 - C2R2
- 9/8 - Al2Si2O5
- 9/6 - Mg3Al2
9/6 - C2B10H12- 9/4 - Fe7C3
9/3 - Tb4O7- 9/1 - V2I3 → stands for volume 2 issue 3; this is a pattern found in dois
- 9/1 - Si25O73
- 8/8 - V79
- 8/8 - K49
- 8/8 - Cu5
- 8/7 - V76
- 8/6 - H56
8/6 - Dy2Ti2O7- 8/5 - V67
- 8/3 - Si2H2
- 8/3 - C6R6
- 8/2 - Al2Si4O12
- 8/1 - V3R5
- 7/7 - Si4O10 → only parts of formula -- false positive?
- 7/7 - Pb4
- 7/7 - O35
- 7/6 - Na8
7/6 - K2Mg2- 7/6 - In20
- 7/6 - Bh8
- 7/5 - W93
- 7/5 - R94
7/5 - Cu31S167/4 - Cs11O37/3 - Sn6O47/3 - S50B30→ BMW engine7/3 - Li6→mostly isotope of lithium, but LI6 something to do with Buick engines7/3 - H2W12O42- 7/3 - Dy4 → Dy4 Systems Inc. a Canadian company probably notable based on how many times it is mentioned; DY4 also used for designations of minor planets
7/3 - Cu6Sn5
Repeating patterns
For rhyme schemes, they probably need to be re-styled to follow Wikipedia:WikiProject Poetry#Style for rhyme schemes. If this ends up making them all-caps, they won't show up here on the next run. For mixed-case rhyme scheme notations, use {{not a typo}} after making sure dashes, commas, and spaces follow the recommended style.
From 2021-07-20 dump:
- all done
For Beland todo
- Rhyme scheme hunting:
- Sync style for articles in Category:Stanzaic form and Category:Rhyme and add to rhyme scheme list if appropriate.
- Sync annotation style for articles that mark up poems line-by-line (use tables, not column divs or parens)
- Manually search for patterns like:
- a-b-a-b-a-b-c-c
- AB,CD,AB (internal rhyme)
- "aa", "ab", "aaa", "aab", "aba", "abb", "abc", "aaaa", "aaba", "aabb", "aabc", "abaa", "abab", "abba", "abca", "abcb", "abcc", "abcd" - probable rhyme sequences where there's an article present so it's not detected as a misspelling
False positives
Is there a word that is correctly used in an article, but which shouldn't be added to Wiktionary? List it here, and Beland will fix the problem.
Archived solutions: Wikipedia:Typo Team/moss/Archive
- wikt:singer(s), wikt:composer(s), etc. Found in Kanto (music).
False negatives
Is there a misspelled word in an article mentioned here that was not reported? Feel free to list it below and Beland will try to improve the code if appropriate.
These are currently over-ignored, but could be used to suggest correct spellings:
- Wikipedia articles with {{R from misspelling}}, {{R from incorrect name}}, {{R from miscapitalisation}}, and redirects to these templates
- Wiktionary entries that are known misspellings (e.g. wikt:anticiliary)
- In cases where there are variant spellings of the same word or phrase, Wikipedia should probably pick one and stick to it except to mention the variants. This happens with:
- Compound words - whether to use a space, dash, or nothing, as in "junebug" vs. "june bug" or "email" vs. "e-mail".
- Words with multiple transliterations from another language (often there are multiple systems, no particular system, or a modern system different from historical systems).
- Redirects with {{R from alternate spelling}} and redirects to that template.
- Article Ana Recio Harvey | detected misspelling: appoinment | additional, undetected misspelling: enterpreneur
- Looks like this was because of redirects with "enterpreneur" in the title. I have tagged them all {{R from misspelling}}, but I'll have to change the code to ignore those, as noted above. Thanks for catching that! -- Beland (talk) 23:52, 18 October 2018 (UTC)
- 1 - Jack Beckitt - wikt:monacled -> also had "whow" in place of who --Xurizuri (talk) 05:10, 5 February 2021 (UTC)
- 1 - Jack Jenkins (rugby player) - wikt:scummage -> "forst" instead of "first" --Xurizuri (talk) 05:43, 5 February 2021 (UTC)
- 1 - Johan Christian Drewsen - wikt:cultication -> "Rogether" instead of "Together", at the start of a sentence. "Copenahgen" instead of "Copenhagen". They obviously didn't get picked up because of capitalisation, but thought I'd list them here anyway just in case it helps. -Xurizuri (talk) 11:09, 13 February 2021 (UTC)
Archived notes
See Wikipedia:Typo Team/moss/Archive.
Mismatched markup and punctuation
Errors in punctuation (mostly quotation marks) and wiki markup generally cause confusion for readers, and also prevent the spell checker from running on these articles.
Inches and feet should not use " and ', per Wikipedia:Manual of Style/Dates and numbers#Specific units; use letters instead. (See MOS:UNITS for general guidance.) Where conversions are needed, use {{convert}}, for example: 2 feet 3 inches (69 cm)
WORK IN PROGRESS
- Integrating these with main listings
- Filter only unmatched " for now
- Filter articles with non-ASCII quote marks to a separate list for JWB processing
- Filter \d" and \d' to a separate sublist for inch/feet style conversion
- Explain ✂ or skip snippets showing this
- Bracketbot web UI seems to be down
-- Beland (talk) 19:03, 4 September 2019 (UTC)
Gender-neutral language
Manned
The word "manned" and related forms like "unmanned" are used in many articles, but is not gender-neutral as required by MOS:S/HE and the NASA style guide. Gender-neutral alternatives include:
- Crewed, uncrewed
- Staffed, unstaffed
- Human spaceflight
- Defended
Not all instances need to be changed.
- Proper nouns should remain the same, like Manned Orbiting Laboratory
- Titles of sources and quotes should remain unchanged.
- If the term itself is being discussed, for example to say that "manned spaceflight" is another way of saying human spaceflight.
- There seems to be consensus on unmanned aerial vehicle that this and related phrases (like unmanned aerial system) should remain intact, since it is much more frequent than "uncrewed aerial vehicle" at the moment. However, when using Wikipedia's voice it is preferred to describe a UAV as "uncrewed" when not using the whole phrase.
- Non-article pages that are retained for historical interest shouldn't be modified if they won't be visible to readers.
- Redirects with this title should be left alone if they are redirecting readers to a gender-neutral title
If the word is found the names of articles and categories (except those with names directly related to UAVs), those should be renamed, and the links changed. Many articles have already been renamed, and the links just need to be updated. (Remember that to rename a category, all the articles in that category must be edited to change their pointers.)
- Coming soon: moss report on "manned" that ignores references, page titles, proper nouns, and consensus-OK phrases.
- Find all instances of "manned" in articles
- Find all instances of "unmanned" in articles
- Find all instances of "manned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
- Find all instances of "unmanned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
Borderline cases
These may need to be discussed before being potentially renamed.
These are generic terms, like Human mission to Mars, as opposed to proper names like Manned Orbiting Laboratory. -- Beland (talk) 19:41, 21 May 2019 (UTC)
- Manned Venus flyby - Based on the NASA style guide, NASA probably would now refer to this as "human Venus flyby" but historical sources say "manned Venus flyby" so that's what the majority of editors commenting on the talk page currently favor. There is some question as to whether the scope of the article concerns a specific mission or this type of mission in general, which is related to the proper name exception (but then the title would be "Manned Venus Flyby"). Compare Colonization of Venus and Human mission to Mars. -- Beland (talk) 19:41, 21 May 2019 (UTC)
Objections in specific cases:
Marriage
Wikipedia:Writing about women § Marriage points out:
- "is the wife of" is less neutral than "is married to" - find all "is the wife of"
- "born to X and his wife Y" is less neutral than "born to X and Y" - approximate search
- "man and wife" is less neutral than "husband and wife", and to be fully neutral the order should be varied - find all "man and wife"
Ladies
Wikipedia:Writing about women § Girls, ladies prefers "women" to "ladies" except where part of set phrases or traditional titles (like first lady). find all lowercase "ladies"
Instructional and presumptuous language
MOS:NOTE says to avoid the following phrases when they address the reader directly. Not all instances are problematic, such as those in direct quotations.
- remember that - find all "remember that"
- note that - find all "note that"
- of course - find all "of course"
- naturally - find all "naturally" (the meaning "related to nature" is not problematic)
- obviously - find all "obviously"
- clearly - find all "clearly"
- actually - find all "actually"
- rhetorical questions, especially in headings - find all questions in headings (some cases, like the names of works, are not problematic)
Internationally comprehensible spelling and vocabulary
MOS:COMMONALITY advises the use of vocabulary and spellings that are shared across national varieties of English, where possible. This section collects instances where an unshared term is being used which could be improved. For proper nouns and direct quotes, a translation or re-spelling into another dialect may be helpful.
- "gaol" should be "jail"
- Disputed, discussion underway at Wikipedia talk:Manual of Style#Gaol vs. jail
- looks like its wrapped up, with jail preferred except in proper nouns Xurizuri (talk) 15:36, 21 December 2020 (UTC)
Currency style
Per MOS:CURRENCY:
- For the UK, Irish, Australian, New Zealand, and South African pound, ₤ should be changed to £
- ₤ is OK to use with Italian lira. Changing e.g. ₤100,000 to [[Italian lira|₤]]100,000 will prevent legitimate uses from showing up in automated reports, and also help readers understand that this is not British pounds. (Mentions of Italian lira are increasingly rare because it has been replaced by the Euro.)
Caution: Not all problem pages show up reliably; if you do a search, fix all the pages in the results, and then do another search, you will probably get a fresh batch of problem pages. It may also take a minute or two for fixed pages to disappear from the results, due to lag updating the search index.
Work is in progress on detecting and fixing other MOS-related issues with numbers and currencies.
Small caps
Per MOS:BCE, smallcaps are not to be used for years like "400 BC". Find all instances of known smallcaps issues...
HTML tags
Updated from 2021-07-20 dump.
You can do one of two things for these articles:
- Remove, repair, or convert the HTML markup to wiki markup yourself.
- Tag the article {{cleanup HTML}} and it will show up under Category:Articles with HTML markup but not on this list. Use the "tags" parameter to indicate which tags are present on the page; many editors find it hard to locate the offending HTML. For example: {{cleanup HTML|tags=table, cite}}
How to clean up
See Category:Articles with HTML markup for instructions on how to find the offending tags and what to do about them.
Find all articles by tag
Can't wait for the next database dump? Want to look for or fix all instances of a specific tag? Use the links below!
- <tt> - find all
- <li>, <ol>, and <ul> - find all
- <table>, <tr>, <td>, <th>, <caption> - find all
- <i> or <em> - find all
- <dd>, <dt>, and <dl> - find all
- <cite> - find all
- <p> - find all
- <strong> and <b> - find all
- <name=> - find all
- </br> - find all
- <hr> and <hr/> - find all
- <font> - find all
- <ins> - find all
- <samp> - find all
- <q> - find all
- <wbr> - find all and find ­
- <ruby>, <rt>, and <rp> - find all
- Elements and attributes obsoleted in HTML 5 have prefab searches linked from Wikipedia:HTML 5
Additional HTML problems are listed at Special:LintErrors.
Sometimes editors use angle brackets (< and >) for other purposes. Though these are not HTML markup, they often need to be fixed.
<<...>> find all can indicate:
- French quotation marks rendered as <<quoted text>>. These should be normalized to "quoted text" or 'quoted text', even in quotations, per MOS:CONFORM.
- A broken citation that should be converted to {{cite web}})
Other weirdness:
- <the> - find all - More French quoting style, bad linking, bad citation style, etc.
- <blockquote> sometimes shows up on the reports if it is capitalized or all-caps on the article page. It should be all lowercase.
Known bad HTML tags (HB)
These are also included in the main listings.
- 1243 - <li> - 1988 Tripura Legislative Assembly election, 1993 Tripura Legislative Assembly election, 2007 UST Growling Tigers men's basketball team, 2019 European Parliament election in Romania, 2019 U.S. Open Polo Championship ... find all
- 917 - <tt> - 23 skidoo (phrase), Chargaff's rules, Content Assembly Mechanism, Cyclometer, DFA minimization ... find all
- 529 - <i> - 1926 Colored World Series, 2022 United States House of Representatives elections in Florida, 3-MeO-PCP, ?:, A Walk in the Spring Rain ... find all
- 439 - <td> - Akiyama Station, Amyloid-related imaging abnormalities, Attention (machine learning), Baraki-Nakayama Station, Brenda Lindiwe Mabaso ... find all
- 290 - <p> - 2020 NASCAR Cup Series, ASTM A193/A193M, Alan Walsh (physicist), Ambiguities in Chinese character simplification, Automatic identification system ... find all
- 250 - <em> - ALTO (protocol), Adam Curtis, Agitu Ideo Gudeta, Albert Sidney Johnston, Alchemy Film & Moving Image Festival ... find all
- 225 - <b> - 1988 Tripura Legislative Assembly election, 1993 Tripura Legislative Assembly election, 1998 Tripura Legislative Assembly election, 2003 Tripura Legislative Assembly election, 2020 Missouri lieutenant gubernatorial election ... find all
- 133 - <ol> - 1988 Tripura Legislative Assembly election, 1993 Tripura Legislative Assembly election, 2019 European Parliament election in Romania, Absolutely convex set, Adjective ... find all
- 101 - <tr> - Akiyama Station, Amyloid-related imaging abnormalities, Attention (machine learning), Baraki-Nakayama Station, Brenda Lindiwe Mabaso ... find all
- 66 - <cite> - 5th of December Party, Cape Collinson, Conic Island, David Lewis (philosopher), Iris recognition ... find all
- 59 - <hr> - Europe '72: The Complete Recordings, Kladorachi, List of aircraft of the Hellenic Air Force, Nonconvex great rhombicosidodecahedron, Octahemioctahedron ... find all
- 52 - <strong> - 2020–21 Úrvalsdeild kvenna (basketball), Altınşehir (Istanbul Metro), Daprodustat, Join Java, Kings of Israel and Judah ... find all
- 52 - <ins> - Bank Hey, Bhelsar, Bobby Hoying, Catchment area, Chameleon ... find all
- 48 - <table> - 1967 Tripura Legislative Assembly election, 1972 Tripura Legislative Assembly election, 1977 Tripura Legislative Assembly election, 1983 Tripura Legislative Assembly election, 1988 Tripura Legislative Assembly election ... find all
- 48 - <hr/> - Al-Asr, Al-Humazah, Al-Ma'un, Al-Masad, Linha do Alentejo ... find all
- 41 - </table> - Akiyama Station, Amyloid-related imaging abnormalities, Ang Probinsyano (season 1), Ang Probinsyano (season 2), Ang Probinsyano (season 3) ... find all
- 26 - <q> - Aberedw railway station, Alfred Delvau, Aramaic Uruk incantation, British Rail Passenger Timetable, Canonical link element ... find all
- 13 - <font> - Barbara Carle, Clockwork (disambiguation), Michaela Pavlátová, Milyan language, Siddique Salik ... find all
- 10 - <th> - Amyloid-related imaging abnormalities, National Tainan Junior College of Nursing ... find all
- 8 - <dd> - 1988 Tripura Legislative Assembly election, 1993 Tripura Legislative Assembly election, HTML element, Linear map ... find all
- 7 - <dt> - 1988 Tripura Legislative Assembly election, 1993 Tripura Legislative Assembly election, Linear map ... find all
- 4 - <wbr/> - Agnosia, List of COVID-19 vaccine authorizations, Viral vector vaccine ... find all
- 3 - <dl> - 1988 Tripura Legislative Assembly election, 1993 Tripura Legislative Assembly election, Linear map ... find all
Bad link formatting (HL)
These are also included in the main listings. Angle brackets are not used for external links (per Wikipedia:Manual of Style/Computing § Exposed URLs); "tags" like <https> and <www> are actually just bad link formatting. See Wikipedia:External links#How to link for external link syntax; use {{cite web}} for footnotes.
- 56 - <https> - 342nd Infantry Division (Wehrmacht), AccuWeather, Akkaldhamayile Pennu, Annette Gordon-Reed, Bentinck family ... find all
- 32 - <http> - Bangladesh Agricultural Development Corporation, Charles Arthur Bissonette, Doris Haddock, Enoch H. Pardee, Keheewin, Edmonton ... find all
- 25 - <http/> - Eakly, Oklahoma, Luisa Morgantini, Lutterworth, Moisture meter, Rietvlei Wetland Reserve ... find all
- 21 - <https/> - Alfred Scharf, CHF Entertainment, Friedrich von Frankenberg, Haplogroup E-M2, Lutterworth ... find all
- 3 - <www> - Wally Hunter ... find all
Unsorted (H)
Many of these can be replaced by {{var}} (for text to be replaced) or {{angbr}} (e.g. for linguistic notation).
- 17 - <m> - Godfried-Willem Raes, Itô calculus, Kaingang language, Lee You-cheong, Lwów subdialect ... find all
- 17 - <c> - Italians in the United Kingdom, List of Schedule 1 substances (CWC), Marzullo's algorithm, Notes on Muscovite Affairs, Old Saxon Baptismal Vow ... find all
- 16 - <no> - 430 Space Shuttle, 57th NHK Kōhaku Uta Gassen, Confederate privateer, Cricket statistics, Kinnikuman Muscle Grand Prix ... find all
- 16 - <e> - Iraya language, Is-a, Khoe–Kwadi languages, Litema, ... find all
- 15 - <gallery> - Kathua, Keluri, Malus, Nassau County Police Department, Petrapole ... find all
- 15 - <encore> - 10th Anniversary Tour Lead Upturn 2012: Now or Never, Lead 15th Anniversary Live Box, Lead Live Tour Upturn 2005, Lead Upturn 2009: Summer Day & Night Fever, Lead Upturn 2010: I'll Be Around ... find all
- 14 - <y> - Andoque language, Elementary function arithmetic, History of Proto-Slavic, Languages of Argentina, ... find all
- 13 - <number> - Fasti Ostienses, GRAU, Geom raid5, Time control, Trash (computing) ... find all
- 13 - <l> - Anatolian hieroglyphs, Blind deconvolution, Colonia Tovar, Geometric design of roads, Litema ... find all
- 13 - <k> - Gompertz function, Hub (network science), List of soft drinks by country, Old Saxon Baptismal Vow ... find all
- 13 - <d> - DT-Manie, Dataphor, Division algorithm, Experix, John (given name) ... find all
- 12 - <x> - Davenport chained rotations, Ethernet over SDH, Mitla Zapotec, Portuguese dialects ... find all
- 12 - <pv> - Bamboo Collage, Love Paradox, Softly (song), Sympathy (Hitomi Takahashi album), Vanilla (Leah Dizon song) ... find all
- 12 - <operate> - Keihin Kyuko Bus ... find all
- 11 - <the> - American Dairy Association, Charlie Luxton, Fan (surname), Haeeunlee, Lee You-cheong ... find all
- 11 - <link> - Ars Magica, GIO General, IELTS Life Skills, MTELP Series, New Dutch Academy ... find all
- 11 - <j> - County Kilkenny, Fast wavelet transform, Glenmore, County Kilkenny, John (given name), Mo Bangfu ... find all
- 10 - <citation> - Abū Isḥāq Ibrāhīm al-Zarqālī, Body image, Chatham Manor, Dunsmuir, California, Frontal lobe injury ... find all
- 10 - <ch> - Basel German, Cuban Spanish, New Rumi Spelling, Nivaclé language, Old Saxon Baptismal Vow ... find all
- 9 - <o> - Alias (TV series), Immortal Beloved, Kagate language, List of Alias characters, ... find all
- 9 - <lol> - Ala Boratyn, Before I'll Die..., Blog 27, LOL (Blog 27 album), Tola Szlagowska ... find all
- 9 - <interlude> - Hall Tour 2014: Bon Voyage, Live Tour 2007: Black Cherry, Live Tour 2015: Walk of My Life ... find all
- 9 - <in> - Alexander Bogomazov, Ilocano numbers, Karl Spencer Lashley Award, Kathy Flores, Nikita Lobanov ... find all
- 9 - <cr> - Carriage return, HTTP message body, Hayes command set, NMEA 0183, Simplified Message Desk Interface ... find all
- 8 - <personal> - Andrei Katkov, Doc Cheatham, George Air Force Base, Hangar Theatre, Jimmy Knepper ... find all
- 8 - <ll> - Languages of Argentina, Literary Welsh morphology, Lj (digraph), Paraguayan Spanish, Spanish verbs ... find all
- 7 - <string> - C++ Standard Library, Generic programming, Is-a, Java collections framework ... find all
- 7 - <random> - C++ Standard Library, Sality, Swen (computer worm), Voyager (computer worm) ... find all
- 7 - <lf> - HTTP message body, Hayes command set, NMEA 0183, Simplified Message Desk Interface ... find all
- 7 - </gallery> - Dhoni, Palakkad, Ectobius lapponicus, Fajara, Mihăilești explosion, Saint-Denis–Porte de Paris (Paris Métro) ... find all
- 6 - <year> - AMD Radeon Software, Animecon (Netherlands), Constitutional Court of Korea, Date and time notation in Catalonia, Madras High Court ... find all
- 6 - <w> - Myles Goodwyn, Old Saxon Baptismal Vow, South-West Irish English, Uyghur phonology ... find all
- 6 - <video> - Firefox 3.5, List of features in Android, Love Paradox, Unreal Media Server, Vanilla (Leah Dizon song) ... find all
- 6 - <username> - Home directory, MobileMe ... find all
- 6 - <tl> - Belizean Spanish, Costa Rican Spanish, Guatemalan Spanish ... find all
- 6 - <space> - Bitboard, Netaji Subhas Mahavidyalaya, Panos (operating system), Resistor, Unicode character property ... find all
- 6 - <reference> - 1982 in art, Anthony Banzi, Olympus Guardian ... find all
- 6 - <object> - Applet, Internet Explorer 9, Named graph, SWFObject ... find all
- 6 - <musical> - Lee You-cheong ... find all
- 6 - <more> - 2004 in Australian literature, 2004 in poetry, 2005 in literature, 2005 in poetry, Adelaide Festival Awards for Literature ... find all
- 6 - <gml> - Geography Markup Language ... find all
- 6 - <game> - SafeDisc ... find all
- 6 - <filename> - Cross File Transfer, Data source name, Ddoc, Leet (programming language), PowerHouse (programming language) ... find all
- 6 - <date> - Battle honour, Carus Publishing Company, Charles E. Fraser, Opera Dragonfly ... find all
- 6 - <dancers> - 10th Anniversary Tour Lead Upturn 2012: Now or Never, Lead Upturn 2011: Sun x You, Lead Upturn 2013: Leap ... find all
Need debugging
- 19 - <pre> - Arena (web browser) (ASCII art breaking parsing?), Back-to-back user agent (ASCII art breaking parsing), BagIt, Call graph, Code folding ... find all
- (These look legit, probably a moss bug. Beland note to self: Run these on wikitext_util functions in an interactive window to find parse breakage.)
Notification of new dumps
"Most likely misspellings by articles" should always have work to do (if not, ping Beland to add more from the current dump). Some of the other sections are occasionally waiting for a new dump to get a useful list, either because they are ranked by frequency or a code change has been made to clean up noise in the next run. New runs are generally posted twice a month. The database snapshot from the first day of the month generally takes about 9-13 days to process, and the snapshot from the twentieth day of the month might take 4-6 days until it can be posted.
All that said, if you want to get a ping when results from a new dump are posted, you can add your name to the list below. If you are only interested in a particular section, include a note to that effect.
- (add your username to this list)
- Jake The Great!📞talk! 01:40, 18 December 2019 (UTC)
- Puddleglum2.0 (talk) 20:31, 13 October 2019 (UTC)
- Schazjmd (talk) 18:25, 21 December 2018 (UTC)
- bradleyagin (talk) 04:08, 12 January 2019 (UTC)
- Darylgolden(talk) Ping when replying 00:50, 11 February 2019 (UTC)
- MarkZusab (talk) 03:52, 15 February 2019 (UTC)
- Amiodarone talk 20:52, 2 April 2019 (UTC)
- Zojomars (talk) 17:48, 31 May 2019 (UTC)
- Anarhistička Maca (talk) 06:25, 30 June 2019 (UTC)
- Clovermoss (talk) 00:46, 27 October 2019 (UTC)
- JaAlDo (talk) 14:18, 11 March 2020 (UTC)
- Creativecreatr Creativecreatr (talk) 09:56, 26 May 2020 (UTC)
- Voidify (talk) 06:12, 9 June 2020 (UTC)
- Doghouse09 (talk) 20:52, 8 September 2020 (UTC)
- -- spazure (contribs) 09:24, 2 December 2020 (UTC)
- Idell (talk) 21:26, 23 October 2020 (UTC)
- --
- Fehufangą ♮ ✉ Talk page ♮ 12:16, 28 December 2020 (UTC)
- Triethylborane (talk) 03:23, 19 May 2021 (UTC)
- littleb2009 (talk · contribs)
- Normal Name (talk) 20:28, 29 June 2021 (UTC)
- Amazomagisto (talk) 02:36, 6 July 2021 (UTC)
- TreeReader (talk) 09:17, 1 August 2021 (UTC)
moss source code
moss is written in Python, and is available on github at: https://github.com/cdbeland/moss
Data is obtained from XML database backup dumps.