Wikipedia:Reference desk/Science: Difference between revisions

Content deleted Content added
Line 500:
== Unicode CLDR for filtering mixed scripts ==
 
The last time I checked the English Wikipedia's regex filters for page titles, a lot of them were designed to prevent mixing of more than one non-Latin script. Could this be done more efficiently by a MediaWiki function that would look up each character's script in the Unicode CLDR and reject the title if more than one non-Latin language-specific script was found this way, or if a Latin and a non-Latin letter were in the same word (delimited by a normal space (\x20) or a printable punctuation mark)? [[User:NeonMerlin|<span style="background:#000;color:red;border:#0f0 solid;border-width:1px 0">Neon</span>]][[User talk:NeonMerlin|<span style="background:#0f0;color:#000;border:red solid;border-width:1px 0">Merlin</span>]] 21:57, 10 March 2024 (UTC)