User:Artoria2e5

Babel user information

zh-N	中文是这位用户的母语。

en-5	This user has professional knowledge of English.

I am Artoria2e5 on The Test Wiki. My global user page contains a few more userboxes, so check them out if you are looking for social pages.

❤️

This user is in love with User:Tsumikiria

Artoria2e5 contributes to OSM as artoria2e5

This user resists the POV pushing of lunatic charlatans.

This user is a socialist.

0

This user has made more than no edits to the English language Wikipedia.

ve

VisualEditor is pretty good for fixing tables, you know.

This user is a member of WikiProject Molecular and Cell Biology.

This user scored 697 on the Wikipediholic test (revision 1182993729).

⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜
⬜⬛⬛⬛⬜⬛⬜⬛⬜⬛⬜⬜⬜⬛⬜⬛⬛⬛⬛⬜⬜
⬜⬜⬜⬛⬜⬛⬜⬛⬜⬛⬜⬛⬜⬛⬜⬛⬜⬜⬜⬛⬜
⬜⬜⬛⬜⬜⬛⬛⬛⬜⬛⬜⬛⬜⬛⬜⬛⬛⬛⬛⬜⬜ <-- Emoj-ixel layout experiment; See zh:表情包
⬜⬛⬜⬜⬜⬛⬜⬛⬜⬛⬜⬛⬜⬛⬜⬛⬜⬜⬜⬜⬜
⬜⬛⬛⬛⬜⬛⬜⬛⬜⬜⬛⬜⬛⬜⬜⬛⬜⬜⬜⬜⬜
⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜⬜

＼／＼／＼／＼／＼／＼／＼／＼／
ｃｏｒｎｅｒ　ｒｅｆｌｅｃｔｏｒ

Projects

I forget about things. Why not run me on XTools?

CJKV PUA

Document PUA code points used by old decoders for old CJK(V?) encodings, as normalizing PUA artifacts is as important as Unicode normalization. Most of the problem should arise in Chinese and Han Nom characters, as there are more characters to screw up.

Done:
- GB 18030#PUA
- Big5 (& -HKSCS) — Code page 950
  - MS PUA allocation: Kanji Database: Big5
  - Comprehensive history: MozTW: Mozilla & the Big5 family of encodings
  - Modern dataset: WHATWG Encoding Spec: Big5

Useful references:

Microsoft mappings (reference implementation)
commit logs from ICU and glibc (same)
WHATWG
"CJKV Information Processing" by Ken Lunde

Other plans:

Add the charts (or Python scripts with str.translate) to stanfordnlp/CoreNLP wiki, then open an issue to suggest inclusion in [1].
- Wiki ready w/ Python script and chart links: [2]

VecFool

What if we write something that automatically generates bad jokes by substituting random words in a Wikipedia article for some boring:INPUT → funny:OUTPUT analogies? Word vectors can do that pretty well.

Infobox gene

There are some good stuff I can backport to here from the zh adaptions. Some messy "refactor" diffs coming up someday.

insert breaks to loops
replace ad-hoc string ops with not-very-ad-hoc ones (aliases, etc.)
probably go for a string table like CS1 is doing?
indents. let's face it we don't care about dirty diffs if it's fixed once and for all.
some styles like actual table literals.
and yeah we don't need to write chrTextTable out like that.
do early returns. why nest it when you can jump out of the wrong ones
use long string literals.

img tools

Hugin, stitching, surprisingly cool shite
GIMP
- G'MIC for signal-processing magic
- resynthesizer for redaction

misc

some hrt translation to zh.wp i guess
gcj-02, damn it: User:Artoria2e5/coord-notice, phab:T162331
idk, maybe some madgenderscience-level fuckup
use Zhang to trace some shit from DigiGlobe to OSM
Special:PrefixIndex/Prokaryotic is a good source of bacteria-centric fuckups
Special:Search/"bundled in GNU coreutils" -POSIX: is part of the [[X/Open]] Portability Guide since issue 2 of 1987. It was inherited into the first version of POSIX and the [[Single Unix Specification]].<ref>{{man|cu|df|SUS}}</ref> It first appeared in <ref>{{man|1|df|FreeBSD}}</ref>

Todo bucket:

teichoic acid