Romanization: Difference between revisions

Browse history interactively

← Previous edit Next edit →

Content deleted Content added

VisualWikitext

Revision as of 05:51, 21 December 2022 edit 212.192.228.47 (talk) No edit summary Tags: Reverted Mobile edit Mobile web edit ← Previous edit		Revision as of 21:10, 9 August 2024 edit undo Alexysun (talk \| contribs) Extended confirmed users 2,223 edits No edit summary Tag: Visual edit Next edit →
(41 intermediate revisions by 33 users not shown)
Line 1: {{Short description\|Transliteration or transcription to Latin ~~characters~~letters}} {{Redirect\|Romanised\|the racehorse\|Romanised (horse)}} {{Other uses\|~~Romanization~~Romanisation (disambiguation)\|Latinisation (disambiguation)}} {{Distinguish\|Romanianization}} [[File:Gwoyu.svg\|thumb\|upright=0.7\|[[Standard Chinese\|Mandarin Chinese]], like many languages, can be romanized in a number of ways; ~~abпЦид~~above: [[Traditional Chinese characters\|Traditional]] and [[Simplified Chinese characters\|Simplified Chinese]] characters meaning Chinese, and romanization systems[[Hanyu pinyin\|Hanyu Pinyin]], [[Gwoyeu Romatzyh]], [[Wade-Giles]] and [[Yale romanization of Mandarin\|Yale]] for those characters.]] ove: [[Traditional Chinese characters\|Traditional]] and [[Simplified Chinese characters\|Simplified Chinese]], and [[Hanyu pinyin\|Hanyu Pinyin]], [[Gwoyeu Romatzyh]], [[Wade-Giles]] and [[Yale romanization of Mandarin\|Yale]]]] ~~'''Romanization'''~~In or[[linguistics]], '''~~romanisation~~romanization'''~~, in [[linguistics]],~~ is the conversion of text from a different [[writing system]] to the [[Latin script\|Roman (Latin) script]], or a system for doing so. Methods of romanization include [[transliteration]], for representing written text, and [[transcription (linguistics)\|transcription]], for representing the spoken word, and combinations of both. Transcription methods can be subdivided into ''[[phonemic orthography\|phonemic]] transcription'', which records the [[phoneme]]s or units of [[semantic]] meaning in speech, and more strict ''[[phonetic transcription]]'', which records speech sounds with precision. ==Methods== There are many consistent or [[standardized]] romanization systems. They can be classified by their characteristics. A particular ~~system’s~~system's characteristics may make it better-suited for various, sometimes contradictory applications, including document retrieval, linguistic analysis, easy readability, faithful representation of pronunciation. * Source, or donor language – A system may be tailored to romanize text from a particular language, or a series of languages, or for any language in a particular writing system. A language-specific system typically preserves language features like pronunciation, while the general one may be better for cataloguing international texts. * Target, or receiver language – Most systems are intended for an audience that speaks or reads a particular language. (So-called ''international'' romanization systems for Cyrillic text are based on central-European alphabets like the [[Czech alphabet\|Czech]] and [[Croatian alphabet]].) Line 30 ⟶ 29: A [[phonetic]] conversion goes one step further and attempts to depict all [[Phone (phonetics)\|phones]] in the source language, sacrificing legibility if necessary by using characters or conventions not found in the target script. In practice such a representation almost never tries to represent ''every'' possible allophone—especially those that occur naturally due to [[coarticulation]] effects—and instead limits itself to the most significant allophonic distinctions. The [[International Phonetic Alphabet]] is the most common system of phonetic transcription. ===Trade~~-offs~~=== For most language pairs, building a usable romanization involves trade~~-offs~~ between the two extremes. Pure transcriptions are generally not possible, as the source language usually contains sounds and distinctions not found in the target language, but which must be shown for the romanized form to be comprehensible. Furthermore, due to [[wikt:diachronic\|diachronic]] and [[wikt:synchronic\|synchronic]] variance no [[written language]] represents any [[spoken language]] with perfect accuracy and the vocal interpretation of a [[Writing system\|script]] may vary by a great degree among languages. In modern times the chain of transcription is usually spoken foreign language, written foreign language, written native language, spoken (read) native language. Reducing the number of those processes, i.e. removing one or both steps of writing, usually leads to more accurate oral articulations. In general, outside a limited audience of scholars, romanizations tend to lean more towards transcription. As an example, consider the Japanese martial art 柔術: the Nihon-shiki romanization ''zyûzyutu'' may allow someone who knows Japanese to reconstruct the kana syllables {{nowrap\|じゅうじゅつ}}, but most native English speakers, or rather readers, would find it easier to guess the pronunciation from the Hepburn version, ''[[jujutsu\|jūjutsu]]''. ==Romanization of specific writing systems== {{see also\|:Category:Romanization}} {{incomplete list\|date=May 2021}} ===Arabic=== {{Main\|Romanization of Arabic\|Maltese alphabet}} The [[Arabic alphabet]] is used to write [[Arabic language\|Arabic]], [[Persian language\|Persian]], [[Urdu]], [[Pashto language\|Pashto]] and [[Sindhi language\|Sindhi]] as well as numerous other languages in the Muslim world, particularly [[Languages of Africa\|African]] and [[Languages of Asia\|Asian]] languages without alphabets of their own. Romanization standards include the following: * {{Lang\|de\|[[Deutsche Morgenländische Gesellschaft]]\|italic=no}} (1936): Adopted by the International Convention of Orientalist Scholars in Rome. It is the basis for the very influential [[Dictionary of Modern Written Arabic\|Hans Wehr dictionary]] ({{ISBN\|0-87950-003-4}}).<ref>{{cite web\|url=http://www.dmg-web.de/ \|title=Deutsche Morgenländische Gesellschaft \|~~publisher~~work=Dmg-web.de \|access-date=2015-07-02}}</ref> * [[BS 4280]] (1968): Developed by the [[British Standards Institution]]<ref>{{cite web \|url=http://www.bsi-global.com/index.xalter \|title=Standards, Training, Testing, Assessment and Certification ~~\|~~ \|publisher=[[BSI Group]] \|~~publisher~~work=~~Bsi~~BSI-~~global~~Global.com \|access-date=2013-04-25}}</ref> * [[SATTS]] (1970s): A one-for-one substitution system, a legacy from the [[Morse code]] era * [[UNGEGN]] (1972)<ref>{{cite web\|url=http://www.eki.ee/wgrs/rom1_ar.pdf \|title=Arabic \|~~publisher~~work=Eki.ee \|access-date=2015-07-02}}</ref> * [[DIN 31635]] (1982): Developed by the {{lang\|de\|[[Deutsches Institut für Normung]]\|italic=no}} (German Institute for Standardization) * [[ISO 233]] (1984). Transliteration. * [[Qalam]] (1985): A system that focuses upon preserving the spelling, rather than the pronunciation, and uses mixed case<ref>{{cite web \|url=http://eserver.org/langs/qalam.txt \|format=TXT \|title=Qalam: A Convention for Morphological ~~rabic~~Arabic-Latin-Arabic Transliteration \|~~publisher~~work=~~Eserver~~EServer.org \|access-date=2015-07-02 \|url-status=dead \|archive-url=https://web.archive.org/web/20090208143407/http://eserver.org/langs/qalam.txt \|archive-date=2009-02-08 }}</ref> * [[ISO 233-2]] (1993): Simplified transliteration. * [[Buckwalter transliteration]] (1990s): Developed at [[Xerox]] by [[Tim Buckwalter]];<ref>{{cite web \|url=http://www.qamus.org/transliteration.htm \|title=Buckwalter Arabic Transliteration \|~~publisher~~work=Qamus.org \|access-date=2013-04-25}}</ref> ~~doesn't~~does not require unusual [[diacritic]]s<ref>{{cite web \|first=Ken \|last=Beesley \|url=http://www.xrce.xerox.com/competencies/content-analysis/arabic/info/buckwalter-about.html \|archive-url=https://web.archive.org/web/20020424194140/http://www.xrce.xerox.com/competencies/content-analysis/arabic/info/buckwalter-about.html \|url-status=dead \|archive-date=2002-04-24 \|title=~~Open~~The ~~Xerox:~~Buckwalter ~~arabic-morphology~~Transliteration ~~Service Home Page~~\|work= \|publisher=~~Xrce.xerox.com~~[[Xerox Research Centre Europe]] \|date=2010-11-22 \|access-date=2013-04-25 }}</ref> * [[ALA-LC romanization\|ALA-LC]] (1997)<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/arabic.pdf \|title=Arabic \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> * [[Arabic chat alphabet]] Line 54: {{Main\|Romanization of Persian}} {{See also\|:Category:Persian orthography}} {\| class="wikitable" style="text-align: center;" \|+ Consonants ! Unicode \|\| Persian<br />letter \|\| IPA \|\| DMG (1969) \|\| ALA-LC (1997) \|\| BGN/PCGN (1958) \|\| EI (1960) \|\| EI (2012) \|\| UN (1967) \|\| UN (2012) !Pronunciation \|- \| U+0627 \|\| style="font-size: 190%;" \| {{lang\|fa\|ا}} \|\| {{IPA\|ʔ, ∅}}{{efn\|name=vowel}} \|\| ʾ, —{{efn\|name=hamza-ayn}} \|\| colspan="4" \| ʼ, —{{efn\|name=hamza-ayn}} \|\| colspan="2" \| ʾ \| - as in uh-oh \|- \| U+0628 \|\| style="font-size: 190%;" \| {{lang\|fa\|ب}} \|\| {{IPA\|b}} \|\| colspan="7" \| b \|B as in Bob \|- \| U+067E \|\| style="font-size: 190%;" \| {{lang\|fa\|پ}} \|\| {{IPA\|p}} \|\| colspan="7" \| p \|P as in pet \|- \| U+062A \|\| style="font-size: 190%;" \| {{lang\|fa\|ت}} \|\| {{IPA\|t}} \|\| colspan="7" \| t \|T as in tall \|- \| U+062B \|\| style="font-size: 190%;" \| {{lang\|fa\|ث}} \|\| {{IPA\|s}} \|\| s̱ \|\| s̱ \|\| s̄ \|\| t͟h \|\| ṯ \|\| s̄ \|\| s \|S as in sand \|- \| U+062C \|\| style="font-size: 190%;" \| {{lang\|fa\|ج}} \|\| {{IPA\|dʒ}} \|\| ǧ \|\| j \|\| j \|\| d͟j \|\| j \|\| colspan="2" \| j \|J as in jam \|- \| U+0686 \|\| style="font-size: 190%;" \| {{lang\|fa\|چ}} \|\| {{IPA\|tʃ}} \|\| č \|\| ch \|\| ch \|\| colspan="2" \| č \|\| ch \|\| č \|Ch as in Charlie \|- \| U+062D \|\| style="font-size: 190%;" \| {{lang\|fa\|ح}} \|\| {{IPA\|h}} \|\| ḥ \|\| ḥ \|\| ḩ/ḥ{{efn\|name=dot}} \|\| colspan="2" \| ḥ \|\| ḩ \|\| h \|H as in holiday \|- \| U+062E \|\| style="font-size: 190%;" \| {{lang\|fa\|خ}} \|\| {{IPA\|x}} \|\| ḫ \|\| kh \|\| kh \|\| k͟h \|\| ḵ \|\| kh \|\| x \|somewhat resembling German Ch \|- \| U+062F \|\| style="font-size: 190%;" \| {{lang\|fa\|د}} \|\| {{IPA\|d}} \|\| colspan="7" \| d \|D as in Dave \|- \| U+0630 \|\| style="font-size: 190%;" \| {{lang\|fa\|ذ}} \|\| {{IPA\|z}} \|\| ẕ \|\| ẕ \|\| z̄ \|\| d͟h \|\| ḏ \|\| z̄ \|\| z \|Z as in zero \|- \| U+0631 \|\| style="font-size: 190%;" \| {{lang\|fa\|ر}} \|\| {{IPA\|r}} \|\| colspan="7" \| r \|R as in rabbit \|- \| U+0632 \|\| style="font-size: 190%;" \| {{lang\|fa\|ز}} \|\| {{IPA\|z}} \|\| colspan="7" \| z \|Z as in zero \|- \| U+0698 \|\| style="font-size: 190%;" \| {{lang\|fa\|ژ}} \|\| {{IPA\|ʒ}} \|\| ž \|\| zh \|\| zh \|\| z͟h \|\| ž \|\| zh \|\| ž \|S as in television or G as in genre \|- \| U+0633 \|\| style="font-size: 190%;" \| {{lang\|fa\|س}} \|\| {{IPA\|s}} \|\| colspan="7" \| s \|S as in Sam \|- \| U+0634 \|\| style="font-size: 190%;" \| {{lang\|fa\|ش}} \|\| {{IPA\|ʃ}} \|\| š \|\| sh \|\| sh \|\| s͟h \|\| š \|\| sh \|\| š \|Sh as in sheep \|- \| U+0635 \|\| style="font-size: 190%;" \| {{lang\|fa\|ص}} \|\| {{IPA\|s}} \|\| ṣ \|\| ṣ \|\| ş/ṣ{{efn\|name=dot}} \|\| colspan="2" \| ṣ \|\| ş \|\| s \|S as in Sam \|- \| U+0636 \|\| style="font-size: 190%;" \| {{lang\|fa\|ض}} \|\| {{IPA\|z}} \|\| ż \|\| z̤ \|\| ẕ \|\| ḍ \|\| ż \|\| ẕ \|\| z \|Z as in zero \|- \| U+0637 \|\| style="font-size: 190%;" \| {{lang\|fa\|ط}} \|\| {{IPA\|t}} \|\| ṭ \|\| ṭ \|\| ţ/ṭ{{efn\|name=dot}} \|\| colspan="2" \| ṭ \|\| ţ \|\| t \|t as in tank \|- \| U+0638 \|\| style="font-size: 190%;" \| {{lang\|fa\|ظ}} \|\| {{IPA\|z}} \|\| ẓ \|\| ẓ \|\| z̧/ẓ{{efn\|name=dot}} \|\| ẓ \|\| ẓ \|\| z̧ \|\| z \|Z as in zero \|- \| U+0639 \|\| style="font-size: 190%;" \| {{lang\|fa\|ع}} \|\| {{IPA\|ʕ}} \|\| ʿ \|\| ʻ \|\| ʼ{{efn\|name=hamza-ayn}} \|\| ʻ \|\| ʻ \|\| ʿ \|\| ʿ \| _____ \|- \| U+063A \|\| style="font-size: 190%;" \| {{lang\|fa\|غ}} \|\| {{IPA\|ɢ~ɣ}} \|\| ġ \|\| gh \|\| gh \|\| g͟h \|\| ḡ \|\| gh \|\| q \|somewhat resembling French R \|- \| U+0641 \|\| style="font-size: 190%;" \| {{lang\|fa\|ف}} \|\| {{IPA\|f}} \|\| colspan="7" \| f \|F as in Fred \|- \| U+0642 \|\| style="font-size: 190%;" \| {{lang\|fa\|ق}} \|\| {{IPA\|ɢ~ɣ}} \|\| colspan="3" \| q \|\| colspan="2" \| ḳ \|\| colspan="2" \| q \|somewhat resembling French R \|- \| U+06A9 \|\| style="font-size: 190%;" \| {{lang\|fa\|ک}} \|\| {{IPA\|k}} \|\| colspan="7" \| k \|C as in card \|- \| U+06AF \|\| style="font-size: 190%;" \| {{lang\|fa\|گ}} \|\| {{IPA\|ɡ}} \|\| colspan="7" \| g \|G as in go \|- \| U+0644 \|\| style="font-size: 190%;" \| {{lang\|fa\|ل}} \|\| {{IPA\|l}} \|\| colspan="7" \| l \|L as in lamp \|- \| U+0645 \|\| style="font-size: 190%;" \| {{lang\|fa\|م}} \|\| {{IPA\|m}} \|\| colspan="7" \| m \|M as in Michael \|- \| U+0646 \|\| style="font-size: 190%;" \| {{lang\|fa\|ن}} \|\| {{IPA\|n}} \|\| colspan="7" \| n \|N as in name \|- \| U+0648 \|\| style="font-size: 190%;" \| {{lang\|fa\|و}} \|\| {{IPA\|v~w}}{{efn\|name=vowel}}{{efn\|name=xw}} \|\| colspan="4" \| v \|\| v, w{{efn\|name=Dari}} \|\| colspan="2" \| v \|V as in vision \|- \| U+0647 \|\| style="font-size: 190%;" \| {{lang\|fa\|ه}} \|\| {{IPA\|h}}{{efn\|name=vowel}} \|\| h \|\| h \|\| h{{efn\|name=h-final}} \|\| colspan="2" \| h \|\| h{{efn\|name=h-final}} \|\| h{{efn\|name=h-final}} \|H as in hot \|- \| U+0629 \|\| style="font-size: 190%;" \| {{lang\|fa\|ة}} \|\| {{IPA\|∅, t}} \|\| — \|\| h{{efn\|name=yh}} \|\| — \|\| t{{efn\|name=marbuta}} \|\| h{{efn\|name=yh}} \|\| — \|\| — \| \|- \| U+06CC \|\| style="font-size: 190%;" \| {{lang\|fa\|ی}} \|\| {{IPA\|j}}{{efn\|name=vowel}} \|\| colspan="7" \| y \|Y as in Yale \|- \| U+0621 \|\| style="font-size: 190%;" \| {{lang\|fa\|ء}} \|\| {{IPA\|ʔ, ∅}} \|\| ʾ \|\| colspan="4" \| ʼ \|\| colspan="2" \| ʾ \| \|- \| U+0623 \|\| style="font-size: 190%;" \| {{lang\|fa\|أ}} \|\| {{IPA\|ʔ, ∅}} \|\| ʾ \|\| colspan="4" \| ʼ \|\| colspan="2" \| ʾ \| \|- \| U+0624 \|\| style="font-size: 190%;" \| {{lang\|fa\|ؤ}} \|\| {{IPA\|ʔ, ∅}} \|\| ʾ \|\| colspan="4" \| ʼ \|\| colspan="2" \| ʾ \| \|- \| U+0626 \|\| style="font-size: 190%;" \| {{lang\|fa\|ئ}} \|\| {{IPA\|ʔ, ∅}} \|\| ʾ \|\| colspan="4" \| ʼ \|\| colspan="2" \| ʾ \| \|} {\| class="wikitable" style="text-align: center;" \|+ Vowels{{efn\|name=diacritics}} \|- ! Unicode \|\| Final \|\| Medial \|\| Initial \|\| Isolated \|\| IPA \|\| DMG (1969) \|\| ALA-LC (1997) \|\| BGN/PCGN (1958) \|\| EI (2012) \|\| UN (1967) \|\| UN (2012) !Pronunciation \|- \| U+064E \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اَ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اَ}} \|\| {{IPA\|æ}} \|\| a \|\| a \|\| a \|\| a \|\| a \|\| a \|A as in cat \|- \| U+064F \|\| style="font-size: 160%;" \| {{lang\|fa\|ـُ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـُ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اُ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اُ}} \|\| {{IPA\|o}} \|\| o \|\| o \|\| o \|\| u \|\| o \|\| o \|O as in go \|- \| U+0648 U+064F \|\| style="font-size: 160%;" \| {{lang\|fa\|ـوَ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـوَ}} \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| {{IPA\|o}}{{efn\|name=wo}} \|\| o \|\| o \|\| o \|\| u \|\| o \|\| o \|O as in go \|- \| U+0650 \|\| style="font-size: 160%;" \| {{lang\|fa\|ـِ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـِ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اِ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اِ}} \|\| {{IPA\|e}} \|\| e \|\| i \|\| e \|\| e \|\| e \|\| e \|E as in ten \|- \| U+064E U+0627 \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَا}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَا}} \|\| style="font-size: 160%;" \| {{lang\|fa\|آ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|آ}} \|\| {{IPA\|ɑː~ɒː}} \|\| ā \|\| ā \|\| ā \|\| ā \|\| ā \|\| ā \|O as in hot \|- \| U+0622 \|\| style="font-size: 160%;" \| {{lang\|fa\|ـآ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـآ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|آ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|آ}} \|\| {{IPA\|ɑː~ɒː}} \|\| ā, ʾā{{efn\|name=aa}} \|\| ā, ʼā{{efn\|name=aa}} \|\| ā \|\| ā \|\| ā \|\| ā \|O as in hot \|- \| U+064E U+06CC \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَی}} \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| {{IPA\|ɑː~ɒː}} \|\| ā \|\| á \|\| á \|\| ā \|\| á \|\| ā \|O as in hot \|- \| U+06CC U+0670 \|\| style="font-size: 160%;" \| {{lang\|fa\|ـیٰ}} \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| {{IPA\|ɑː~ɒː}} \|\| ā \|\| á \|\| á \|\| ā \|\| ā \|\| ā \|O as in hot \|- \| U+064F U+0648 \|\| style="font-size: 160%;" \| {{lang\|fa\|ـُو}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـُو}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اُو}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اُو}} \|\| {{IPA\|uː, oː}}{{efn\|name=Dari}} \|\| ū \|\| ū \|\| ū \|\| u, ō{{efn\|name=Dari}} \|\| ū \|\| u \|U as in actual \|- \| U+0650 U+06CC \|\| style="font-size: 160%;" \| {{lang\|fa\|ـِی}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـِیـ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اِیـ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اِی}} \|\| {{IPA\|iː, eː}}{{efn\|name=Dari}} \|\| ī \|\| ī \|\| ī \|\| i, ē{{efn\|name=Dari}} \|\| ī \|\| i \|Y as in happy \|- \| U+064E U+0648 \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَو}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَو}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اَو}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اَو}} \|\| {{IPA\|ow~aw}}{{efn\|name=Dari}} \|\| au \|\| aw \|\| ow \|\| ow, aw{{efn\|name=Dari}} \|\| ow \|\| ow \|O as in go \|- \| U+064E U+06CC \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَی}} \|\| style="font-size: 160%;" \| {{lang\|fa\|ـَیـ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اَیـ}} \|\| style="font-size: 160%;" \| {{lang\|fa\|اَی}} \|\| {{IPA\|ej~aj}}{{efn\|name=Dari}} \|\| ai \|\| ay \|\| ey \|\| ey, ay{{efn\|name=Dari}} \|\| ey \|\| ey \|Ay as in play \|- \| U+064E U+06CC \|\| style="font-size: 160%;" \| {{lang\|fa\|ـیِ}} \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| {{IPA\|–e, –je}} \|\| –e, –ye \|\| –i, –yi \|\| –e, –ye \|\| –e, –ye \|\| –e, –ye \|\| –e, –ye \|Ye as in yes \|- \| U+06C0 \|\| style="font-size: 160%;" \| {{lang\|fa\|ـهٔ}} \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| style="font-size: 160%;" \| — \|\| {{IPA\|–je}} \|\| –ye \|\| –ʼi \|\| –ye \|\| –ye \|\| –ye \|\| –ye \|Ye as in yes \|} '''Notes''': {{notelist\|refs= {{efn\|name=hamza-ayn\|Hamza and ayn are not transliterated at the beginning of words.}} {{efn\|name=dot\|The dot below may be used instead of cedilla.}} {{efn\|name=Dari\|In Dari.}} {{efn\|name=yh\|In the combination {{angbr\|{{lang\|fa\|یة}}}} at the end of words.}} {{efn\|name=marbuta\|When used instead of {{angbr\|{{lang\|fa\|ت}}}} at the end of words.}} {{efn\|name=vowel\|Used as a vowel as well.}} {{efn\|name=xw\|At the beginning of words the combination {{angbr\|{{lang\|fa\|خو}}}} was pronounced {{IPA\|/xw/}} or {{IPA\|/xʷ/}} in Classical Persian. In modern varieties the glide {{IPA\|/ʷ/}} has been lost, though the spelling has not been changed. It may be still heard in Dari as a relict pronunciation. The combination {{IPA\|/xʷa/}} was changed to {{IPA\|/xo/}} (see below).}} {{efn\|name=h-final\|Not transliterated at the end of words.}} {{efn\|name=diacritics\|Diacritical signs (''[[Arabic diacritics\|harakat]]'') are rarely written.}} {{efn\|name=aa\|After vowels.}} {{efn\|name=wo\|After {{angbr\|{{lang\|fa\|خ}}}} from the earlier {{IPA\|/xʷa/}}. Often transliterated as ''xwa'' or ''xva''. For example, {{lang\|fa\|خور}} {{IPA\|/xor/}} "sun" was {{IPA\|/xʷar/}} in Classical Persian.}} }} ===Armenian=== Line 62 ⟶ 244: {{Main\|Romanization of Georgian}} {{expand section\|date=June 2015}} {\| class="wikitable" style="text-align:center;" ! <small>[[Georgian scripts\|Georgian letter]]</small> ! <small>[[International Phonetic Alphabet\|IPA]]</small> ! <small>National system<br> (2002)</small> ! <small>[[BGN/PCGN romanization\|BGN/PCGN]]<br> (1981—2009)</small> ! <small>ISO 9984<br> (1996)</small> ! <small>[[ALA-LC romanization\|ALA-LC]]<br> (1997)</small> ! <small>Unofficial system</small> ! <small>Kartvelo translit</small> ! <small>NGR2</small> \|- \| <span style="font-size: 170%; line-height:1em">ა</span> \| {{IPA\|/ɑ/}} \|\| a \| a \| a \| a \| a \| a \| a \|- \| <span style="font-size: 170%; line-height:1em">ბ</span> \| {{IPA\|/b/}} \|\| b \| b \| b \| b \| b \| b \| b \|- \| <span style="font-size: 170%; line-height:1em">გ</span> \| {{IPA\|/ɡ/}} \|\| g \| g \| g \| g \| g \| g \| g \|- \| <span style="font-size: 170%; line-height:1em">დ</span> \| {{IPA\|/d/}} \|\| d \| d \| d \| d \| d \| d \| d \|- \| <span style="font-size: 170%; line-height:1em">ე</span> \| {{IPA\|/ɛ/}} \|\| e \| e \| e \| e \| e \| e \| e \|- \| <span style="font-size: 170%; line-height:1em">ვ</span> \| {{IPA\|/v/}} \|\| v \| v \| v \| v \| v \| v \| v \|- \| <span style="font-size: 170%; line-height:1em">ზ</span> \| {{IPA\|/z/}} \|\| z \| z \| z \| z \| z \| z \| z \|-style="background:#FFCCCC" \| <span style="font-size: 170%; line-height:1em">ჱ</span>{{efn\|name=archaic\|Archaic letters.}} \| {{IPA\|/eɪ/}} \|\| \| ey \| ē \| ē \| é \| ej \| ẽ \|- \| <span style="font-size: 170%; line-height:1em">თ</span> \| {{IPA\|/tʰ/}} \|\| t \| tʼ \| t̕ \| tʻ \| T{{efn\|name=IbL\|These are influenced by aforementioned layout, and are preferred to avoid ambiguity, as an expressions: t, j, g, ch can mean two letters.}} or t \| t \| t / t̊ \|- \| <span style="font-size: 170%; line-height:1em">ი</span> \| {{IPA\|/i/}} \|\| i \| i \| i \| i \| i \| i \| i \|- \| <span style="font-size: 170%; line-height:1em">კ</span> \| {{IPA\|/kʼ/}} \|\| kʼ \| k \| k \| k \| k \| ǩ \| k̉ \|- \| <span style="font-size: 170%; line-height:1em">ლ</span> \| {{IPA\|/l/}} \|\| l \| l \| l \| l \| l \| l \| l \|- \| <span style="font-size: 170%; line-height:1em">მ</span> \| {{IPA\|/m/}} \|\| m \| m \| m \| m \| m \| m \| m \|- \| <span style="font-size: 170%; line-height:1em">ნ</span> \| {{IPA\|/n/}} \|\| n \| n \| n \| n \| n \| n \| n \|-style="background:#FFCCCC" \| <span style="font-size: 170%; line-height:1em">ჲ</span>{{efn\|name=archaic}} \| {{IPA\|/i/, /j/}} \|\| \| j \| y \| y \| \| j \| ĩ \|- \|<span style="font-size: 170%; line-height:1em">ო</span> \|{{IPA\|/ɔ/}} \| o \|\| o \| o \| o \| o \| o \| o \|- \| <span style="font-size: 170%; line-height:1em">პ</span> \| {{IPA\|/pʼ/}} \|\| pʼ \| p \| p \| p \| p \| p̌ \| p̉ \|- \| <span style="font-size: 170%; line-height:1em">ჟ</span> \| {{IPA\|/ʒ/}} \|\| zh \| zh \| ž \| ž \| J,{{efn\|name=IbL}} zh or j \| ž \| g̃ \|- \| <span style="font-size: 170%; line-height:1em">რ</span> \| {{IPA\|/r/}} \|\| r \| r \| r \| r \| r \| r \| r \|- \| <span style="font-size: 170%; line-height:1em">ს</span> \| {{IPA\|/s/}} \|\| s \| s \| s \| s \| s \| s \| s \|- \| <span style="font-size: 170%; line-height:1em">ტ</span> \| {{IPA\|/tʼ/}} \|\| tʼ \| t \| t \| t \| t \| t̆ \| t̉ \|-style="background:#FFCCCC" \| <span style="font-size: 170%; line-height:1em">ჳ</span>{{efn\|name=archaic}} \| {{IPA\|/w/}} \|\| \| \| w \| w \| \| ŭ \| f̃ \|- \|<span style="font-size: 170%; line-height:1em">უ</span> \|{{IPA\|/u/}} \| u \|\| u \| u \| u \| u \| u \| u \|- \| <span style="font-size: 170%; line-height:1em">ფ</span> \| {{IPA\|/pʰ/}} \|\| p \| pʼ \| p̕ \| pʻ \| p or f \| p \| p / p̊ \|- \|<span style="font-size: 170%; line-height:1em">ქ</span> \|{{IPA\|/kʰ/}} \| k \|\| kʼ \| k̕ \| kʻ \| q or k \| q or k \| k / k̊ \|- \|<span style="font-size: 170%; line-height:1em">ღ</span> \|{{IPA\|/ʁ/}} \| gh \|\| gh \| ḡ \| ġ \| g, gh or R{{efn\|name=IbL}} \| g, gh or R{{efn\|name=IbL}} \| q̃ \|- \| <span style="font-size: 170%; line-height:1em">ყ</span> \| {{IPA\|/qʼ/}} \|\| qʼ \| q \| q \| q \| y{{efn\|Initially, the use of letter y for ყ is most probably due to their resemblance to each other.}} \| q \| q \|- \|<span style="font-size: 170%; line-height:1em">შ</span> \|{{IPA\|/ʃ/}} \| sh \|\| sh \| š \| š \| sh or S{{efn\|name=IbL}} \| š \| x \|- \|<span style="font-size: 170%; line-height:1em">ჩ</span> \|{{IPA\|/t͡ʃ(ʰ)/}} \| ch \|\| chʼ \| č̕ \| čʻ \| ch or C{{efn\|name=IbL}} \| č \| c̃ \|- \|<span style="font-size: 170%; line-height:1em">ც</span> \|{{IPA\|/t͡s(ʰ)/}} \| ts \|\| tsʼ \| c̕ \| cʻ \| c or ts \| c \| c \|- \| <span style="font-size: 170%; line-height:1em">ძ</span> \| {{IPA\|/d͡z/}} \|\| dz \| dz \| j \| ż \| dz or Z{{efn\|name=IbL}} \| ʒ \| d̃ \|- \| <span style="font-size: 170%; line-height:1em">წ</span> \| {{IPA\|/t͡sʼ/}} \|\| tsʼ \| ts \| c \| c \| w, c or ts \| ʃ \| c̉ \|- \| <span style="font-size: 170%; line-height:1em">ჭ</span> \| {{IPA\|/t͡ʃʼ/}} \|\| chʼ \| ch \| č \| č \| W,{{efn\|name=IbL}} ch or tch \| ʃ̌ \| j̉ \|- \| <span style="font-size: 170%; line-height:1em">ხ</span> \| {{IPA\|/χ/}} \|\| kh \| kh \| x \| x \| x or kh (rarely) \| x \| k̃ \|-style="background:#FFCCCC" \| <span style="font-size: 170%; line-height:1em">ჴ</span>{{efn\|name=archaic}} \| {{IPA\|/q/, /qʰ/}} \|\| \| qʼ \| ẖ \| x̣ \| \| q̌ \| q̊ \|- \|<span style="font-size: 170%; line-height:1em">ჯ</span> \|{{IPA\|/d͡ʒ/}} \| j \|\| j \| ǰ \| j \| j \| ǯ\| - \| j \|- \|<span style="font-size: 170%; line-height:1em">ჰ</span> \|{{IPA\|/h/}} \| h \| h \| h \| h \| h \| h \| h \|- style="background:#FFCCCC" \|<span style="font-size: 170%; line-height:1em">ჵ</span>{{efn\|name=archaic}} \|{{IPA\|/oː/}}\|\| \| \| ō \| ō \| \| ȯ \| h̃ \|}<br /> '''Notes''': {{reflist\|group=lower-alpha}} ===Greek=== {{Main\|Romanization of Greek}} There are romanization systems for both [[Modern Greek\|Modern]] and [[Ancient Greek]]. * [[ALA-LC romanization\|ALA-LC]]<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/greek.pdf \|title=Greek \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> * [[Beta Code]]<ref>{{cite web\|url=http://www.tlg.uci.edu/BCM2004.pdf \|title=The TLG<sup>®</sup> Beta Code Manual 2004 \|date=June 23, 2004 \|website=~~tlg.uci.edu~~Thesaurus Linguae Graecae \|publisher=[[University of California, Irvine~~\|UCI~~]] \|archive-url=https://web.archive.org/web/20060129172716/http://www.tlg.uci.edu/BCM2004.pdf \|archive-date=January 29, 2006 \|url-status=dead}}</ref> * [[Greeklish]] * [[ISO 843]] (1997)<ref>{{cite web\|url=http://www.biology.uoc.gr/gvd/contents/databases/01c.htm \|title=Transliteration scheme ISO 843 \|website=biology.uoc.gr \|publisher=[[University of Crete~~\|UoC~~]] \|last1=Lefort \|first1=Francois \|last2=Roubelakis-Angelakis \|first2=Kalliopi A. \|archive-url=https://web.archive.org/web/20041210021506/http://www.biology.uoc.gr/gvd/contents/databases/01c.htm \|archive-date=December 10, 2004 \|url-status=dead}}</ref> ===Hebrew=== Line 75 ⟶ 620: The [[Hebrew alphabet]] is romanized using several standards: * [[American National Standards Institute\|ANSI]] Z39.25 (1975) * [[UNGEGN]] (1977)<ref>{{cite web\|url=http://www.eki.ee/wgrs/rom1_he.pdf \|title=Hebrew \|~~publisher~~work=Eki.ee \|access-date=2015-07-02}}</ref> * [[ISO 259]] (1984): Transliteration. * [[ISO 259-2]] (1994): Simplified transliteration. * ISO/DIS 259-3: Phonemic transcription. * [[ALA-LC romanization\|ALA-LC]]<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/hebrew.pdf \|title=Hebrew and Yiddish \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> ===Indic (Brahmic) scripts=== {{See also\|Devanagari transliteration\|Romanization of Bengali\|~~Romanization~~Romanisation of Malayalam}} The [[Brahmic family]] of [[abugida]]s is used for languages of the Indian subcontinent and south-east Asia. There is a long tradition in the west to study [[Sanskrit]] and other Indic texts in Latin transliteration. Various transliteration conventions have been used for Indic scripts since the time of Sir William Jones.<ref>{{cite web \|author=Gabriel Pradīpaka \|url=http://www.sanskrit-sanscrito.com.ar/english/sanskrit/sanskrit3part2.html \|archive-url=https://web.archive.org/web/20040315080622/http://www.sanskrit-sanscrito.com.ar/english/sanskrit/sanskrit3part2.html \|url-status=dead \|archive-date=2004-03-15 \|title=ASanskrit ~~comparison~~3: ofcomparing ~~some~~transliteration ~~of them~~systems \|~~publisher~~work=Sanskrit-~~sanscrito~~Sanscrito.com.ar \|access-date=2013-04-25 }}</ref> * [[ISO 15919]] (2001): A standard [[transliteration]] convention was codified in the ISO 15919 standard. It uses [[diacritic]]s to map the much larger set of Brahmic [[consonant]]s and [[vowel]]s to the Latin script. The Devanagari-specific portion is very similar to the academic standard, [[IAST]]: "International Alphabet of Sanskrit Transliteration", and to the United States Library of Congress standard, [[ALA-LC romanization\|ALA-LC]],<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/hindi.pdf \|title=Hindi \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> although there are a few differences * The [[National Library at Kolkata romanization]], intended for the romanization of all [[Brahmic family\|Indic scripts]], is an extension of [[IAST]] * [[Harvard-Kyoto]]: Uses upper and lower case and doubling of letters, to avoid the use of diacritics, and to restrict the range to 7-bit ASCII. Line 93 ⟶ 638: ==== Devanagari–nastaʿlīq (Hindustani) ==== [[Hindustani language\|Hindustani]] is an [[Indo-Aryan language]] with extreme [[digraphia]] and [[diglossia]] resulting from the [[Hindi–Urdu controversy]] starting in the 1800s. Technically, Hindustani itself is recognized by neither the language community nor any governments. Two [[Standard language\|standardized]] [[Register (sociolinguistics)\|registers]], [[Standard Hindi]] and [[Standard Urdu]], are recognized as [[official language]]s in India and Pakistan. However, in practice the situation is, * In Pakistan: Standard (Saaf or Khaalis) Urdu is the "high" variety, whereas Hindustani is the "low" variety used by the masses (called Urdu, written in [[nastaʿlīq script]]). * In India, both Standard (Shuddh) Hindi and Standard (Saaf or Khaalis) Urdu are the "H" varieties (written in [[devanagari]] and nastaʿlīq respectively), whereas Hindustani is the "L" variety used by the masses and written in either devanagari or nastaʿlīq (and called 'Hindi' or 'Urdu' respectively). The digraphia renders any work in either script largely inaccessible to users of the other script, though otherwise Hindustani is a perfectly mutually intelligible language, essentially meaning that any kind of text-based [[open source]] collaboration is impossible among devanagari and nastaʿlīq readers. Initiated in 2011, the Hamari Boli Initiative<ref>{{cite web \|url=http://www.hamariboli.com \|title=~~hamariboli.co~~What is HamariBoli? \|~~publisher~~work=~~Hamariboli.com~~HamariBoli \|date=2011-06-15 \|access-date=2013-04-25 \|url-status=dead \|archive-url=https://web.archive.org/web/20130601231148/http://www.hamariboli.com/ \|archive-date=2013-06-01 }}</ref> is a full-scale open-source [[language planning]] initiative aimed at Hindustani script, style, status & lexical reform and modernization. One of primary stated objectives of Hamari Boli is to relieve Hindustani of the crippling devanagari–nastaʿlīq digraphia by way of romanization.<ref>[http://www.thenews.com.pk/Todays-News-4-84689-Hamari-boli-angraizon-ke-rasmul-khat-mein ''The News International'' - Dec 29, 2011] {{Webarchive\|url=https://web.archive.org/web/20130616173554/http://www.thenews.com.pk/Todays-News-4-84689-Hamari-boli-angraizon-ke-rasmul-khat-mein \|date=June 16, 2013 }} --– "Hamari Boli (our language) is perhaps one of the very first serious undertakings to explore, develop and encourage the growth of Roman script in the use of Urdu/Hindi language."</ref> ===Chinese=== Line 106 ⟶ 651: ====Mandarin==== * [[ALA-LC romanization\|ALA-LC]]: Used to be similar to Wade–Giles,<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/chinese.pdf \|title=Chinese \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> but converted to [[Hanyu Pinyin]] in 2000<ref>{{cite web\|url=https://www.loc.gov/catdir/pinyin/romcover.html \|title=New Chinese Romanization Guidelines \|publisher=~~Loc.gov~~[[Library of Congress]] \|date=1998-11-03 \|access-date=2013-04-25}}</ref> * [[French School of the Far East#EFEO romanization system\|EFEO]]. Developed by [[École française d'Extrême-Orient]] in the 19th century, used mainly in France. * [[Latinxua Sin Wenz]] (1926): Omitted tone sounds. Used mainly in the [[Soviet Union]] and [[Xinjiang]] in the 1930s. Predecessor of [[Hanyu Pinyin]]. * [[Lessing-Othmer]]: Used mainly in Germany. Line 124 ⟶ 669: # [[Gwoyeu Romatzyh]] (GR, 1928–1986, in Taiwan 1945–1986; Taiwan used Japanese Romaji before 1945), # [[Mandarin Phonetic Symbols II]] (MPS II, 1986–2002), # [[Tongyong Pinyin]] (2002–2008),<ref name="taipei_times_2002-07-11">{{cite news \|~~publisher~~work=Taipei Times \|date=2002-07-11 \|title=Tongyong Pinyin the new system for romanization \|url=http://www.taipeitimes.com/News/taiwan/archives/2002/07/11/147813}}</ref><ref name="peoples_daily_2002-07-12">{{cite news \|work=People's Daily Online \|date=2002-07-12 \|title=Taiwan Authority Concerned Passes Tongyong Pinyin Scheme \|url=http://english.peopledaily.com.cn/200207/12/print20020712_99598.html}}</ref> and # [[Hanyu Pinyin]] (since January 1, 2009).<ref name="taipei_times_2008-09-18">{{cite news \|~~publisher~~work=Taipei Times \|date=2008-09-18 \|title=Hanyu Pinyin to be standard system in 2009 \|url=http://www.taipeitimes.com/News/taiwan/archives/2008/09/18/2003423528}}</ref><ref name="china_post_2008-09-18">{{cite news \|work=The China Post \|date=2008-09-18 \|title=Gov't to improve English-friendly environment \|url=http://www.chinapost.com.tw/taiwan/national/national%20news/2008/09/18/175155/Gov%27t-to.htm \|url-status=dead \|archive-url=https://web.archive.org/web/20080919054355/http://www.chinapost.com.tw/taiwan/national/national%20news/2008/09/18/175155/Gov%27t%2Dto.htm \|archive-date=2008-09-19 }}</ref> =====Singapore===== Line 140 ⟶ 685: * [[Yale romanization of Cantonese\|Yale]] (1942) * [[Cantonese Pinyin]] ====Wu==== {{see also\|Romanization of Wu Chinese}} ====Min Nan or Hokkien==== Line 163 ⟶ 711: * [[Kunrei-shiki romanization\|Kunrei-shiki]] (1937): phonemic transcription. Also adopted as ([[ISO 3602]]). * [[JSL romanization\|JSL]] (1987): phonemic transcription. Named after the book ''Japanese: The Spoken Language'' by Eleanor Jorden. * [[ALA-LC romanization\|ALA-LC]]: Similar to Modified Hepburn<ref>{{cite web \|url=https://www.loc.gov/catdir/cpso/romanization/japanese.pdf \|title=Japanese \|publisher=[[Library of Congress]] \|access-date=2014-09-28}}</ref> * [[Wāpuro rōmaji\|Wāpuro]]: ("word processor romanization") transliteration. Not strictly a system, but a collection of common practices that enables input of Japanese text. Line 171 ⟶ 719: While romanization has taken various and at times seemingly unstructured forms, some sets of rules do exist: * [[McCune–Reischauer]] (MR; 1937?), the first transcription to gain some acceptance. A slightly changed version of MR was the official system for [[Korean language\|Korean]] in [[South Korea]] from 1984 to 2000, and yet a different modification is still the official system in [[North Korea]]. Uses [[breve]]s, [[apostrophe]]s and [[Diaeresis (diacritic)\|diereses]], the latter two indicating orthographic syllable boundaries in cases that would otherwise be ambiguous.<br/>What is called MR may in many cases be any of a number of systems that differ from each other and from the original MR mostly in whether word endings are separated from the stem by a space, a hyphen or – according to McCune's and Reischauer's system – not at all; and if a hyphen or space is used, whether sound change is reflected in a stem's last and an ending's first consonant letter (e.g. ''pur-i'' vs. ''pul-i''). Although mostly irrelevant when transcribing uninflected words, these aberrations are so widespread that any mention of "McCune-Reischauer romanization" may not necessarily refer to the original system as published in the 1930s. ** There is, for example, the [[ALA-LC romanization\|ALA-LC]] / U.S. Library of Congress system, based on MR but with some deviations. Word division is addressed in detail, with a generous use of spaces to separate word endings from stems that is not seen in MR. Syllables of given names are always separated with a hyphen, which is expressly never done by MR. Sound changes are ignored more often than in MR. Distinguishes between '''‘''' and '''’'''.<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/korean.pdf \|title=Korean \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> Several problems with MR led to the development of the newer systems: * [[Yale romanization of Korean\|Yale]] (1942): This system has become the established standard romanization for Korean among [[linguistics\|linguists]]. Vowel length in old or dialectal pronunciation is indicated by a [[Macron (diacritic)\|macron]]. In cases that would otherwise be ambiguous, orthographic syllable boundaries are indicated with a period. This system also indicates consonants that have disappeared from a word's [[Hangul orthography\|South Korean orthography]] and standard pronunciation. * [[Revised Romanization of Korean]] (RR; 2000): Includes rules both for transcription and for transliteration. South Korea now officially uses this system that was approved in 2000. Road signs and textbooks were required to follow these rules as soon as possible, at a cost estimated by the government to be at least US$20 million. All road signs, names of railway and subway stations on line maps and signs etc. have been changed. The change has been either ignored or [[grandfather clause\|grandfathered]] in some cases, notably the romanization of names and existing companies. RR is generally similar to MR, but uses no diacritics or apostrophes, and uses distinct letters for ㅌ/ㄷ (t/d), ㅋ/ㄱ (k/g), ㅊ/ㅈ (ch/j) and ㅍ/ㅂ (p/b). In cases of ambiguity, orthographic syllable boundaries were intended to be indicated with a [[hyphen]], but this is inconsistently applied in practice. * [[ISO/TR 11941]] (1996): This actually is two different standards under one name: one for North Korea (DPRK) and the other for South Korea (ROK). The initial submission to the ISO was based heavily on Yale and was a joint effort between both states, but they could not agree on the final draft.<ref>{{cite web\|url=http://www.sori.org/hangul/romanizations.html#Roman_Intro \|title=A superficial comparison between the two \|~~publisher~~work=Sori.org \|access-date=2013-04-25}}</ref> * [[Fred Lukoff\|Lukoff]] romanization, developed 1945–47 for his ''Spoken Korean'' coursebooks<ref>{{cite web\|url=http://www.glossika.com/en/dict/korpin.html \|title=Korean Romanization Reference \|website=Glossika.com \|archive-url=https://web.archive.org/web/20060214042244/http://www.glossika.com/en/dict/korpin.html \|archive-date=February 14, 2006 \|url-status=dead}}</ref> Line 185 ⟶ 733: * [[ISO 11940]] 1998 Transliteration * [[ISO 11940-2]] 2007 Transcription * [[ALA-LC romanization\|ALA-LC]]<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/thai.pdf \|title=Thai \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> ===Nuosu=== The [[Nuosu language]], spoken in southern China, is written with its own script, the [[Yi script]]. The only existing romanisation system is [[Nuosu language#Phonology\|YYPY]] (Yi Yu Pin Yin), which represents tone with letters attached to the end of syllables, as Nuosu forbids codas. It does not use diacritics, and as such due to the large phonemic inventory of Nuosu, it requires frequent use of digraphs, including for monophthong vowels. ===Tibetan=== The [[Tibetan script]] has two official romanization systems: [[Tibetan Pinyin]] (for [[Lhasa Tibetan]]) and [[Roman Dzongkha]] (for [[Dzongkha]]). ===Cyrillic=== Line 201 ⟶ 752: * [[BGN/PCGN romanization of Belarusian]], 1979 ([[United States Board on Geographic Names]] and [[Permanent Committee on Geographical Names for British Official Use]]) * [[Scientific transliteration]], or the ''International Scholarly System'' for [[linguistics]] * [[ALA-LC romanization]], 1997 (American Library Association and Library of Congress):<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/beloruss.pdf \|title=Belarusian \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> * [[ISO 9]]:1995 * ''[[Instruction on transliteration of Belarusian geographical names with letters of Latin script]]'', 2000 Line 210 ⟶ 761: A system based on [[scientific transliteration]] and [[ISO/R 9:1968]] was considered official in Bulgaria since the 1970s. Since the late 1990s, Bulgarian authorities have switched to the so-called [[Romanization of Bulgarian#Streamlined System\|Streamlined System]] avoiding the use of diacritics and optimized for compatibility with English. This system became mandatory for public use with a law passed in 2009.<ref>State Gazette # 19, Sofia, 13 March 2009. (in Bulgarian)</ref> Where the old system uses <č,š,ž,št,c,j,ă>, the new system uses <ch,sh,zh,sht,ts,y,a>. The new Bulgarian system was endorsed for official use also by UN in 2012,<ref>{{cite web \|url=http://www.eki.ee/wgrs/rom1_bg.htm \|title=UN Romanization of Bulgarian for Geographical Names (1977) \|work=Eki.ee \|access-date=2015-06-27}}</ref> and by [[United States Board on Geographic Names\|BGN]] and [[Permanent Committee on Geographical Names for British Official Use\|PCGN]] in 2013.<ref>{{cite web\|url=http://earth-info.nga.mil/gns/html/Romanization/Romanization_Bulgarian.pdf \|title=Romanization System for Bulgarian, BGN/PCGN 1952 System \|website=earth-info.nga.mil \|publisher=[[National Geospatial-Intelligence Agency~~\|NGA~~]] \|archive-url=https://web.archive.org/web/20071219024011/http://earth-info.nga.mil/gns/html/Romanization/Romanization_Bulgarian.pdf \|archive-date=December 19, 2007 \|url-status=dead}}</ref> ====Kyrgyz==== Line 224 ⟶ 775: There is no single universally accepted system of writing [[Russian language\|Russian]] using the Latin script—in fact there are a huge number of such systems: some are adjusted for a particular target language (e.g. German or French), some are designed as a librarian's transliteration, some are prescribed for Russian travellers' passports; the transcription of some names is purely traditional.   All this has resulted in great reduplication of names.   E.g. the name of the Russian composer [[Pyotr Ilyich Tchaikovsky\|Tchaikovsky]] may also be written as ''Tchaykovsky'', ''Tchajkovskij'', ''Tchaikowski'', ''Tschaikowski'', ''Czajkowski'', ''Čajkovskij'', ''Čajkovski'', ''Chajkovskij'', ''Çaykovski'', ''Chaykovsky'', ''Chaykovskiy'', ''Chaikovski'', ''Tshaikovski'', ''Tšaikovski'', ''Tsjajkovskij'' etc. Systems include: * [[BGN/PCGN romanization\|BGN/PCGN]] (1947): Transliteration system (United States Board on Geographic Names & Permanent Committee on Geographical Names for British Official Use).<ref>{{cite web \|url=http://dspace.dial.pipex.com/town/avenue/vy75/cyrillic.htm \|title=Cyrillic Translations \|~~publisher~~work=~~Dspace~~DSpace.~~dial~~Dial.~~pipex~~Pipex.com \|access-date=2013-04-25 \|url-status=dead \|archive-url=https://web.archive.org/web/20120716180905/http://dspace.dial.pipex.com/town/avenue/vy75/cyrillic.htm \|archive-date=2012-07-16 }}</ref> * [[GOST 16876-71]] (1971): A now defunct Soviet transliteration standard. Replaced by GOST 7.79, which is an [[ISO 9]] equivalent. * [[United Nations]] romanization system for geographical names (1987): Based on [[GOST 16876-71]]. * [[ISO 9]] (1995): Transliteration. From the [[International Organization for Standardization]]. * [[ALA-LC romanization\|ALA-LC]] (1997)<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/russian.pdf \|title=Russian \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> * ~~[[Volapuk encoding\|~~"Volapuk" encoding]] (1990s): Slang term (it's is not really [[Volapük]]) for a writing method that's is not truly a transliteration, but used for similar goals (see article). * Conventional English transliteration is based to BGN/PCGN, but ~~doesn't~~does not follow a particular standard. Described in detail at [[Romanization of Russian]]. * Streamlined System<ref>{{cite web\|author=Dimiter Dobrev \|url=http://www.metodii.com/ru_Russian_Translit.html \|title=Транслитерация \|~~publisher~~trans-title=Transliteration \|language=ru \|work=Metodii.com \|access-date=2013-04-25}}</ref><ref>[https://web.archive.org/web/20070930163656/http://members.lycos.co.uk/rre/Russian.html Basic] and [http://russian-romanization.weebly.com/ Optimized] {{Webarchive\|url=https://web.archive.org/web/20160412140953/http://russian-romanization.weebly.com/ \|date=2016-04-12 }} Romanization of Russian. 2006–2016.</ref><ref>L. Ivanov. [https://web.archive.org/web/20170707114819/http://russian-romanization.weebly.com/uploads/5/5/3/4/55341127/2017-paper.pdf "Streamlined Romanization of Russian Cyrillic."]. ''Contrastive Linguistics''. XLII (2017) No. 2. pp. 66-73. {{ISSN\|0204-8701}}</ref><ref>[https://interscript.org/systems/bas-rus-Cyrl-Latn-2017-bss Interscript.] Streamlined Romanization of Russian Cyrillic (Basic Streamlined System).</ref><ref>[https://interscript.org/systems/bas-rus-Cyrl-Latn-2017-oss Interscript.] Streamlined Romanization of Russian Cyrillic (Optimized Streamlined System).</ref> for the romanization of Russian. * Comparative transliteration of Russian<ref>{{cite web\|url=http://www.russki-mat.net/trans.htm \|title=Транслитерация русского алфавита \|~~publisher~~trans-title=Transliteration of the Russian alphabet \|language=ru \|work=Russki-mat.net \|access-date=2013-04-25}}</ref> in different languages (Western European, Arabic, Georgian, Braille, Morse) ====Syriac==== Line 242 ⟶ 793: {{See also\|Ukrainian Latin alphabet}} The 2010 Ukrainian National system has been adopted by the UNGEGN in 2012 and by the BGN/PCGN in 2020. It is also very close to the modified (simplified) ALA-LC system, which has remained unchanged since 1941. * [[ALA-LC romanization\|ALA-LC]]<ref>{{cite web\|url=https://www.loc.gov/catdir/cpso/romanization/ukrainia.pdf \|title=Ukrainian \|publisher=~~Loc.gov~~[[Library of Congress]] \|access-date=2015-07-02}}</ref> * [[ISO 9]] * Ukrainian National transliteration<ref>{{cite web\|url=http://www.hostmaster.net.ua/docs/translit/tab_01.jpg \|title=Додаток до рішення № 9 \|website=hostmaster.net.ua \|archive-url=https://web.archive.org/web/20050307110826/http://www.hostmaster.net.ua/docs/translit/tab_01.jpg \|archive-date=March 7, 2005 \|url-status=dead}}</ref> * Ukrainian National and BGN/PCGN systems, at the UN Working Group on Romanization Systems<ref>{{cite web\|url=http://www.eki.ee/wgrs/rom1_uk.pdf \|title=Ukrainian \|~~publisher~~work=Eki.ee \|access-date=2015-07-02}}</ref> * Thomas T. Pedersen's comparison of five systems<ref>{{cite web\|url=http://transliteration.eki.ee/pdf/Ukrainian.pdf \|title=Ukrainian \|~~publisher~~work=Transliteration.~~eki~~Eki.ee \|access-date=2015-07-02}}</ref> ==Overview and summary== Line 288 ⟶ 839: \|- \| E \|e/ɛ\|\| Ε, ΑΙ \|\| Э \|\| ⴻ \|\| , ֱ, י ֵֶ, ֵ, י ֶ \|\| \|\| \|\| エ~~, ヱ~~ \|\| ㅔ \|\|ㄟ \|- \| EO Line 324 ⟶ 875: \|- \| I \| i/ɪ\|\| Η, Ι, Υ, ΕΙ, ΟΙ \|\| И, І \|\| ⵉ \|\| ִ, י ִ \|\| دِ \|\| \|\| イ~~, ヰ~~ \|\| ㅣ \|\|ㄧ \|- \| IY Line 510 ⟶ 1,061: \|- \| WE \| we\|\| \|\| \|\| \|\| \|\| \|\| \|\| ヱ \|\| ㅞ \|\| \|- \| WI \| y/ɥi\|\| \|\| \|\| \|\| \|\| \|\| \|\| ヰ \|\| ㅟ \|\| \|- \| WO Line 557 ⟶ 1,108: * [[Francization]] * [[Gairaigo]] * [[~~Transcription_into_Chinese_characters~~Transcription into Chinese characters#~~Transcription_table~~Transcription table\|Transcription into Chinese]], though standards vary by polity. * [[Sinicization]], specifically [[~~Adoption of Chinese literary culture\|~~adoption of Chinese literary culture]] * [[Latinisation of names]] * [[Semitic romanization]] Line 570 ⟶ 1,121: ; About romanization * [http://www.theelp.org/library.htm IPA for Urdu and Roman Urdu for Mobile and Internet Users (Download)] {{Webarchive\|url=https://web.archive.org/web/20081223115838/http://www.theelp.org/library.htm \|date=2008-12-23 }} * [http://www.microsoft.com/globaldev/tools/translit.mspx Microsoft Transliteration Utility] – A tool for creating, debugging and using transliteration modules from any script to any other script. * Randall Barry (ed.) ''ALA-LC Romanization Tables'' U.S. Library of Congress, 1997, {{ISBN\|0-8444-0940-5}}. (One of the few printed books with lists of romanizations) * [https://www.loc.gov/catdir/cpso/roman.html U.S. Library of Congress Romanization Tables] in PDF format * [http://www.eki.ee/wgrs/ UNGEGN Working Group on Romanization Systems] * [http://unicode.org/cldr/transliteration_guidelines.html Unicode Transliteration Guidelines] {{Webarchive\|url=https://web.archive.org/web/20090328043053/http://unicode.org/cldr/transliteration_guidelines.html \|date=2009-03-28 }} ; Romanization online * [http://ctext.org/pinyin.pl?if=en Chinese Phonetic Conversion Tool] – Converts between Pinyin and other formats * [http://www.cesty.in/transliteration Cyrillic Transliteration and Transcription ONLINE (Cyrillic -> Latin)] * [http://www.eiktub.com eiktub] {{Webarchive\|url=https://web.archive.org/web/20191025200939/http://www.eiktub.com/ \|date=2019-10-25 }} – An Arabic Transliteration Pad * [http://netzum-sorglos.de/software/lingua-translit/ Lingua::Translit] – [[Perl]] module covering a variety of writing systems e.g. Cyrillic or Greek. Provides a lot of standards as well as common transliteration schemes. * [http://www.arabeasy.net/wp-content/uploads/ae.htm Arabeasy] – Arabic Transliteration (free chrome extension exists, also works for Persian, Urdu) * [http://russianeasy.net/aeRUS1.htm Russianeasy] {{Webarchive\|url=https://web.archive.org/web/20160308102351/http://russianeasy.net/aeRUS1.htm \|date=2016-03-08 }} – Russian Transliteration (free chrome extension exists) For Persian Romanization * [https://laatingar.com/dashboard} For Cantonese Romanization * [https://hongkongvision.com/tool/cc_py_conv_en] {{-}}