Machine Transliteration
1,409 Followers
Recent papers in Machine Transliteration
Language transliteration is one of the important areas in NLP. Transliteration is very useful for converting the named entities (NEs) written in one script to another script in NLP applications like Cross Lingual Information Retrieval... more
У монографії подано, обґрунтовано та розвинено комплексний пiдxiд до вживання і творення українських тeрмiнiв, заснований на використанні методів природничих наук і мовознавчих аспектів дослідження. Такий підхід створює передумови для... more
Let me present to you my study “Word Order Errors in English and Bulgarian Machine-Translated Texts”. My work contains Introduction, Three chapters, Conclusion, Appendix and Bibliography. The object, the main goal and tasks of the... more
This paper proposes a novel approach for multilingual query processing, wherein we propose a phonetic distance based measure, for searching proper name data in Indian language scripts. The systems allows query in a language of user's... more
AWEJ Volume.5 Number.2, 2014 Pp. 142-152 The Impact of... more
Globalization and growth of Internet users truly demands for almost all internet based applications to support local languages. Support of local languages can be given in all internet based applications by means of Machine Transliteration... more
In Sanskrit study and research in Thailand, transliteration from Sanskrit documents and proofreading is a difficult and time-consuming task since Sanskrit documents contain a great deal of content. To find solutions, this study presented... more
This paper describes our submission for FIRE 2014 Shared Task on Transliterated Search. The shared task features two subtasks: Query word labeling and Mixed-script Ad hoc retrieval for Hindi Song Lyrics. Query Word Labeling is on token... more
This paper presents a design for rule-based machine translation system for English to Marathi language pair. The machine translation system will take input script as English sentence and parse with the help of Stanford parser. The... more
This article discusses basic approaches to transcribe foreign and borrowed words in Ukrainian, and Russian, Belarusian, and Ukrainian words in Latin script. It is argued that the adopted and foreign words should be rendered on different... more
e-Governance and Web based online commercial multilingual applications has given utmost importance to the task of translation and transliteration. The Named Entities and Technical Terms occur in the source language of translation are... more
Transliteration may be defined as the alteration of text from one language to another. In this paper, we have discussed how transliteration is useful in handling unknown words in Named Entity Recognition (NER). We have shown some of our... more
Hindi is the lingua-franca of India. Although all non-native speakers can communicate well in Hindi, there are only a few who can read and write in it. In this work, we aim to bridge this gap by building transliteration systems that could... more
We have developed a finite state transducer based transliteration engine called Konkanverter that performs statistical machine transliteration between three different scripts used to write the Konkani language. The statistical machine... more
The Web provides the largest, exploitable collection of language use. If we can mine the Web to build abstract models of language use, these models may have many applications. Here we present one example of using the implicit intelligence... more
Translatability in natural languages today supports, and can only be understood in the context of, economic globalization, and the universalization of market logic. ‘Untranslatability’, as it is most often construed, does not provide a... more
Transliteration may be defined as the process of mapping sounds in a text written in one language to another language. Current paper discusses about transliteration and its use in Named Entity Recognition. We have designed a code that... more
Much of the user generated content on the internet is written in their transliterated form instead of in their indigenous script. Due to this search engines receive a large number of transliterated search queries. This paper presents... more
Machine Translation for Indian languages is an emerging research area. Transliteration is one such module that we design while designing a translation system. Transliteration means mapping of source language text into the target language.... more
Translatability in natural languages today supports, and can only be understood in the context of, economic globalization, and the universalization of market logic. ‘Untranslatability’, as it is most often construed, does not provide a... more
We describe ScriptTranscriber, an open source toolkit for extracting transliterations in comparable corpora from lan- guages written in different scripts. The system includes various methods for extracting potential terms of interest from... more
The present article investigates the influence of concept transfer and translation on the political discourse used in Morocco. It has been attested that the Moroccan political register suffers from an acute lexical gap to express a number... more
Transliteration may be defined as the alteration of text from one language to another. In this paper, we have discussed how transliteration is useful in handling unknown words in Named Entity Recognition (NER). We have shown some of our... more
Computational LFG grammar in development in Konstanz Aim: large-scale LFG grammar for parsing Urdu/Hindi Grammar is part of the ParGram project Collaborative, world-wide research project Devoted to developing parallel LFG grammars for a... more
В настоящия доклад се прави опит да се отчетат и опишат особеностите в словореда на българския и английския език, които биха довели до допускане на грешки при превод на текстове, както от човек, така и чрез машинен превод.
In this paper, we propose efficient and less resource-intensive strategies for parsing of code-mixed data. These strategies are not constrained by in-domain annotations , rather they leverage pre-existing monolingual annotated resources... more
Computational LFG grammar in development in Konstanz Aim: large-scale LFG grammar for parsing Urdu/Hindi Grammar is part of the ParGram project Collaborative, world-wide research project Devoted to developing parallel LFG grammars for a... more
Machine Transliteration has come out to be an emerging and a very important research area in the field of machine translation. Transliteration basically aims to preserve the phonological structure of words. Proper transliteration of name... more
e-Governance and Web based online commercial multilingual applications has given utmost importance to the task of translation and transliteration. The Named Entities and Technical Terms occur in the source language of translation are... more
Our work is based on the system “Transscriba” [11]. This is a rule-based system, that means that transliteration model is a set of rules, constructed manually by expert. Such approach provides very high accuracy of transliteration, but it... more
An important prerequisite for data-driven Machine Translation (MT) systems is the availability of high-quality training data. Corpus-based MT systems requiring domain specificity additionally require the selection of a large training... more
Phonetic typing using the English alphabet has become widely popular nowadays for social media and chat services. As a result, a text containing various English and Bangla words and phrases has become increasingly common. Existing... more