Exploiting Sentential Context for Neural Machine Translation

Wang, Xing; Tu, Zhaopeng; Wang, Longyue; Shi, Shuming

Computer Science > Computation and Language

arXiv:1906.01268 (cs)

[Submitted on 4 Jun 2019]

Title:Exploiting Sentential Context for Neural Machine Translation

Authors:Xing Wang, Zhaopeng Tu, Longyue Wang, Shuming Shi

View PDF

Abstract:In this work, we present novel approaches to exploit sentential context for neural machine translation (NMT). Specifically, we first show that a shallow sentential context extracted from the top encoder layer only, can improve translation performance via contextualizing the encoding representations of individual words. Next, we introduce a deep sentential context, which aggregates the sentential context representations from all the internal layers of the encoder to form a more comprehensive context representation. Experimental results on the WMT14 English-to-German and English-to-French benchmarks show that our model consistently improves performance over the strong TRANSFORMER model (Vaswani et al., 2017), demonstrating the necessity and effectiveness of exploiting sentential context for NMT.

Comments:	Accepted by ACL 2019
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1906.01268 [cs.CL]
	(or arXiv:1906.01268v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1906.01268

Submission history

From: Xing Wang [view email]
[v1] Tue, 4 Jun 2019 08:29:33 UTC (453 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xing Wang
Zhaopeng Tu
Longyue Wang
Shuming Shi

export BibTeX citation

Computer Science > Computation and Language

Title:Exploiting Sentential Context for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Exploiting Sentential Context for Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators