Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation.

AllImages Shopping Videos Maps News Books

Only 5% Attention Is All You Need: Efficient Long-range Document ...

aclanthology.org › 2023.ijcnlp-main.47

Only 5% attention is all you need: Efficient long-range document-level neural machine translation. Zihan Liu, Zewei Sun, Shanbo Cheng, Shujian Huang, Mingxuan ...

Efficient Long-range Document-level Neural Machine Translation - arXiv

arxiv.org › cs

Sep 25, 2023 · Only 5\% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation. Authors:Zihan Liu, Zewei Sun, Shanbo Cheng, ...

[PDF] Efficient Long-range Document-level Neural Machine Translation

aclanthology.org › 2023.ijcnlp-mai...

Nov 4, 2023 · Experimen- tal results show that our method could achieve up to 95% sparsity (only 5% tokens attended) approximately, and save 93% computation ...

Efficient Long-range Document-level Neural Machine Translation

www.semanticscholar.org › paper

This work keeps the translation performance while gaining 20% speed up by introducing extra selection layer based on lightweight attention that selects a ...

Only 5% Attention Is All You Need: Efficient Long-range... - OpenReview

openreview.net › forum

Dec 31, 2022 · Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation. Download PDF · Open Webpage · Zihan Liu, ...

Efficient Long-range Document-level Neural Machine Translation

www.researchgate.net › publication › 37...

... 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation | Find, read and cite all the research you need on ResearchGate.

Search | OpenReview

openreview.net › search › group=all

Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation ... Translation Dataset for Multimodal Machine Translation ...

[1706.03762] Attention Is All You Need - arXiv

arxiv.org › cs

Jun 12, 2017 · We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely.

Attention - a stereoplegic Collection - Hugging Face

huggingface.co › collections › stereoplegic

Only 5\% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation ... Semantics-aware Attention Improves Neural Machine ...

Rethinking Document-level Neural Machine Translation | Request PDF

www.researchgate.net › publication › 36...

Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation. Conference Paper. Jan 2023. Zihan Liu · Zewei Sun ...