skip to main content
10.1145/3428757.3429118acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
short-paper

Effect of Semantic Content Generalization on Pointer Generator Network in Text Summarization

Published: 27 January 2021 Publication History

Abstract

Semantic content generalization is a method for text summarization that reduces the difficulty of training of neural networks by replacing some phrases such as named entities with generalized terms. The semantic content generalization has achieved remarkable results in enhancing the performance of the sequence to sequence attention model. Besides that, the pointer generator network could ease the training of the summarization based on a mechanism that copies words from the original text, which shares a similar idea with semantic content generalization. The purpose of this work is to test and verify the effect of semantic content generalization on the pointer generator network. Therefore, we use the preprocessing of semantic content generalization and then combine it with the pointer generator network. We examine the performance through an experiment using CNN/DailyMail dataset. From the experiment, we found that the semantic content generalization can improve the performance of the pointer generator network.

References

[1]
Christiane Fellbaum (Ed.). 1998. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA.
[2]
Jenny Rose Finkel, Trond Grenager, and Christopher Manning. 2005. Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05). 363--370.
[3]
Fadi Hassan, Josep Domingo-Ferrer, and Jordi Soria-Comas. 2018. Anonymization of Unstructured Data via Named-Entity Recognition. In Modeling Decisions for Artificial Intelligence. 296--305.
[4]
Sébastien Jean, Kyunghyun Cho, Roland Memisevic, and Yoshua Bengio. 2015. On Using Very Large Target Vocabulary for Neural Machine Translation. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 1--10.
[5]
Panagiotis Kouris, Georgios Alexandridis, and Andreas Stafylopatis. 2019. Abstractive Text Summarization Based on Deep Learning and Semantic Content Generalization. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 5082--5092.
[6]
Chin-Yew Lin. 2004. ROUGE: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out. 74--81.
[7]
George A. Miller. 1995. WordNet: A Lexical Database for English. Commun. ACM 38, 11 (1995), 39--41.
[8]
Ramesh Nallapati, Bowen Zhou, Cicero dos Santos, Çağlar Gülçehre, and Bing Xiang. 2016. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond. In Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning. 280--290.
[9]
Abigail See, Peter J. Liu, and Christopher D. Manning. 2017. Get To The Point: Summarization with Pointer-Generator Networks. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1073--1083.
[10]
Jingbo Shang, Liyuan Liu, Xiaotao Gu, Xiang Ren, Teng Ren, and Jiawei Han. 2018. Learning Named Entity Tagger using Domain-Specific Dictionary. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 2054--2064.
[11]
SongShengli, HuangHaitao, and RuanTongxiao. 2019. Abstractive text summarization using LSTM-CNN based deep learning. Multimedia Tools and Applications 78 (2019), 857--875.
[12]
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. 2014. Sequence to Sequence Learning with Neural Networks. In Advances in Neural Information Processing Systems 27. 3104--3112.
[13]
Kristina Toutanova, Dan Klein, Christopher D. Manning, and Yoram Singer. 2003. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics. 252--259.
[14]
Kristina Toutanvoa and Christopher D. Manning. 2000. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger. In 2000 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. 63--70.
[15]
Zhaopeng Tu, Zhengdong Lu, Yang Liu, Xiaohua Liu, and Hang Li. 2016. Modeling Coverage for Neural Machine Translation. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 76--85.

Cited By

View all

Index Terms

  1. Effect of Semantic Content Generalization on Pointer Generator Network in Text Summarization

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        iiWAS '20: Proceedings of the 22nd International Conference on Information Integration and Web-based Applications & Services
        November 2020
        492 pages
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        In-Cooperation

        • Johannes Kepler University, Linz, Austria

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 27 January 2021

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Nature language processing
        2. Neural network
        3. Semantic content
        4. Text summarization

        Qualifiers

        • Short-paper
        • Research
        • Refereed limited

        Conference

        iiWAS '20

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)2
        • Downloads (Last 6 weeks)1
        Reflects downloads up to 14 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media