skip to main content
research-article

A context-aware citation recommendation model with BERT and graph convolutional networks

Published: 01 September 2020 Publication History

Abstract

With the tremendous growth in the number of scientific papers being published, searching for references while writing a scientific paper is a time-consuming process. A technique that could add a reference citation at the appropriate place in a sentence will be beneficial. In this perspective, the context-aware citation recommendation has been researched for around two decades. Many researchers have utilized the text data called the context sentence, which surrounds the citation tag, and the metadata of the target paper to find the appropriate cited research. However, the lack of well-organized benchmarking datasets, and no model that can attain high performance has made the research difficult. In this paper, we propose a deep learning-based model and well-organized dataset for context-aware paper citation recommendation. Our model comprises a document encoder and a context encoder. For this, we use graph convolutional networks layer, and bidirectional encoder representations from transformers, a pre-trained model of textual data. By modifying the related PeerRead dataset, we propose a new dataset called FullTextPeerRead containing context sentences to cited references and paper metadata. To the best of our knowledge, this dataset is the first well-organized dataset for a context-aware paper recommendation. The results indicate that the proposed model with the proposed datasets can attain state-of-the-art performance and achieve a more than 28% improvement in mean average precision and recall@k.

References

[1]
Bai X, Zhang F, and Lee I Predicting the citations of scholarly paper Journal of Informetrics 2019 13 1 407
[2]
Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2018). BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv e-prints
[3]
Dragomir, B. G. P. M., Radev, R., & Thomas, J. M. (2009). A bibliometric and network analysis of the field of computational linguistics. Journal of the American Society for Information Science and Technology.
[4]
Ebesu, T., Fang, Y. (2017). In Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval, SIGIR ’17 (pp. 1093–1096). New York, NY: ACM. 10.1145/3077136.3080730.
[5]
He, Q., Kifer, D., Pei, J., Mitra, P., & Giles, C. L. (2011). In Proceedings of the fourth ACM international conference on web search and data mining (pp. 755–764). ACM
[6]
He, Q., Pei, J., Kifer, D., Mitra, P., & Giles, L. (2010). In Proceedings of the 19th international conference on World wide web (pp. 421–430). ACM
[7]
Huang, W., Wu, Z., Liang, C., Mitra, P., & Giles, C. L. (2015). In Proceedings of the twenty-ninth AAAI conference on artificial intelligence (pp. 2404–2410). AAAI Press.
[8]
Kang, D., Ammar, W., Dalvi, B., van Zuylen, M., Kohlmeier, S., Hovy, E., & Schwartz, R. (2018). In Proceedings of the 2018 conference of the North American chapter of the association for computational linguistics: Human language technologies, Volume 1 (Long Papers) (Vol. 1, pp. 1647–1661).
[9]
Kim MC and Chen C A scientometric review of emerging trends and new developments in recommendation systems Scientometrics 2015 104 1 239
[10]
Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv e-prints
[11]
Kipf, T. N., & Welling, M. (2016). Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
[12]
Kipf, T. N., & Welling, M. (2016). Variational graph auto-encoders, NIPS Workshop on Bayesian Deep Learning.
[13]
Le, Q., & Mikolov, T. (2014). Distributed representations of sentences and documents. arXiv preprint arXiv:1405.4053
[14]
Liu H, Kong X, Bai X, Wang W, Bekele TM, and Xia F Context-based collaborative filtering for citation recommendation IEEE Access 2015 3 1695
[15]
Moed HF Measuring contextual citation impact of scientific journals Journal of Informetrics 2010 4 3 265
[16]
Niepert, M., Ahmed, M., & Kutzkov, K. (2016). Learning convolutional neural networks for graphs. arXiv:1605.05273
[17]
Radev, D. R., Muthukrishnan, P., & Qazvinian, V. (2009). In Proceedings, ACL workshop on natural language processing and information retrieval for digital libraries. Singapore
[18]
Radev, D., Muthukrishnan, P., Qazvinian, V., & Abu-Jbara, A. (2013). The ACL anthology network corpus, language resources and evaluation pp. 1–26. 10.1007/s10579-012-9211-2.
[19]
Rezende, D. J., Mohamed, S., & Wierstra, D. (2014). In Proceedings of the 31st international conference on international conference on machine learning—Volume 32 (JMLR.org), ICML’14 (pp. II–1278–II–1286). http://dl.acm.org/citation.cfm?id=3044805.3045035
[20]
Rokach, L., Mitra, P., Kataria, S., Huang, W., & Giles, L. (2013). A supervised learning method for context-aware citation recommendation in a large corpus. INVITED SPEAKER: Analyzing the Performance of Top-K Retrieval Algorithms, 1978.
[21]
Tang, X., Wan, X., & Zhang, X. (2014). In Proceedings of the 37th international ACM SIGIR conference on research & development in information retrieval, SIGIR ’14 (pp. 817–826). New York, NY: ACM. 10.1145/2600428.2609564.
[22]
Tan J, Wan X, Liu H, and Xiao J Quoterec: Toward quote recommendation for writing ACM Transactions on Information Systems 2018 36 34 1
[23]
Yang L, Zheng Y, Cai X, Dai H, Mu D, Guo L, and Dai T A LSTM based model for personalized context-aware citation recommendation IEEE Access 2018 6 59618

Cited By

View all

Index Terms

  1. A context-aware citation recommendation model with BERT and graph convolutional networks
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Information & Contributors

            Information

            Published In

            cover image Scientometrics
            Scientometrics  Volume 124, Issue 3
            Sep 2020
            987 pages

            Publisher

            Springer-Verlag

            Berlin, Heidelberg

            Publication History

            Published: 01 September 2020
            Received: 06 September 2019

            Author Tags

            1. Paper citation
            2. Citation recommendation
            3. BERT
            4. Deep learning
            5. Transformer
            6. Graph convolution network

            Author Tag

            1. 68U15

            Qualifiers

            • Research-article

            Funding Sources

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • Downloads (Last 12 months)0
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 09 Jan 2025

            Other Metrics

            Citations

            Cited By

            View all

            View Options

            View options

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media