skip to main content
10.1145/3627915.3628091acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsaeConference Proceedingsconference-collections
research-article

Research on Construction of Chinese Technology Literature Question Answering System Based on Knowledge Graph

Published: 21 December 2023 Publication History

Abstract

In order to improve the knowledge service ability of the library and meet the user's demand for querying Chinese technology literature in natural language. This article proposes a two-stage method to implement the question answering system for Chinese technology literature. In the first stage, the Chinese question classification system based on sentence pattern is designed, and the stacking framework of ensemble learning algorithm is used to classify Chinese question. In the second stage, the pipeline is employed to parse the natural language question sentences and transfer them into Cypher statement. The field of "competitive intelligence" is selected for experiment. The experimental results show that the proposed stacking classification model improves the metric F1 by 10.81% compared with the single model average. Overall, the accuracy of translating Chinese questions into Cypher statements reached 83.96%. The proposed method can effectively convert Chinese questions into Cypher statements and directly return the answer to users. It provides a reference scheme for the application of the question answering based on knowledge map, which can improve the knowledge service ability of the library.

References

[1]
Bornmann L, Mutz R (2015). Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references . Journal of the Association for Information Science & Technology, 66(11), 2215-2222.
[2]
Lu Wei, Qi Yue, Hu Xiaoge, Huang Yongkai and Cheng Qikai (2019). Design and Implementation of Library Automatic Question Answering System. Technology Intelligence Engineering, 5(2): 5-16.
[3]
Guo Jingang (2010). Design and implementation of Chinese automatic question answering system for digital library. Sichuan: University of Electronic Science and Technology of China.
[4]
Qiu X, Sun T, Xu Y, (2020). Pre-trained Models for Natural Language Processing: A Survey. Science China: Technological Sciences, 63(10): 1871-1897.
[5]
Wang Xinlei, Li Shuaichi, Yang Zhihao, (2020). Chinese Knowledge Base Question Answering System Based on Pre-trained Language Model . Journal of Shanxi University (Nat. Sci. Ed.), 43(4): 955-962.
[6]
Xu Li, Wang Daling and Xia Xiufeng (2010). Question Features Exaction Method Based on Syntactic and Semantic Information. Computer Engineering, 36(21): 65-66.
[7]
Metzler D and Croft W B (2005). Analysis of statistical question classification for fact-based questions. Information Retrieval, 8(3): 481-504.
[8]
Blunsom P, Kocik K and Curran J R (2006). Question classification with log-linear models. Research and Development in Information Retrieval. New York:Association for Computing Machinery, 615-616.
[9]
Mishra M, Mishra V K and Sharma H R (2013). Question classification using semantic, syntactic and lexical features. International Journal of Web & Semantic Technology, 4(3): 39-47.
[10]
Wu Yongliang and Zhao Shuliang (2021). Community answer generation based on knowledge graph. Information Sciences, 545: 132-152.
[11]
Ding K, Han HQ, (2021). Research on Question Answering System for COVID-19 Based on Knowledge Graph. 2021 40th Chinese Control Conference (CCC). IEEE, 2021: 4659-4664.
[12]
Li X and Roth D (2006). Learning question classifiers: the role of semantic information. Natural Language Engineering, 12(3): 229-249.
[13]
Bordes A, Chopra S and Weston J (2014). Question answering with subgraph embeddings. Conference on Empirical Methods in Natural Language Processing. Lisbon: Association for Computational Linguistics, 615-620.
[14]
Bao J W, Duan N, Yan Z, (2016). Constraint based question answering with knowledge graph. Proceedings of COLING 2016. Osaka:COLING 2016 Organizing Committee, 2503-2514.
[15]
Cheng K, Shen G H, Huang Z Q, (2021). Improved entity linking for simple question answering over knowledge graph. International Journal of Software Engineering and Knowledge Engineering, 31(1): 55-80.
[16]
Hovy E, Gerber L, Hermjakob U, (2003). Toward Semantics-based answer pinpointing. Proceedings of the First International Conference on Human Language Technology Research. New York: Association for Computing Machinery, 245-256.
[17]
Zhang Yu, Liu Ting and Wen Xu (2015). Modified bayesian model question classification. Journal of Chinese Information Processing, 19(2): 100-105.
[18]
Ali H, Michel B, Kristine L, (2011). Enhanced semantic expansion for question classification. International Journal of Internet Technology and Secured Transactions, 3(2): 134-148.
[19]
Bo Q, Gao C, Li CP, (2012). An evaluation of classification models for question topic categorization. Journal of the American Society for Information Science and Technology, 63(5): 889-903.
[20]
Kalchbrebber N, Grefenstette E and Blunsom P (2014). A convolutional neural network for modelling sentences. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. New York: Association for Computing Machinery, 655-665.
[21]
Li L, Zhou Z, Wang B, (2021). A Novel CNN-based Method for Accurate Ship Detection in HR Optical Remote Sensing Images via Rotated Bounding Box. IEEE Transactions on Geoscience and Remote Sensing, 59(1): 686-699.
[22]
Zhang J C and Li J (2018). Topic memory networks for short text classification. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Lisbon: Association for Computational Linguistics, 3120-3131.
[23]
Ujith R and Zornitsa K (2018). Self-governing neural networks for on-device short text classification. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Lisbon: Association for Computational Linguistics, 804-810.
[24]
Du Yue, Chang Zhijun, Dong Mei, (2022). A large-scale knowledge graph construction method for massive sci-tech literature data . Data Analysis and Knowledge Discovery, 6(7): 1-14.
[25]
Wang Li, Han Hongqi, Gao Xiong, (2021). Research on the Application of Relational Database to Neo4j Graph Database: Taking the Engineering Science Word System as an Example. China Science & Technology Resource Review, 53(5): 55-65.
[26]
Ran YX, Han HQ, Zhang YL, (2020). Large scale text hierarchical classification method based on stacking ensemble learning. Information studies: Theory & Application, 43(10): 171-176, 182.
[27]
Dori S and Landau G M (2006). Construction of Aho Corasick automaton in linear time for integer alphabets. Information Processing Letters, 98(2): 66-72.
[28]
Lample G, Ballesteros M, Subramanian S, (2016). Neural Architectures for Entity Recognition. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.
[29]
Devlin J, Chang M W, Lee K, (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, PA: Association for Computational Linguistics, 4171-4186.
[30]
Fu Yonglin (2014). Research and Implementation of Chinese Digital Library Question Answering System based on Data Driven. Beijing: Beijing University.

Index Terms

  1. Research on Construction of Chinese Technology Literature Question Answering System Based on Knowledge Graph

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CSAE '23: Proceedings of the 7th International Conference on Computer Science and Application Engineering
    October 2023
    358 pages
    ISBN:9798400700590
    DOI:10.1145/3627915
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 December 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Cypher Statement
    2. Ensemble Learning
    3. Graph Database
    4. Knowledge Graph
    5. Q&A System

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CSAE 2023

    Acceptance Rates

    Overall Acceptance Rate 368 of 770 submissions, 48%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 15
      Total Downloads
    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Sep 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media