skip to main content
10.1145/2872427.2883068acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Public Access

Entity Disambiguation with Linkless Knowledge Bases

Published: 11 April 2016 Publication History

Abstract

Named Entity Disambiguation is the task of disambiguating named entity mentions in natural language text and link them to their corresponding entries in a reference knowledge base (e.g. Wikipedia). Such disambiguation can help add semantics to plain text and distinguish homonymous entities. Previous research has tackled this problem by making use of two types of context-aware features derived from the reference knowledge base, namely, the context similarity and the semantic relatedness. Both features heavily rely on the cross-document hyperlinks within the knowledge base: the semantic relatedness feature is directly measured via those hyperlinks, while the context similarity feature implicitly makes use of those hyperlinks to expand entity candidates' descriptions and then compares them against the query context. Unfortunately, cross-document hyperlinks are rarely available in many closed domain knowledge bases and it is very expensive to manually add such links. Therefore few algorithms can work well on linkless knowledge bases. In this work, we propose the challenging Named Entity Disambiguation with Linkless Knowledge Bases (LNED) problem and tackle it by leveraging the useful disambiguation evidences scattered across the reference knowledge base. We propose a generative model to automatically mine such evidences out of noisy information. The mined evidences can mimic the role of the missing links and help boost the LNED performance. Experimental results show that our proposed method substantially improves the disambiguation accuracy over the baseline approaches.

References

[1]
http://en.wikipedia.org/wiki/wikipedia:wikipedians.
[2]
D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, 2003.
[3]
R. Bunescu and M. Pasca. Using encyclopedic knowledge for named entity disambiguation. In Proceedings of EACL, pages 9--16, 2006.
[4]
Z. Cai, K. Zhao, K. Q. Zhu, and H. Wang. Wikification via link co-occurrence. In Proceedings of CIKM, pages 1087--1096, 2013.
[5]
A. Chisholm and B. Hachey. Entity disambiguation with web links. Transactions of the Association for Computational Linguistics, 3:145--156, 2015.
[6]
R. Cilibrasi and P. Vitanyi. The google similarity distance. IEEE Transactions on Knowledge and Data Engineering, 19(3):370--383, 2007.
[7]
S. Cucerzan. Large-scale named entity disambiguation based on wikipedia data. In Proceedings of EMNLP-CoNLL, pages 708--716, 2007.
[8]
N. Dalvi, R. Kumar, and B. Pang. Object matching in tweets with spatial models. In Proceedings of WSDM, pages 43--52, 2012.
[9]
M. Dredze, P. McNamee, D. Rao, A. Gerber, and T. Finin. Entity disambiguation for knowledge base population. In Proceedings of ICCL, pages 277--285, 2010.
[10]
S. Gottipati and J. Jiang. Linking entities to a knowledge base with query expansion. In Proceedings of EMNLP, pages 804--813, 2011.
[11]
X. Han and L. Sun. A generative entity-mention model for linking entities with knowledge base. In Proceedings of ACL-HLT, pages 945--954, 2011.
[12]
X. Han and L. Sun. An entity-topic model for entity linking. In Proceedings of EMNLP, pages 105--115, 2012.
[13]
X. Han, L. Sun, and J. Zhao. Collective entity linking in web text: a graph-based method. In Proceedings of SIGIR, pages 765--774, 2011.
[14]
J. Hoffart, S. Seufert, D. B. Nguyen, M. Theobald, and G. Weikum. Kore: keyphrase overlap relatedness for entity disambiguation. In Proceedings of CIKM, pages 545--554, 2012.
[15]
J. Hoffart, M. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater, and G. Weikum. Robust disambiguation of named entities in text. In Proceedings of EMNLP, pages 782--792, 2011.
[16]
Y. Jin, E. Kıcıman, K. Wang, and R. Loynd. Entity linking at the tail: Sparse signals, unknown entities and phrase models. In Proceedings of WSDM, 2014.
[17]
S. Kataria, K. Kumar, R. Rastogi, P. Sen, and S. Sengamedu. Entity disambiguation with hierarchical topic models. In Proceedings of SIGKDD, pages 1037--1045, 2011.
[18]
Y. Li, C. Wang, F. Han, J. Han, D. Roth, and X. Yan. Mining evidences for named entity disambiguation. In Proceedings of SIGKDD, pages 1070--1078, 2013.
[19]
D. Milne and I. Witten. Learning to link with wikipedia. In Proceedings of CIKM, pages 509--518, 2008.
[20]
A. A. Morgan, Z. Lu, X. Wang, A. M. Cohen, J. Fluck, P. Ruch, A. Divoli, K. Fundel, R. Leaman, J. Hakenberg, et al. Overview of biocreative ii gene normalization. Genome biology, 9(Suppl 2):S3, 2008.
[21]
P. Pantel and A. Fuxman. Jigs and lures: Associating web queries with structured entities. In Proceedings of ACL-HLT, pages 83--92, 2011.
[22]
D. Ramage, D. Hall, R. Nallapati, and C. Manning. Labeled lda: A supervised topic model for credit attribution in multi-labeled corpora. In Proceedings EMNLP, pages 248--256, 2009.
[23]
L. Ratinov, D. Roth, D. Downey, and M. Anderson. Local and global algorithms for disambiguation to wikipedia. In Proceedings of ACL, pages 1375--1384, 2011.
[24]
P. Sen. Collective context-aware topic models for entity disambiguation. In Proceedings of WWW, pages 729--738, 2012.
[25]
W. Shen, J. Han, and J. Wang. A probabilistic model for linking named entities in web text with heterogeneous information networks. In Proceedings of SIGMOD, 2014.
[26]
W. Shen, J. Wang, P. Luo, and M. Wang. Linden: linking named entities with knowledge base via semantic knowledge. In Proceedings of WWW, pages 449--458, 2012.
[27]
A. Sil, E. Cronin, P. Nie, Y. Yang, A.-M. Popescu, and A. Yates. Linking named entities to any database. In Proceedings of EMNLP/CoNLL, pages 116--127, 2012.
[28]
V. Varma, V. Bharat, S. Kovelamudi, P. Bysani, G. Santosh, K. Kumar, K. Reddy, K. Kumar, and N. Maganti. Iiit hyderabad at tac 2009. In Proceedings of TAC, 2009.
[29]
W. Zhang, Y. Sim, J. Su, and C. Tan. Entity linking with effective acronym expansion, instance selection and topic modeling. In Proceedings of IJCAI, pages 1909--1914, 2011.
[30]
Z. Zheng, X. Si, F. Li, E. Y. Chang, and X. Zhu. Entity disambiguation with freebase. In Proceedings of WI-IAT, pages 82--89, 2012.

Cited By

View all

Index Terms

  1. Entity Disambiguation with Linkless Knowledge Bases

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '16: Proceedings of the 25th International Conference on World Wide Web
    April 2016
    1482 pages
    ISBN:9781450341431

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    International World Wide Web Conferences Steering Committee

    Republic and Canton of Geneva, Switzerland

    Publication History

    Published: 11 April 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. entity disambiguation
    2. evidence mining
    3. generative model
    4. linkless knowledge bases

    Qualifiers

    • Research-article

    Funding Sources

    • NSF
    • Army Research Laboratory

    Conference

    WWW '16
    Sponsor:
    • IW3C2
    WWW '16: 25th International World Wide Web Conference
    April 11 - 15, 2016
    Québec, Montréal, Canada

    Acceptance Rates

    WWW '16 Paper Acceptance Rate 115 of 727 submissions, 16%;
    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)62
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 21 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media