skip to main content
10.5555/1596374.1596406dlproceedingsArticle/Chapter ViewAbstractPublication PagesconllConference Proceedingsconference-collections
research-article
Free access

Fine-grained classification of named entities exploiting latent semantic kernels

Published: 04 June 2009 Publication History

Abstract

We present a kernel-based approach for fine-grained classification of named entities. The only training data for our algorithm is a few manually annotated entities for each class. We defined kernel functions that implicitly map entities, represented by aggregating all contexts in which they occur, into a latent semantic space derived from Wikipedia. Our method achieves a significant improvement over the state of the art for the task of populating an ontology of people, although requiring considerably less training instances than previous approaches.

References

[1]
Javier Artiles, Julio Gonzalo, and Satoshi Sekine. 2007. The semeval-2007 weps evaluation: Establishing a benchmark for the web people search task. In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), pages 64--69, Prague, Czech Republic, June. Association for Computational Linguistics.
[2]
Javier Artiles, Julio Gonzalo, and Satoshi Sekine. 2009. Weps 2 evaluation campaign: overview of the web people search clustering task. In 2nd Web People Search Evaluation Workshop (WePS 2009), 18th WWW Conference, Madrid, Spain, April.
[3]
Thorsten Brants and Alex Franz. 2006. Web 1T 5-gram corpus version 1, Linguistic Data Consortium, Philadelphia.
[4]
Philipp Cimiano and Johanna Völker. 2005. Towards large-scale, open-domain and ontology-based named entity classification. In Proceedings of RANLP'05, pages 66-- 166--172, Borovets, Bulgaria.
[5]
Scott C. Deerwester, Susan T. Dumais, Thoms K. Landauer, George W. Furnas, and Richard A. Harshman. 1990. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407.
[6]
Michael Fleischman and Eduard Hovy. 2002. Fine grained classification of named entities. In Proceedings of the 19th International Conference on Computational Linguistics, Taipei, Taiwan.
[7]
Aldo Gangemi, Roberto Navigli, and Paola Velardi. 2003. Axiomatizing WordNet glosses in the On-toWordNet project. In Proocedings of the Workshop on Human Language Technology for the Semantic Web and Web Services at ISWC 2003, Sanibel Island, Florida.
[8]
Venkatesh Ganti, Arnd C. König, and Rares Vernica. 2008. Entity categorization over large document collections. In KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 274--282, New York, NY, USA. ACM.
[9]
Alfio Giozzo and Carlo Strapparava. 2005. Domain kernels for text categorization. In Ninth Conference on Computational Natural Language Learning (CoNLL-2005), pages 56--63, Ann Arbor, Michigan, June.
[10]
Claudio Giuliano and Alfio Gliozzo. 2007. Instance based lexical entailment for ontology population. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 248--256.
[11]
Claudio Giuliano and Alfio Gliozzo. 2008. Instance-based ontology population exploiting named-entity substitution. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 265--272, Manchester, UK, August.
[12]
Alfio Massimiliano Gliozzo, Claudio Giuliano, and Carlo Strapparava. 2005. Domain kernels for word sense disambiguation. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL'05), pages 403--410, Ann Arbor, Michigan, June.
[13]
Zornitsa Kozareva, Sonia Vazquez, and Andres Montoyo. 2008. Domain information for fine-grained person name categorization. In 9th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2008), pages 311--321, Haifa, Israel, 17--23 February.
[14]
Xin Li and Dan Roth. 2005. Learning question classifiers: the role of semantic information. Natural Language Engineering, 12(3):229--249.
[15]
I. Dan Melamed and Philip Resnik. 2000. Tagger evaluation given hierarchical tag sets. Computers and the Humanities, pages 79--84.
[16]
Davide Picca, Alfio Gliozzo, and Massimiliano Ciaramita. 2007. Semantic domains and supersense tagging for domain-specific ontology learning. In David Evans, Sadaoki Furui, and Chantal Soulé-Dupuy, editors, Recherche d'Information Assistée par Ordinateur (RIAO), Pittsburgh, PA, USA, May.
[17]
B. Schölkopf and A. Smola. 2002. Learning with Kernels. MIT Press, Cambridge, Massachusetts.
[18]
J. Shawe-Taylor and N. Cristianini. 2004. Kernel Methods for Pattern Analysis. Cambridge University Press.
[19]
Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2008. YAGO: A large ontology from wikipedia and wordnet. Elsevier Journal of Web Semantics.
[20]
Partha Pratim Talukdar, Joseph Reisinger, Marius Pasca, Deepak Ravichandran, Rahul Bhagat, and Fernando Pereira. 2008. Weakly-supervised acquisition of labeled class instances using graph random walks. In Proceedings of the conference on Empirical Methods in Natural Language Processing (EMNLP), Waikiki, Honolulu, Hawaii, October 25--27.
[21]
Hristo Tanev and Bernardo Magnini. 2006. Weakly supervised approaches for ontology population. In Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics (EACL-2006), Trento, Italy.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
CoNLL '09: Proceedings of the Thirteenth Conference on Computational Natural Language Learning
June 2009
243 pages
ISBN:9781932432299

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 04 June 2009

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)59
  • Downloads (Last 6 weeks)6
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media