skip to main content
10.1007/11766247_23guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Unsupervised named-entity recognition: generating gazetteers and resolving ambiguity

Published: 07 June 2006 Publication History

Abstract

In this paper, we propose a named-entity recognition (NER) system that addresses two major limitations frequently discussed in the field. First, the system requires no human intervention such as manually labeling training data or creating gazetteers. Second, the system can handle more than the three classical named-entity types (person, location, and organization). We describe the system's architecture and compare its performance with a supervised system. We experimentally evaluate the system on a standard corpus, with the three classical named-entity types, and also on a new corpus, with a new named-entity type (car brands).

References

[1]
Chinchor, N. (1998) MUC-7 Named Entity Task Definition, version 3.5. Proc. of the Seventh Message Understanding Conference.
[2]
Cohen, W. and Fan, W. (1999) Learning Page-Independent Heuristics for Extracting Data from Web Page, Proc. of the International World Wide Web Conference.
[3]
Collins M. and Singer, Y. (1999) Unsupervised Models for Named Entity Classification. Proc. of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora.
[4]
Etzioni, O., Cafarella, M., Downey, D., Popescu, A.-M., Shaked, T., Soderland, S., Weld, D. S. and Yates, A. (2005) Unsupervised Named-Entity Extraction from the Web: An Experimental Study. Artificial Intelligence, 165, pp. 91-134.
[5]
Evans, R. (2003) A Framework for Named Entity Recognition in the Open Domain. Proc. Recent Advances in Natural Language Processing.
[6]
Hearst, M. (1992) Automatic Acquisition of Hyponyms from Large Text Corpora. Proc. of International Conference on Computational Linguistics.
[7]
Lin, D. and Pantel, P. (2001) Induction of Semantic Classes from Natural Language Text. Proc. of ACM SIGKDD Conference on Knowledge Discovery and Data Mining.
[8]
Ling, C., and Li, C. (1998). Data Mining for Direct Marketing: Problems and Solutions. Proc. International Conference on Knowledge Discovery and Data Mining.
[9]
Mikheev, A. (1999) A Knowledge-free Method for Capitalized Word Disambiguation. Proc. Conference of Association for Computational Linguistics.
[10]
Mikheev, A., Moens, M. and Grover, C. (1999) Named Entity Recognition without Gazetteers. Proc. Conference of European Chapter of the Association for Computational Linguistics.
[11]
Nadeau, D. (2005) Création de surcouche de documents hypertextes et traitement du langage naturel, Proc. Computational Linguistics in the North-East.
[12]
Palmer, D. D. and Day, D. S. (1997) A Statistical Profile of the Named Entity Task. Proc. ACL Conference for Applied Natural Language Processing.
[13]
Petasis, G., Vichot, F., Wolinski, F., Paliouras, G., Karkaletsis, V. and Spyropoulos, C. D. (2001) Using Machine Learning to Maintain Rule-based Named-Entity Recognition and Classification Systems. Proc. Conference of Association for Computational Linguistics.
[14]
Riloff, E. and Jones, R (1999) Learning Dictionaries for Information Extraction using Multi-level Bootstrapping. Proc. of National Conference on Artificial Intelligence.
[15]
Sekine, S., Sudo, K., Nobata, C. (2002) Extended Named Entity Hierarchy, Proc. of the Language Resource and Evaluation Conference.
[16]
Zhu, X., Wu, X. and Chen Q. (2003) Eliminating Class Noise in Large Data-Sets, Proc. of the International Conference on Machine Learning.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
AI'06: Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
June 2006
561 pages
ISBN:3540346287
  • Editors:
  • Luc Lamontagne,
  • Mario Marchand

Sponsors

  • CSCSI: Canadian Society for the Computional Studies of Intelligence
  • SCEIO: Soc can pour l'etude de l'intelligence par ordinateur

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 07 June 2006

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media