skip to main content
10.3115/974557.974590dlproceedingsArticle/Chapter ViewAbstractPublication PagesanlcConference Proceedingsconference-collections
Article
Free access

Building a generation knowledge source using Internet-accessible newswire

Published: 31 March 1997 Publication History

Abstract

In this paper, we describe a method for automatic creation of a knowledge source for text generation using information extraction over the Internet. We present a prototype system called PROFILE which uses a client-server architecture to extract noun-phrase descriptions of entities such as people, places, and organizations. The system serves two purposes: as an information extraction tool, it allows users to search for textual descriptions of entities; as a utility to generate functional descriptions (FD), it is used in a functional-unification based generation system. We present an evaluation of the approach and its applications to natural language generation and summarization.

References

[1]
John Aberdeen, John Burger, Dennis Connolly, Susan Roberts, and Marc Vilain. 1992. Mitrebedford: Description of the alembic system as used for muc-4. In Proceedings of the Fourth Message Understanding Conference (MUC-4), pages 215--222, McLean, Virginia, June.]]
[2]
Damaris Ayuso, Sean Boisen, Heidi Fox, Herb Gish, Robert Ingria, and Ralph Weischedel. 1992. Bbn: Description of the plum system as used for muc-4. In Proceedings of the Fourth Message Understanding Conference (MUC-4), pages 169--176, McLean, Virginia, June.]]
[3]
Kenneth W. Church. 1988. A stochastic parts program and noun phrase parser for unrestricted text. In Proceedings of the Second Conference on Applied Natural Language Processing (ANLP-88), pages 136--143, Austin, Texas, February. Association for Computational Linguistics.]]
[4]
Sam Coates-Stephens. 1991. Automatic lexical acquisition using within-text descriptions of proper nouns. In Proceedings of the Seventh Annual Conference of the UW Centre for the New OED and Text Research, pages 154--169.]]
[5]
Jim Cowie, Louise Guthrie, Yorick Wilks, James Pustejovsky, and Scott Waterman. 1992. Crl/nmsu and brandeis: Description of the mucbruce system as used for muc-4. In Proceedings of the Fourth Message Understanding Conference (MUC-4), pages 223--232, McLean, Virginia, June.]]
[6]
Darrin Duford. 1993. Crep: a regular expression-matching textual corpus tool. Technical Report CUCS-005--93, Columbia University.]]
[7]
Michael Elhadad. 1991. Fuf: The universal unifier - user manual, version 5.0. Technical Report CUCS-038--91, Columbia University.]]
[8]
Michael Elhadad. 1993. Using argumentation to control lexical choice: a unification-based implementation. Ph.D. thesis, Computer Science Department, Columbia University.]]
[9]
Tim Finin, Rich Fritzson, Don McKay, and Robin McEntire. 1994. KQML - a language and protocol for knowledge and information exchange. Technical Report CS-94--02, Computer Science Department, University of Maryland and Valley Forge Engineering Center, Unisys Corporation.]]
[10]
R. Grishman, C. Macleod, and J. Sterling. 1992. New york university: Description of the proteus system as used for muc-4. In Proceedings of the Fourth Message Understanding Conference, June.]]
[11]
Julian M. Kupiec. 1993. Murax: A robust linguistic approach for question answering using an on-line encyclopedia. In Proceedings, 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.]]
[12]
W. Lehnert, J. McCarthy, S. Soderlan, E. Riloff, C. Cardie, J. Peterson, and F. Feng. 1993. Umass/hughes: Description of the circus system used for muc-5. In Proceedings of the Fifth Message Understanding Conference (MUC-5), pages 277--291, Baltimore, Md., August.]]
[13]
Inderjeet Mani, Richard T. Macmillan, Susann Luperfoy, Elaine Lusher, and Sharon Laskowski. 1993. Identifying unknown proper names in newswire text. In Proceedings of the Workshop on Acquisition of Lexical Knowledge from Text, pages 44--54, Columbus, Ohio, June. Special Interest Group on the Lexicon of the Association for Computational Linguistics.]]
[14]
David D. McDonald. 1993. Internal and external evidence in the identification and semantic cateogrization of proper names. In Proceedings of the Workshop on Acquisition of Lexical Knowledge from Text, pages 32--43, Columbus, Ohio, June. Special Interest Group on the Lexicon of the Association for Computational Linguistics.]]
[15]
Kathleen R. McKeown and Dragomir R. Radev. 1995. Generating summaries of multiple news articles. In Proceedings, 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 74--82, Seattle, Washington, July.]]
[16]
George A. Miller, Richard Beckwith, Christiane Fellbaum, Derek Gross, and Katherine J. Miller. 1990. Introduction to WordNet: An on-line lexical database. International Journal of Lexicography (special issue), 3(4):235--312.]]
[17]
Message Understanding Conference MUC. 1992. Proceedings of the Fourth Message Understanding Conference (MUC-4). DARPA Software and Intelligent Systems Technology Office.]]
[18]
Woojin Paik, Elizabeth D. Liddy, Edmund Yu, and Mary McKenna. 1994. Interpretation of proper nouns for information retrieval. In Proceedings of the Human Language Technology Workshop, pages 309--313, Plainsboro, New Jersey, March. ARPA Software and Intelligent Systems Technology Office, Morgan Kaufmann.]]
[19]
Dragomir R. Radev. 1996. An architecture for distributed natural language summarization. In Proceedings of the 8th International Workshop on Natural Language Generation: Demonstrations and Posters, pages 45--48, Herstmonceaux, England, June.]]
[20]
Jacques Robin. 1994. Revision-Based Generation of Natural Language Summaries Providing Historical Background. Ph.D. thesis, Computer Science Department, Columbia University.]]
[21]
Frank Smadja and Kathleen R. McKeown. 1991. Using collocations for language generation. Computational Intelligence, 7(4), December.]]
[22]
Ralph Weischedel, Damaris Ayuso, Sean Boisen, Heidi Fox, Robert Ingria, Tomoyoshi Matsukawa, Constantine Papageorgiou, Dawn MacLaughlin, Masaichiro Kitagawa, Tsutomu Sakai, June Abe, Hiroto Hosihi, Yoichi Miyamoto, and Scott Miller. 1993. Bbn: Description of the plum system as used for muc-5. In Proceedings of the Fifth Message Understanding Conference (MUC-5), pages 93--108, Baltimore, Md., August.]]

Cited By

View all
  1. Building a generation knowledge source using Internet-accessible newswire

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      ANLC '97: Proceedings of the fifth conference on Applied natural language processing
      March 1997
      417 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 31 March 1997

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)16
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 27 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media