skip to main content
10.3115/981823.981855dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Automatically extracting and representing collocations for language generation

Published: 06 June 1990 Publication History

Abstract

Collocational knowledge is necessary for language generation. The problem is that collocations come in a large variety of forms. They can involve two, three or more words, these words can be of different syntactic categories and they can be involved in more or less rigid ways. This leads to two main difficulties: collocational knowledge has to be acquired and it must be represented flexibly so that it can be used for language generation. We address both problems in this paper, focusing on the acquisition problem. We describe a program, Xtract, that automatically acquires a range of collocations from large textual corpora and we describe how they can be represented in a flexible lexicon using a unification based formalism.

References

[1]
{Abney 89} S. Abney, "Parsing by Chunks" in C. Tenny, ed., The MIT Parsing Volume, 1989, to appear.
[2]
{Amsler 89} R. Amsler, "Research Towards the Development of a Lexical Knowledge Base for Natural Language Processing" Proceedings of the 1989 SIGIR Conference, Association for Computing Machinery. Cambridge, Ma, June 1989.
[3]
{Benson 86} M. Benson, E. Benson and R. Ilson, Lexicographic Description of English. John Benjamins Publishing Company, Philadelphia, 1986.
[4]
{Boguraev & Briscoe 89} B. Boguraev & T. Briscoe, in Computational Lexicography for natural language processing. B. Boguraev and T. Briscoe editors. Longmans, NY 1989.
[5]
{Choueka 88} Y. Choueka, Looking for Needles in a Haystack. In Proceedings of the RIAO, p:609--623, 1988.
[6]
{Church 88} K. Church, A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text In Proceedings of the Second Conference on Applied Natural Language Processing, Austin, Texas, 1988.
[7]
{Church 89} K. Church & K. Hanks, Word Association Norms, Mutual Information, and Lexicography. In Proceedings of the 27th meeting of the Association for Computational Linguistics, Vancouver, B.C, 1989.
[8]
{Cruse 86} D. A. Cruse, Lexical Semantics. Cambridge University Press, 1986.
[9]
{Danlos 87} L. Danlos, The Linguistic Basis of Text Generation. Cambridge University Press, 1987.
[10]
Desemer & Jabobs 87} D. Desemer & P. Jacobs, FLUSH: A Flexible Lexicon Design. In proceedings of the 25th Annual Meeting of the ACL, Stanford University, CA, 1987.
[11]
{Elhadad 90} M. Elhadad, Types in Functional Unification Grammars, Proceedings of the 28th meeting of the Association for Computational Linguistics, Pittsburgh, PA, 1990.
[12]
{Garside 87} R. Garside, G. Leech & G. Sampson, editors, The computational Analysis of English, a corpus based approach. Longmans, NY 1987.
[13]
{Gross 75} M. Gross, Méthodes en Syntaxe. Hermann, Paris, France, 1975.
[14]
{Halliday 66} M. A. K. Halliday, Lexis as a Linguistic Level. In C. E. Bazell, J. C. Catford, M. A.K Halliday and R. H. Robins (eds.), In memory of J. R. Firth London: Longmans Linguistics Ila Library, 1966, pp: 148--162.
[15]
{Iordanskaja 88} L. Iordanskaja, R. Kittredge, A. Polguere, Lexical Selection and Paraphrase in a Meaning-Text Generation Model. Presented at the fourth International Workshop on Language Generation, Catalina Island, CA, 1988.
[16]
{Jacobs 85} P. Jacobs, PHRED: a generator for natural language interfaces, Computational Linguistics, volume 11--4, 1985
[17]
{Kay 79} M. Kay, Functional Grammar, in Proceedings of the 5th Meeting of the Berkeley Linguistic Society, Berkeley Linguistic Society, 1979.
[18]
{Klavans 88} J. Klavans, "COMPLEX: a computational lexicon for natural language systems." In proceeding of the 12th International Conference on Computational Linguistics, Budapest, Hungary, 1988.
[19]
{Kukich 83} K. Kukich, Knowledge-Based Report Generation: A Technique for Automatically Generating Natural Language Reports from Databases. Proceedings of the 6th International ACM SIGIR Conference, Washington, DC, 1983.
[20]
{Maarek & Smadja 89} Y.S Maarek & F. A. Smadja, Full Text Indexing Based on Lexical Relations, An Application: Software Libraries, Proceedings of the 12th International ACM SIGIR Conference, Cambridge, Ma, June 1989.
[21]
{Mel'čuk, 81} I. A. Mel'čuk, Meaning-Text Models: a Recent Trend in Soviet Linguistics. The annual review of anthropology, 1981.
[22]
{Nirenburg 88} S. Nirenburg et. al., Lexicon building in natural language processing. In program and abstracts of the 15th International ALLC, Conference of the Association for Literary and Linguistic Computing, Jerusalem, Israel, 1988.
[23]
{Smadja 88} F. A. Smadja, Lexical Co-occurrence: The Missing link. In program and abstracts of the 15th International ALLC, Conference of the Association for Literary and Linguistic Computing, Jerusalem, Israel, 1988. Also in the Journal for Literary and Linguistic computing, Vol. 4, No. 3, 1989, Oxford University Press.
[24]
{Smadja 89a} F. A. Smadja, Microcoding the Lexicon for Language Generation, First International Workshop on Lexical Acquisition, IJCAI'89, Detroit, Mi, August 89. Also in "Lexical Acquisition: Using on-line resources to build a lexicon", MIT press, Uri Zernik editor, to appear.
[25]
{Smadja 89b} F. A. Smadja, On the Use of Flexible Collocations for Language Generation. Columbia University, technical report, TR# CUCS-507-89.

Cited By

View all
  1. Automatically extracting and representing collocations for language generation

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      ACL '90: Proceedings of the 28th annual meeting on Association for Computational Linguistics
      June 1990
      324 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 06 June 1990

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 85 of 443 submissions, 19%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)45
      • Downloads (Last 6 weeks)12
      Reflects downloads up to 03 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media