skip to main content
10.1145/99412.99453acmconferencesArticle/Chapter ViewAbstractPublication PagessmallConference Proceedingsconference-collections
Article
Free access

An evaluation of type-10 homograph discrimination at the semi-colon level in Roget's International Thesaurus

Published: 01 February 1990 Publication History

Abstract

This paper reports the results of evaluating a large sample of the 23,858 type-10 homographs found in Roget's International Thesaurus (3rd Ed.) as defined by the Bryan Model of abstract thesauri, of which Roget's is an instantiation. According to the Bryan model, two different entries in a thesaurus that have the same spelling are homographs (semantically unrelated) if and only if they cannot be the endpoints of a sequence of entries called a type-10 chain. The Bryan definition of a type-10 homograph has not been tested thoroughly until recently because of the combinatorial complexity associated with a direct application of the definition to a large instantiation such as Roget's. However, in 1989, the authors were able to decompose Roget's in into its type-10 components, and as a result, generate all 23,858 type-10 homographs at the semi-colon category level.
The principal result is that Bryan's definition of homographs by type-10 semantic disjunction does not appear to work uniformly over a broad range of entries in Roget's when the selected semantic category is the semi-colon group. Although there are many cases where type-10 homographs agree with conventional classifications, in general type-10 discrimination at the semi-colon level “over discriminates” in that it generates many more homographs than are found in standard English language dictionaries.

References

[1]
Robert M. Bryan, "Abstract Thesauri and Graph Theory Applications to Thesaurus Research", in Sally Yeates Sedelow, ed., Automated Laneuaee A~ (University of Kansas, Lawrence, 1973; also Defense Documentation Center, #AD 774- 692).
[2]
Archibald B. Patrick, An Exploration of an Abstract Thesaurus Instantia ili09~(M.S. Thesis, Computer Science Department, University of Kansas, Lawrence, 1985).
[3]
Roger's International Thesauri, Third Edition (New York, 1963).
[4]
Sally Yeates Sedelow and Donna M. Mooney, "Knowledge Retrieval from Domain Transcendent Expert Systems: Research Results", Proceedings of the 5!st AS!S Annual Meeting, October, 1988.
[5]
Sally Yeates Sedelow and Walter A. Sedelow, Jr., "Thesamal Knowledge Representation", Proceedings. Advances in Lexicology, Second Annual Conference of the UW Centre for the New Oxford English Dictionary (University of Waterloo, Canada, 1986) 29-43.
[6]
Walter A. Sedelow, Jr., and Sally Yeates Sedelow, "Semantic Space", Cpmputem and Translation, Vol. II (1987), 231-242.
[7]
John R. Talburt and Donna M. Mooney, "Determination of Strongly Connected Components in Abstract Thesauri by the Method of Quartets", Proceeding: ACM Workshop on Applied Computing, March, 1989, Stillwater, Oklahoma, in Press.
[8]
John R. Talburt and Donna M. Mooney, "The Decomposition of Roget's International Thesaurus into Type-10 Semantically Strong Components", Proceedings~ ACM South Central Regional Conference, November, 1989, Tulsa, Oklahoma, In Press.
[9]
John R. Talburt and Donna M. Mooney, "The Determination of All Type-10 Homographs in Roger's International Thesaurus", Proceedings: Third Oklahoma Symposium on Artificial Intelligence, November, 1989, Tulsa, Oklahoma, In Press.
[10]
Webster's N e.w Collegiate Dictionary, (G. & C. Merriam Co., 1980).

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGSMALL '90: Proceedings of the 1990 ACM SIGSMALL/PC symposium on Small systems
February 1990
304 pages
ISBN:0897913477
DOI:10.1145/99412
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 February 1990

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

Small/PC 90
Sponsor:
Small/PC 90: Conference on Small Systems
March 28 - 30, 1990
Virginia, Crystal City, USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)30
  • Downloads (Last 6 weeks)8
Reflects downloads up to 21 Sep 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media