skip to main content
article
Free access

Partial-match retrieval using indexed descriptor files

Published: 01 September 1980 Publication History

Abstract

In this paper we describe a practical method of partial-match retrieval in very large data files. A binary code word, called a descriptor, is associated with each record of the file. These record descriptors are then used to form a derived descriptor for a block of several records, which will serve as an index for the block as a whole; hence, the name “indexed descriptor files.”
First the structure of these files is described and a simple, efficient retrieval algorithm is presented. Then its expected behavior, in terms of storage accesses, is analyzed in detail. Two different file creation procedures are sketched, and a number of ways in which the file organization can be “tuned” to a particular application are suggested.

References

[1]
Aho, A.V., and Ullman, J.D. Optimal partial-match retrieval when fields are independently specified. Trans. Database Systs. 4, 2 (June 1979), 168-179.
[2]
Berman, W.J., and Pfahz, J.L. Multi-dimensional bucket arrays. DAMACS Tech. Rep. TR-16-78, Univ. of Virginia, March 1978.
[3]
Cagley, E.M. A retrieval strategy for large, multi-key files requiring frequent updating. TR-75, Executive Office of the President (Office of Emergency Preparedness), Dec. 1971.
[4]
Cagley, E.M., et al. Information Management System Reference Manual. GSA/FPA/MCL TM-208, Oct. 1976.
[5]
Cardenas, A.F. Analysis and performance of inverted data base structures, Comm. ACM. 18, 5 (May 1975), 253-263.
[6]
DATAPRO Res. Corp. A Buyer's Guide to Data Base Management Systems, 1974.
[7]
Files, J.R., and Huskey, H.D. An information retrieval system based on superimposed coding. Proc. AF1PS Fall Joint Comptr. Conf., Vol. 35, AFIPS Press, Arlington, Va., 1969.
[8]
Knuth, D.E. The Art of Computer Programming, Vol. 3. Addison- Wesley, Reading, Mass., 1973.
[9]
Lefkovitz, D. The large data base file structure dilemma. Rep. 76-5, Univ. of Penn. Moore School, 1976.
[10]
Rivest, R.L. Partial-match retrieval algorithms. SlAM J. Comptng. 5, 1 (March 1976), 19-50.
[11]
Roberts, C.S. Partial-match retrieval via the method of superimposed codes. Proc. IEEE, Vol. 67, No. 12 (Dec. 1979), pp. 1624-1642.
[12]
Schroeder, J., et al. Stanford's Generalized Data Base System. Internat. Conf. on Very Large Data Bases, Framingham, Mass., Sept. 1975.
[13]
Severance, D.G., and Carlis, J.V. A practical approach to selecting record access paths. Comptng. Surveys 9, 4 (Dec. 1977), 259- 272.
[14]
Vallarino, O. On the use of bit maps for multiple key retrieval. ACM S1GPLAN Notices 11 (March 1976), 108-114.
[15]
Wiederhold, G. Database Design. McGraw-Hill, N.Y., 1977.

Cited By

View all
  1. Partial-match retrieval using indexed descriptor files

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Communications of the ACM
    Communications of the ACM  Volume 23, Issue 9
    Sept. 1980
    38 pages
    ISSN:0001-0782
    EISSN:1557-7317
    DOI:10.1145/359007
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 September 1980
    Published in CACM Volume 23, Issue 9

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. bit codes
    2. multiattribute
    3. partial-match
    4. retrieval
    5. storage access cost

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)84
    • Downloads (Last 6 weeks)14
    Reflects downloads up to 17 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media