skip to main content
10.1145/342009.335415acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article
Free access

Influence sets based on reverse nearest neighbor queries

Published: 16 May 2000 Publication History

Abstract

Inherent in the operation of many decision support and continuous referral systems is the notion of the “influence” of a data point on the database. This notion arises in examples such as finding the set of customers affected by the opening of a new store outlet location, notifying the subset of subscribers to a digital library who will find a newly added document most relevant, etc. Standard approaches to determining the influence set of a data point involve range searching and nearest neighbor queries.
In this paper, we formalize a novel notion of influence based on reverse neighbor queries and its variants. Since the nearest neighbor relation is not symmetric, the set of points that are closest to a query point (i.e., the nearest neighbors) differs from the set of points that have the query point as their nearest neighbor (called the reverse nearest neighbors). Influence sets based on reverse nearest neighbor (RNN) queries seem to capture the intuitive notion of influence from our motivating examples.
We present a general approach for solving RNN queries and an efficient R-tree based method for large data sets, based on this approach. Although the RNN query appears to be natural, it has not been studied previously. RNN queries are of independent interest, and as such should be part of the suite of available queries for processing spatial and multimedia data. In our experiments with real geographical data, the proposed method appears to scale logarithmically, whereas straightforward sequential scan scales linearly. Our experimental study also shows that approaches based on range searching or nearest neighbors are ineffective at finding influence sets of our interest.

References

[1]
P. Agrawal. Range searching. In E. Goodman and J. O'Rourke, editors, Handbook of Discrete and Computational Geometry, pages 575-598. CRC Press, Boca Raton, FL, 1997.
[2]
N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The R*-tree: An efficient and robust access method for points and rectangles. A CM SIGMOD, pages 322-331, May 23-25 1990.
[3]
T. Brinkhoff, H.-P. Kriegel, and B. Seeger. Efficient processing of spatial joins using R-trees. In Proc. of A CM SIGMOD, pages 237-246, Washington, D.C., May 26-28 1993.
[4]
B. Chazelle and L. J. Guibas. Fractional cascading: I. A data structuring technique. Algorithmica, 1:133-162, 1986.
[5]
K. Fukunaga and P. M. Narendra. A branch and bound algorithm for computing k-nearest neighbors. IEEE Trans. on Computers (TOC), C- 24(7):750-753, July 1975.
[6]
V. Gaede and O. Gunther. Multidimensional access methods. A CM Computing Surveys, 30(2):170- 231, June 1998.
[7]
A. Guttman. R-trees: A dynamic index structure for spatial searching. In Proc. A CM SIGMOD, pages 47-57, Boston, Mass, June 1984.
[8]
G. R. Hjaltason and H. Samet. Incremental distance join algorithms for spatial databases. ACM SIGMOD '98, pages 237-248, June 1998.
[9]
G. R. Hjaltason and H. Samet. Distance browsing in spatial databases. A CM TODS, 24(2):265-318, June 1999.
[10]
K. Jain and V. Vazirani. Primal-dual approximation algorithms for metric facility location and kmedian problems. Proc. /tOth IEEE Foundations of Computer Science (FOCS '99), pages 2-13, 1999.
[11]
D. G. Kirkpatrick. Optimal search in planar subdivisions. SIAM J. Comput., 12:28-35, 1983.
[12]
F. Korn, N. Sidiropoulos, C. Faloutsos, E. Siegel, and Z. Protopapas. Fast nearest-neighbor search in medical image databases. Conf. on Very Large Data Bases (VLDB), pages 215-226, September 1996.
[13]
B. Pagel, H. Six, H. Toben, and P. Widmayer. Towards an analysis of range query performance. In Proe. of A CM SIGA CT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), pages 214-221, Washington, D.C., May 1993.
[14]
R. Rajaraman, M. Korupolu, and G. Plaxton. Analysis of a local search heuristic for facility location problems. Proceedings of A CM-SIAM Symposium on Discrete Algorithms (SODA '98), pages 1-10, 1998.
[15]
N. Roussopoulos, S. Kelley, and F. Vincent. Nearest neighbor queries. In Proe. of A CM-SIGMOD, pages 71-79, San Jose, CA, May 1995.
[16]
T. Sellis, N. Roussopoulos, and C. Faloutsos. The R+ tree: A dynamic index for multi-dimensional objects. In Proc. 13th International Co@fence on VLDB, pages 507-518, England, September 1987.
[17]
M. Staid. Closest point problems in computational geometry. In J.-R. Sack and J. Urrutia, editors, Handbook on Computational Geometry. Elsevier Science Publishing, 1997.

Cited By

View all

Index Terms

  1. Influence sets based on reverse nearest neighbor queries

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMOD '00: Proceedings of the 2000 ACM SIGMOD international conference on Management of data
      May 2000
      604 pages
      ISBN:1581132174
      DOI:10.1145/342009
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 May 2000

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Article

      Conference

      SIGMOD/PODS00
      Sponsor:

      Acceptance Rates

      SIGMOD '00 Paper Acceptance Rate 42 of 248 submissions, 17%;
      Overall Acceptance Rate 785 of 4,003 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)199
      • Downloads (Last 6 weeks)33
      Reflects downloads up to 27 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media