skip to main content
10.1145/342009acmconferencesBook PagePublication PagesmodConference Proceedingsconference-collections
SIGMOD '00: Proceedings of the 2000 ACM SIGMOD international conference on Management of data
ACM2000 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
SIGMOD/PODS00: ACM International Conference on Management of Data and Symposium on Principles of Database Systems Dallas Texas USA May 15 - 18, 2000
ISBN:
978-1-58113-217-5
Published:
16 May 2000
Sponsors:

Reflects downloads up to 25 Dec 2024Bibliometrics
Abstract

No abstract available.

Article
Free
Mining frequent patterns without candidate generation

Mining frequent patterns in transaction databases, time-series databases, and many other kinds of databases has been studied popularly in data mining research. Most of the previous studies adopt an Apriori-like candidate set generation-and-test ...

Article
Free
Data mining on an OLTP system (nearly) for free

This paper proposes a scheme for scheduling disk requests that takes advantage of the ability of high-level functions to operate directly at individual disk drives. We show that such a scheme makes it possible to support a Data Mining workload on an ...

Article
Free
Turbo-charging vertical mining of large databases

In a vertical representation of a market-basket database, each item is associated with a column of values representing the transactions in which it is present. The association-rule mining algorithms that have been recently proposed for this ...

Article
Free
High speed on-line backup when using logical log operations

Media recovery protects a database from failures of the stable medium by maintaining an extra copy of the database, called the backup, and a media recovery log. When a failure occurs, the database is “restored” from the backup, and the media recovery ...

Article
Free
Efficient resumption of interrupted warehouse loads

Data warehouses collect large quantities of data from distributed sources into a single repository. A typical load to create or maintain a warehouse processes GBs of data, takes hours or even days to execute, and involves many complex and user-defined ...

Article
Free
On-line reorganization in object databases

Reorganization of objects in an object databases is an important component of several operations like compaction, clustering, and schema evolution. The high availability requirements (24 × 7 operation) of certain application domains requires ...

Article
Free
Finding generalized projected clusters in high dimensional spaces

High dimensional data has always been a challenge for clustering algorithms because of the inherent sparsity of the points. Recent research results indicate that in high dimensional data, even the concept of proximity or clustering may not be ...

Article
Free
Density biased sampling: an improved method for data mining and clustering

Data mining in large data sets often requires a sampling or summarization step to form an in-core representation of the data that can be processed more efficiently. Uniform random sampling is frequently used in practice and also frequently criticized ...

Article
Free
LOF: identifying density-based local outliers

For many KDD applications, such as detecting criminal activities in E-commerce, finding the rare instances or the outliers, can be more interesting than finding the common patterns. Existing work in outlier detection regards being an outlier as a binary ...

Article
Free
Answering complex SQL queries using automatic summary tables

We investigate the problem of using materialized views to answer SQL queries. We focus on modern decision-support queries, which involve joins, arithmetic operations and other (possibly user-defined) functions, aggregation (often along multiple ...

Article
Free
Synchronizing a database to improve freshness

In this paper we study how to refresh a local copy of an autonomous data source to maintain the copy up-to-date. As the size of the data grows, it becomes more difficult to maintain the copy \ fresh, “making it crucial to synchronize the copy ...

Article
Free
How to roll a join: asynchronous incremental view maintenance

Incremental refresh of a materialized join view is often less expensive than a full, non-incremental refresh. However, it is still a potentially costly atomic operation. This paper presents an algorithm that performs incremental view maintenance as a ...

Article
Free
On wrapping query languages and efficient XML integration

Modern applications (Web portals, digital libraries, etc.) require integrated access to various information sources (from traditional DBMS to semistructured Web repositories), fast deployment and low maintenance cost in a rapidly evolving environment. ...

Article
Free
XMill: an efficient compressor for XML data

We describe a tool for compressing XML data, with applications in data exchange and archiving, which usually achieves about twice the compression ratio of gzip at roughly the same speed. The compressor, called XMill, incorporates and combines existing ...

Article
Free
XTRACT: a system for extracting document type descriptors from XML documents

XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the role of a schema for an XML data collection. DTDs contain valuable ...

Article
Free
Spatial join selectivity using power laws

We discovered a surprising law governing the spatial join selectivity across two sets of points. An example of such a spatial join is “find the libraries that are within 10 miles of schools”. Our law dictates that the number of such qualifying pairs ...

Article
Free
Closest pair queries in spatial databases

This paper addresses the problem of finding the K closest pairs between two spatial data sets, where each set is stored in a structure belonging in the R-tree family. Five different algorithms (four recursive and one iterative) are presented for solving ...

Article
Free
Influence sets based on reverse nearest neighbor queries

Inherent in the operation of many decision support and continuous referral systems is the notion of the “influence” of a data point on the database. This notion arises in examples such as finding the set of customers affected by the opening of a new ...

Article
Free
MOCHA: a self-extensible database middleware system for distributed data sources

We present MOCHA, a new self-extensible database middleware system designed to interconnect distributed data sources. MOCHA is designed to scale to large environments and is based on the idea that some of the user-defined functionality in the system ...

Article
Free
Towards self-tuning data placement in parallel database systems

Parallel database systems are increasingly being deployed to support the performance demands of end-users. While declustering data across multiple nodes facilitates parallelism, initial data placement may not be optimal due to skewed workloads and ...

Article
Free
LH*RS: a high-availability scalable distributed data structure using Reed Solomon Codes

LH*RS is a new high-availability Scalable Distributed Data Structure (SDDS). The data storage scheme and the search performance of LH*RS are basically these of LH*. LH*RS manages in addition the parity information to tolerate the unavailability of k ⪈ 1 ...

Article
Free
Efficient and extensible algorithms for multi query optimization

Complex queries are becoming commonplace, with the growing use of decision support systems. These complex queries often have a lot of common sub-expressions, either within a single query, or across multiple such queries run as a batch. Multiquery ...

Article
Free
Eddies: continuously adaptive query processing

In large federated and shared-nothing databases, resources can exhibit widely fluctuating characteristics. Assumptions made at the time a query is submitted will rarely hold throughout the duration of query processing. As a result, traditional static ...

Article
Free
A chase too far?

In a previous paper we proposed a novel method for generating alternative query plans that uses chasing (and back-chasing) with logical constraints. The method brings together use of indexes, use of materialized views, semantic optimization and join ...

Article
Free
WSQ/DSQ: a practical approach for combined querying of databases and the Web

We present WSQ/DSQ (pronounced “wisk-disk”), a new approach for combining the query facilities of traditional databases with existing search engines on the Web. WSQ, for Web-Supported (Database) Queries, leverages results from Web searches to enhance ...

Article
Free
A framework for expressing and combining preferences

The advent of the World Wide Web has created an explosion in the available on-line information. As the range of potential choices expand, the time and effort required to sort through them also expands. We propose a formal framework for expressing and ...

Article
Free
Microsoft TerraServer: a spatial data warehouse

Microsoft® TerraServer stores aerial, satellite, and topographic images of the earth in a SQL database available via the Internet. It is the world's largest online atlas, combining eight terabytes of image data from the United States Geological Survey (...

Article
Free
A data model and data structures for moving objects databases

We consider spatio-temporal databases supporting spatial objects with continuously changing position and extent, termed moving objects databases. We formally define a data model for such databases that includes complex evolving spatial structures such ...

Article
Free
Indexing the positions of continuously moving objects

The coming years will witness dramatic advances in wireless communications as well as positioning technologies. As a result, tracking the changing positions of objects capable of continuous movement is becoming increasingly feasible and necessary. The ...

Article
Free
Adaptive multi-stage distance join processing

A spatial distance join is a relatively new type of operation introduced for spatial and multimedia database applications. Additional requirements for ranking and stopping cardinality are often combined with the spatial distance join in on-line query ...

Contributors
  • Southern Methodist University
  • University of Toronto

Index Terms

  1. Proceedings of the 2000 ACM SIGMOD international conference on Management of data

        Recommendations

        Acceptance Rates

        SIGMOD '00 Paper Acceptance Rate 42 of 248 submissions, 17%;
        Overall Acceptance Rate 785 of 4,003 submissions, 20%
        YearSubmittedAcceptedRate
        SIGMOD '194308820%
        SIGMOD '184619020%
        SIGMOD '1541510626%
        SIGMOD '1442110725%
        SIGMOD '133727620%
        SIGMOD '122894817%
        SIGMOD '033425315%
        SIGMOD '022404218%
        SIGMOD '012934415%
        SIGMOD '002484217%
        SIGMOD '972024221%
        SIGMOD '962904716%
        Overall4,00378520%