Proceedings of the 2001 ACM SIGMOD international conference on Management of data

SIGMOD '01: Proceedings of the 2001 ACM SIGMOD international conference on Management of data

May 2001

2001 Proceeding

Editors:
Timos Sellis,
Sharad Mehrotra

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

SIGMOD/PODS01: ACM SIGMOD International Conference on Management of Data Santa Barbara California USA May 21 - 24, 2001

ISBN:

978-1-58113-332-5

Published:

01 May 2001

Sponsors:

SIGMOD

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Reflects downloads up to 27 Dec 2024Bibliometrics

Citation Count

8,355

Downloads (6 weeks)

192

Downloads (12 months)

1,654

Downloads (cumulative)

87,347

Sections

SIGMOD '01: Proceedings of the 2001 ACM SIGMOD international conference on Management of data

2001

Previous Next

Abstract

No abstract available.

Select All

Export Citations Save to Binder

Article

Efficient computation of Iceberg cubes with complex measures

Jiawei Han,
Jian Pei,
Guozhu Dong,
Ke Wang

Pages 1–12https://doi.org/10.1145/375663.375664

It is often too expensive to compute and materialize a complete high-dimensional data cube. Computing an iceberg cube, which contains only aggregates above certain thresholds, is an effective way to derive nontrivial multi-dimensional aggregations for ...

- 167
- 2,836
Metrics
Total Citations167
Total Downloads2,836
Last 12 Months32
Last 6 weeks0

Abstract
Get Access

Article

On computing correlated aggregates over continual data streams

Johannes Gehrke,
Flip Korn,
Divesh Srivastava

Pages 13–24https://doi.org/10.1145/375663.375665

In many applications from telephone fraud detection to network management, data arrives in a stream, and there is a need to maintain a variety of statistical summary information about a large number of customers in an online fashion. At present, such ...

- 208
- 1,160
Metrics
Total Citations208
Total Downloads1,160
Last 12 Months10
Last 6 weeks1

Abstract
Get Access

Article

Iceberg-cube computation with PC clusters

Raymond T. Ng,
Alan Wagner,
Yu Yin

Pages 25–36https://doi.org/10.1145/375663.375666

In this paper, we investigate the approach of using low cost PC cluster to parallelize the computation of iceberg-cube queries. We concentrate on techniques directed towards online querying of large, high-dimensional datasets where it is assumed that ...

- 60
- 945
Metrics
Total Citations60
Total Downloads945
Last 12 Months6
Last 6 weeks0

Abstract
Get Access

Article

Outlier detection for high dimensional data

Charu C. Aggarwal,
Philip S. Yu

Pages 37–46https://doi.org/10.1145/375663.375668

The outlier detection problem has important applications in the field of fraud detection, network robustness analysis, and intrusion detection. Most such applications are high dimensional domains in which the data can contain hundreds of dimensions. ...

- 824
- 8,502
Metrics
Total Citations824
Total Downloads8,502
Last 12 Months298
Last 6 weeks35

Abstract
Get Access

Article

Bit-sliced index arithmetic

Denis Rinfret,
Patrick O'Neil,
Elizabeth O'Neil

Pages 47–57https://doi.org/10.1145/375663.375669

The bit-sliced index (BSI) was originally defined in [ONQ97]. The current paper introduces the concept of BSI arithmetic. For any two BSI's X and Y on a table T, we show how to efficiently generate new BSI's Z, V, and W, such that Z = X + Y, V = X - Y, ...

- 47
- 833
Metrics
Total Citations47
Total Downloads833
Last 12 Months18
Last 6 weeks1

Abstract
Get Access

Article

Space-efficient online computation of quantile summaries

Michael Greenwald,
Sanjeev Khanna

Pages 58–66https://doi.org/10.1145/375663.375670

An ∈-approximate quantile summary of a sequence of N elements is a data structure that can answer quantile queries about the sequence to within a precision of ∈N.

We present a new online algorithm for computing∈-approximate quantile summaries of very ...

- 451
- 2,808
Metrics
Total Citations451
Total Downloads2,808
Last 12 Months151
Last 6 weeks24

Abstract
Get Access

Article

Probe, count, and classify: categorizing hidden web databases

Panagiotis G. Ipeirotis,
Luis Gravano,
Mehran Sahami

Pages 67–78https://doi.org/10.1145/375663.375671

The contents of many valuable web-accessible databases are only accessible through search interfaces and are hence invisible to traditional web “crawlers.” Recent studies have estimated the size of this “hidden web” to be 500 billion pages, while the ...

- 111
- 1,109
Metrics
Total Citations111
Total Downloads1,109
Last 12 Months7
Last 6 weeks0

Abstract
Get Access

Article

Data bubbles: quality preserving performance boosting for hierarchical clustering

Markus M. Breunig,
Hans-Peter Kriegel,
Peer Kröger,
Jörg Sander

Pages 79–90https://doi.org/10.1145/375663.375672

In this paper, we investigate how to scale hierarchical clustering methods (such as OPTICS) to extremely large databases by utilizing data compression methods (such as BIRCH or random sampling). We propose a three step procedure: 1) compress the data ...

- 54
- 1,183
Metrics
Total Citations54
Total Downloads1,183
Last 12 Months8
Last 6 weeks1

Abstract
Get Access

Article

Mining needle in a haystack: classifying rare classes via two-phase rule induction

Mahesh V. Joshi,
Ramesh C. Agarwal,
Vipin Kumar

Pages 91–102https://doi.org/10.1145/375663.375673

Learning models to classify rarely occurring target classes is an important problem with applications in network intrusion detection, fraud detection, or deviation detection in general. In this paper, we analyze our previously proposed two-phase rule ...

- 80
- 1,176
Metrics
Total Citations80
Total Downloads1,176
Last 12 Months19
Last 6 weeks4

Abstract
Get Access

Article

Efficient evaluation of XML middle-ware queries

Mary Fernandez,
Atsuyuki Morishima,
Dan Suciu

Pages 103–114https://doi.org/10.1145/375663.375674

We address the problem of efficiently constructing materialized XML views of relational databases. In our setting, the XML view is specified by a query in the declarative query language of a middle-ware system, called SilkRoute. The middle-ware system ...

- 104
- 1,023
Metrics
Total Citations104
Total Downloads1,023
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

Article

Filtering algorithms and implementation for very fast publish/subscribe systems

Françoise Fabret,
H. Arno Jacobsen,
François Llirbat,
Joăo Pereira,
Kenneth A. Ross,
Dennis Shasha

Pages 115–126https://doi.org/10.1145/375663.375677

Publish/Subscribe is the paradigm in which users express long-term interests (“subscriptions”) and some agent “publishes” events (e.g., offers). The job of Publish/Subscribe software is to send events to the owners of subscriptions satisfied by those ...

- 456
- 1,922
Metrics
Total Citations456
Total Downloads1,922
Last 12 Months22
Last 6 weeks0

Abstract
Get Access

Article

Adaptable query optimization and evaluation in temporal middleware

Giedrius Slivinskas,
Christian S. Jensen,
Richard Thomas Snodgrass

Pages 127–138https://doi.org/10.1145/375663.375678

Time-referenced data are pervasive in most real-world databases. Recent advances in temporal query languages show that such database applications may benefit substantially from built-in temporal support in the DBMS. To achieve this, temporal query ...

- 14
- 807
Metrics
Total Citations14
Total Downloads807
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

Article

Optimizing multidimensional index trees for main memory access

Kihong Kim,
Sang K. Cha,
Keunjoo Kwon

Pages 139–150https://doi.org/10.1145/375663.375679

Recent studies have shown that cache-conscious indexes such as the CSB+-tree outperform conventional main memory indexes such as the T-tree. The key idea of these cache-conscious indexes is to eliminate most of child pointers from a node to increase the ...

- 104
- 1,419
Metrics
Total Citations104
Total Downloads1,419
Last 12 Months16
Last 6 weeks1

Abstract
Get Access

Article

Locally adaptive dimensionality reduction for indexing large time series databases

Eamonn Keogh,
Kaushik Chakrabarti,
Michael Pazzani,
Sharad Mehrotra

Pages 151–162https://doi.org/10.1145/375663.375680

Similarity search in large time series databases has attracted much research interest recently. It is a difficult problem because of the typically high dimensionality of the data.. The most promising solutions involve performing dimensionality reduction ...

- 640
- 2,890
Metrics
Total Citations640
Total Downloads2,890
Last 12 Months90
Last 6 weeks5

Abstract
Get Access

Article

Main-memory index structures with fixed-size partial keys

Philip Bohannon,
Peter Mcllroy,
Rajeev Rastogi

Pages 163–174https://doi.org/10.1145/375663.375681

The performance of main-memory index structures is increasingly determined by the number of CPU cache misses incurred when traversing the index. When keys are stored indirectly, as is standard in main-memory databases, the cost of key retrieval in terms ...

- 82
- 1,409
Metrics
Total Citations82
Total Downloads1,409
Last 12 Months23
Last 6 weeks1

Abstract
Get Access

Article

Automatic segmentation of text into structured records

Vinayak Borkar,
Kaustubh Deshmukh,
Sunita Sarawagi

Pages 175–186https://doi.org/10.1145/375663.375682

In this paper we present a method for automatically segmenting unformatted text records into structured elements. Several useful data sources today are human-generated as continuous text whereas convenient usage requires the data to be organized as ...

- 155
- 1,937
Metrics
Total Citations155
Total Downloads1,937
Last 12 Months30
Last 6 weeks4

Abstract
Get Access

Article

Efficient and effective metasearch for text databases incorporating linkages among documents

Clement Yu,
Weiyi Meng,
Wensheng Wu,
King-Lup Liu

Pages 187–198https://doi.org/10.1145/375663.375684

Linkages among documents have a significant impact on the importance of documents, as it can be argued that important documents are pointed to by many documents or by other important documents. Metasearch engines can be used to facilitate ordinary users ...

- 27
- 670
Metrics
Total Citations27
Total Downloads670
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

Article

Independence is good: dependency-based histogram synopses for high-dimensional data

Amol Deshpande,
Minos Garofalakis,
Rajeev Rastogi

Pages 199–210https://doi.org/10.1145/375663.375685

Approximating the joint data distribution of a multi-dimensional data set through a compact and accurate histogram synopsis is a fundamental problem arising in numerous practical scenarios, including query optimization and approximate query answering. ...

- 141
- 880
Metrics
Total Citations141
Total Downloads880
Last 12 Months32
Last 6 weeks7

Abstract
Get Access

Article

STHoles: a multidimensional workload-aware histogram

Nicolas Bruno,
Surajit Chaudhuri,
Luis Gravano

Pages 211–222https://doi.org/10.1145/375663.375686

Attributes of a relation are not typically independent. Multidimensional histograms can be an effective tool for accurate multiattribute query selectivity estimation. In this paper, we introduce STHoles, a “workload-aware” histogram that allows bucket ...

- 265
- 1,168
Metrics
Total Citations265
Total Downloads1,168
Last 12 Months61
Last 6 weeks13

Abstract
Get Access

Article

Global optimization of histograms

H. V. Jagadish,
Hui Jin,
Beng Chin Ooi,
Kian-Lee Tan

Pages 223–234https://doi.org/10.1145/375663.375687

Histograms are frequently used to represent the distribution of data values in an attribute of a relation. Most previous work has focused on identifying the optimal histogram (given a limited number of buckets) for a single attribute independent of ...

- 44
- 660
Metrics
Total Citations44
Total Downloads660
Last 12 Months22
Last 6 weeks2

Abstract
Get Access

Article

Improving index performance through prefetching

Shimin Chen,
Phillip B. Gibbons,
Todd C. Mowry

Pages 235–246https://doi.org/10.1145/375663.375688

This paper proposes and evaluate Prefetching B⁺-Trees (pB⁺-Trees), which use prefetching to accelerate two important operations on B⁺-Tree indices: searches and range scans. To accelerate searches, pB⁺-Trees use prefetching to effectively create wider ...

- 110
- 2,145
Metrics
Total Citations110
Total Downloads2,145
Last 12 Months37
Last 6 weeks4

Abstract
Get Access

Article

Efficient and tumble similar set retrieval

Aristides Gionis,
Dimitrios Gunopulos,
Nick Koudas

Pages 247–258https://doi.org/10.1145/375663.375689

Set value attributes are a concise and natural way to model complex data sets. Modern Object Relational systems support set value attributes and allow various query capabilities on them. In this paper we initiate a formal study of indexing techniques ...

- 49
- 420
Metrics
Total Citations49
Total Downloads420
Last 12 Months3
Last 6 weeks1

Abstract
Get Access

Article

PREFER: a system for the efficient execution of multi-parametric ranked queries

Vagelis Hristidis,
Nick Koudas,
Yannis Papakonstantinou

Pages 259–270https://doi.org/10.1145/375663.375690

Users often need to optimize the selection of objects by appropriately weighting the importance of multiple object attributes. Such optimization problems appear often in operations' research and applied mathematics as well as everyday life; e.g., a ...

- 216
- 710
Metrics
Total Citations216
Total Downloads710
Last 12 Months11
Last 6 weeks1

Abstract
Get Access

Article

Query optimization in compressed database systems

Zhiyuan Chen,
Johannes Gehrke,
Flip Korn

Pages 271–282https://doi.org/10.1145/375663.375692

Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk access rates by orders of magnitude, enabling the use of data compression techniques to improve the performance of database systems. Previous work ...

- 110
- 1,481
Metrics
Total Citations110
Total Downloads1,481
Last 12 Months57
Last 6 weeks9

Abstract
Get Access

Article

SPARTAN: a model-based semantic compression system for massive data tables

Shivnath Babu,
Minos Garofalakis,
Rajeev Rastogi

Pages 283–294https://doi.org/10.1145/375663.375693

While a variety of lossy compression schemes have been developed for certain forms of digital data (e.g., images, audio, video), the area of lossy compression techniques for arbitrary data tables has been left relatively unexplored. Nevertheless, such ...

- 55
- 770
Metrics
Total Citations55
Total Downloads770
Last 12 Months17
Last 6 weeks1

Abstract
Get Access

Article

A robust, optimization-based approach for approximate answering of aggregate queries

Surajit Chaudhuri,
Gautam Das,
Vivek Narasayya

Pages 295–306https://doi.org/10.1145/375663.375694

The ability to approximately answer aggregation queries accurately and efficiently is of great benefit for decision support and data mining tools. In contrast to previous sampling-based studies, we treat the problem as an optimization problem whose goal ...

- 87
- 1,185
Metrics
Total Citations87
Total Downloads1,185
Last 12 Months12
Last 6 weeks2

Abstract
Get Access

Article

Materialized view selection and maintenance using multi-query optimization

Hoshi Mistry,
Prasan Roy,
S. Sudarshan,
Krithi Ramamritham

Pages 307–318https://doi.org/10.1145/375663.375703

Materialized views have been found to be very effective at speeding up queries, and are increasingly being supported by commercial databases and data warehouse systems. However, whereas the amount of data entering a warehouse and the number of ...

- 150
- 1,800
Metrics
Total Citations150
Total Downloads1,800
Last 12 Months39
Last 6 weeks6

Abstract
Get Access

Article

Generating efficient plans for queries using views

Foto N. Afrati,
Chen Li,
Jeffrey D. Ullman

Pages 319–330https://doi.org/10.1145/375663.375705

We study the problem or generating efficient, equivalent rewritings using views to compute the answer to a query. We take the closed-world assumption, in which views are materialized from base relations, rather than views describing sources in terms of ...

- 50
- 600
Metrics
Total Citations50
Total Downloads600
Last 12 Months6
Last 6 weeks1

Abstract
Get Access

Article

Optimizing queries using materialized views: a practical, scalable solution

Jonathan Goldstein,
Per-Åke Larson

Pages 331–342https://doi.org/10.1145/375663.375706

Materialized views can provide massive improvements in query processing time, especially for aggregation queries over large tables. To realize this potential, the query optimizer must know how and when to exploit materialized views. This paper presents ...

- 208
- 2,541
Metrics
Total Citations208
Total Downloads2,541
Last 12 Months68
Last 6 weeks8

Abstract
Get Access

Article

Dynamic buffer allocation in video-on-demand systems

Sang-Ho Lee,
Kyu-Young Whang,
Yang-Sae Moon,
Il-Yeol Song

Pages 343–354https://doi.org/10.1145/375663.375709

In video-on-demand (VOD) systems, as the size of the buffer allocated to user requests increases, initial latency and memory requirements increase. Hence, the buffer size must be minimized. The existing static buffer allocation scheme, however, ...

- 10
- 318
Metrics
Total Citations10
Total Downloads318
Last 12 Months10
Last 6 weeks1

Abstract
Get Access

Cited By

Farnsworth D and Tang N (2023). Modeling and Fitting Two-Way Tables Containing Outliers, International Journal of Mathematics and Mathematical Sciences, 10.1155/2023/6352058, 2023, (1-6), Online publication date: 11-Feb-2023.
Schlaipfer M, Rajan K, Lal A and Samak M Optimizing Big-Data Queries Using Program Synthesis Proceedings of the 26th Symposium on Operating Systems Principles, (631-646)

Save to Binder

Create a New Binder

Name

Contributors

Timos Sellis
RMIT University
- Publication Years1985 - 2024
- Publication counts231
- Citation count4,083
- Available for Download79
- Downloads (cumulative)42,174
- Downloads (12 months)3,244
- Downloads (6 weeks)466
- Average Downloads per Article534
- Average Citation per Article18
View Full Profile
Sharad Mehrotra
University of California, Irvine
- Publication Years1990 - 2024
- Publication counts209
- Citation count4,976
- Available for Download118
- Downloads (cumulative)79,050
- Downloads (12 months)8,057
- Downloads (6 weeks)962
- Average Downloads per Article670
- Average Citation per Article24
View Full Profile

Index Terms

Proceedings of the 2001 ACM SIGMOD international conference on Management of data
1. Information systems

Comments

Recommendations

SIGMOD '02: Proceedings of the 2002 ACM SIGMOD international conference on Management of data
SIGMOD '03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data
SIGMOD '10: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data

Acceptance Rates

SIGMOD '01 Paper Acceptance Rate 44 of 293 submissions, 15%;

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Year	Submitted	Accepted	Rate
SIGMOD '19	430	88	20%
SIGMOD '18	461	90	20%
SIGMOD '15	415	106	26%
SIGMOD '14	421	107	25%
SIGMOD '13	372	76	20%
SIGMOD '12	289	48	17%
SIGMOD '03	342	53	15%
SIGMOD '02	240	42	18%
SIGMOD '01	293	44	15%
SIGMOD '00	248	42	17%
SIGMOD '97	202	42	21%
SIGMOD '96	290	47	16%
Overall	4,003	785	20%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Cited By

Save to Binder

Index Terms

Recommendations

SIGMOD '02: Proceedings of the 2002 ACM SIGMOD international conference on Management of data

SIGMOD '03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data

SIGMOD '10: Proceedings of the 2010 ACM SIGMOD International Conference on Management of data

Acceptance Rates