skip to main content
10.5555/1316689.1316720dlproceedingsArticle/Chapter ViewAbstractPublication PagesvldbConference Proceedingsconference-collections
Article

Resource sharing in continuous sliding-window aggregates

Published: 31 August 2004 Publication History

Abstract

We consider the problem of resource sharing when processing large numbers of continuous queries. We specifically address sliding-window aggregates over data streams, an important class of continuous operators for which sharing has not been addressed. We present a suite of sharing techniques that cover a wide range of possible scenarios: different classes of aggregation functions (algebraic, distributive, holistic), different window types (time-based, tuple-based, suffix, historical), and different input models (single stream, multiple substreams). We provide precise theoretical performance guarantees for our techniques, and show their practical effectiveness through experimental study.

References

[1]
{1} M. K. Aguilera, R. E. Strom, et al. Matching events in a content-based subscription system. In Proc. of the 18th Annual ACM Symp. on Principles of Distributed Computing, pages 53-61, May 1999.
[2]
{2} M. Altinel and M. J. Franklin. Efficient filtering of XML documents for selective dissemination of information. In Proc. of the 26th Intl. Conf. on Very Large Data Bases, pages 53-64, Sept. 2000.
[3]
{3} A. Arasu, S. Babu, and J. Widom. The CQL Continuous Query Language: Semantic Foundations and Query Execution. Technical report, Stanford University, Oct. 2003. http://dbpubs. stanford.edu/pub/2003-67.
[4]
{4} A. Arasu and G. Manku. Approximate counts and quantiles over sliding windows. In Proc. of the 23rd ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems, June 2004.
[5]
{5} A. Arasu and J. Widom. Resource sharing in continuous sliding window aggregates. Technical Report http://dbpubs. stanford.edu/pub/2004-15, Stanford University, 2004.
[6]
{6} R. Avnur and J. M. Hellerstein. Eddies: Continuously adaptive query processing. In Proc. of the 2000 ACM SIGMOD Intl. Conf. on Management of Data, pages 261-272, May 2000.
[7]
{7} B. Babcock, S. Babu, et al. Models and issues in data stream systems. In Proc. of the 21st ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems, pages 1-16, June 2002.
[8]
{8} D. Carney, U. Centintemel, et al. Monitoring streams - a new class of data management applications. In Proc. of the 28th Intl. Conf. on Very Large Data Bases, pages 215-226, Aug. 2002.
[9]
{9} S. Chandrasekharan, O. Cooper, et al. TelegraphCQ: Continuous dataflow processing for an uncertain world. In Proc. of the 1st Conf. on Innovative Data Systems Research, pages 269-280, Jan. 2003.
[10]
{10} S. Chandrasekharan and M. J. Franklin. Streaming queries over streaming data. In Proc. of the 28th Intl. Conf. on Very Large Data Bases, pages 203-214, Aug. 2002.
[11]
{11} J. Chen, D. J. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query system for internet databases. In Proc. of the 2000 ACM SIGMOD Intl. Conf. on Management of Data, pages 379-390, May 2000.
[12]
{12} M. Datar, A. Gionis, P. Indyk, and R. Motwani. Maintaining stream statistics over sliding windows. In Proc. of the 13th Annual ACM-SIAM Symp. on Discrete Algorithms, pages 635-644, Jan. 2002.
[13]
{13} Y. Diao, P. M. Fischer, M. J. Franklin, and R. To. YFilter: Efficient and scalable filtering of XML documents. In Proc. of the 18th Intl. Conf. on Data Engineering, pages 341-344, Feb. 2002.
[14]
{14} A. Dobra, M. Garofalakis, J. Gehrke, and R. Rastogi. Sketch-based multi-query processing over data streams. In Proc. of the 9th Intl. Conf. on Extending Database Technology, Mar. 2004.
[15]
{15} C. Estan and G. Varghese. New directions in traffic measurement and accounting: Focusing on the elephants, ignoring the mice. ACM Transactions on Computer Systems, 21(3):270-313, Aug. 2003.
[16]
{16} F. Fabret, H. Jacobsen, et al. Filtering algorithms and implementation for very fast publish/subscribe. In Proc. of the 2000 ACM SIGMOD Intl. Conf. on Management of Data, pages 115-126, May 2001.
[17]
{17} J. Gehrke, F. Korn, and D. Srivastava. On computing correlated aggregates over continual data streams. In Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of Data, pages 13-24, May 2001.
[18]
{18} P. B. Gibbons and S. Tirthapura. Distributed streams algorithms for sliding windows. In Proc. of the 14th Annual ACM Symp. on Parallel Algs. and Architectures, pages 63-72, Aug. 2002.
[19]
{19} A. C. Gilbert, Y. Kotidis, et al. How to summarize the universe: Dynamic maintenance of quantiles. In Proc. of the 28th Intl. Conf. on Very Large Data Bases, pages 454-465, Aug. 2002.
[20]
{20} J. Gray, S. Chaudhuri, et al. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub totals. Data Mining and Knowledge Discovery, 1(1):29-53, Mar. 1997.
[21]
{21} R. E. Gruber, B. Krishnamurthy, and E. Panagos. READY: A high performance event notification system. In Proc. of the 16th Intl. Conf. on Data Engineering, pages 668-669, Mar. 2000.
[22]
{22} A. K. Gupta and D. Suciu. Stream processing of XPath queries with predicates. In Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data, pages 419-430, June 2003.
[23]
{23} libavl: Library for balanced binary trees. Available at http:// www.gnu.org/directory/GNU/libavl.html.
[24]
{24} U. Lindqvist and P. A. Porras. Detecting computer and network misuse through the production-based expert system toolset (P-BEST). In Proc. of the IEEE Symp. on Security and Privacy, pages 146-161, May 1999.
[25]
{25} S. Madden, M. A. Shah, J. M. Hellerstein, and V. Raman. Continuously adaptive continuous queries over streams. In Proc. of the 2002 ACM SIGMOD Intl. Conf. on Management of Data, pages 49-60, June 2002.
[26]
{26} F. Peng and S. S. Chawathe. XPath queries on streaming data. In Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data, pages 431-442, June 2003.
[27]
{27} P. Roy, S. Seshadri, et al. Efficient and extensible algorithms for multi query optimization. In Proc. of the 2000 ACM SIGMOD Intl. Conf. on Management of Data, pages 249-260, May 2000.
[28]
{28} T. K. Sellis. Multiple-query optimization. ACM Trans. on Database Systems, 13(1):23-52, Mar. 1988.
[29]
{29} Traderbot home page. http://www.traderbot.com, 2003.
[30]
{30} G. Vigna and R. A. Kemmerer. NetSTAT: A network-based intrusion detection approach. In Proc. of the 14th Annual Computer Security Appln. Conf., pages 25-38, Dec. 1998.
[31]
{31} Y. Zhu and D. Shasha. StatStream: Statistical monitoring of thousands of data streams in real time. In Proc. of the 28th Intl. Conf. on Very Large Data Bases, pages 358-369, Aug. 2002.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
VLDB '04: Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
August 2004
1380 pages

Sponsors

  • VLDB Endowment: Very Large Database Endowment

Publisher

VLDB Endowment

Publication History

Published: 31 August 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media