skip to main content
article
Free access

Statistical estimators for aggregate relational algebra queries

Published: 01 December 1991 Publication History
First page of PDF

References

[1]
CHAO, A. Nonparametric estimation of the number of classes in a population. Scand. J. Stat. 11 (1984), 265-270.
[2]
COCHRAN, W. Sampling Techniques. Third ed. Wiley, New York, 1977.
[3]
CHRISTODOULAKIS, S. Estimating record selectivities. Inf. System 8, 2 (1983), 105-115.
[4]
DEVORE, J. Probability and Statistics for Engineering and Sciences. Brook/Cole, 1984.
[5]
DATTA, A., FOURNIER, B., Hou, W.-C., AND OZSOYOGLU, G. The implementation of SSDB. In Proceedings of the Third International Workshop on Statistical and Scientific Database Management (Luxembourg, July 1986), pp. 245-260.
[6]
FORTUNATO, E., R~ANELLI, M., RIccI, F., AND SEBASTIO, A. An algebra for statistical data. In Proceedings of the Third International Workshop on Statistical and Scientific Database Management (Luxembourg, July 1986),
[7]
FISHER, R. A. On the mathematical foundations of theoretical statistics. Philosophical Trans. Royal Soc. 222 (1921).
[8]
FISHER, R.A. Theory of statistical estimation. In Proceedings of the Cambridge Philosophical Society. 22, 1925.
[9]
GOODMAN, L. On the estimation of the number of classes in a population. Ann. Math. Stat. 20 (1949), 572-579.
[10]
GHOSH, S. SIAM: Statistical information access method. In Proceedings of the Third International Workshop on Statistical and Scientific Database Management (Luxembourg, July 1986), pp. 286-293.
[11]
G~osH, S. Numerical operation on relational database. IBM Res. Rep. 5605, 1987.
[12]
Hou, W.-C., OZSOYOGLU, G., AND TANEJA, B. Statistical estimators for relational algebra expressions, In Proceedings ACM Symposium on Principles of Database Systems (Texas, March 1988), pp. 276-287.
[13]
Hou, W.-C., OZSOYOGLU, G., ANn TANEJA, B. Processing aggregate relational queries with hard time constraints. In Proceedings of the ACM SIGMOD Conference (Oregon, May 1989), pp. 68-77.
[14]
Hou, W.-C. Relational aggregate query processing techniques for real-time databases. Ph.D. thesis, Case Western Reserve Univ., May 1989.
[15]
HANSEN, M., HURWITZ, W., MADOW, W. Sample Survey Methods and Theory, Vol. 1. Wiley, New York, 1953.
[16]
KISH, L. Survey Sampling. Wiley, New York, 1965.
[17]
KLUG, A. Access paths in the 'ABE' statistical query facility. In Proceedings of the ACM SIGMOD Conference (1982).
[18]
MANNINO, M., CHU, P., ANn SAGER, T. Statistical profile estimation in database systems. ACM Comput. Surv. 20, 3 (Sept. 1988), 191-221.
[19]
MATOS, V. Extensions to the relational data model for statistical database applications. Ph.D. thesis, Case Western Reserve Univ., 1985.
[20]
MORGENSTEIN, J. Computer based management information systems embodying answer accuracy as a user parameter. Ph.D. thesis, Univ. of California, Berkeley, 1981.
[21]
NEYMAN, J. On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection. J. Royal Stat. Soc. 97 (}934), 558-606.
[22]
NEYMAN, J. On the problem of confidence intervals. Ann. Math. Stat. 6 (1935).
[23]
OLKEN, F. Physical database support for scientific and statistical databases. In Proceedings of the Third International Workshop on Statzstwal and Scientific Database Management (Luxembourg, July 1986).
[24]
OLKEN, F., ANn ROTEM, D. Simple random sampling from relational databases. In Proceedings of the VLDB Conference (Kyoto, Aug. 1986), pp. 160-169.
[25]
Ross, S. Introduction to Probab~lzty Models, 2nd ed Academic Press, 1980
[26]
RowE, N. Antisampling for estimation: An overview. IEEE Trans. Softw. Eng. 11, 10 (Oct. 1985), 1081-1091.
[27]
SROS~AN~, A., OLKEN, F., AND WONO, H. Characteristics of scientific databases. In Proceedings of the VLDB Conference (1984).
[28]
SCHEAFFER, R., MENDE~HALL, W., AND O?% L. Elementary Survey Sampling, 3rd ed. Duxbury Press, Boston, 1986.
[29]
Second International Workshop on Statisttcal and Scientific Database Management. California, 1982.
[30]
Third International Workshop on Statistical and Scientific Database Management. Luxembourg, 1986.
[31]
Fourth International Workshop on Stat~stzcal and Scientzfic Database Management. Rome, 1988.
[32]
SUKH^TME, P., A~ SUKHATME, B Sampling Theory of Survey Applications, third ed. New Delhi, India and Iowa State University Press, Ames, Iowa, 1984.
[33]
TUKE~, J. Exploratory Data Analysis Addison-Wesley, Reading, Mass., 1977.

Cited By

View all

Recommendations

Reviews

Pericles Loucopoulos

The authors describe the derivation of statistical estimators for COUNT(E) queries on a relational database and present algorithms for evaluating these estimators. A brief description of the relational model and the background statistical theory is provided, and this background is supplemented by a comprehensive set of references to related work. The application of statistical theory to the production of COUNT(E) estimators is explained in a thorough but readable form, and the algorithms provide a concise summary of the results. The paper proposes several enhancements to the estimators, and concentrates on cluster-sampling–based estimators in its appraisal of their performance on arbitrary relational databases. The experimental results are described in some detail, but provide useful insights into the type of estimator most appropriate to a variety of situations. The paper is long, but the length is justified by the detailed coverage of the subject area. It provides a valuable contribution to the field of statistical estimators.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Database Systems
ACM Transactions on Database Systems  Volume 16, Issue 4
Dec. 1991
176 pages
ISSN:0362-5915
EISSN:1557-4644
DOI:10.1145/115302
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 December 1991
Published in TODS Volume 16, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. relational algebra
  2. sampling
  3. selectivity
  4. simple random sampling
  5. statistical estimators

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)65
  • Downloads (Last 6 weeks)15
Reflects downloads up to 05 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media