skip to main content
10.1145/584792.584823acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Efficient prediction of web accesses on a proxy server

Published: 04 November 2002 Publication History

Abstract

Web access prediction is an active research topic with many applications. Various approaches have been proposed for Web access prediction in the domain of individual Web servers but they have to be tailored to the domain of proxy servers to satisfy its special requirements in prediction efficiency and scalability. In this paper, the design and implementation of proxy-based prediction service (PPS) is presented. For prediction efficiency, PPS applies a new prediction scheme which employs a two-layer navigation model to capture both inter-site and intra-site access patterns, incorporated with a bottom-up prediction mechanism that exploits reference locality in proxy logs. For system scalability, PPS manages the navigation model in disk database and adopts a predictive cache replacement strategy for data shipping between the model database and cache. We show the superiority of our prediction scheme over existing approaches and validate our model management and caching strategies, with a detailed performance study using real-world data.

References

[1]
M.-S. Chen, J. S. Park, and P. S. Yu. Efficient data mining for path traversal patterns. Knowledge and Data Engineering, 10(2):209--221, 1998.]]
[2]
R. Cooley, B. Mobasher, and J. Srivastava. Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems, 1(1):5--32, 1999.]]
[3]
J. Dean and M. R. Henzinger. Finding related pages in the world wide web. WWW8 / Computer Networks, 31(11-16):1467--1479, 1999.]]
[4]
M. Deshpande and G. Karypis. Selective markov models for predicting web-page accesses. In Proc. SIAM Int. Conference on Data Mining, Apr. 2001.]]
[5]
D. J. DeWitt, P. Futtersack, D. Maier, and F. Velez. A study of three alternative workstation-server architectures for object oriented database systems. In The VLDB Journal, pages 107--121, 1990.]]
[6]
L. Fan, P. Cao, W. Lin, and Q. Jacobson. Web prefetching between low-bandwidth clients and proxies: Potential and performance. In Proc. of the SIGMETRICS'99 Conference, Atlanta, May 1999.]]
[7]
J. Griffioen and R. Appleton. Reducing file system latency using a predictive approach. In USENIX Summer, pages 197--207, 1994.]]
[8]
W. Lou, G. Liu, H. Lu, and Q. Yang. Cut-and-pick transactions for proxy log mining. In Proc. of the 8th EDBT Conference, Prague, Czech, March 2002.]]
[9]
W. Lou and H. Lu. Proxy-based web access prediciton: Design and implementation. Technical report, CS Department, Hong Kong University of Science and Technology, May 2002.]]
[10]
A. Nanopoulos, D. Katsaros, and Y. Manolopoulos. Effective prediction of web-user accesses, a data mining approach. In Proc. of Workshop on WebKDD'01, San Francisco, CA, Aug 2001.]]
[11]
V. N. Padmanabhan and J. C. Mogul. Using predictive prefetching to improve World-Wide Web latency. In Proc. of the SIGCOMM'96 conference, 1996.]]
[12]
T. Palpanas and A. Mendelzon. Web prefetching using partial match prediction. Technical report CSRG-376. CS Department, University of Toronto, 1998.]]
[13]
A. Papoulis. Probability, Random Variables, and Stochastic Processes. McGraw Hill, 1991.]]
[14]
J. Pitkow and P. Pirolli. Mining longest repeated subsequences to predict world wide web surfing. In Proc. of Second USENIX Symposium on Internet Technologies and Systems, Oct 1999.]]
[15]
R. Ramakrishnan and J. Gehrke. Database Management Systems (2nd Edt.). McGraw-Hill, 2000.]]
[16]
B. M. Sarwar, G. Karypis, J. A. Konstan, and J. Riedl. Analysis of recommendation algorithms for e-commerce. In ACM Conference on Electronic Commerce, pages 158--167, 2000.]]
[17]
Q. Yang, H. H. Zhang, and T. Li. Mining web logs for prediction models in www caching and prefetching. In Proc. of the 7th ACM SIGKDD'01, San Francisco, California, USA, August 2001.]]

Cited By

View all

Index Terms

  1. Efficient prediction of web accesses on a proxy server

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management
      November 2002
      704 pages
      ISBN:1581134924
      DOI:10.1145/584792
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 November 2002

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. navigational model
      2. proxy server
      3. web access prediction

      Qualifiers

      • Article

      Conference

      CIKM02

      Acceptance Rates

      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 14 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media