skip to main content
10.1145/775152.775234acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Make it fresh, make it quick: searching a network of personal webservers

Published: 20 May 2003 Publication History

Abstract

Personal webservers have proven to be a popular means of sharing files and peer collaboration. Unfortunately, the transient availability and rapidly evolving content on such hosts render centralized, crawl-based search indices stale and incomplete. To address this problem, we propose YouSearch, a distributed search application for personal webservers operating within a shared context (e.g., a corporate intranet). With YouSearch, search results are always fast, fresh and complete -- properties we show arise from an architecture that exploits both the extensive distributed resources available at the peer webservers in addition to a centralized repository of summarized network state. YouSearch extends the concept of a shared context within web communities by enabling peers to aggregate into groups and users to search over specific groups. In this paper, we describe the challenges, design, implementation and experiences with a successful intranet deployment of YouSearch.

References

[1]
Apache's HTTP Server Project. http://httpd.apache.org/.]]
[2]
BadBlue -- The P2P File Sharing Web Server. http://www.badblue.com/]]
[3]
The Gnutella Network. http://www.gnutella.com/.]]
[4]
Google -- The Web Search Engine. http://www.google.com/.]]
[5]
Groove Networks, Inc. Desktop Collaboration Software. http://www.groove.net/.]]
[6]
ht://Dig - Internet Search Engine Software. http://htdig.org/.]]
[7]
The KaZaa Media Network. http://www.kazaa.com/.]]
[8]
Mac OS X: Personal Web Sharing. http://www.mac3d.com/.]]
[9]
LAN-Based web caching for accelerated web access. http://www.mangosoft.com/products/cachelink.]]
[10]
Microsoft's Personal Web Server and Peer Web Services. http://www.microsoft.com/.]]
[11]
The Napster Company. http://www.napster.com/.]]
[12]
SWISH-E: Simple Web Indexing System for Humans-Enhanced. http://swish-e.org/.]]
[13]
The XDegrees Company. http://www.xdegrees.com/.]]
[14]
R. J. Bayardo Jr., A. Somani, D. Gruhl and R. Agrawal.YouServ: A Web Hosting and Content Sharing Tool for the Masses.In Proc. Intl. World Wide Web Conf. (WWW), 2002.]]
[15]
K. Bharat.SearchPad: Explicit Capture of Search Context to Support Web Search.In Proc. 9th Intl. World Wide Web (WWW) Conference, 2000.]]
[16]
B. Bloom.Space/time Trade-offs in Hash Coding with Allowable Errors.In Communications of ACM, volume 13(7), pages 422--426, 1970.]]
[17]
D. Carmel, E. Amitay, M. Herscovici, Y. Maarek, Y. Petruschka, and A. Soffer.Juru at TREC 10 - Experiments with Index Pruning.In Proc. Text REtrieval Conference (TREC), 2001.]]
[18]
F. M. Cuenca-Acuna and T. D. Nguyen.Text-Based Content Search and Retrieval in Ad Hoc P2P Communities.In Proc. Intl. Workshop in Peer-to-Peer Computing, 2002.]]
[19]
F. Dabek, E. Brunskill, M. F. Kaashoek, D. Karger, R. Morris, I. Stoica, and H. Balakrishnan.Building Peer-to-Peer Systems with Chord, a Distributed Lookup Service.In Proc. Workshop on Hot Topics in Operating Systems (HotOS), 2001.]]
[20]
C. Dwork, R. Kumar, M. Naor, and D. Sivakumar.Rank Aggregation Methods for the Web.In Proc. Intl. World Wide Web Conf. (WWW), 2001.]]
[21]
S. Iyer, A. Rowstron, and P. Druschel.Squirrel: A decentralized peer-to-peer web cache. In Proc. ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC), 2002.]]
[22]
M. Mitzenmacher.Compressed Bloom Filters. In Proc. ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC), 2001.]]
[23]
V. N. Padmanabhan and K. Sripanidkulchai. The Case for Cooperative Networking. In Proc. Intl. Peer-to-Peer Systems (IPTPS) Workshop, 2002.]]
[24]
S. Ratnasamy, P. Francis, M. Handley, and R. Karp. A Scalable Content-Addressable Network (CAN). In Proc. of ACM SIGCOMM, 2001.]]
[25]
R. Rivest. RFC 1321: The MD5 Message-Digest Algorithm. As Technical report, Network Working Group, 1992.]]
[26]
A. Rowstron and P. Druschel. Pastry: Scalable, Distributed Object Location and Routing for Large-scale Peer-to-Peer Systems. In Proc. IFIP/ACM Intl. Conf. on Distributed Systems Platforms (Middleware), 2001.]]
[27]
T. Stading, P. Maniatis, and M. Baker. Peer-to-peer Caching Schemes to Address Flash Crowds. In Proc. Intl. Peer-to-Peer Systems (IPTPS) Workshop, 2002.]]
[28]
A. Stavrou, D. Rubenstein, and S. Sahu. A Lightweight, Robust P2P System to Handle Flash Crowds. In Proc. IEEE Intl. Conf. on Network Protocols (ICNP), 2002.]]
[29]
Y. Xie and D. O'Hallaron. Locality in Search Engine Queries and its Implications for Caching. In Proc. IEEE INFOCOM, 2000.]]

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '03: Proceedings of the 12th international conference on World Wide Web
May 2003
772 pages
ISBN:1581136803
DOI:10.1145/775152
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. P2P
  2. decentralized systems
  3. information communities
  4. intranet search
  5. peer-to-peer networks
  6. web search

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media