skip to main content
10.1145/974044.974079acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
Article

Using locality of reference to improve performance of peer-to-peer applications

Published: 01 January 2004 Publication History

Abstract

Peer-to-peer, or simply P2P, systems have recently emerged as a popular paradigm for building distributed applications. One key aspect of the P2P system design is the mechanism used for content location. A number of different approaches are currently in use. In particular, the location algorithm used in Gnutella, a popular and extensively analyzed P2P file sharing application, is based on flooding of messages in the network, which results in significant processing overhead on the participant nodes and thus, poor performance.In this paper, we provide an extensive performance evaluation of alternative algorithms for content location and retrieval in P2P systems, in particular, the Freenet and Gnutella systems. We compare the original Freenet and Gnutella algorithms, a previously proposed interest-based algorithm and two new algorithms which also explore locality of interest among peers to efficiently allow content location. Unlike previous proposals, the new algorithms organize the peers into communities that share interests. Two peers are said to have common interest if they share some of the locally stored files.In order to evaluate the performance of these algorithms, we use a previously developed Freenet simulator and build a new Gnutella simulator, which includes several realistic system characteristics. We show that the new community-based algorithms improve the original Gnutella content location latency (and thus the system QoS) and system load by up to 31% and 30%, respectively. Our algorithms also reduce the average Freenet request and response path lengths by up to 39% and 31%, respectively. Furthermore, we show that, compared to the previously proposed interest-based algorithm, our new algorithms improve query latency by up to 27% without a significant increase in the load.

References

[1]
L. A. Adamic, R. M. Lukose, A. R. Puniyani, and B. A. Huberman. Search in power-law networks. Physical Review E, 2001.]]
[2]
R. K. Ahuja, T. L. Magnanti, and J. B. Orlin. Network Flows: Theory, Algorithms and Applications. Prentice Hall, Englewood Cliffs, N J, 1993.]]
[3]
J. Chu, K. Labonte, and B. Levine. Availability and locality measurements of peer-to-peer file systems, 2002.]]
[4]
I. Clarke, T. W. Hong, S. G. Miller, O. Sandberg, and B. Wiley. Protecting freedom of information online with freenet, for review purposes only.]]
[5]
I. Clarke, O. Sandberg, B. Wiley, and T. W. Hong. Freenet: A distributed anonymous information storage and retrieval system. In ICSI Workshop on Design Issues in Anonymity and Unobservability, July 2000.]]
[6]
E. Cohen, A. Fiat, and H. Kaplan. Associative search in peer to peer networks: Harnessing latent semantics. Infocom, 2003.]]
[7]
F. Cornelli, E. Damiani, and S. D. Capitani. Choosing reputable servents in a p2p network, 2002.]]
[8]
F. Dabek, E. Brunskill, M. F. Kaashoek, DavidKarger, R. Morris, I. Stoica, and H. Balakrishnan. Building peer-to-peer systems with chord, a distributed lookup service. In 8th IEEE Workshop on Hot Topics in Operating Systems, pages 71--76, Elmau/Oberbayern, May 2001. MIT Laboratory for Computer Science.]]
[9]
G. W. Flakes, S. Lawrence, and C. L. Giles. Efficient identification of web communities. ACM KDD, 2000.]]
[10]
G. W. Flakes, S. Lawrence, C. L. Giles, and F. M. Coetzee. Self-organization of the web and identification of communities. IEEE Computer, 2002.]]
[11]
Z. Ge, D. R. Figueiredo, S. Jaiswal, J. Kurose, and D. Towsley. Modeling peer-to-peer file sharing systens. In Proceedings of Infocom, 2003.]]
[12]
B. Hayes. Graph theory in practice: Part ii. American Scientist, 88(1), January 2000.]]
[13]
A. Langley. Freenet, chapter 1. O'Reilly, 2001.]]
[14]
Q. Lv, P. Cao, E. Cohen, K. Li, and S. Shenker. Search and replication in unstructured peer-to-peer networks. In Proceedings of the 16th international conference on Supercomputing, extended version in http://crypto, stanford, edu/cao/p2p-search.ps, pages 84--95. ACM Press, 2002.]]
[15]
E. P. Markatos. Tracing a large scale peer to peer system: an hour in life of gnutella. Second IEEE/ACM International Symposium on Cluster Computing and the Grid, 2002.]]
[16]
J. Meadows, March 1999. ftp://researchsmp2.cc.vt.edu/pub/boeing.]]
[17]
D. Nogueira, L. Rocha, J. Santos, P. Araujo, V. Almeida, and W. M. Jr. A methodology for workload characterization of file-sharing peer-to-peer systems. IEEE Workshop on Workload Characterization, 2002.]]
[18]
S. Ratnasamy, P. Francis, M. Handley, R. Karp, and S. Shenker. A scalable content-addressable network. In ACM Sigcomm 2001 Technical Conference, San Diego, CA, August 2001.]]
[19]
M. Ripeanu and I. Foster. Mapping the gnutella network: Macroscopic properties of large-scale peer-to-peer systems. IPTPS, 2002.]]
[20]
A. Rowstron and P. Druschel. Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In IFIP/ACM International Conference on Distributed Systems Platforms (Middleware), pages 329--350, Nov. 2001.]]
[21]
S. Saroiu, P. K. Gummadi, and S. D. Gribble. A measurement study of peer-to-peer file sharing systems. Technical report, University of Washington, 2001.]]
[22]
K. Sripanidkulchai. The popularity of gnutella queries and its implications on scalability. Technical report, Carnegie Mellon University, 2001.]]
[23]
K. Sripanidkulchai, B. Maggs, and H. Zhang. Enabling efficient content location and retrieval in peer-to-peer systems. Technical report, Carnegie Mellon University, 2001.]]
[24]
K. Sripanidkulchai, B. Maggs, and H. Zhang. Efficient content location using interest-based locality in peer-to-peer systems. Infocom, 2003.]]
[25]
The aurora home page. http://www.doc.ic.ac.uk/twhl/longitude/.]]
[26]
http://freenet.sourceforge.net.]]
[27]
The gnutella home page. http://www.gnutella.wego.com.]]
[28]
The gnutella protocol specification. http://www.clip2.com.]]
[29]
The kazaa home page. http://www.kazaa.com.]]
[30]
The morpheus home page. http://www.morpheus.com.]]
[31]
The napster home page. http://www.napster.com.]]
[32]
J. Vaucher, G. Babin, P. Kropf, and T. Jouve. Experimenting with gnutella communities. Distributed Communities on the Web International Workshop, 2002.]]
[33]
B. Yang and H. Garcia-Molina. Efficient search in peer-to-peer networks. Technical report, Stanford University, October 2001.]]
[34]
B. Y. Zhao, J. D. Kubiatowicz, and A. D. Joseph. Tapestry: An infrastructure for fault-tolerant wide-area location and routing. Technical report, University of Berkeley, April 2000.]]

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WOSP '04: Proceedings of the 4th international workshop on Software and performance
January 2004
313 pages
ISBN:1581136730
DOI:10.1145/974044
  • cover image ACM SIGSOFT Software Engineering Notes
    ACM SIGSOFT Software Engineering Notes  Volume 29, Issue 1
    January 2004
    300 pages
    ISSN:0163-5948
    DOI:10.1145/974043
    Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 January 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content location algorithms
  2. interest-based communities
  3. peer-to-peer systems
  4. performance analysis

Qualifiers

  • Article

Conference

WOSP04
WOSP04: Fourth International Workshop on Software and Performance 2004
January 14 - 16, 2004
California, Redwood Shores

Acceptance Rates

WOSP '04 Paper Acceptance Rate 38 of 70 submissions, 54%;
Overall Acceptance Rate 149 of 241 submissions, 62%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)2
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media