skip to main content
research-article

Fair sampling across network flow measurements

Published: 11 June 2012 Publication History

Abstract

Sampling is crucial for controlling resource consumption by internet traffic flow measurements. Routers use Packet Sampled NetFlow, and completed flow records are sampled in the measurement infrastructure. Recent research, motivated by the need of service providers to accurately measure both small and large traffic subpopulations, has focused on distributing a packet sampling budget amongst subpopulations. But long timescales of hardware development and lower bandwidth costs motivate post-measurement analysis of complete flow records at collectors instead. Sampling in collector databases then manages data volumes, yielding general purpose summaries that are rapidly queried to trigger drill-down analysis on a time limited window of full data. These are sufficiently small to be archived. This paper addresses the problem of distributing a sampling budget over subpopulations of flow records. Estimation accuracy goals are met by fairly sharing the budget. We establish a correspondence between the type of accuracy goal, and the flavor of fair sharing used. A streaming Max-Min Fair Sampling algorithm fairly shares the sampling budget across subpopulations, with sampling as a mechanism to deallocate budget. This provides timely samples and is robust against uncertainties in configuration and demand. We illustrate using flow records from an access router of a large ISP, where rates over interface traffic subpopulations vary over several orders of magnitude. We detail an implementation whose computational cost is no worse than subpopulation-oblivious sampling.

References

[1]
AT&T's Global Networking Facts. http://www.corp.att.com/globalnetworking.
[2]
D. Bertsekas and R. Gallager. Data Networks. Prentice-Hall, Englewood Cliffs, NJ, 1992.
[3]
R. Bhargava, A. Goel, and A. Meyerson. Using approximate majorization to characterize protocol fairness. In SIGMETRICS '01, pages 330--331, New York, NY, 2001.
[4]
CAIDA: The Cooperative Association for Internet Data Analysis. http://www.caida.org.
[5]
G. R. Cantieni, G. Iannaccone, C. Barakat, C. Diot, and P. Thiran. Reformulating the monitor placement problem: optimal network-wide sampling. In CoNEXT '06, pages 5:1--5:12, New York, NY, 2006.
[6]
M. T. Chao. A general purpose unequal probability sampling plan. Biometrika, 69(3):653--656, 1982.
[7]
S. Chaudhuri, G. Das, M. Data, R. Motwani, and V. Narasayya. Overcomng limitations of sampling for aggregation queries. In ICDE'01, pages 534--542, 2001.
[8]
S. Chen, Y. Fang, and Y. Xia. Lexicographic maxmin fairness for data collection in wireless sensor networks. IEEE Transactions on Mobile Computing, 6:762--776, 2007.
[9]
Cisco NetFlow. http://www.cisco.com/warp/public/732/Tech/netflow.
[10]
E. Cohen, N. Duffield, H. Kaplan, C. Lund, and M. Thorup. Stream sampling for variance-optimal estimation of subset sums. In Proc. 20th ACM-SIAM Symposium on Discrete Algorithms, 2009.
[11]
C. Cranor, T. Johnson, O. Spatscheck, and V. ladislav Shkapenyuk. Gigascope: A stream database for network applications. In Proc ACM SIGMOD, June 2003.
[12]
A. Dhamdhere, L. Breslau, N. G. Duffield, C. Ee, A. Gerber, C. Lund, and S. Sen. Flowroute: inferring forwarding table updates using passive flow-level measurements. In IMC 2010, pages 315--321, 2010.
[13]
N. Duffield and C. Lund. Predicting resource usage and estimation accuracy in an IP flow measurement collection infrastructure. In ACM SIGCOMM Internet Measurement Workshop, 2003. Miami Beach, Fl, October 27--29, 2003.
[14]
N. Duffield, C. Lund, and M. Thorup. Learn more, sample less: control of volume and variance in network measurements. IEEE Transactions on Information Theory, 51(5):1756--1775, 2005.
[15]
N. Duffield, C. Lund, and M. Thorup. Optimal combination of sampled network measurements. In Proc. ACM SIGCOMM Internet Measurement Conference (IMC), pages 91--104, 2005.
[16]
N. Duffield, C. Lund, and M. Thorup. Priority sampling for estimation of arbitrary subset sums. J. ACM, 54(6):Article 32, December, 2007. Announced at SIGMETRICS'04.
[17]
C. Estan and G. Varghese. New directions in traffic measurement and accounting. In Proc. ACM SIGCOMM '2002, Pittsburgh, PA, August 2002.
[18]
J. Faffe. Bottleneck flow control. IEEE Trans. Comm., 29(7):954--962, July 1981.
[19]
A. Feldmann, J. Rexford, and R. Cáceres. Efficient policies for carrying web traffic over flow-switched networks. IEEE/ACM Transactions on Networking, 6(6):673--685, December 1998.
[20]
M. Grossglauser and J. Rexford. Passive traffic measurement for IP operations. In K. Park and W. Willinger, editors, The Internet as a Large-Scale Complex System. Oxford University Press, 2005.
[21]
D. G. Horvitz and D. J. Thompson. A generalization of sampling without replacement from a finite universe. J. Amer. Stat. Assoc., 47(260):663--685, 1952.
[22]
M. Hosaagrahara and H. Sethu. Max-min fair scheduling in input-queued switches. IEEE Transactions on Parallel and Distributed Systems, 19:462--475, 2008.
[23]
F. Kelly. Charging and rate control for elastic traffic. European Trans. Telecomm., 8(1):33--37, 1997.
[24]
S. Keshav. An Engineering Approach to Computer Networking. Addison-Wesley, Reading, MA, 1997.
[25]
J. Kleinberg, E. Tardos, and Y. Rabani. Fairness in routing and load balancing. Foundations of Computer Science, Annual IEEE Symposium on, 0:568, 1999.
[26]
A. Kumar and J. Xu. Sketch guided sampling - using on-line estimates of flow size for adaptive data collection. In INFOCOM, 2006.
[27]
N. Kumar, R. Pan, and D. Shah. Fair scheduling in input-queued switches under inadmissible traffic. In IEEE Globecom, volume 3, pages 1713--1717, 2004.
[28]
S. Leinen. UDP Samplicator. http://www.switch.ch/network/downloads/tf-tant/samplicator.
[29]
W. B. Norton. DrPeering.net. http://drpeering.net.
[30]
A. K. Parekh and R. G. Gallager. A generalized processor sharing approach to flow control in integrated services networks: the single-node case. IEEE/ACM Trans. Netw., 1:344--357, June 1993.
[31]
M. Pioro, G. Fodor, P. Nilsson, and E. Kubilinskas. On efficient max-min fair routing algorithms. Computers and Communications, IEEE Symposium on, 0:365, 2003.
[32]
B. Radunović and J.-Y. L. Boudec. A unified framework for max-min and min-max fairness with applications. IEEE/ACM Trans. Netw., 15:1073--1083, October 2007.
[33]
A. Ramachandran, S. Seetharaman, N. Feamster, and V. Vazirani. Fast monitoring of traffic subpopulations. In Proc. Internet Measurement Conference 2008, pages 257--270, New York, NY, USA, 2008.
[34]
V. Sekar, M. K. Reiter, W. Willinger, H. Zhang, R. R. Kompella, and D. G. Andersen. cSamp: A system for network-wide flow monitoring. In Proc. 5th USENIX NSDI, San Francisco, CA, Apr. 2008.
[35]
K. Suh, Y. Guo, J. Kurose, and D. Towsley. Locating network monitors: Complexity, heuristics and coverage. In IEEE Infocom'05, 2005.
[36]
M. Szegedy and M. Thorup. On the variance of subset sum estimation. In Proc. 15th ESA, LNCS 4698, pages 75--86, 2007.
[37]
L. Tassiulas and S. Sarkar. Maxmin fair scheduling in wireless networks. In Proceedings of IEEE INFOCOM, pages 763--772, 2001.
[38]
Y. Tillé. Sampling Algorithms. Springer, New York, 2006.
[39]
J. Vitter. Random sampling with a reservoir. ACM Trans. Math. Softw., 11(1):37--57, 1985.
[40]
J. Zhang, X. Niu, and J. Wu. A space-efficient fair packet sampling algorithm. In Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management, APNOMS '08, pages 246--255, Berlin, Heidelberg, 2008.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGMETRICS Performance Evaluation Review
ACM SIGMETRICS Performance Evaluation Review  Volume 40, Issue 1
Performance evaluation review
June 2012
433 pages
ISSN:0163-5999
DOI:10.1145/2318857
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGMETRICS '12: Proceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems
    June 2012
    450 pages
    ISBN:9781450310970
    DOI:10.1145/2254756
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2012
Published in SIGMETRICS Volume 40, Issue 1

Check for updates

Author Tags

  1. IP flows
  2. estimation
  3. max-min fairness
  4. sampling
  5. streaming

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)2
Reflects downloads up to 14 Sep 2024

Other Metrics

Citations

Cited By

View all

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media