skip to main content
research-article

Designing and embedding reliable virtual infrastructures

Published: 15 April 2011 Publication History

Abstract

In a virtualized infrastructure where physical resources are shared, a single physical server failure will terminate several virtual servers and crippling the virtual infrastructures which contained those virtual servers. In the worst case, more failures may cascade from overloading the remaining servers. To guarantee some level of reliability, each virtual infrastructure, at instantiation, should be augmented with backup virtual nodes and links that have sufficient capacities. This ensures that, when physical failures occur, sufficient computing resources are available and the virtual network topology is preserved. However, in doing so, the utilization of the physical infrastructure may be greatly reduced. This can be circumvented if backup resources are pooled and shared across multiple virtual infrastructures, and intelligently embedded in the physical infrastructure. These techniques can reduce the physical footprint of virtual backups while guaranteeing reliability.

References

[1]
M. Ajtai, N. Alon, J. Bruck, R. Cypher, C. Ho, M. Naor, and E. Szemeredi. Fault tolerant graphs, perfect hash functions and disjoint paths. Symposium on Foundations of Computer Science, 0:693--702, 1992.
[2]
D. Atwood and J. G. Miner. Reducing Data Center Cost with an Air Economizer. http://www.intel. com/it/pdf/Reducing_Data_Center_Cost_with_an_ Air_Economizer.pdf, Aug. 2008.
[3]
S. Bhatia, M. Motiwala, W. Muhlbauer, Y. Mundada, V. Valancius, A. Bavier, N. Feamster, L. Peterson, and J. Rexford. Trellis: a platform for building flexible, fast virtual networks on commodity hardware. In ACM CONEXT '08, pages 1--6, 2008.
[4]
T. C. Bressoud and F. B. Schneider. Hypervisor-based fault tolerance. ACM Trans. Comput. Syst., 14(1):80--107, 1996.
[5]
CBC: Coin-or Branch and Cut. https://projects. coin-or.org/Cbc.
[6]
F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A Distributed Storage System for Structured Data. In USENIX OSDI '06, Nov. 2006.
[7]
N. M. M. K. Chowdhury and R. Boutaba. Network virtualization: State of the art and research challenges. IEEE Communication Magazine, 47(7):20--26, July 2009.
[8]
N. M. M. K. Chowdhury, M. R. Rahman, and R. Boutaba. Virtual Network Embedding with Coordinated Node and Link Mapping. In IEEE INFOCOM '09, Apr. 2009.
[9]
C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live migration of virtual machines. In USENIX NSDI'05, May 2005.
[10]
B. Cully, G. Lefebvre, D. M. M. Feeleyand, and N. Hutchinson. Remus: High availability via asynchronous virtual machine replication. In USENIX NSDI '08, Apr. 2008.
[11]
A. G. Dimakis, V. Prabhakaran, and K. Ramchandran. Decentralized erasure codes for distributed networked storage. IEEE Trans. Inf. Theory, 52(6):2809--2816, June 2006.
[12]
S. Dutt and N. R. Mahapatra. Node-covering, error-correcting codes and multiprocessors with very high average fault tolerance. IEEE Trans. Comput., 46(9):997--1015, 1997.
[13]
C. Guo, G. Lu, D. Li, H. Wu, X. Zhang, Y. Shi, C. Tian, Y. Zhang, and S. Lu. Bcube: A high performance, server-centric network architecture for modular data centers. In SIGCOMM, Aug. 2009.
[14]
F. Harary and J. P. Hayes. Node fault tolerance in graphs. Networks, 27(1):19--23, 1996.
[15]
A. Kangarlou, P. Eugster, and D. Xu. VNsnap: Taking Snapshots of Virtual Networked Environments with Minimal Downtime. In IEEE/IFIP DSN '09, June 2009.
[16]
G. Koslovski, W.-L. Yeow, C. Westphal, T. T. Huu, J. Montagnat, and P. Vicat-Blanc. Reliability support in virtual infrastructures. In IEEE CloudCom, Dec. 2010.
[17]
K. Lee, H.-W. Lee, and E. Modiano. Reliability in Layered Networks with Random Link Failures. In IEEE INFOCOM '10, Mar. 2010.
[18]
J. Lischka and H. Karl. A Virtual Network Mapping Algorithm based on Subgraph Isomorphism Detection. In VISA '09, Aug. 2009.
[19]
R. N. Mysore, A. Pamboris, N. Farrington, N. Huang, P. Miri, S. Radhakrishnan, V. Subramanya, and A. Vahdat. Portland: A scalable fault-tolerant layer 2 data center network fabric. In SIGCOMM, Aug. 2009.
[20]
P. D. T. O'Connor, D. Newton, and R. Bromley. Practical reliability engineering. John Wiley and Sons, fourth edition, 2002.
[21]
Y. Tamura, K. Sato, S. Kihara, and S. Moriai. Kemari: VM Synchronization for Fault Tolerance. In USENIX '08 Poster Session, June 2008.
[22]
Y. Wang, E. Keller, B. Biskeborn, J. van der Merwe, and J. Rexford. Virtual routers on the move: live router migration as a network-management primitive. In SIGCOMM, Aug. 2008.
[23]
A. Wundsam, A. Mehmood, A. Feldmann, and O. Maennel. Network troubleshooting with shadow vnets. In SIGCOMM Posters & Demos, Aug. 2009.
[24]
W.-L. Yeow, C. Westphal, and U. C. Kozat. Designing and Embedding Reliable Virtual Infrastructures. Technical report, Docomo USA Labs, Mar. 2010. arXiv:1005.5367 {cs.NI}.
[25]
W.-L. Yeow, C. Westphal, and U. C. Kozat. Highly Available Virtual Machines with Network Coding. In IEEE INFOCOM '11 Mini-conference, Apr. 2011.
[26]
M. Yu, Y. Yi, J. Rexford, and M. Chiang. Rethinking virtual network embedding: substrate support for path splitting and migration. SIGCOMM Comput. Commun. Rev., 38(2):17--29, 2008.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGCOMM Computer Communication Review
ACM SIGCOMM Computer Communication Review  Volume 41, Issue 2
April 2011
60 pages
ISSN:0146-4833
DOI:10.1145/1971162
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 April 2011
Published in SIGCOMM-CCR Volume 41, Issue 2

Check for updates

Author Tag

  1. infrastructure virtualization

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)1
Reflects downloads up to 15 Sep 2024

Other Metrics

Citations

Cited By

View all

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media