Reliability and Survivability Analysis of Data Center Network Topologies - Sorbonne Université Access content directly
Journal Articles Journal of Network and Systems Management Year : 2016

Reliability and Survivability Analysis of Data Center Network Topologies

Rodrigo de Souza
  • Function : Author
Stefano Secci
Miguel Elias Mitre Campista
  • Function : Author
Luís Henrique Maciel Kosmalski Costa
  • Function : Author
  • PersonId : 1037287

Abstract

The architecture of several data centers have been proposed as alternatives to the conventional three-layer one. Most of them employ commodity equipment for cost reduction. Thus, robustness to failures becomes even more important, because commodity equipment is more failure-prone. Each architecture has a different network topology design with a specific level of redundancy. In this work, we aim at analyzing the benefits of different data center topologies taking the reliability and survivability requirements into account. We consider the topologies of three alternative data center architecture: Fat-tree, BCube, and DCell. Also, we compare these topologies with a conventional three-layer data center topology. Our analysis is independent of specific equipment, traffic patterns, or network protocols, for the sake of generality. We derive closed-form formulas for the Mean Time To Failure of each topology. The results allow us to indicate the best topology for each failure scenario. In particular, we conclude that BCube is more robust to link failures than the other topologies, whereas DCell has the most robust topology when considering switch failures. Additionally, we show that all considered alternative topologies outperform a three-layer topology for both types of failures. We also determine to which extent the robustness of BCube and DCell is influenced by the number of network interfaces per server.

Dates and versions

hal-01217000 , version 1 (18-10-2015)

Identifiers

Cite

Rodrigo de Souza, Stefano Secci, Miguel Elias Mitre Campista, Luís Henrique Maciel Kosmalski Costa. Reliability and Survivability Analysis of Data Center Network Topologies. Journal of Network and Systems Management, 2016, 24 (2), pp.346-392. ⟨10.1007/s10922-015-9354-8⟩. ⟨hal-01217000⟩
128 View
0 Download

Altmetric

Share

Gmail Facebook X LinkedIn More