• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

DMCA

F10: A Fault-Tolerant Engineered Network

Cached

  • Download as a PDF

Download Links

  • [homes.cs.washington.edu]
  • [www.usenix.org]
  • [www.cs.rutgers.edu]
  • [homes.cs.washington.edu]
  • [homes.cs.washington.edu]
  • [homes.cs.washington.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Vincent Liu , Daniel Halperin , Arvind Krishnamurthy , Thomas Anderson
Citations:24 - 4 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Liu_f10:a,
    author = {Vincent Liu and Daniel Halperin and Arvind Krishnamurthy and Thomas Anderson},
    title = {F10: A Fault-Tolerant Engineered Network},
    year = {}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

The data center network is increasingly a cost, reliability and performance bottleneck for cloud computing. Although multi-tree topologies can provide scalable bandwidth and traditional routing algorithms can provide eventual fault tolerance, we argue that recovery speed can be dramatically improved through the co-design of the network topology, routing algorithm and failure detector. We create an engineered network and routing protocol that directly address the failure characteristics observed in data centers. At the core of our proposal is a novel network topology that has many of the same desirable properties as FatTrees, but with much better fault recovery properties. We then create a series of failover protocols that benefit from this topology and are designed to cascade and complement each other. The resulting system, F10, can almost instantaneously reestablish connectivity and load balance, even in the presence of multiple failures. Our results show that following network link and switch failures, F10 has less than 1/7th the packet loss of current schemes. A trace-driven evaluation of MapReduce performance shows that F10’s lower packet loss yields a median application-level 30 % speedup. 1

Keyphrases

fault-tolerant engineered network    packet loss    data center network    trace-driven evaluation    desirable property    mapreduce performance show    load balance    eventual fault tolerance    network link    multi-tree topology    recovery speed    performance bottleneck    novel network topology    current scheme    fault recovery property    scalable bandwidth    switch failure    reestablish connectivity    traditional routing algorithm    network topology    multiple failure    failover protocol    failure detector    cloud computing    engineered network    data center    failure characteristic   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University