Abstract
To keep up with the continuous growth in demand, cloud providers spend millions of dollars augmenting the capacity of their widearea backbones and devote significant effort to efficiently utilizing WAN capacity. A key challenge is striking a good balance between network utilization and availability, as these are inherently at odds; a highly utilized network might not be able to withstand unexpected traffic shifts resulting from link/node failures. We advocate a novel approach to this challenge that draws inspiration from financial risk theory: leverage empirical data to generate a probabilistic model of network failures and maximize bandwidth allocation to network users subject to an operator-specified availability target. Our approach enables network operators to strike the utilizationavailability balance that best suits their goals and operational reality. We present TeaVaR (Traffic Engineering Applying Value at Risk), a system that realizes this risk management approach to traffic engineering (TE). We compare TeaVaR to state-of-the-art TE solutions through extensive simulations across many network topologies, failure scenarios, and traffic patterns, including benchmarks extrapolated from Microsoft's WAN. Our results show that with TeaVaR, operators can support up to twice as much throughput as state-ofthe- art TE schemes, at the same level of availability.
Original language | English |
---|---|
Title of host publication | SIGCOMM 2019 - Proceedings of the 2019 Conference of the ACM Special Interest Group on Data Communication |
Publisher | Association for Computing Machinery, Inc |
Pages | 29-43 |
Number of pages | 15 |
ISBN (Electronic) | 9781450359566 |
DOIs | |
State | Published - 19 Aug 2019 |
Event | 50th Conference of the ACM Special Interest Group on Data Communication, SIGCOMM 2019 - Beijing, China Duration: 19 Aug 2019 → 23 Aug 2019 |
Publication series
Name | SIGCOMM 2019 - Proceedings of the 2019 Conference of the ACM Special Interest Group on Data Communication |
---|
Conference
Conference | 50th Conference of the ACM Special Interest Group on Data Communication, SIGCOMM 2019 |
---|---|
Country/Territory | China |
City | Beijing |
Period | 19/08/19 → 23/08/19 |
Bibliographical note
Publisher Copyright:© 2019 Association for Computing Machinery.
Keywords
- Availability
- Network optimization
- Traffic engineering
- Utilization