Black History Month is here! Discover ERA research focused on Black experiences in Canada and worldwide. Use our general search below to get started!

An Investigation on Data Center Congestion Control Algorithms

Loading...
Thumbnail Image

Institution

http://id.loc.gov/authorities/names/n79058482

Degree Level

Master's

Degree

Master of Science

Department

Department of Electrical and Computer Engineering

Specialization

Software Engineering and Intelligent Systems

Citation for Previous Publication

Link to Related Item

Abstract

With the rapid growth of data-intensive applications, congestion control algorithms for datacenter networks under RDMA over Converged Ethernet protocol have become vital in managing various traffic patterns that demand ultra-low latency and high end-to-end throughput. Although many rule-based and learning-based algorithms have been proposed to enhance throughput and reduce latency, challenges still remain in ensuring fair and efficient control under dynamic, bursty, and variable-sized flows which are prevalent in datacenter networks. In this thesis, we analyze the traffic control mechanism existing in modern data center networks and propose the Fair Datacenter Congestion Control (FDCC) algorithm based on deep reinforcement learning that leverages Long Short-Term Memory (LSTM) and historical memory augmentation to enable predictive control. FDCC operates from a window-based perspective, and introduces a staged reward design to effectively learn from network states based on in-band network telemetry acquired from various sources along the flow path to achieve fair congestion control under dynamic traffic. We perform extensive experiments under a variety of datacenter traffic patterns, including incast flows, long-short flows, and real-world traffic flows. The results suggest that our approach significantly enhances fairness for dynamic flow patterns over a range of state-of-the-art rule-based and learning-based congestion control algorithms, while maintaining comparable flow completion time and goodput.

Item Type

http://purl.org/coar/resource_type/c_46ec

Alternative

License

Other License Text / Link

This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.

Language

en

Location

Time Period

Source