Data Center Paper Reading
Table of Contents
- Some terminologies
- CSE 599
- Course Website
- Reading List
- A Case for NOW (Networks of Workstations)
- A Scalable, Commodity Data Center Network Architecture
- DONE SiP-ML: high-bandwidth optical network interconnects for machine learning training
- Data Center TCP (DCTCP)
- TIMELY: RTT-based Congestion Control for the Datacenter
- Homa: a receiver-driven low-latency transport protocol using network priorities
- PowerTCP: Pushing the Performance Limits of Datacenter Networks
- Backpressure Flow-Control
- TODO HeMem: Scalable Tiered Memory Management for Big Data Applications and Real NVM
- TODO NanoPU: A Nanosecond Network Stack for Data Centers
- Project
- Other Papers
Some terminologies
Intel Optane
CSE 599
Course Website
Reading List
A Case for NOW (Networks of Workstations)
- Background
- Commercial Motivations
Smaller computers offer better cost-performance than larger computer.
- Lessons for Multi Parallel Processor
- Engineering lag
- software incompatibility
- Motivation of Now:
- Emergence of faster network.
- Emergence of more powerful workstations.
- I/O bottleneck: I/O increased in terms of capacity instead of performance, NOW proposes to use a huge pool of memory.
- Commercial Motivations
- Opportunities
- Overview of the NOW project
A Scalable, Commodity Data Center Network Architecture
DONE SiP-ML: high-bandwidth optical network interconnects for machine learning training
- Goal
Reduce the time-to-accuracy metric.
Increase the throughput of data processing does not necessarily reduce time-to-accuracy (weak scaling).
We want strong-scaling: reduce the computation time per worker, but this requires more extensive data exchange, rendering large bandwidth requirement.
The demand of communication bandwidth grows super-linearly.
- Optical Networks
Silicon Photonics (SiP) offers order-of-magnitude higher bandwidth.
Support dedicated bandwidth as long as there is a path between source/destination nodes.
- Optical Solution for ML?
Control over the traffic patterns by choosing the parallelization strategy and device placement.
The two ends of the spectrum are SiP-OCS and SiP-Ring.
Data Center TCP (DCTCP)
TIMELY: RTT-based Congestion Control for the Datacenter
Homa: a receiver-driven low-latency transport protocol using network priorities
PowerTCP: Pushing the Performance Limits of Datacenter Networks
- paper
- https://www.usenix.org/system/files/nsdi22-paper-addanki_3.pdf
- slide
- https://www.usenix.org/system/files/nsdi22_slides_addanki.pdf
Key idea: power = voltage x current
Viewed in networks:
- voltage: BDP(bandwidth-delay product) + queue length
- Current: transmission rate (bits/s)
Backpressure Flow-Control
TODO HeMem: Scalable Tiered Memory Management for Big Data Applications and Real NVM
- DRAM + NVM tiered memory
The property of NVM
- 8x capacity
- 2x latency
- asymmetric read/write bandwidth
- high overhead for small accesses
- Hardware solution
Example: Intel Optane
- :) No need for OS support
- :) Low overhead (why?)
- :( Not visible to apps
- :( Naive memory management
- Software tiered memory
Examples: HeteroOS, Nimble Page Management
- :) insights into apps
- :) complex memory management
- :( evaluated on emulated environment
- do not scale to NVM capacity
- do support for asymmetric read/write
- limited flexibility (why?)
- Scalable Software solution to Tiered Memory
HeMem leverages asynchronous.