Notes on OneFlow paper

Table of Contents

Motivation

Distributed Deep Learning is bottlenecked by data movement. Translation from logical graph to physical graph is neither automatic nor optimal.

Proposed solution:

  • Data movement as first-class member like computations.
  • Manage the dependencies.
  • Actor-based runtime for asynchorous execution of physical graph.

Author: expye(Zihao Ye)

Email: expye@outlook.com

Date:

Last modified: 2022-12-04 Sun 02:08

Licensed under CC BY-NC 4.0