DevilKing's blog

冷灯看剑,剑上几分功名?炉香无需计苍生,纵一穿烟逝,万丈云埋,孤阳还照古陵

0%

Naiad timely dataflow

原文链接

This model enriches dataflow computation with timestamps that represent logical points in the computation and provide the basis for an efficient, lightweight coordination mechanism

However, no existing system satisfies all three requirements:

  • stream processors can produce low-latency results for non-iterative algorithms
  • batch systems can iterate synchronously at the expense of latency
  • trigger-based approaches support iteration with only weak consistency guarantees
  1. structured loops allowing feedback in the dataflow
  2. stateful dataflow vertices capable of consuming and producing records without global coordination, and
  3. notifications for vertices once they have received all records for a given round of input or loop iteration.
image-20200629170545369

利用loops context来解决问题?

Loop Context in the above graph is a cycle with an ingress(I) and an egress(E) node along with a feedback (F) node.

img

Dataflow graphs use logical timestamps [3]. These logical timestamps contain epoch and loop counter. They are used to track the computation, loop number of the input data.

逻辑时间戳代替实际的时间戳,这样就不存在过去时间的事件,

错误处理,恢复起来,比较缓慢