Skip to content

Configuration

xitep edited this page May 3, 2017 · 7 revisions

Euphoria Flink Executor

  • euphoria.flink.streaming.windowing.only.after.shuffle: Normally RBK and RSBK operator's input elements are windowed and transformed before shuffle to reduce the amount of data being transferred. However, under certain circumstances, it can be beneficial to do this only after shuffling the original input elements. See #47 for more information. Note that the time-sliding windowing mentioned in that ticket received a very efficient serialization implementation.
  • euphoria.flink.streaming.descriptors.cache.max.size: The euphoria flink streaming executor uses the underlying storage provider through Flink itself and therefore must translate the euphoria storage descriptors into Flink native descriptors. The result of this translation is cached since creating Flink native storage descriptors is actually expensive. The number of storage descriptors in a program is typically very small since these are meant to be statically defined. This configuration option defines a threshold to make euphoria flows fail if the mentioned cache grows beyond the defined value prevent accidental out of memory upon dynamic allocation of descriptors.
  • euphoria.flink.streaming.allow.early.emitting: States are typically accumulating value until a flushed at which point they are allowed to produce output. This configuration option enabled experimental support for receiving the output "channel" already at the creation of a state and thus allowing it to produce output early, before flush. This is known to cause troubles with merging windowing and early triggering. Currently, only the Join operator supports this experimental feature on the flink streaming executor.
  • euphoria.flink.batch.state.max.memory.elements: Defines the number of items kept in memory by the list storage before starting to spill to disk. Applied only on the flink batch executor.
Clone this wiki locally