美文网首页
Flink总结-Parallel配置

Flink总结-Parallel配置

作者: zachary_1db5 | 来源:发表于2018-04-19 20:49 被阅读0次

Parallel Execution

https://ci.apache.org/projects/flink/flink-docs-master/dev/parallel.html

配置Parallel

Operator Level

final StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment(); DataStream<String> text = [...] DataStream<Tuple2<String, Integer>> wordCounts = text .flatMap(new LineSplitter()) .keyBy(0) .timeWindow(Time.seconds(5)) .sum(1).setParallelism(5); wordCounts.print(); env.execute("Word Count Example");

Execution Environment Level

final StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment(); env.setParallelism(3);

Client Level

在提交程序的时候设置

./bin/flink run -p 10 ../examples/WordCount-java.jar

System Level

setting the parallelism.default property in ./conf/flink-conf.yaml

Using the parallelism provided by the remote cluster (16). To use another parallelism, set it at the ./bin/flink client.

这个通常表示,集群能够提供的并行度没有达到用户设置的并行度

在运行命令的时候 -yn 4 -ys 4 决定了程序的并行度。

最大并行度= container个数 * 每个container的槽位

在程序中设置的并行度 parallelism 不能大于 最大并行度

相关文章

网友评论

      本文标题:Flink总结-Parallel配置

      本文链接:https://www.haomeiwen.com/subject/ntqgkftx.html