美文网首页
spark 参数

spark 参数

作者: 博瑜 | 来源:发表于2017-06-19 10:45 被阅读0次

spark.default.parallelism Default number of partitions in RDDs returned by transformations like join, reduceByKey, and parallelize when not set by user.
reducer的个数
spark.sql.shuffle.partitions denote the number of reducers in a join/group-by operation (dataframe)

相关文章

网友评论

      本文标题:spark 参数

      本文链接:https://www.haomeiwen.com/subject/uclgqxtx.html