美文网首页
spark 参数

spark 参数

作者: 博瑜 | 来源:发表于2017-06-19 10:45 被阅读0次

    spark.default.parallelism Default number of partitions in RDDs returned by transformations like join, reduceByKey, and parallelize when not set by user.
    reducer的个数
    spark.sql.shuffle.partitions denote the number of reducers in a join/group-by operation (dataframe)

    相关文章

      网友评论

          本文标题:spark 参数

          本文链接:https://www.haomeiwen.com/subject/uclgqxtx.html