spark常用参数

作者: scottzcw | 来源:发表于2020-01-07 15:11 被阅读0次

spark-sql \

--master yarn \

--deploy-mode client \

--num-executors "20" \

--executor-cores "2" \

--executor-memory "6g" \

--driver-memory "6g" \

--conf spark.driver.maxResultSize=4g \

--conf spark.kryoserializer.buffer.max=1024m \

--conf spark.debug.maxToStringFields=999 \

--conf spark.sql.broadcastTimeout=2600 \

--conf spark.network.timeout=1200 \

--conf spark.rpc.askTimeout=1200 \

--conf spark.rpc.lookupTimeout=360 \

--conf spark.locality.wait=10 \

--conf spark.memory.fraction=0.80 \

--conf spark.sql.parquet.writeLegacyFormat=true \

--conf spark.sql.crossJoin.enabled=true \

--hiveconf hive.metastore.execute.setugi=true \

--hiveconf hive.exec.dynamic.partition=true \

--hiveconf hive.exec.dynamic.partition.mode=nonstrict \

--hiveconf hive.exec.max.dynamic.partitions=1000000 \

--hiveconf hive.exec.max.dynamic.partitions.pernode=100000 \

--hiveconf hive.mapred.supports.subdirectories=true \

--hiveconf mapreduce.input.fileinputformat.input.dir.recursive=true -S

网友评论

本文标题：spark常用参数

本文链接：https://www.haomeiwen.com/subject/rsgractx.html

spark常用参数