美文网首页
pyspark线性回归(弹性网)

pyspark线性回归(弹性网)

作者: 米斯特芳 | 来源:发表于2021-08-27 16:59 被阅读0次

没什么好解释的,上代码

from pyspark.ml.regression import LinearRegression
from pyspark.sql import SparkSession

spark = SparkSession\
    .builder\
    .appName("LinearRegressionWithElasticNet")\
    .getOrCreate()

training = spark.read.format("libsvm")\
    .load("sample_linear_regression_data.txt")
# loss:squaredError, huber
# solver:auto, normal, l-bfgs
# elasticNetParam:控制L1正则与L2正则的比例,0即L2,1即L1。计算规则:L1参数为regParam*elasticNetParam,L2参数为regParam*(1-elasticNetParam)
lr = LinearRegression(maxIter=10, regParam=0.3, elasticNetParam=0.8)

lrModel = lr.fit(training)

print("Coefficients: %s" % str(lrModel.coefficients))
print("Intercept: %s" % str(lrModel.intercept))

# Summarize the model over the training set and print out some metrics
trainingSummary = lrModel.summary
print("numIterations: %d" % trainingSummary.totalIterations)
print("objectiveHistory: %s" % str(trainingSummary.objectiveHistory))
trainingSummary.residuals.show()
print("RMSE: %f" % trainingSummary.rootMeanSquaredError)
print("r2: %f" % trainingSummary.r2)

相关文章

网友评论

      本文标题:pyspark线性回归(弹性网)

      本文链接:https://www.haomeiwen.com/subject/qhwmiltx.html