Linear Regression
Hypothesis:
Parameters:
Cost Function:
Goal:
minimize
Gradient Descent
Outline:
- start with some
- Keep changing
to reduce
,until we end up at a minimum
Algorithm:
repeat until convergence
tips:
- (simultaneously update
and
)
-
learning rate
if alpha is too small, gradient descent can be slow
if alpha is too large, gradient descent can overshoot the minimum. it may fail to converge or even diverge
data:image/s3,"s3://crabby-images/0d1d8/0d1d8eab751b233dcc5d5b562b509062bfec79df" alt=""
多元线性回归
Hypothesis:
问题有那个特征量,则预测函数为:
假设
写成矩阵形式:
故:
Cost Function:
Multiple Gradient Descent
Algorithm:
Feature Scaling(特征缩放)
Goal:
Get every feature into approximately a range
Aligorithm
data:image/s3,"s3://crabby-images/04408/044086d625ae7caf5af15eae1d47e0e6a9cfef4e" alt=""
Mean Normalization(缩放到接近0水平)
Goal
Replace with
to make features have approximately zero mean
Algorithm
Polynomial Regression
data:image/s3,"s3://crabby-images/466d2/466d251a2e932effa5c3ad0822a21d5f9b70ba45" alt=""
Advantage and Disadvantage between gradient descent and normal equation:
data:image/s3,"s3://crabby-images/698d8/698d84469a66ca32344d0c7353726fcc5810ee52" alt=""
Normal Equation(正规方程法)
data:image/s3,"s3://crabby-images/5e9e1/5e9e1f35c3bc8f05c546ab825ebca84c7a95bc50" alt=""
data:image/s3,"s3://crabby-images/c3456/c3456b47dad060f99933814c5b376244317e525a" alt=""
网友评论