美文网首页
Logistic 回归模型

Logistic 回归模型

作者: shenghaishxt | 来源:发表于2019-03-14 14:33 被阅读0次

本文来自我的个人博客 https://www.zhangshenghai.com/posts/48429/

Logistic 回归是统计学习中的经典方法,属于对数线性模型。

二项 Logistic 回归模型

将线性回归函数和Logistic函数复合起来,称为逻辑回归函数,二项Logistic回归模型是一种分类模型,二项Logistic回归模型是如下的条件概率分布:

\begin{align} \\& P \left( Y = 1 | x \right) = \dfrac{1}{1+\exp{-\left(w \cdot x + b \right)}} \\ & \quad\quad\quad\quad = \dfrac{\exp{\left(w \cdot x + b \right)}}{\left( 1+\exp{-\left(w \cdot x + b \right)}\right) \cdot \exp{\left(w \cdot x + b \right)}} \\ & \quad\quad\quad\quad = \dfrac{\exp{\left(w \cdot x + b \right)}}{1+\exp{\left( w \cdot x + b \right)}}\\& P \left( Y = 0 | x \right) = 1- P \left( Y = 1 | x \right) \\ & \quad\quad\quad\quad=1- \dfrac{\exp{\left(w \cdot x + b \right)}}{1+\exp{\left( w \cdot x + b \right)}} \\ & \quad\quad\quad\quad=\dfrac{1}{1+\exp{\left( w \cdot x + b \right)}}\end{align}

其中,x \in R^{n}是输入,Y \in \left\{ 0, 1 \right\}是输出,w \in R^{n}b \in R是参数,w称为权值向量,b称为偏置,w \cdot xwb的内积。

Logistic回归比较两个条件概率值的大小,将实例x分到概率值较大的那一类。

可将权值权值向量和输入向量加以扩充,即w = \left( w^{\left(1\right)},w^{\left(2\right)},\cdots,w^{\left(n\right)},b \right)^{T}x = \left( x^{\left(1\right)},x^{\left(2\right)},\cdots,x^{\left(n\right)},1 \right)^{T},则逻辑斯谛回归模型如下:
\begin{align*} \\& P \left( Y = 1 | x \right) = \dfrac{\exp{\left(w \cdot x \right)}}{1+\exp{\left( w \cdot x \right)}}\\& P \left( Y = 0 | x \right) =\dfrac{1}{1+\exp{\left( w \cdot x \right)}}\end{align*}

模型参数估计

Logistic回归模型学习时,对于给定训练数据集T = \left\{ \left( x_{1}, y_{1} \right), \left( x_{2}, y_{2} \right), \cdots, \left( x_{N}, y_{N} \right) \right\},其中,x_{i} \in R^{n+1}, y_{i} \in \left\{ 0, 1 \right\}, i = 1, 2, \cdots, N,可以应用极大似然估计法估计模型参数,从而得到Logistic回归模型。

设:
\begin{align*} \\& P \left( Y =1 | x \right) = \pi \left( x \right) ,\quad P \left( Y =0 | x \right) = 1 - \pi \left( x \right) \end{align*}
似然函数为:
\begin{align*} \\& l \left( w \right) = \prod_{i=1}^{N} P \left( y_{i} | x_{i} \right) \\ & = P \left( Y = 1 | x_{i} , w \right) \cdot P \left( Y = 0 | x_{i}, w \right) \\ & = \prod_{i=1}^{N} \left[ \pi \left( x_{i} \right) \right]^{y_{i}}\left[ 1 - \pi \left( x_{i} \right) \right]^{1 - y_{i}}\end{align*}
对数似然函数为:
\begin{align*} \\& L \left( w \right) = \log l \left( w \right) \\ & = \sum_{i=1}^{N} \left[ y_{i} \log \pi \left( x_{i} \right) + \left( 1 - y_{i} \right) \log \left( 1 - \pi \left( x_{i} \right) \right) \right] \\ & = \sum_{i=1}^{N} \left[ y_{i} \log \dfrac{\pi \left( x_{i} \right)}{1- \pi \left( x_{i} \right)} + \log \left( 1 - \pi \left( x_{i} \right) \right) \right] \\ & = \sum_{i=1}^{N} \left[ y_{i} \left( w \cdot x_{i} \right) - \log \left( 1 + \exp \left( w \cdot x \right) \right) \right]\end{align*}
L(w)求极大值,得到w的估计值。这样,问题就变成了以对数似然函数为目标函数的最优化问题。Logistic回归学习中通常采用的方法是梯度下降法和拟牛顿法。

假设w的极大似然估计值是\hat{w},则学得的Logistic回归模型为:

\begin{align} \\& P \left( Y = 1 | x \right) = \dfrac{\exp{\left(\hat{w} \cdot x \right)}}{1+\exp{\left( \hat{w} \cdot x \right)}}\\& P \left( Y = 0 | x \right) =\dfrac{1}{1+\exp{\left( \hat{w} \cdot x \right)}}\end{align}

多项 Logistic 回归模型

可将上面介绍的二项分类Logistic回归模型推广为多项Logistic回归模型,用于多类分类。

假设离散型随机变量Y的取值集合\left\{ 1, 2, \cdots, K \right\},则多项逻辑斯谛回归模型为:
\begin{align*} \\& P \left( Y = k | x \right) = \dfrac{\exp{\left(w_{k} \cdot x \right)}}{1+ \sum_{k=1}^{K-1}\exp{\left( w_{k} \cdot x \right)}}, \quad k=1,2,\cdots,K-1 \\ & P \left( Y = K | x \right) = 1 - \sum_{k=1}^{K-1} P \left( Y = k | x \right) \\ & = 1 - \sum_{k=1}^{K-1} \dfrac{\exp{\left(w_{k} \cdot x \right)}}{1+ \sum_{k=1}^{K-1}\exp{\left( w_{k} \cdot x \right)}} \\ & = \dfrac{1}{1+ \sum_{k=1}^{K-1}\exp{\left( w_{k} \cdot x \right)}}\end{align*}
二项Logistic回归的参数估计法也可以推广到多项Logistic回归。

相关文章

网友评论

      本文标题:Logistic 回归模型

      本文链接:https://www.haomeiwen.com/subject/grgqmqtx.html