评价分类器的好坏的几种方法 Precision Recall F

作者: 环境与方法 | 来源:发表于2017-10-09 02:48 被阅读149次

评价分类器的好坏的几种方法 Precision Recall F
机器学习day4
2019-01-20
分类器指标
精确率(Precision)、召回率(Recall)与F1值
Evaluate the Model: ROC曲线与AUC
ROC曲线，AUC，和P-R曲线的关系
python 使用sklearn 绘制Precision-Rec
机器学习常用评价指标
性能评价指标(Precision, Recall, F-scor

准确率和召回率应用于信息检索InformationRetrieval和文本分类TextClassification

如果我们将一个样本集进行分类，那么结果将会有四种情况：

Classified Positive Classified Negative

Actual Positive TP FN

Actual Negative FP TN

where： TP True Positive

FP False Positive

FN False Negative

TN True Negative

上述表格被称为Confusion Matrix

接下来我们引入三个指标对分类进行评价：

1. 准确率Precision：表示的是预测为正的样本中有多少是真正的正样本。衡量的是检索系统的查准率。P is the number of correctly classified positive examples divided by the total number of examples that are classified as positive.

2. 召回率Recall：表示的是样本的正例中有多少被预测正确了。衡量的是检索系统的查全率。r is the number of correctly classified positive examples divided by the total number of actual positive examples in the test set.

3. F1-Score: 准确率和召回率的综合指标。F1-Score combines precision and recall into one measure.

计算公式如下：