R语言（一）SVM & LDA

作者: hlyyllyyl | 来源:发表于2018-09-18 00:25 被阅读2次

R语言（一）SVM & LDA
必须掌握的算法
机器学习算法实现（三）：SVM
SVM 简单应用
机器学习中的数学：线性判别分析、主成分分析
降维算法二：LDA（Linear Discriminant An
机器学习算法汇总
lda实验相关
sklearn SVM的参数与R语言的区别
黑箱方法支持向量机②

Tutorial7

Yangkai Hong

17/09/2018

The R package “e1071” has the implementation of SVM with a number of kernel choices. Try to classify Ionosphere dataset from “mlbench” package with:

library(e1071)
library(mlbench)
data(Ionosphere)

(1) Linear kernel, polynomial kernel with different degrees, and radial basis kernel.

linear.model <- svm(x=Ionosphere[,-35],y=Ionosphere[,35],kernel='linear',type='C-classification',scale=FALSE)
poly3.model <- svm(x=Ionosphere[,-35],y=Ionosphere[,35],kernel='polynomial',degree=3,type='C-classification',scale=FALSE)
poly6.model <- svm(x=Ionosphere[,-35],y=Ionosphere[,35],kernel='polynomial',degree=6,type='C-classification',scale=FALSE)
radial.model <- svm(x=Ionosphere[,-35],y=Ionosphere[,35],kernel='radial',type='C-classification',scale=FALSE)

(2) Benchmark your classification accuracy using 10-fold cross-validation.

library(caret)
set.seed(1)
fold <- createFolds(Ionosphere$Class,k=10)
linearTrue <- c()
poly3True <- c()
poly6True <- c()
radialTrue <- c()
for(i in 1:length(fold)){
  truth <- Ionosphere$Class[fold[[i]]]
  linearPreds <- predict(linear.model,newdata = Ionosphere[fold[[i]],-35])
  poly3Preds <- predict(poly3.model,newdata = Ionosphere[fold[[i]],-35])
  poly6Preds <- predict(poly6.model,newdata = Ionosphere[fold[[i]],-35])
  radialPreds <- predict(radial.model,newdata = Ionosphere[fold[[i]],-35])
  linearTrue <- c(linearTrue,sum(linearPreds==truth))
  poly3True <- c(poly3True,sum(poly3Preds==truth))
  poly6True <- c(poly6True,sum(poly6Preds==truth))
  radialTrue <- c(radialTrue,sum(radialPreds==truth))
}
cat(c("Linear kernel accuracy:",sum(linearTrue)/nrow(Ionosphere),"\n"))
## Linear kernel accuracy: 0.923076923076923
cat(c("Polynomial kernel with degree 3 accuracy:",sum(poly3True)/nrow(Ionosphere),"\n"))
## Polynomial kernel with degree 3 accuracy: 0.689458689458689
cat(c("Polynomial kernel with degree 6 accuracy:",sum(poly6True)/nrow(Ionosphere),"\n"))
## Polynomial kernel with degree 6 accuracy: 0.641025641025641
cat(c("Radial kernel accuracy:",sum(radialTrue)/nrow(Ionosphere),"\n"))
## Radial kernel accuracy: 0.945868945868946

(3) Repeat the above classification using LDA with 10-fold cross-validation.

library(MASS)
ionosphere <- Ionosphere[,-2] #delete constant column
lda.model <- lda(Class~.,ionosphere) 
ldaTrue <- c()
for(i in 1:length(fold)){
  truth <- ionosphere$Class[fold[[i]]]
  ldaPreds <- predict(lda.model,ionosphere[fold[[i]],-34])$posterior[,'good'] 
  #cat(ldaPreds)
  lda.decision <- ifelse(ldaPreds > 0.5,'good','bad')
  ldaTrue <- c(ldaTrue,sum(lda.decision==truth))
}
cat(c("LDA accuracy:",sum(ldaTrue)/nrow(ionosphere)))
## LDA accuracy: 0.9002849002849

(4) Comment on the linear separability of the data based on the classification result using SVM with different kernels and LDA.

The linear separability of the data is high. Because accuracy of both linear kernel SVM and LDA are high, while accuracy of polynomial kernel SVM with degree 3 and 6 are low.

R语言（一）SVM & LDA
Tutorial7 Yangkai Hong 17/09/2018 The R package “e1071” h...
必须掌握的算法
逻辑回归 SVM XGBoost LDA FM FMM 推荐算法常用推荐算法
机器学习算法实现（三）：SVM
SVM算法的R语言实践数据集，采用R语言内置的iris数据集。查看数据集前六个观测 head(iris) 第一...
SVM 简单应用
Learn R | SVM of Data Mining一文学会SVM
机器学习中的数学：线性判别分析、主成分分析
如果学习分类算法，最好从线性的入手，线性分类器最简单的就是LDA，它可以看做是简化版的SVM，如果想理解SVM这种...
降维算法二：LDA（Linear Discriminant An
前言学习分类算法，线性分类器最简单的就是LDA，它可以看做是简化版的SVM，如果想理解SVM这种分类器，那理解L...
机器学习算法汇总
线性回归逻辑回归朴素贝叶斯感知机 KNN SVM 最大熵模型 SVD PCA LDA EM算法高斯混合模型...
lda实验相关
（一）lda训练：数据很大，使用R的lda包，进行数据的处理及分析，其中重要的几个参数的输入的设置依据：数据比较...
sklearn SVM的参数与R语言的区别
sklearn的SVM函数没有对数据做scale操作，而e1071包的对应函数做数据做了scale。因此在R语言中...
黑箱方法支持向量机②
支持向量机 1. R中svm介绍 R的函数包e1071提供了libsvm的接口。使用e1071包中svm函数可以得...