Day6-Gloria-FLY:
1、下载R包:
1.镜像设置:
(1)> options("repos" =c(CRAN="https://mirrors.tuna.tsinghua.edu.cn/CRAN/"))
(2)> options(BioC_mirror="https://mirrors.ustc.edu.cn/bioc/")
2.下载:install.packages(“包名”)(注意符号也应为英文的)
3.加载:library(包名);require(包名)
![](https://img.haomeiwen.com/i25704956/b164120b20e9370e.png)
总结:options("repos" = c(CRAN="https://mirrors.tuna.tsinghua.edu.cn/CRAN/"))
options(BioC_mirror="https://mirrors.ustc.edu.cn/bioc/")
install.packages("dplyr")
library(dplyr)
2、dplyr五个基础函数:
1.mutate(),新增列
![](https://img.haomeiwen.com/i25704956/406c6d4b3b1edc26.png)
2.select(),按列筛选
(1)按列号筛选
![](https://img.haomeiwen.com/i25704956/b5104f0dc2f4aa19.png)
(2)按列名筛选
![](https://img.haomeiwen.com/i25704956/d9ca4522319f3d50.png)
3.filter()筛选行
![](https://img.haomeiwen.com/i25704956/940006cfa7a8dbcc.png)
4.arrange(),按某1列或某几列对整个表格进行排序
![](https://img.haomeiwen.com/i25704956/00f9f6f6229e18f9.png)
5.summarise():汇总
![](https://img.haomeiwen.com/i25704956/0083411119f6ab72.png)
![](https://img.haomeiwen.com/i25704956/6a342f4dadeb8ef3.png)
3、dplyr两个实用技能:
1:管道操作 %>% (cmd/ctr + shift + M)
![](https://img.haomeiwen.com/i25704956/a7941d360634bdc8.png)
2:count统计某列的unique值
![](https://img.haomeiwen.com/i25704956/0316d957c936079d.png)
4、dplyr处理关系数据:
1、将2个表进行连接,注意:不要引入factor
![](https://img.haomeiwen.com/i25704956/65450fb07e828d81.png)
2、內连inner_join,取交集
![](https://img.haomeiwen.com/i25704956/fcdcba720b255611.png)
3、左连left_join
![](https://img.haomeiwen.com/i25704956/49d2f272f4ec8c95.png)
4、全连full_join
![](https://img.haomeiwen.com/i25704956/aa68e7107784e562.png)
5、半连接:返回能够与y表匹配的x表所有记录semi_join
![](https://img.haomeiwen.com/i25704956/bf321c6298126fa9.png)
6、反连接:返回无法与y表匹配的x表的所记录anti_join
![](https://img.haomeiwen.com/i25704956/bc17a776e8c67df4.png)
7、简单合并:在相当于base包里的cbind()函数和rbind()函数;注意,bind_rows()函数需要两个表格列数相同,而bind_cols()函数则需要两个数据框有相同的行数
![](https://img.haomeiwen.com/i25704956/0819bc7a01940b7a.png)
![](https://img.haomeiwen.com/i25704956/71c05e9f58b2f8bb.png)
ps:今天的课操作不难,但需要掌握很多函数的意义。
网友评论