和Python对应,用R如何清理数据呢?
R主要用到两个包:tidyr
和dplyr
library(tidyr)
?gather
?spread
data:image/s3,"s3://crabby-images/d1b6b/d1b6b19e44ac92442e708a0ba474c199f72b612b" alt=""
一、单独表的“长宽互变”
依旧按场景来看。
(一)“省份—年份”式
data:image/s3,"s3://crabby-images/11d30/11d3095782184dfc9f71c151451a834341493912" alt=""
gather(pf, 'year', 'n', 2:4)
data:image/s3,"s3://crabby-images/29618/296180f3ef263e84af74880476a4d419eeba2e9e" alt=""
(二)“省份—分类”式
data:image/s3,"s3://crabby-images/ddc56/ddc5632364b261b7d38382f6f91da433470219d3" alt=""
spread(pf, size, amount)
data:image/s3,"s3://crabby-images/d8e34/d8e348a4c80166dcd5e9455422274b2ab9e77651" alt=""
(三)时间式
类似时间这样的数据分割和合并
data:image/s3,"s3://crabby-images/a2a1c/a2a1ce5606afa88a26e476045ca7e58a974a4aeb" alt=""
data:image/s3,"s3://crabby-images/f2a9f/f2a9f3f604c455446428e9b434757e72b1046761" alt=""
二、多表的“增宽加长”
data:image/s3,"s3://crabby-images/b3706/b3706aa37e9782da51f000127a5427e2edd06569" alt=""
data:image/s3,"s3://crabby-images/8ec34/8ec347d55c5d0000e49c58cb948b05329770c709" alt=""
data:image/s3,"s3://crabby-images/f6cae/f6cae82fcc1ca90edb927fccf2303dcb48059535" alt=""
data:image/s3,"s3://crabby-images/b1d71/b1d71b4a96e594780c7e4305558e50f3926cbc6e" alt=""
data:image/s3,"s3://crabby-images/1d21b/1d21ba9772b0901f952ac5f316660be2d3fca9e5" alt=""
data:image/s3,"s3://crabby-images/4446c/4446c917ca58e9dbd07b5b6d646178f0060112ec" alt=""
data:image/s3,"s3://crabby-images/d2c06/d2c06dc386480dd12984be180031f8c4658786eb" alt=""
data:image/s3,"s3://crabby-images/33d9d/33d9d522c15d54c255ef592b286a7269bd702537" alt=""
和Python对应,用R如何清理数据呢?
R主要用到两个包:tidyr
和dplyr
library(tidyr)
?gather
?spread
依旧按场景来看。
gather(pf, 'year', 'n', 2:4)
spread(pf, size, amount)
类似时间这样的数据分割和合并
本文标题:用R处理数据的结构性问题
本文链接:https://www.haomeiwen.com/subject/cfwyvftx.html
网友评论