数据结构

新手记下来~~

来自生信星球
1、向量
多个元素组成的变量，元素指的是数字或字符串（chr）等。
赋值
给变量定义，赋予该变量一个数值（字符串、数据框等）。
从向量中提取元素
（1）根据元素位置
例：来自生信星球

x[4] #x第4个元素
x[-4]#排除法，除了第4个元素之外剩余的元素
x[2:4]#第2到4个元素
x[-(2:4)]#除了第2-4个元素
x[c(1,5)] #第1个和第5个元素

（2）根据值
例：来自生信星球

x[x==10]#等于10的元素
x[x<0]
x[x %in% c(1,2,5)]#存在于向量c（1，2，5）中的元素

注：提取元素用[]
2、数据框
将示例数据放在工作目录下！
（1）读取本地数据

image.png
read table: Reads a file in table format and creates a data frame from it, with cases corresponding to lines and variables to fields in the file.
sep:the field separator character. Values on each line of the file are separated by this character. If sep = "" (the default for read.table) the separator is ‘white space’, that is one or more spaces, tabs, newlines or carriage returns.
header:a logical value indicating whether the file contains the names of the variables as its first line. If missing, the value is determined from the file format: header is set to TRUE if and only if the first row contains one fewer field than the number of columns.
（2）设置行名和列名

image.png
（3）数据框的导出

write.table(X,file = "yu.txt",sep = ",",quote=F)#分隔符改为逗号，字符串不加双引号（默认格式带由双引号）

（4）变量的保存和重新加载
没有处理完的数据可以保存和重新加载，下次接着用。保存格式是RData。

image.png

（5）提取元素

来自生信星球
- X[x,y]#第x行第y列
- X[x,]#第x行
- X[,y]#第y列
- X[y] #也是第y列
- X[a:b]#第a列到第b列
- X[c(a,b)]#第a列和第b列
- X$列名#也可以提取列（优秀写法，而且这个命令还优秀到不用写括号的地步，并且支持Tab自动补全哦，不过只能提取一列）

image.png