美文网首页
R语言基础作业

R语言基础作业

作者: 民先生 | 来源:发表于2019-04-17 11:28 被阅读0次

    R语言作业

    • 1、打开 Rstudio 告诉我它的工作目录。
    [1] "F:/生物信息学/技能树/生信技能树/3天课程资料/1.R/02-plots"
    
    • 2、建立6个向量,基于不同的原子类型
    a<-c(1,2,3)
    b<-c(z,c,b)
    c<-c("susahn","nimakl","xinjone")
    d<-c(true, true, false,false)
    e<-c(1,a,"ba","bc")
    f<-c(TRUE,"c",2,a)
    
    • 3、告诉我在你打开的rstudio里面 getwd() 代码运行后返回的是什么?
    getwd()
    [1] "F:/生物信息学/技能树/生信技能树/3天课程资料/1.R/02-plots"
    
    • 4、新建一些数据结构,比如矩阵,数组,数据框,列表等重点是数据框,矩阵)
    matrix(seq(1:100),nrow = 5)
      [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] [,10] [,11] [,12] [,13] [,14] [,15] [,16] [,17] [,18] [,19]
    [1,]    1    6   11   16   21   26   31   36   41    46    51    56    61    66    71    76    81    86    91
    [2,]    2    7   12   17   22   27   32   37   42    47    52    57    62    67    72    77    82    87    92
    [3,]    3    8   13   18   23   28   33   38   43    48    53    58    63    68    73    78    83    88    93
    [4,]    4    9   14   19   24   29   34   39   44    49    54    59    64    69    74    79    84    89    94
    [5,]    5   10   15   20   25   30   35   40   45    50    55    60    65    70    75    80    85    90    95
         [,20]
    [1,]    96
    [2,]    97
    [3,]    98
    [4,]    99
    [5,]   100
    as.data.frame(matrix(seq(1:100),nrow = 4))
    V1 V2 V3 V4 V5 V6 V7 V8 V9 V10 V11 V12 V13 V14 V15 V16 V17 V18 V19 V20 V21 V22 V23 V24 V25
    1  1  5  9 13 17 21 25 29 33  37  41  45  49  53  57  61  65  69  73  77  81  85  89  93  97
    2  2  6 10 14 18 22 26 30 34  38  42  46  50  54  58  62  66  70  74  78  82  86  90  94  98
    3  3  7 11 15 19 23 27 31 35  39  43  47  51  55  59  63  67  71  75  79  83  87  91  95  99
    4  4  8 12 16 20 24 28 32 36  40  44  48  52  56  60  64  68  72  76  80  84  88  92  96 100
    
    • 5、在你新建的数据框进行切片操作,比如首先取第1,3行, 然后取第4,6列
    zym<-as.data.frame(matrix(seq(1:100),nrow = 10))
    a<-zym[c(1,3),]
    b<- zym[,c(4,6)]
    
    • 6、使用data函数来加载R内置数据集 rivers 描述它。并且可以查看更多的R语言内置的数据集
    data(rivers)###没有报错,但是显示不出数据,用下面的命令就可以显示。
    rivers
    [1]  735  320  325  392  524  450 1459  135  465  600  330  336  280  315  870  906  202  329  290 1000  600
     [22]  505 1450  840 1243  890  350  407  286  280  525  720  390  250  327  230  265  850  210  630  260  230
     [43]  360  730  600  306  390  420  291  710  340  217  281  352  259  250  470  680  570  350  300  560  900
     [64]  625  332 2348 1171 3710 2315 2533  780  280  410  460  260  255  431  350  760  618  338  981 1306  500
     [85]  696  605  250  411 1054  735  233  435  490  310  460  383  375 1270  545  445 1885  380  300  380  377
    [106]  425  276  210  800  420  350  360  538 1100 1205  314  237  610  360  540 1038  424  310  300  444  301
    [127]  268  620  215  652  900  525  246  360  529  500  720  270  430  671 1770
    
    
    iris[1:5,]
     Sepal.Length Sepal.Width Petal.Length Petal.Width Species
    1          5.1         3.5          1.4         0.2  setosa
    2          4.9         3.0          1.4         0.2  setosa
    3          4.7         3.2          1.3         0.2  setosa
    4          4.6         3.1          1.5         0.2  setosa
    5          5.0         3.6          1.4         0.2  setosa
    
    • 7、下载 https://www.ncbi.nlm.nih.gov/sra?term=SRP133642 里面的 RunInfo Table 文件读入到R里面,了解这个数据框,多少列,每一列都是什么属性的元素。(参考B站生信小技巧获取runinfo table) 这是一个单细胞转录组项目的数据,共768个细胞,如果你找不到RunInfo Table 文件,可以点击下载,然后读入你的R里面也可以
    dim(a)
    [1] 768  31
    
    class(a)
    [1] "data.frame"
    
    summary(a)
      BioSample          Experiment            MBases          MBytes           Run             SRA_Sample       
     Length:768         Length:768         Min.   : 0.00   Min.   : 0.000   Length:768         Length:768        
     Class :character   Class :character   1st Qu.: 8.00   1st Qu.: 4.000   Class :character   Class :character  
     Mode  :character   Mode  :character   Median :12.00   Median : 6.000   Mode  :character   Mode  :character  
                                           Mean   :12.55   Mean   : 6.414                                        
                                           3rd Qu.:16.00   3rd Qu.: 8.000                                        
                                           Max.   :74.00   Max.   :37.000                                        
     Sample_Name         Assay_Type        AssemblyName         AvgSpotLen  BioProject        Center_Name       
     Length:768         Length:768         Length:768         Min.   :43   Length:768         Length:768        
     Class :character   Class :character   Class :character   1st Qu.:43   Class :character   Class :character  
     Mode  :character   Mode  :character   Mode  :character   Median :43   Mode  :character   Mode  :character  
                                                              Mean   :43                                        
                                                              3rd Qu.:43                                        
                                                              Max.   :43                                        
       Consent          DATASTORE_filetype DATASTORE_provider   InsertSize  Instrument        LibraryLayout     
     Length:768         Length:768         Length:768         Min.   :0    Length:768         Length:768        
     Class :character   Class :character   Class :character   1st Qu.:0    Class :character   Class :character  
     Mode  :character   Mode  :character   Mode  :character   Median :0    Mode  :character   Mode  :character  
                                                              Mean   :0                                         
                                                              3rd Qu.:0                                         
                                                              Max.   :0                                         
     LibrarySelection   LibrarySource        LoadDate           Organism           Platform        
     Length:768         Length:768         Length:768         Length:768         Length:768        
     Class :character   Class :character   Class :character   Class :character   Class :character  
     Mode  :character   Mode  :character   Mode  :character   Mode  :character   Mode  :character  
                                                                                                   
                                                                                                   
                                                                                                   
     ReleaseDate         SRA_Study             age             cell_type         marker_genes      
     Length:768         Length:768         Length:768         Length:768         Length:768        
     Class :character   Class :character   Class :character   Class :character   Class :character  
     Mode  :character   Mode  :character   Mode  :character   Mode  :character   Mode  :character  
                                                                                                   
                                                                                                   
                                                                                                   
     source_name           strain             tissue         
     Length:768         Length:768         Length:768        
     Class :character   Class :character   Class :character  
     Mode  :character   Mode  :character   Mode  :character
    
    dim(b)
    [1] 768  12
    sumary(b)
     Accession            Title           Sample.Type          Taxonomy            Channels   Platform        
     Length:768         Length:768         Length:768         Length:768         Min.   :1   Length:768        
     Class :character   Class :character   Class :character   Class :character   1st Qu.:1   Class :character  
     Mode  :character   Mode  :character   Mode  :character   Mode  :character   Median :1   Mode  :character  
                                                                                 Mean   :1                     
                                                                                 3rd Qu.:1                     
                                                                                 Max.   :1                     
        Series          Supplementary.Types Supplementary.Links SRA.Accession        Contact         
     Length:768         Length:768          Length:768          Length:768         Length:768        
     Class :character   Class :character    Class :character    Class :character   Class :character  
     Mode  :character   Mode  :character    Mode  :character    Mode  :character   Mode  :character  
                                                                                                     
                                                                                                     
                                                                                                     
     Release.Date      
     Length:768        
     Class :character  
     Mode  :character
    
    • 9、建立两个数据的关联
    c=merge(a,b,by.x = 'Sample_Name',by.y = 'Accession')
    
    • 10、对前面读取的 RunInfo Table 文件在R里面探索其MBases列,包括 箱线图(boxplot)和五分位数(fivenum),还有频数图(hist),以及密度图(density)
    d<-c[,c("MBases","Title")]
    plate=unlist(lapply(e[,2],function(x){
      x
      strsplit(x,'_')[[1]][3]
      
    }))
    e$plate<-plate
    

    下面的不会。

    相关文章

      网友评论

          本文标题:R语言基础作业

          本文链接:https://www.haomeiwen.com/subject/zgcgwqtx.html