1.本课程导学
2.pandas库的介绍
3.pandas库的Serious类型
4.pandas库的DataFrame类型
5.pandas库的数据类型操作
6.pandas库的数据类型运算
7.单元小结
[网页链接【Python数据分析与展示】.MOOC. 北京理工大学
https://www.bilibili.com/video/av10101509/?from=search&seid=8584212945516406240#page=35)
最近更新:2018-01-29
1.本课程导学
data:image/s3,"s3://crabby-images/c5010/c50105de21dbd318e4de2893089d484331c3fb06" alt=""
2.pandas库的介绍
2.1Pandas库的引用
data:image/s3,"s3://crabby-images/26c92/26c921e0bd68cab6fb5a8496d64b6c67e1e6d413" alt=""
data:image/s3,"s3://crabby-images/0c60d/0c60d3d27de87564e0d081288f2328076534fc25" alt=""
Pandas库小测
左边0-19是索引,右边是值
data:image/s3,"s3://crabby-images/44b3c/44b3cb9dfcabcaa434a1a63d9ae951fd00d7e096" alt=""
import pandas as pd
d=pd.Series(range(20))
d
Out[15]:
0 0
1 1
2 2
3 3
4 4
5 5
6 6
7 7
8 8
9 9
10 10
11 11
12 12
13 13
14 14
15 15
16 16
17 17
18 18
19 19
dtype: int64
d.cumsum()
Out[16]:
0 0
1 1
2 3
3 6
4 10
5 15
6 21
7 28
8 36
9 45
10 55
11 66
12 78
13 91
14 105
15 120
16 136
17 153
18 171
19 190
dtype: int64
2.2Pandas库的理解
data:image/s3,"s3://crabby-images/06bec/06bec1d65953d4943da15a4cb8f297872ae262f4" alt=""
data:image/s3,"s3://crabby-images/de25f/de25f9be798368d4ed0f55609eab5c35dddd0d18" alt=""
3.pandas库的Serious类型
3.1Serious类型
data:image/s3,"s3://crabby-images/6ad3d/6ad3deb7e72a57b2d2c23147113166906a6ababf" alt=""
data:image/s3,"s3://crabby-images/4bc71/4bc71333274afac61f7b9aad57b34988914588d6" alt=""
import pandas as pd
a=pd.Series([9,8,7,6])
a
Out[19]:
0 9
1 8
2 7
3 6
dtype: int64
自定义索引
data:image/s3,"s3://crabby-images/16892/1689256358bd80b6ac0bea8e201a8fba5d5a2c49" alt=""
import pandas as pd
a=pd.Series([9,8,7,6],index=["a","b","c","d"])
a
Out[22]:
a 9
b 8
c 7
d 6
dtype: int64
data:image/s3,"s3://crabby-images/3ebeb/3ebeb25854b4a0974eb49a7222342287b7c7d01c" alt=""
-
从标量值创建
注意:图片上写着不可以省略index,实际是可以省略,a=pd.Series(25,["a","b","c"])
import pandas as pd
a=pd.Series(25,index=["a","b","c"])
a
Out[25]:
a 25
b 25
c 25
dtype: int64
-
从字典类型创建
import pandas as pd
a=pd.Series({"a":9,"b":8,"c":7})
a
Out[30]:
a 9
b 8
c 7
dtype: int64
import pandas as pd
e=pd.Series({"a":9,"b":8,"c":7},index=["c","a","b","d"])
e
Out[33]:
c 7.0
a 9.0
b 8.0
d NaN
dtype: float64
-
从ndarray类型创建
import pandas as pd
n=pd.Series(np.arange(5))
n
Out[36]:
0 0
1 1
2 2
3 3
4 4
dtype: int32
import pandas as pd
m=pd.Series(np.arange(5),index=np.arange(9,4,-1))
m
Out[41]:
9 0
8 1
7 2
6 3
5 4
dtype: int32
-
Serious类型总结
3.2Serious类型的基本操作
data:image/s3,"s3://crabby-images/d5571/d55710856ae7ec7b1143ef4073140904b6285a8b" alt=""
-
Serious类型包括index和values两部分
import pandas as pd
b=pd.Series([9,8,7,6],["a","b","c","d"])
b.index
Out[44]: Index(['a', 'b', 'c', 'd'], dtype='object')
b.values
Out[45]: array([9, 8, 7, 6], dtype=int64)
b["b"]
Out[46]: 8
b[1]
Out[47]: 8
b[["c","d",0]]
Out[49]:
c 7.0
d 6.0
0 NaN
dtype: float64
b[["c","d","a"]]
Out[50]:
c 7
d 6
a 9
dtype: int64
-
Serious类型的操作类似ndarray类型
data:image/s3,"s3://crabby-images/6a262/6a26280582ddc668a155ea2d16c58a847e381ac0" alt=""
import pandas as pd
b=pd.Series([9,8,7,6],["a","b","c","d"])
b
Out[52]:
a 9
b 8
c 7
d 6
dtype: int64
b[3]
Out[53]: 6
b[:3]
Out[54]:
a 9
b 8
c 7
dtype: int64
b[b>b.median()]
Out[55]:
a 9
b 8
dtype: int64
np.exp(b)
Out[56]:
a 8103.083928
b 2980.957987
c 1096.633158
d 403.428793
dtype: float64
- Serious类型的操作类似Python字典类型
0是否在自定义的索引中
data:image/s3,"s3://crabby-images/294a9/294a963c64bb61b775d6fd1e52e4f4858661f312" alt=""
data:image/s3,"s3://crabby-images/6b677/6b677069ac4bcbf85d3a38e35786fc619528d848" alt=""
import pandas as pd
b=pd.Series([9,8,7,6],["a","b","c","d"])
b["b"]
Out[59]: 8
"c" in b
Out[60]: True
0 in b
Out[61]: False
b.get("f",100)
Out[62]: 100
#在b中提取索引"f"'对应的值100,如果对应的值不存在就返回100.
3.3Serious类型对齐操作
data:image/s3,"s3://crabby-images/8e0c6/8e0c620104c48e93e5b2dcfd5729dad3242db0e0" alt=""
import pandas as pd
a=pd.Series([1,2,3],["c","d","e"])
b=pd.Series([9,8,7,6],["a","b","c","d"])
a+b
Out[69]:
a NaN
b NaN
c 8.0
d 8.0
e NaN
dtype: float64
3.4Serious类型的name属性
data:image/s3,"s3://crabby-images/6f0e3/6f0e375b12870688fd2b411a208f7c94087f4993" alt=""
import pandas as pd
b=pd.Series([9,8,7,6],["a","b","c","d"])
b.name
b.name="Serious 对象"
b.index.name="索引列"
b
Out[76]:
索引列
a 9
b 8
c 7
d 6
Name: Serious 对象, dtype: int64
3.5Serious类型的修改
data:image/s3,"s3://crabby-images/b43ae/b43ae19bb2b112958dc909864ee2b340d73d1662" alt=""
import pandas as pd
b=pd.Series([9,8,7,6],["a","b","c","d"])
b["a"]=15
b.name="Serious"
b
Out[81]:
a 15
b 8
c 7
d 6
Name: Serious, dtype: int64
b.name="New Serious"
b["b","c"]=20
b
Out[84]:
a 15
b 20
c 20
d 6
Name: New Serious, dtype: int64
3.6Serious类型的总结
data:image/s3,"s3://crabby-images/e3ec0/e3ec0f362304c5b444bcb3478dc385709bd77366" alt=""
4.pandas库的DataFrame类型
4.1DataFram类型
data:image/s3,"s3://crabby-images/ec08d/ec08d4b1685b5985ea28094683c7cd4f8a7a0af8" alt=""
data:image/s3,"s3://crabby-images/3d2cf/3d2cf8c0325503eadcf1dbb5e600773eb8f90a09" alt=""
data:image/s3,"s3://crabby-images/02a12/02a126ea439c5e75d25ba00d540fbb37e816f242" alt=""
data:image/s3,"s3://crabby-images/b08ba/b08ba23320fbd92b301dcfb8e289f1cd3f9a9934" alt=""
-
二维ndarray对象
import pandas as pd
import numpy as np
d=pd.DataFrame(np.arange(10).reshape(2,5))
d
Out[88]:
0 1 2 3 4
0 0 1 2 3 4
1 5 6 7 8 9
-
由一维ndarray/列表/字典/元组或Serious构成的字典
1)从一维ndarray对象字典创建
import pandas as pd
dt={"one":pd.Series(([1,2,3]),index=["a","b","c"]),"two":pd.Series(([9,8,7,6]),index=["a","b","c","d"])}
d=pd.DataFrame(dt)
d
Out[92]:
one two
a 1.0 9
b 2.0 8
c 3.0 7
d NaN 6
pd.DataFrame(dt,index=["b","c","d"],columns=["two","three"])
Out[93]:
two three
b 8 NaN
c 7 NaN
d 6 NaN
2)从列表类型的字典创建
data:image/s3,"s3://crabby-images/1a2de/1a2defc2b762b25fb31b3570d0127db3197f3d4e" alt=""
import numpy as np
d1={"one":[1,2,3,4],"two":[9,8,7,6]}
d=pd.DataFrame(d1,index=["a","b","c","d"])
d
Out[100]:
one two
a 1 9
b 2 8
c 3 7
d 4 6
data:image/s3,"s3://crabby-images/915b7/915b7e803ba0a4f5af34f25e5f7819da0ba474b6" alt=""
data:image/s3,"s3://crabby-images/bd479/bd4793ae7bb7d00172a1461a8d86fe80ddbc2818" alt=""
4.2DataFrame类型的理解
data:image/s3,"s3://crabby-images/d041d/d041d318b24be50ae6f77a861c629c2222db85d6" alt=""
5.pandas库的数据类型操作
data:image/s3,"s3://crabby-images/f9554/f955434955bdb8760b53f4a89702a0de88686314" alt=""
5.1重新索引
data:image/s3,"s3://crabby-images/ec1fe/ec1fe04c58d80afc89d0522dec0ccfa3b46bea78" alt=""
data:image/s3,"s3://crabby-images/44242/44242e41a7976f109db19440a4efc0ba0b97c8b9" alt=""
data:image/s3,"s3://crabby-images/6c92e/6c92e833189f7c715eaf48a4b727272d9f7430e6" alt=""
5.2索引类型
data:image/s3,"s3://crabby-images/65420/654205123818d0cc09a72cfc5d1456b1b9019d08" alt=""
data:image/s3,"s3://crabby-images/28628/28628627fd4e3c3ff510973b2becd592c1fc3401" alt=""
data:image/s3,"s3://crabby-images/dd8c4/dd8c4c08e27e96d420cbe63e989151394f93474e" alt=""
5.3删除指定索引对象
data:image/s3,"s3://crabby-images/69017/69017bf2feb7cb637851ddd7dd988bbbad821fd8" alt=""
6.pandas库的数据类型运算
6.1算术运算法则
data:image/s3,"s3://crabby-images/c38df/c38df7f382655fe9827cc2b57439d26efa724174" alt=""
data:image/s3,"s3://crabby-images/87e6d/87e6d11bea4947a73c9a2847f73a0075051d9c9b" alt=""
data:image/s3,"s3://crabby-images/44dcb/44dcbe9c35a479c7c94e00e0324ae3517f240713" alt=""
data:image/s3,"s3://crabby-images/ee4a4/ee4a4aee70f8de9ce17433ecd3e47402225879cf" alt=""
data:image/s3,"s3://crabby-images/6194c/6194cea8d99d026ba066b5133e7995a053b67709" alt=""
data:image/s3,"s3://crabby-images/ad263/ad26319f5f41d05ade76264e2521ac25cc5d2999" alt=""
6.2比较运算法则
data:image/s3,"s3://crabby-images/975ff/975ffea4b9bcbed283262aef9d4f73ecbcdb51db" alt=""
data:image/s3,"s3://crabby-images/f4338/f4338c3b3482669ebf2204ad57405bd51db2a9c6" alt=""
data:image/s3,"s3://crabby-images/5a29a/5a29a513600aa38295b28b4ebcb761336e1e5dcf" alt=""
7.单元小结
data:image/s3,"s3://crabby-images/1551c/1551c9c3df39757b6ae2618ce2dd5be742d3ee26" alt=""
网友评论