df = DataFrame({'key1':['a','a','b','b','a'],
'key2':['one','two','one','two','one'],
'data1':np.random.randn(5),
'data2':np.random.randn(5)})
df
![](https://img.haomeiwen.com/i6460608/64867a320faa5839.png)
通用的groupby方法是size,size方法返回一个包含组大小信息的Series:
df.groupby(['key1','key2']).size()
key1 key2
a one 2
two 1
b one 1
two 1
dtype: int64
遍历各分组
GroupBy对象支持迭代,会生成一个包含组名和数据块的2维元组序列。
for name,group in df.groupby('key1'):
print(name)
print(group)
![](https://img.haomeiwen.com/i6460608/3257e94af966b087.png)
网友评论