用happybase进行hbase中数据的增删改查
前提:已经安装happybase库(pip install happybase),已有hbase环境并开启thrift通讯端口(nohup hbase thrift start &),thrift默认端口为9090,10.10.30.200为hbase主机ip
查询
scan方法:
scan(self, row_start=None, row_stop=None, row_prefix=None,
columns=None, filter=None, timestamp=None,
include_timestamp=False, batch_size=1000, scan_batching=None,
limit=None, sorted_columns=False, reverse=False)
参数:
row_start、row_stop:起始和终止rowkey,查询两rowkey间的数据
row_prefix:rowkey前缀。注:使用row_prefix的时候,row_start和row_stop不能使用
filter:要使用的过滤器(hbase 0.92版本及以上生效)
timestamp:按指定时间戳查询
reverse:默认为False。为True时,scan结果按rowkey倒序排列
e.g:
import happybase
connection = happybase.Connection('10.10.30.200',9090)
#获取table实例
t = connection.table('iot_flow_history')
#按row前缀遍历查询数据,因该工程存入hbase的rowkey为id-timestamp,使用rowkey查询十分不方便,故使用row_prefix进行查询
for key,value in t.scan(row_prefix='1100HD72004016A16'):
print key,value
增加&编辑
put方法:
put(self, row, data, timestamp=None, wal=True)
e.g:
import happybase
connection = happybase.Connection('10.10.30.200',9090)
#获取table实例
t = connection.table('iot_flow_history')
#该表已有column familylz,列id和realValue
student = {"lz:id":'test',"lz:realValue":"18"}
t.put(row="test",data = student)
△ 如put中的rowkey已存在,则为修改数据
删除
delete方法:
delete(self, row, columns=None, timestamp=None, wal=True)
row:删除rowkey为row的数据
columns:指定columns参数时,删除
e.g:
import happybase
connection = happybase.Connection('10.10.30.200',9090)
#.delete('test')
t = connection.table('iot_flow_history')
#删除rowkey为test的数据
t.delete('test')
删除rowkey为student2的name数据:
markt = connection.table('students')
t.delete(row='student2',columns=["account:name"])
删除成功:
mark批量处理
batch方法:
batch(self, timestamp=None, batch_size=None, transaction=False,
wal=True)
1、批量操作
import happybase
connection = happybase.Connection('10.10.30.200',9090)
#.delete('test')
t = connection.table('iot_flow_history')
student111 = {"account:name":"xiaohuang","address:zipcode":"222222"}
student222 = {"account:name":"xiaolv","address:zipcode":"333333"}
t = connection.table('students')
bat = t.batch()
#添加student111和student222,也可以进行批量删除
bat.put('student111',data = student111)
bat.put('student222',data = student222)
#bat.delete('student111')
bat.send()
mark
2、使用with管理批量
import happybase
connection = happybase.Connection('10.10.30.200',9090)
#.delete('test')
t = connection.table('iot_flow_history')
student333 = {"account:name":"xiaolan","address:zipcode":"323232"}
t = connection.table('students')
#添加student333并删除student222
with t.batch() as bat:
bat.put('student333',data = student333)
bat.delete('student222')
检索
row方法及rows()方法,检索指定rowkey的数据
检索一条:
r = t.row('student444')
检索多条:
r = t.rows(['student444','student333'])
返回结果:
mark查询指定cell的数据
cells(self, row, column, versions=None, timestamp=None,
include_timestamp=False)
e.g:
import happybase
connection = happybase.Connection('10.10.30.200',9090)
#.delete('test')
t = connection.table('iot_flow_history')
#查询rowkey为student444,column为account:name的cell的数据,包含时间戳
print t.cells('student444','account:name',include_timestamp=True)
结果:
[('xiaolan', 1511343966713L)]
暂时就这些0v0
网友评论