美文网首页数据库
用happybase happy地查询hbase数据

用happybase happy地查询hbase数据

作者: Lacia | 来源:发表于2017-11-22 18:04 被阅读0次

    用happybase进行hbase中数据的增删改查

    前提:已经安装happybase库(pip install happybase),已有hbase环境并开启thrift通讯端口(nohup hbase thrift start &),thrift默认端口为9090,10.10.30.200为hbase主机ip

    查询

    scan方法:

    scan(self, row_start=None, row_stop=None, row_prefix=None,
             columns=None, filter=None, timestamp=None,
             include_timestamp=False, batch_size=1000, scan_batching=None,
             limit=None, sorted_columns=False, reverse=False)
    

    参数:

    row_start、row_stop:起始和终止rowkey,查询两rowkey间的数据

    row_prefix:rowkey前缀。注:使用row_prefix的时候,row_start和row_stop不能使用

    filter:要使用的过滤器(hbase 0.92版本及以上生效)

    timestamp:按指定时间戳查询

    reverse:默认为False。为True时,scan结果按rowkey倒序排列

    e.g:

    import happybase
    
    connection = happybase.Connection('10.10.30.200',9090)
    #获取table实例
    t = connection.table('iot_flow_history')
    #按row前缀遍历查询数据,因该工程存入hbase的rowkey为id-timestamp,使用rowkey查询十分不方便,故使用row_prefix进行查询
    for key,value in t.scan(row_prefix='1100HD72004016A16'):
        print key,value
    
    增加&编辑

    put方法:

    put(self, row, data, timestamp=None, wal=True)
    

    e.g:

    import happybase
    
    connection = happybase.Connection('10.10.30.200',9090)
    #获取table实例
    t = connection.table('iot_flow_history')
    
    #该表已有column familylz,列id和realValue
    student = {"lz:id":'test',"lz:realValue":"18"}
    t.put(row="test",data = student)
    

    △ 如put中的rowkey已存在,则为修改数据

    删除

    delete方法:

    delete(self, row, columns=None, timestamp=None, wal=True)
    

    row:删除rowkey为row的数据

    columns:指定columns参数时,删除

    e.g:

    import happybase
    
    connection = happybase.Connection('10.10.30.200',9090)
    #.delete('test')
    t = connection.table('iot_flow_history')
    #删除rowkey为test的数据
    t.delete('test')
    

    删除rowkey为student2的name数据:

    mark
    t = connection.table('students')
    t.delete(row='student2',columns=["account:name"])
    

    删除成功:

    mark
    批量处理

    batch方法:

    batch(self, timestamp=None, batch_size=None, transaction=False,
              wal=True)
    

    1、批量操作

    import happybase
    
    connection = happybase.Connection('10.10.30.200',9090)
    #.delete('test')
    t = connection.table('iot_flow_history')
    
    student111 = {"account:name":"xiaohuang","address:zipcode":"222222"}
    student222 = {"account:name":"xiaolv","address:zipcode":"333333"}
    t = connection.table('students')
    bat = t.batch()
    
    #添加student111和student222,也可以进行批量删除
    bat.put('student111',data = student111)
    bat.put('student222',data = student222)
    #bat.delete('student111')
    bat.send()
    
    mark

    2、使用with管理批量

    import happybase
    
    connection = happybase.Connection('10.10.30.200',9090)
    #.delete('test')
    t = connection.table('iot_flow_history')
    
    student333 = {"account:name":"xiaolan","address:zipcode":"323232"}
    t = connection.table('students')
    
    #添加student333并删除student222
    with t.batch() as bat:
        bat.put('student333',data = student333)
        bat.delete('student222')
    
    检索

    row方法及rows()方法,检索指定rowkey的数据

    检索一条:

    r = t.row('student444')
    

    检索多条:

    r = t.rows(['student444','student333'])
    

    返回结果:

    mark
    查询指定cell的数据
    cells(self, row, column, versions=None, timestamp=None,
              include_timestamp=False)
    

    e.g:

    import happybase
    
    connection = happybase.Connection('10.10.30.200',9090)
    #.delete('test')
    t = connection.table('iot_flow_history')
    
    #查询rowkey为student444,column为account:name的cell的数据,包含时间戳
    print t.cells('student444','account:name',include_timestamp=True)
    

    结果:

    [('xiaolan', 1511343966713L)]
    

    暂时就这些0v0

    相关文章

      网友评论

        本文标题:用happybase happy地查询hbase数据

        本文链接:https://www.haomeiwen.com/subject/mqeovxtx.html