美文网首页大数据之路0-1
Flink1.9.1写入Hbase1.1.2

Flink1.9.1写入Hbase1.1.2

作者: 嘿嘿hhahaah | 来源:发表于2020-03-11 19:16 被阅读0次

这次试手Flink从kafka读数据写入hbase,遇到了很大的坑

1.我的程序是用Flink 1.9.1从本地kafka读取数据,写到本地hbase,本地zookeeper和kafka服务都起好了,开始运行程序,没有报错信息,就是一直读不到kafka的数据,在kafka生产者命令窗口都输入10条了,我想怎么还没开始读数据,我也没设置时间窗口啊,见鬼了

答:这种问题99%都是因为你的kafka连接依赖版本不对,如果你现在是1.1不妨改成0.9试试,或许可以读出来了,相反也可以试试。

注:别忘了在flink代码addsource时也要用“FlinkKafkaConsumer09”,不过你改完依赖不改这个,IDEA会提示你的,没多大事

        <dependency>
            <groupId>org.apache.flink</groupId>
            <artifactId>flink-connector-kafka-0.9_2.11</artifactId>
            <version>1.9.1</version>
        </dependency>

2.程序运行起来没问题,kafka也读出数据了,但是一直卡在连接hbase步骤,不失败也不报错,这个开始以为是hbase-client引用版本的事情,特意去maven官网去查了查对应支持的版本,发现没问题啊,为啥这样对我呢?

答:这个问题99%是因为没有找到zookeeper的主机,程序在不停的尝试连接你配置的主机,就是连不上,你说气人不?但是像我这种人没有服务器的主,连接的是本地的地址啊“127.0.0.1”,为啥还会这样呢,讲不讲理?本地也找不到了????这个时候看看你有没有连接什么代理工具,你要是老老实实连个WiFi不至于这样,把代理关了,再试试,或许真的连上了。

configuration.set("hbase.zookeeper.quorum", "127.0.0.1");

3.还有一种情况实在本地运行不易发生的,但是我必须说,线上很容易出问题,此时将写入hbase的配置信息的zookeeper连接地址改为服务器的地址,然后运行程序,这个时候读取kafka一点问题没有,写入hbase报空指针,死活写不进去,你说咋办吧,网上有很多博客说这个事,但是很多都不解决问题或者不适合我们的问题。

答:这个可能是我们程序找不到hbase在zookeeper的目录了,跟默认的不一致,我们最好去zk客户端里边找找我们的hbase的目录之后再填写这个参数,保险些。

configuration.set("zookeeper.znode.parent","/hbase-unsecure");

最后附上我的垃圾代码,仅供参考,你要运行起来之后可能会发现Hbase之插进一条记录,那是我的row_key、列族和列名都写死了,导致不断的覆盖value,你可以给row_key一个变量,最常见的当前时间戳。

pom.xml:

<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0"
         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
    <modelVersion>4.0.0</modelVersion>

    <groupId>com.wy</groupId>
    <artifactId>flink2hbase</artifactId>
    <version>1.0-SNAPSHOT</version>

    <dependencies>

        <dependency>
            <groupId>org.apache.hbase</groupId>
            <artifactId>hbase-client</artifactId>
            <version>1.1.2</version>
        </dependency>
        <dependency>
            <groupId>org.apache.phoenix</groupId>
            <artifactId>phoenix-core</artifactId>
            <version>4.14.1-HBase-1.1</version>
        </dependency>


        <dependency>
            <groupId>org.apache.flink</groupId>
            <artifactId>flink-java</artifactId>
            <version>1.9.1</version>
        </dependency>
        <dependency>
            <groupId>org.apache.flink</groupId>
            <artifactId>flink-streaming-java_2.11</artifactId>
            <version>1.9.1</version>
        </dependency>
        <dependency>
            <groupId>org.apache.flink</groupId>
            <artifactId>flink-clients_2.11</artifactId>
            <version>1.9.1</version>
        </dependency>

        <dependency>
            <groupId>org.apache.flink</groupId>
            <artifactId>flink-connector-kafka-0.9_2.11</artifactId>
            <version>1.9.1</version>
        </dependency>

        <dependency>
            <groupId>org.projectlombok</groupId>
            <artifactId>lombok</artifactId>
            <version>1.18.4</version>
        </dependency>


    </dependencies>

</project>

主程序:

import org.apache.flink.api.common.serialization.SimpleStringSchema;
import org.apache.flink.streaming.api.datastream.DataStream;
import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;
import org.apache.flink.streaming.connectors.kafka.FlinkKafkaConsumer09;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.client.*;

import java.util.Properties;


public class flinkhbase {
    public static Configuration configuration;
    public static Connection connection;
    public static Admin admin;

    public static void main(String[] args) throws Exception {
        StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();
        env.setParallelism(1);
        Properties properties = new Properties();
        properties.setProperty("bootstrap.servers", "localhost:9092");
        FlinkKafkaConsumer09<String> consumer = new FlinkKafkaConsumer09<String>("sinkTest", new SimpleStringSchema(), properties);
        //从最早开始消费
        consumer.setStartFromEarliest();
        DataStream<String> stream = env.addSource(consumer);
        stream.print();
        stream.process(new HbaseProcess());
        env.execute();
    }
}

写入Hbase:

import lombok.extern.slf4j.Slf4j;
import org.apache.flink.streaming.api.functions.ProcessFunction;
import org.apache.flink.util.Collector;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.HBaseConfiguration;
import org.apache.hadoop.hbase.TableName;
import org.apache.hadoop.hbase.client.*;
import org.apache.hadoop.hbase.util.Bytes;

@Slf4j
public class HbaseProcess extends ProcessFunction<String, String> {
    private static final long serialVersionUID = 1L;

    private Connection connection = null;
    private Table table = null;

    @Override
    public void open(org.apache.flink.configuration.Configuration parameters) throws Exception {
        try {
            // 加载HBase的配置
            Configuration configuration = HBaseConfiguration.create();

            // 读取配置文件
            configuration.set("hbase.zookeeper.quorum", "127.0.0.1");
            configuration.set("hbase.zookeeper.property.clientPort", "2181");
            configuration.setInt("hbase.rpc.timeout", 30000);
            configuration.setInt("hbase.client.operation.timeout", 30000);
            configuration.setInt("hbase.client.scanner.timeout.period", 30000);
//            configuration.set("zookeeper.znode.parent","/hbase-unsecure");
            configuration.set("hbase.master","localhost:60010");
            connection = ConnectionFactory.createConnection(configuration);

            HBaseAdmin hbaseadmin = new HBaseAdmin(connection);

            TableName tableName = TableName.valueOf("ygc_test");

            // 获取表对象
            table = connection.getTable(tableName);

            System.out.println(hbaseadmin.tableExists(tableName));

            System.out.println("[HbaseSink] : open HbaseSink finished");
        } catch (Exception e) {
            System.out.println(e);
        }
    }

    @Override
    public void close() throws Exception {
        System.out.println("close...");
        if (null != table) table.close();
        if (null != connection) connection.close();
    }

    @Override
    public void processElement(String value, Context ctx, Collector<String> out) throws Exception {
        try {
            System.out.println("输入的值:"+value);

            //row1:cf:a:aaa
            String[] split = value.split(":");

            // 创建一个put请求,用于添加数据或者更新数据
            Put put = new Put(Bytes.toBytes("1002"));
            put.addColumn(Bytes.toBytes("info"), Bytes.toBytes("a"), Bytes.toBytes(value));
            table.put(put);
            System.out.println("插入成功");
        } catch (Exception e) {
            System.out.println(e);
        }
    }
}

相关文章

网友评论

    本文标题:Flink1.9.1写入Hbase1.1.2

    本文链接:https://www.haomeiwen.com/subject/zvmhjhtx.html