美文网首页
Oozie案例运行MapReduce Wordflow讲解

Oozie案例运行MapReduce Wordflow讲解

作者: chengruru | 来源:发表于2018-08-16 14:52 被阅读0次

一、解压oozie目录下的oozie-examples.tar.gz

       
Step1

$ cd oozie
$ sudo tar -zxvf oozie-examples.tar.gz

Step2. 将oozie/examples 拷贝到hdfs 主目录上,且需保证hdfs上不存在文件路径/user/hadoop/examples

$ cd  /opt/cloudera/hadoop
$ ./bin/hadoop fs -put /opt/cloudera/oozie/examples/ examples

二、运行例子

Oozie运行的是mapreduce程序,因而需要配置相关信息。当前的mapreduce程序是运行在yarn上,即resource manager上,我们需要知道其端口号。
查看ResourceManager链接的端口号

$ cd /opt/cloudera/hadoop
$ ./bin/yarn jar share/hadoop/mapreduce/hadoop-mapreduce-examples-2.6.0-cdh5.14.0.jar pi 1 30
运行结果:
18/08/15 23:02:38 INFO client.RMProxy: Connecting to ResourceManager at Master/192.168.1.187:8032
18/08/15 23:02:39 INFO input.FileInputFormat: Total input paths to process : 1
18/08/15 23:02:39 INFO mapreduce.JobSubmitter: number of splits:1
18/08/15 23:02:40 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1534386415651_0001
 18/08/15 23:02:41 INFO impl.YarnClientImpl: Submitted application application_1534386415651_0001
18/08/15 23:02:41 INFO mapreduce.Job: The url to track the job: http://Master:8088/proxy/application_1534386415651_0001/
18/08/15 23:02:41 INFO mapreduce.Job: Running job: job_1534386415651_0001
18/08/15 23:02:53 INFO mapreduce.Job: Job job_1534386415651_0001 running in uber mode : false
18/08/15 23:02:53 INFO mapreduce.Job:  map 0% reduce 0%
18/08/15 23:02:59 INFO mapreduce.Job:  map 100% reduce 0%
18/08/15 23:03:06 INFO mapreduce.Job:  map 100% reduce 100%
18/08/15 23:03:06 INFO mapreduce.Job: Job job_1534386415651_0001 completed successfully
18/08/15 23:03:06 INFO mapreduce.Job: Counters: 49

端口号:8032

三、mapreduce例子的目录结构

$ cd oozie/examples/apps/map-reduce
$ ll
-rw-r--r--  1 1106 4001 1012 Jan  6  2018 job.properties
-rw-r--r--  1 1106 4001 1028 Jan  6  2018 job-with-config-class.properties
drwxr-xr-x  2 root root 4096 Aug 15 20:32 lib/
-rw-r--r--  1 1106 4001 2274 Jan  6  2018 workflow-with-config-class.xml
-rw-r--r--  1 1106 4001 2559 Jan  6  2018 workflow.xml

修改job.properties配置文件信息

nameNode=hdfs://localhost:8020
jobTracker=localhost:8021
queueName=default
examplesRoot=examples

oozie.wf.application.path=${nameNode}/user/${user.name}/${examplesRoot}/apps/map-reduce/workflow.xml
outputDir=map-reduce

修改成如下配置:

nameNode=hdfs://Master:9000
jobTracker=Master:8032
queueName=default
examplesRoot=examples

# 定义workflow工作的hdfs目录
oozie.wf.application.path=${nameNode}/user/${user.name}/${examplesRoot}/apps/map-reduce/workflow.xml
# mapreduce输出结果的目录
outputDir=map-reduce

输入文件路径(hdfs):
       examples/input-data/
输出文件路径(hdfs):
       examples/output-data/${EXAMPLE_NAME}
Note : job.properties file needs to be a local file during submissions, and not a HDFS path.也就是说在本地需要这个配置文件,即使我们更改了本地的 job.properties,没有提交更新到hdfs上,也是没关系的。

四、如何运行一个应用

$ ./bin/oozie job -oozie http://192.168.1.187:11000/oozie -config examples/apps/map-reduce/job.properties -run

结果显示:

job: 0000001-180815233809319-oozie-hado-W

打开Master:8088监控页面:


yarn.png

打开Oozie监控页面 Master:11000


oozie.png

到这里,Oozie运行MapReduce Wordflow案例讲解完毕!

相关文章

网友评论

      本文标题:Oozie案例运行MapReduce Wordflow讲解

      本文链接:https://www.haomeiwen.com/subject/qionbftx.html