![](https://img.haomeiwen.com/i7007629/d749db556888e048.png)
1. 只使用Spark Shell
这里只需要下载任何Spark版本,不需要作任何配置,直接使用Spark shell(我这里为方便,把spark的bin目录加入到全局path中)
![](https://img.haomeiwen.com/i7007629/be7dc034bf92d97f.png)
![](https://img.haomeiwen.com/i7007629/99721459ad6d12fc.png)
2. 使用Scala写一个独立的Application
2.1 安装SBT
A下载地址:https://sbt-downloads.cdnedge.bluemix.net/releases/v1.3.4/sbt-1.3.4.tgz
然后用tar -zxvf解压缩,解压缩结果如图:
![](https://img.haomeiwen.com/i7007629/bc4ef60dcfde9e8d.png)
B sbt换用华为源
SBT 下载依赖的速度极慢,换用华为源(路径是~/.sbt/repositories
)
![](https://img.haomeiwen.com/i7007629/8bf48a8aed937316.png)
[repositories]
local
huaweicloud-maven: https://repo.huaweicloud.com/repository/maven/
maven-central: https://repo1.maven.org/maven2/
huaweicloud-ivy: https://repo.huaweicloud.com/repository/ivy/, [organization]/[module]/(scala_[scalaVersion]/)(sbt_[sbtVersion]/)[revision]/[type]s/[artifact](-[classifier]).[ext]
C 设置所有项目均使用全局仓库配置,忽略项目自身仓库配置
![](https://img.haomeiwen.com/i7007629/4f1631b348c2a577.png)
-Dsbt.override.build.repos=true
运行命令检查是否可用
![](https://img.haomeiwen.com/i7007629/9413b75c3a9902dc.png)
2.2 例子
2.2.1 目录结构和文件内容
![](https://img.haomeiwen.com/i7007629/a9d526e11a104257.png)
![](https://img.haomeiwen.com/i7007629/e46d3e917b73ec19.png)
import org.apache.spark.sql.SparkSession
object SimpleApp
{
def main(args : Array[String])
{
val logFile ="/home/yay/software/spark-2.4.4-bin-hadoop2.7/README.md"
val spark = SparkSession.builder.appName("Simple Application").getOrCreate()
val logData = spark.read.textFile(logFile).cache()
val numAs = logData.filter(line => line.contains("a")).count()
val numBs = logData.filter(line => line.contains("b")).count()
println(s"Lines with a: $numAs, Lines with b: $numBs")
spark.stop()
}
}
build.sbt文件内容为:
name := "Simple Project"
version := "1.0"
scalaVersion := "2.11.12"
libraryDependencies += "org.apache.spark" %% "spark-sql" % "2.4.4"
2.2.2 编译
![](https://img.haomeiwen.com/i7007629/d28eef94c3ae8e8a.png)
![](https://img.haomeiwen.com/i7007629/dc2ac0dbaa47d2ef.png)
2.2.3 使用 spark-submit script运行程序
![](https://img.haomeiwen.com/i7007629/223d3f631a29040b.png)
![](https://img.haomeiwen.com/i7007629/1aeb4191098251b1.png)
问题说明:image.png
网友评论