在IntelliJ IDEA中配置Spark(Java API)运行环境

Posted Li Baoquan

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了在IntelliJ IDEA中配置Spark(Java API)运行环境相关的知识,希望对你有一定的参考价值。

1. 新建Maven项目




初始Maven项目完成后,初始的配置(pom.xml)如下:

2. 配置Maven

向项目里新建Spark Core库

<?xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0"
         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
    <modelVersion>4.0.0</modelVersion>

    <groupId>net.libaoquan</groupId>
    <artifactId>TestSpark</artifactId>
    <version>1.0-SNAPSHOT</version>

    <dependencies>
        <dependency> <!-- Spark dependency -->
            <groupId>org.apache.spark</groupId>
            <artifactId>spark-core_2.11</artifactId>
            <version>2.2.1</version>
        </dependency>
    </dependencies>
</project>

3.新建Java类

新建Java类,写入Spark(Java API)代码:

import org.apache.spark.api.java.*;
import org.apache.spark.SparkConf;
import org.apache.spark.api.java.function.Function;

public class TestSparkJava {
    public static void main(String[] args) {
        String logFile = "D:\\\\ab.txt";
        SparkConf conf = new SparkConf().setMaster("local").setAppName("TestSpark");
        JavaSparkContext sc = new JavaSparkContext(conf);

        JavaRDD<String> logData = sc.textFile(logFile).cache();

        long numAs = logData.filter(new Function<String, Boolean>() {
            public Boolean call(String s) { return s.contains("0"); }
        }).count();

        long numBs = logData.filter(new Function<String, Boolean>() {
            public Boolean call(String s) { return s.contains("1"); }
        }).count();

        System.out.println("Lines with 0: " + numAs + ", lines with 1: " + numBs);

        sc.stop();
    }
}

运行项目,结果如下:

以上是关于在IntelliJ IDEA中配置Spark(Java API)运行环境的主要内容,如果未能解决你的问题,请参考以下文章

Intellij Idea搭建Spark开发环境

intellij idea 2017 怎么创建一个web工程

ja-netfilter-all,IntelliJ IDEA 2021.3.2 (Ultimate Edition)

ja-netfilter-all,IntelliJ IDEA 2021.3.2 (Ultimate Edition)

intellij idea 怎么编写python程序打包发送到spark

如何在IntelliJ IDEA中运行Java/Scala/Spark程序