Hadoop自己编译Hadoop安装包

Posted sangumaolu

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Hadoop自己编译Hadoop安装包相关的知识,希望对你有一定的参考价值。

CentOS-6.7 编译 hadoop-2.6.5

(只需输入红色命令即可)

目录

1、阅读编译文档 .............................................................................................................................1

2、准备编译环境 .............................................................................................................................2

3、安装 gcc .....................................................................................................................................2

4、安装 gcc-c++ ..............................................................................................................................2

5、安装 make ..................................................................................................................................3

6、安装 maven(必须) ................................................................................................................. 3

7、安装 jdk(必须) ........................................................................................................................4

8、安装 ant(重要) ........................................................................................................................4

9、安装 findbugs(可选,最好装) ..................................................................................................5

10、安装 cmake(重要) .................................................................................................................5

11、安装 protobuf(重要) ..............................................................................................................5

12、安装 Snappy .............................................................................................................................6

13、编译 hadoop .............................................................................................................................6

1 、 阅读编译文档

  (1)准备一个 hadoop 源码包,我选择的 hadoop 版本是:hadoop-2.6.5-src.tar.gz,在hadoop-2.6.5 的源码包的根目录下有一个文档叫做 BUILDING.txt,这其中说明了编译

hadoop 所需要的一些编译环境相关的东西。不同的 hadoop 版本的要求都不一样。对应的版本参照 BUILDING.txt

    请仔细阅读:

    Requirements:

 

    * Unix System

    * JDK 1.6+

    * Maven 3.0 or later

    * Findbugs 1.3.9 (if running findbugs)

    * ProtocolBuffer 2.5.0

    * CMake 2.6 or newer (if compiling native code), must be 3.0 or newer on Mac

    * Zlib devel (if compiling native code)

    * openssl devel ( if compiling native hadoop-pipes )

    * Internet connection for first build (to fetch all Maven and Hadoop dependencies)

  (2)对应以上需求,我们准备好所要求版本的这些软件

      1. 准备一台 Unix 类型操作系统,在这里我们选用的是 CentOS-6.7,初次编译要求必须

        联网,切记

      以下这些东西都是需要的,详细安装在下面,这里只介绍我准备这些软件所选择的版本

      2. 安装 openssl-devel

      3. 安装 gcc

      4. 安装 gcc-c++

      5. JDK:jdk-7u80-linux-x64.tar.gz

      6. Maven:apache-maven-3.3.3-bin.tar.gz

      7. Ant:apache-ant-1.9.4-bin.tar.gz

      8. FindBugs:findbugs-3.0.0.tar.gz

      9. Cmake:cmake-2.8.12.2.tar.gz

      10. Protobuf:protobuf-2.5.0.tar.gz

      11. Snappy:snappy-1.1.1.tar.gz

2 、 准备编译环境

  安装以下软件:

  yum -y install svn

  yum -y install autoconf automake libtool cmake

  yum -y install ncurses-devel

  yum -y install openssl-devel

3 、 安装 gcc

  先使用命令检测一下看 gcc 是否已经安装过了

  [[email protected]_hadoop soft]# gcc -v

 

  gcc version 4.4.7 20120313 (Red Hat 4.4.7-16) (GCC)

  如果最后一行出现如上的 gcc 版本信息日志,表示已经安装成功过了。

  不然使用命令安装:

  yum install gcc

4 、 安装 gcc-c++

  直接使用命令安装:yum install gcc-c++

 

5 、 安装 make

  先检测以下系统是否安装了 make 工具:make -version

  技术分享图片

  如果没有安装过 make,那么请使用命令安装:yum install -y make

6 、 安装 maven (必须)

  (1) 解压

    tar -zxvf /root/soft/apache-maven-3.3.3-bin.tar.gz -C /root/apps/

  (2)修改配置文件(如果需要更改默认的 maven 仓库路径的话)

    1. 进入到 maven 安装包的 conf 目录:/root/apps/apache-maven-3.3.3/conf

    2. 修改配置文件 settings.xml

      技术分享图片

    在配置文件的中部找到 localRepository 这个标签,它本来是注释了的,并且有一个默认仓库路径,我们最好自己设置一个,

    所以我自己加了一个,我的路径是:<localRepository>/root/mavenrepo/</localRepository>

    3. 配置环境变量

      export M2_HOME=/root/apps/apache-maven-3.3.3

      export PATH=$PATH:$M2_HOME/bin

    4. 检测是否成功

      source /etc/profile

      mvn -version

      技术分享图片

7 、 安装 jdk (必须)

  1. 解压

    tar -zxvf /root/soft/jdk-7u80-linux-x64.tar.gz -C /root/apps/

  2. 配置环境变量

    export JAVA_HOME=/root/apps/jdk1.7.0_80

    export PATH=$PATH:$JAVA_HOME/bin

    export CLASSPATH=.:/root/apps/jdk1.7.0_80/lib/dt.jar:/root/apps/jdk1.7.0_80/lib/tools.jar

  3. 检测是否安装

    source /etc/profile

    java -version

    技术分享图片

8 、 安装 ant (重要)

  1. 解压

    tar -zxvf /root/soft/apache-ant-1.9.4-bin.tar.gz -C /root/apps/

  2. 配置环境变量

    export ANT_HOME=/root/apps/apache-ant-1.9.4

    export PATH=$PATH:$ANT_HOME/bin

  3. 检测是否安装成功

    source /etc/profile

    ant -version

    技术分享图片

9 、 安装 findbugs (可选 ,最好装 )

  1. 解压

    tar -zxvf /root/soft/findbugs-3.0.0.tar.gz -C /root/apps/

  2. 配置环境变量

    export FINDBUGS_HOME=/root/apps/findbugs-3.0.0

    export PATH=$PATH:$FINDBUGS_HOME/bin

  3. 检测是否安装成功

    [[email protected] apps]# findbugs -version

    技术分享图片

10 、 安装 cmake (重要)

  1. 解压

    tar -zxvf /root/soft/cmake-2.8.12.2.tar.gz -C /root/apps/

  2. 安装

    首先进入到根目录:

    cd /root/apps/cmake-2.8.12.2/

    然后依次执行以下命令:

    ./bootstrap

    gmake & gmake install

  3. 检测是否安装成功

    cmake -version

    技术分享图片

11 、 安装 protobuf (重要)

  1. 解压

    tar -zxvf /root/soft/protobuf-2.5.0.tar.gz -C /root/apps/

  2. 安装

    首先进入到根目录:

    cd /root/apps/protobuf-2.5.0/

    然后依次执行以下命令:

    ./configure --prefix=/root/apps/protobuf (表示安装到这个目录)

    make

    make check

    make install

  3. 配置环境变量

    export PROTOBUF_HOME=/root/apps/protobuf

    export PATH=$PATH:$PROTOBUF_HOME/bin

  注意:PROTOBUF_HOME 就是我们在执行 configure 命令的时候指定的目录

  4. 检测是否安装成功

    [[email protected] ~]# protoc --version

    技术分享图片

12 、 安装 Snappy

  1. 使用 root 用户安装 Snappy:

    tar -zxvf /root/soft/snappy-1.1.1.tar.gz -C /root/apps/

  2. 编译安装:

    cd snappy-1.1.1/

    ./configure

    make

    make install

  3. 查看 snappy 库文件

    ls -lh /usr/local/lib |grep snappy

    技术分享图片

13 、 编译 hadoop

  1. 解压 hadoop 源码包

    tar -zxvf /root/soft/hadoop-2.6.5-src.tar.gz -C /root/apps/

  2. 在编译之前防止 java.lang.OutOfMemoryError: Java heap space 堆栈问题,在 centos 系

    统中执行命令:export MAVEN_OPTS="-Xms256m -Xmx512m"

  3. 进入到源码包根目录下

    cd /root/apps/hadoop-2.6.5-src

  4. 执行命令编译

    Create binary distribution with native code and with documentation:

    mvn package -Pdist,native,docs -DskipTests –Dtar

  如果中途编译失败,并且不要文档的话,请使用这个命令:

    mvn clean package -Pdist,native -DskipTests -Dtar -Dsnappy.lib=/usr/local/lib -Dbundle.snappy -Drequire.openssl

  PS :tar dist 表示用 maven 项目管理工具对 hadoop 进行编译,编译好了之后会打成 tar.gz包放到 hadoop-dist 目录下,

    nativedocs 表示编译出来会编译出来本地库,并且把文档打包到该.tar.gz 文件中,skipTests 表示忽略测试

 

  5. 静静等待编译……. 第一次编译预估一个小时左右。

 

  6. 编译成功了的话,最后的日志信息:
  [INFO] Reactor Summary:
  [INFO]
  [INFO] Apache Hadoop Main ................................. SUCCESS [05:52 min]
  [INFO] Apache Hadoop Project POM .......................... SUCCESS [02:23 min]
  [INFO] Apache Hadoop Annotations .......................... SUCCESS [01:11 min]
  [INFO] Apache Hadoop Assemblies ........................... SUCCESS [ 0.267 s]
  [INFO] Apache Hadoop Project Dist POM ..................... SUCCESS [01:08 min]
  [INFO] Apache Hadoop Maven Plugins ........................ SUCCESS [01:13 min]
  [INFO] Apache Hadoop MiniKDC .............................. SUCCESS [06:36 min]
  [INFO] Apache Hadoop Auth ................................. SUCCESS [04:32 min]
  [INFO] Apache Hadoop Auth Examples ........................ SUCCESS [ 22.255 s]
  [INFO] Apache Hadoop Common ............................... SUCCESS [08:12 min]
  [INFO] Apache Hadoop NFS .................................. SUCCESS [ 8.060 s]
  [INFO] Apache Hadoop KMS .................................. SUCCESS [01:37 min]
  [INFO] Apache Hadoop Common Project ....................... SUCCESS [ 0.039 s]
  [INFO] Apache Hadoop HDFS ................................. SUCCESS [05:54 min]
  [INFO] Apache Hadoop HttpFS ............................... SUCCESS [01:24 min]
  [INFO] Apache Hadoop HDFS BookKeeper Journal .............. SUCCESS [01:11 min]
  [INFO] Apache Hadoop HDFS-NFS ............................. SUCCESS [ 4.370 s]
  [INFO] Apache Hadoop HDFS Project ......................... SUCCESS [ 0.048 s]
  [INFO] hadoop-yarn ........................................ SUCCESS [ 0.040 s]
  [INFO] hadoop-yarn-api .................................... SUCCESS [01:26 min]
  [INFO] hadoop-yarn-common ................................. SUCCESS [02:11 min]
  [INFO] hadoop-yarn-server ................................. SUCCESS [ 0.048 s]
  [INFO] hadoop-yarn-server-common .......................... SUCCESS [ 32.832 s]
  [INFO] hadoop-yarn-server-nodemanager ..................... SUCCESS [03:11 min]
  [INFO] hadoop-yarn-server-web-proxy ....................... SUCCESS [ 2.603 s]
  [INFO] hadoop-yarn-server-applicationhistoryservice ....... SUCCESS [ 6.006 s]
  [INFO] hadoop-yarn-server-resourcemanager ................. SUCCESS [ 19.331 s]
  [INFO] hadoop-yarn-server-tests ........................... SUCCESS [ 5.008 s]
  [INFO] hadoop-yarn-client ................................. SUCCESS [ 7.812 s]
  [INFO] hadoop-yarn-applications ........................... SUCCESS [ 0.049 s]
  [INFO] hadoop-yarn-applications-distributedshell .......... SUCCESS [ 2.211 s]
  [INFO] hadoop-yarn-applications-unmanaged-am-launcher ..... SUCCESS [ 1.773 s]
  [INFO] hadoop-yarn-site ................................... SUCCESS [ 0.033 s]
  [INFO] hadoop-yarn-registry ............................... SUCCESS [ 4.927 s]
  [INFO] hadoop-yarn-project ................................ SUCCESS [ 2.988 s]
  [INFO] hadoop-mapreduce-client ............................ SUCCESS [ 0.052 s]
  [INFO] hadoop-mapreduce-client-core ....................... SUCCESS [ 22.567 s]
  [INFO] hadoop-mapreduce-client-common ..................... SUCCESS [ 16.208 s]
  [INFO] hadoop-mapreduce-client-shuffle .................... SUCCESS [ 4.122 s]
  [INFO] hadoop-mapreduce-client-app ........................ SUCCESS [ 9.984 s]
  [INFO] hadoop-mapreduce-client-hs ......................... SUCCESS [ 7.873 s]
  [INFO] hadoop-mapreduce-client-jobclient .................. SUCCESS [ 31.351 s]
  [INFO] hadoop-mapreduce-client-hs-plugins ................. SUCCESS [ 1.769 s]
  [INFO] Apache Hadoop MapReduce Examples ................... SUCCESS [ 5.106 s]
  [INFO] hadoop-mapreduce ................................... SUCCESS [ 2.335 s]
  [INFO] Apache Hadoop MapReduce Streaming .................. SUCCESS [ 11.850 s]
  [INFO] Apache Hadoop Distributed Copy ..................... SUCCESS [ 33.553 s]
  [INFO] Apache Hadoop Archives ............................. SUCCESS [ 1.983 s]
  [INFO] Apache Hadoop Rumen ................................ SUCCESS [ 5.720 s]
  [INFO] Apache Hadoop Gridmix .............................. SUCCESS [ 4.326 s]
  [INFO] Apache Hadoop Data Join ............................ SUCCESS [ 2.572 s]
  [INFO] Apache Hadoop Ant Tasks ............................ SUCCESS [ 1.878 s]
  [INFO] Apache Hadoop Extras ............................... SUCCESS [ 2.936 s]
  [INFO] Apache Hadoop Pipes ................................ SUCCESS [ 6.255 s]
  [INFO] Apache Hadoop OpenStack support .................... SUCCESS [ 4.872 s]
  [INFO] Apache Hadoop Amazon Web Services support .......... SUCCESS [05:32 min]
  [INFO] Apache Hadoop Client ............................... SUCCESS [ 4.333 s]
  [INFO] Apache Hadoop Mini-Cluster ......................... SUCCESS [ 0.094 s]
  [INFO] Apache Hadoop Scheduler Load Simulator ............. SUCCESS [ 3.894 s]
  [INFO] Apache Hadoop Tools Dist ........................... SUCCESS [ 6.618 s]
  [INFO] Apache Hadoop Tools ................................ SUCCESS [ 0.047 s]
  [INFO] Apache Hadoop Distribution ......................... SUCCESS [ 13.077 s]
  [INFO] ------------------------------------------------------------------------
  [INFO] BUILD SUCCESS
  [INFO] ------------------------------------------------------------------------
  [INFO] Total time: 01:01 h
  [INFO] Finished at: 2016-12-31T01:43:51-08:00
  [INFO] Final Memory: 93M/395M
  [INFO] ------------------------------------------------------------------------

  7. 编译成功之后,hadoop-2.6.5.tar.gz 位于/root/apps/hadoop-2.6.5-src/hadoop-dist/target 目录下,这是编译后该文件夹的状态:

    技术分享图片

 

至此,大功告成。恭喜。








































































以上是关于Hadoop自己编译Hadoop安装包的主要内容,如果未能解决你的问题,请参考以下文章

奈学教育:Hadoop源码编译全流程分享

奈学教育:Hadoop源码编译全流程分享

奈学教育:Hadoop源码编译全流程分享

奈学教育:Hadoop源码编译全流程分享

hadoop 源码包编译

Hadoop2.x系列之源码编译