docker中搭建分布式hadoop集群

Posted

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了docker中搭建分布式hadoop集群相关的知识,希望对你有一定的参考价值。

1、pull Ubuntu镜像配置Java环境

2、下载hadoop软件包

3、配置JAVA_HOME(hadoop-env.sh、mapred-env.sh、yarn-env.sh)

4、配置core-site.xml

<configuration>
    <property>
        <name>fs.defaultFS</name>
        <value>hdfs://127.17.0.5:8020</value>
    </property>
    <property>
        <name>hadoop.tmp.dir</name>
        <value>/home/root/data/tmp</value>
    </property>
</configuration>

5、配置hdfs-site.xml

<configuration>
    <property>
        <name>dfs.namenode.secondary.http-address</name>
        <value>172.17.0.2:50090</value>
    </property>
</configuration>

6、配置slave

127.17.0.5
127.17.0.6
127.17.0.2

7、配置yarn-site.xml

<configuration>
    <property>
        <name>yarn.nodemanager.aux-services</name>
        <value>mapreduce_shuffle</value>
    </property>
    <property>
        <name>yarn.resourcemanager.hostname</name>
        <value>172.17.0.6</value>
    </property>
    <property>
        <name>yarn.log-aggregation-enable</name>
        <value>true</value>
    </property>
    <property>
        <name>yarn.log-aggregation.retain-seconds</name>
        <value>106800</value>
    </property>
</configuration>

8、配置mapred-site.xml

cp mapred-site.xml.template mapred-site.xml
<configuration>
    <property>
        <name>mapreduce.framework.name</name>
        <value>yarn</value>
    </property>
    <property>
        <name>mapreduce.jobhistory.address</name>
        <value>172.17.0.5:10020</value>
    </property>
    <property>
        <name>mapreduce.jobhistory.webapp.address</name>
        <value>172.17.0.5:19888</value>
    </property>
</configuration>

9、设置ssh登录

安装sshd

apt-get install openssh-server
service ssh start
ps -e | grep ssh

生成秘钥

ssh-keygen -t rsa

设置root密码

passwd

设置root远程登录 PermitRootLogin yes 

vim /etc/ssh/sshd_config
/etc/init.d/ssh restart

分发公钥

ssh-copy-id 172.17.0.5
ssh-copy-id 172.17.0.6
ssh-copy-id 172.17.0.2

NameNode执行格式化

hdfs namenode –format

NodeNode(172.17.0.5)上启动HDFS集群

/sbin/start-dfs.sh

启动出错

The authenticity of host 127.17.0.2 (127.17.0.2) cant be established.
Host key verification failed.
vi /etc/ssh/ssh_config

修改 StrictHostKeyChecking no 

  172.17.0.6  上启动ResourceManager

sbin/yarn-daemon.sh start resourcemanager

 




以上是关于docker中搭建分布式hadoop集群的主要内容,如果未能解决你的问题,请参考以下文章

docker中搭建分布式hadoop集群

用Docker在一台笔记本电脑上搭建一个具有10个节点7种角色的Hadoop集群(下)-搭建Hadoop集群

docker搭建Hadoop集群

Docker搭建Hadoop环境

基于docker的hadoop集群搭建

搭建docker-spark-hadoop-hive-zeppelin分布式集群环境