如何在 Ubuntu 上安装 Impala? [关闭]

Posted

技术标签:

【中文标题】如何在 Ubuntu 上安装 Impala? [关闭]【英文标题】:How to install Impala on Ubuntu? [closed] 【发布时间】:2018-09-25 16:08:46 【问题描述】:

我想在 Ubuntu 实例上安装 Impala。到目前为止,以下方法都没有奏效。

如何在 Ubuntu 中安装稳定版本的 Impala?

方法失败 nr. 1:apt-get

首先我尝试使用

安装二进制文件
sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store

但是,Impala 存储库的公钥存在问题:

Err:8 https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release
  404  Not Found [IP: 151.101.192.167 443]
Err:4 http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease
  The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 327574EE02A818DD
Reading package lists... Done
E: The repository 'https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release' does not have a Release file.
N: Updating from such a repository can't be done securely, and is therefore disabled by defau
lt.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease: The following signatures couldn't be verified because the public key is not available:
 NO_PUBKEY 327574EE02A818DD
E: The repository 'http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1
 InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.

方法失败 nr. 2:构建它

我做了git clone https://git-wip-us.apache.org/repos/asf/impala.git,然后跑了bin/bootstrap_build.sh

但是,文件 bootstrap_build.sh 在最后一行 ./buildall.sh -notests -so 中引用了一个不存在的 shell 脚本。

更新 1: 当我运行 sudo dpkg --install impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb 时,出现以下依赖项错误:

dpkg: dependency problems prevent configuration of impala:
 impala depends on bigtop-utils (>= 0.7); however:
  Package bigtop-utils is not installed.
 impala depends on hadoop; however:
  Package hadoop is not installed.
 impala depends on hadoop-hdfs; however:
  Package hadoop-hdfs is not installed.
 impala depends on hadoop-yarn; however:
  Package hadoop-yarn is not installed.
 impala depends on hadoop-mapreduce; however:
  Package hadoop-mapreduce is not installed.
 impala depends on hbase; however:
  Package hbase is not installed.
 impala depends on hive (>= 0.12.0+cdh5.1.0); however:
  Package hive is not installed.
 impala depends on zookeeper; however:
  Package zookeeper is not installed.
 impala depends on libhdfs0; however:
  Package libhdfs0 is not installed.
 impala depends on avro-libs; however:
  Package avro-libs is not installed.
 impala depends on parquet; however:
  Package parquet is not installed.
 impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
  Package sentry is not installed.
dpkg: error processing package impala (--install):
 dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
 impala

更新 2: 按照this tutorial 安装 Hadoop 并没有解决 Hadoop 依赖问题。

【问题讨论】:

您是否计划针对 HBase 或 Hive 表运行 Impala?你打算在 HDFS 上运行它吗? 我想针对 standalone HBase 实例运行 Impala。 你能通过这个命令lsb-release -a 发布你的ubuntu 版本吗,也可以发布sudo apt list --installed 的输出,这将有助于找到安装的确切软件包。 【参考方案1】:

如果您获取与某个版本捆绑在一起的软件包,则安装 Impala 会更容易,因为这些软件包与一组软件版本相关联。选择一个 CDH 版本(例如 cdh5.14.4),然后安装适当的软件包。所以看:

http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/

您可以wget 软件包然后运行dpkg install 来安装它们:

wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-catalog_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-dbg_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-server_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-shell_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-state-store_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb

dpkg --install impal*.deb

要在 Ubuntu16 上设置 Impala,请执行以下操作:

wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/cloudera.list'
mv cloudera.list /etc/apt/sources.list.d/cloudera.list
wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/archive.key'
apt-key add archive.key`

然后执行原指令:

sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store

参考文献

Installing the Latest CDH 5 Release - On Ubuntu or Debian Systems Installing the Latest CDH 5 Release - Step 2: Optionally Add a Repository Key

【讨论】:

谢谢。如何自动安装所有必需的依赖项(请参阅我的问题中的更新 1)? 更新答案。【参考方案2】:

我会建议使用 github repo 但稳定的分支,即 2.x

虽然从 dpkg 安装不会安装所有依赖项,但您有两个选择:

    从错误中安装所有依赖项

    我已经使用下面的脚本/heredoc 来生成命令。通过更改heredoc中的错误消息,它可以与其他错误一起使用。

    使用apt-get install ./impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb

    这是一个更简单的方法,因为一切都得到了照顾。

$ echo "sudo apt-get install " `grep "Package" << Devidas | awk 'print $2' | tr '\r\n' ' ' ; echo ' '
dpkg: dependency problems prevent configuration of impala:
 impala depends on bigtop-utils (>= 0.7); however:
  Package bigtop-utils is not installed.
 impala depends on hadoop; however:
  Package hadoop is not installed.
 impala depends on hadoop-hdfs; however:
  Package hadoop-hdfs is not installed.
 impala depends on hadoop-yarn; however:
  Package hadoop-yarn is not installed.
 impala depends on hadoop-mapreduce; however:
  Package hadoop-mapreduce is not installed.
 impala depends on hbase; however:
  Package hbase is not installed.
 impala depends on hive (>= 0.12.0+cdh5.1.0); however:
  Package hive is not installed.
 impala depends on zookeeper; however:
  Package zookeeper is not installed.
 impala depends on libhdfs0; however:
  Package libhdfs0 is not installed.
 impala depends on avro-libs; however:
  Package avro-libs is not installed.
 impala depends on parquet; however:
  Package parquet is not installed.
 impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
  Package sentry is not installed.
dpkg: error processing package impala (--install):
 dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
 impala
Devidas`

这应该会产生类似的结果:

sudo apt-get install bigtop-utils hadoop hadoop-hdfs hadoop-yarn hadoop-mapreduce \
    hbase hive zookeeper libhdfs0 avro-libs parquet sentry

【讨论】:

以上是关于如何在 Ubuntu 上安装 Impala? [关闭]的主要内容,如果未能解决你的问题,请参考以下文章

教你如何快速掌握Kudu+Impala

在没有 cloudera manager 的情况下安装 cloudera impala

什么是impala,如何安装使用Impala

什么是impala,如何安装使用Impala

什么是impala,如何安装使用Impala

什么是impala,如何安装使用Impala