如何在 Ubuntu 上安装 Impala? [关闭]
Posted
技术标签:
【中文标题】如何在 Ubuntu 上安装 Impala? [关闭]【英文标题】:How to install Impala on Ubuntu? [closed] 【发布时间】:2018-09-25 16:08:46 【问题描述】:我想在 Ubuntu 实例上安装 Impala。到目前为止,以下方法都没有奏效。
如何在 Ubuntu 中安装稳定版本的 Impala?
方法失败 nr. 1:apt-get
首先我尝试使用
安装二进制文件sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store
但是,Impala 存储库的公钥存在问题:
Err:8 https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release
404 Not Found [IP: 151.101.192.167 443]
Err:4 http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease
The following signatures couldn't be verified because the public key is not available: NO_PUBKEY 327574EE02A818DD
Reading package lists... Done
E: The repository 'https://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala bionic Release' does not have a Release file.
N: Updating from such a repository can't be done securely, and is therefore disabled by defau
lt.
N: See apt-secure(8) manpage for repository creation and user configuration details.
W: GPG error: http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1 InRelease: The following signatures couldn't be verified because the public key is not available:
NO_PUBKEY 327574EE02A818DD
E: The repository 'http://archive.cloudera.com/impala/ubuntu/lucid/amd64/impala lucid-impala1
InRelease' is not signed.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
方法失败 nr. 2:构建它
我做了git clone https://git-wip-us.apache.org/repos/asf/impala.git
,然后跑了bin/bootstrap_build.sh
。
但是,文件 bootstrap_build.sh 在最后一行 ./buildall.sh -notests -so
中引用了一个不存在的 shell 脚本。
更新 1: 当我运行 sudo dpkg --install impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
时,出现以下依赖项错误:
dpkg: dependency problems prevent configuration of impala:
impala depends on bigtop-utils (>= 0.7); however:
Package bigtop-utils is not installed.
impala depends on hadoop; however:
Package hadoop is not installed.
impala depends on hadoop-hdfs; however:
Package hadoop-hdfs is not installed.
impala depends on hadoop-yarn; however:
Package hadoop-yarn is not installed.
impala depends on hadoop-mapreduce; however:
Package hadoop-mapreduce is not installed.
impala depends on hbase; however:
Package hbase is not installed.
impala depends on hive (>= 0.12.0+cdh5.1.0); however:
Package hive is not installed.
impala depends on zookeeper; however:
Package zookeeper is not installed.
impala depends on libhdfs0; however:
Package libhdfs0 is not installed.
impala depends on avro-libs; however:
Package avro-libs is not installed.
impala depends on parquet; however:
Package parquet is not installed.
impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
Package sentry is not installed.
dpkg: error processing package impala (--install):
dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
impala
更新 2: 按照this tutorial 安装 Hadoop 并没有解决 Hadoop 依赖问题。
【问题讨论】:
您是否计划针对 HBase 或 Hive 表运行 Impala?你打算在 HDFS 上运行它吗? 我想针对 standalone HBase 实例运行 Impala。 你能通过这个命令lsb-release -a
发布你的ubuntu 版本吗,也可以发布sudo apt list --installed
的输出,这将有助于找到安装的确切软件包。
【参考方案1】:
如果您获取与某个版本捆绑在一起的软件包,则安装 Impala 会更容易,因为这些软件包与一组软件版本相关联。选择一个 CDH 版本(例如 cdh5.14.4),然后安装适当的软件包。所以看:
http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/
您可以wget
软件包然后运行dpkg install
来安装它们:
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-catalog_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-dbg_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-server_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-shell_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala-state-store_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
wget http://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/pool/contrib/i/impala/impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
dpkg --install impal*.deb
要在 Ubuntu16 上设置 Impala,请执行以下操作:
wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/cloudera.list'
mv cloudera.list /etc/apt/sources.list.d/cloudera.list
wget 'https://archive.cloudera.com/cdh5/ubuntu/xenial/amd64/cdh/archive.key'
apt-key add archive.key`
然后执行原指令:
sudo apt-get update
sudo apt-get install impala
sudo apt-get install impala-server
sudo apt-get install impala-state-store
参考文献
Installing the Latest CDH 5 Release - On Ubuntu or Debian Systems Installing the Latest CDH 5 Release - Step 2: Optionally Add a Repository Key【讨论】:
谢谢。如何自动安装所有必需的依赖项(请参阅我的问题中的更新 1)? 更新答案。【参考方案2】:我会建议使用 github repo 但稳定的分支,即 2.x
虽然从 dpkg 安装不会安装所有依赖项,但您有两个选择:
从错误中安装所有依赖项
我已经使用下面的脚本/heredoc 来生成命令。通过更改heredoc中的错误消息,它可以与其他错误一起使用。
使用apt-get install ./impala_2.11.0+cdh5.14.4+0-1.cdh5.14.4.p0.4~xenial-cdh5.14.4_amd64.deb
这是一个更简单的方法,因为一切都得到了照顾。
$ echo "sudo apt-get install " `grep "Package" << Devidas | awk 'print $2' | tr '\r\n' ' ' ; echo ' '
dpkg: dependency problems prevent configuration of impala:
impala depends on bigtop-utils (>= 0.7); however:
Package bigtop-utils is not installed.
impala depends on hadoop; however:
Package hadoop is not installed.
impala depends on hadoop-hdfs; however:
Package hadoop-hdfs is not installed.
impala depends on hadoop-yarn; however:
Package hadoop-yarn is not installed.
impala depends on hadoop-mapreduce; however:
Package hadoop-mapreduce is not installed.
impala depends on hbase; however:
Package hbase is not installed.
impala depends on hive (>= 0.12.0+cdh5.1.0); however:
Package hive is not installed.
impala depends on zookeeper; however:
Package zookeeper is not installed.
impala depends on libhdfs0; however:
Package libhdfs0 is not installed.
impala depends on avro-libs; however:
Package avro-libs is not installed.
impala depends on parquet; however:
Package parquet is not installed.
impala depends on sentry (>= 1.3.0+cdh5.1.0); however:
Package sentry is not installed.
dpkg: error processing package impala (--install):
dependency problems - leaving unconfigured
Processing triggers for libc-bin (2.27-3ubuntu1) ...
Errors were encountered while processing:
impala
Devidas`
这应该会产生类似的结果:
sudo apt-get install bigtop-utils hadoop hadoop-hdfs hadoop-yarn hadoop-mapreduce \
hbase hive zookeeper libhdfs0 avro-libs parquet sentry
【讨论】:
以上是关于如何在 Ubuntu 上安装 Impala? [关闭]的主要内容,如果未能解决你的问题,请参考以下文章