Flume环境搭建_五种案例
Posted 日月的弯刀
tags:
篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了Flume环境搭建_五种案例相关的知识,希望对你有一定的参考价值。
Flume环境搭建_五种案例
http://flume.apache.org/FlumeUserGuide.html
A simple example
Here, we give an example configuration file, describing a single-node Flume deployment. This configuration lets a user generate events and subsequently logs them to the console.
# example.conf: A single-node Flume configuration
# Name the components on this agent
a1.sources = r1
a1.sinks = k1
a1.channels = c1
# Describe/configure the source
a1.sources.r1.type = netcat
a1.sources.r1.bind = localhost
a1.sources.r1.port = 44444
# Describe the sink
a1.sinks.k1.type = logger
# Use a channel which buffers events in memory
a1.channels.c1.type = memory
a1.channels.c1.capacity = 1000
a1.channels.c1.transactionCapacity = 100
# Bind the source and sink to the channel
a1.sources.r1.channels = c1
a1.sinks.k1.channel = c1
This configuration defines a single agent named a1. a1 has a source that listens for data on port 44444, a channel that buffers event data in memory, and a sink that logs event data to the console. The configuration file names the various components, then describes their types and configuration parameters. A given configuration file might define several named agents; when a given Flume process is launched a flag is passed telling it which named agent to manifest.
Given this configuration file, we can start Flume as follows:
$ bin/flume-ng agent --conf conf --conf-file example.conf --name a1 -Dflume.root.logger=INFO,console
Note that in a full deployment we would typically include one more option: --conf=<conf-dir>. The <conf-dir> directory would include a shell script flume-env.sh and potentially a log4j properties file. In this example, we pass a Java option to force Flume to log to the console and we go without a custom environment script.
From a separate terminal, we can then telnet port 44444 and send Flume an event:
$ telnet localhost 44444
Trying 127.0.0.1...
Connected to localhost.localdomain (127.0.0.1).
Escape character is \'^]\'.
Hello world! <ENTER>
OK
The original Flume terminal will output the event in a log message.
12/06/19 15:32:19 INFO source.NetcatSource: Source starting
12/06/19 15:32:19 INFO source.NetcatSource: Created serverSocket:sun.nio.ch.ServerSocketChannelImpl[/127.0.0.1:44444]
12/06/19 15:32:34 INFO sink.LoggerSink: Event: { headers:{} body: 48 65 6C 6C 6F 20 77 6F 72 6C 64 21 0D Hello world!. }
Congratulations - you’ve successfully configured and deployed a Flume agent! Subsequent sections cover agent configuration in much more detail.
以下为具体搭建流程
Flume搭建_案例一:单个Flume
安装node2上
1. 上传到/home/tools,解压,解压后移动到/home下
2. 重命名,并修改flume-env.sh
data:image/s3,"s3://crabby-images/b3621/b3621002bd1f4067f1a2bbccb96341e7d1b2fd8c" alt=""
data:image/s3,"s3://crabby-images/6b34d/6b34ddb7fc8730c795325c6573dc34051ea5a16e" alt=""
data:image/s3,"s3://crabby-images/0bd9a/0bd9a6788ee853c275ce123073cedca003b7fe69" alt=""
data:image/s3,"s3://crabby-images/2e80c/2e80c37ce535145d4266e61a93b3173cf0fbcbbc" alt=""
data:image/s3,"s3://crabby-images/384b6/384b6d57b76afcfd6088661915ea4cb1a89df41d" alt=""
data:image/s3,"s3://crabby-images/62551/62551b8bb0c73a7a418151b3cc3ea2e316029e06" alt=""
data:image/s3,"s3://crabby-images/2388a/2388aed7c7aff0576db972f66c7f9b9710d74387" alt=""
data:image/s3,"s3://crabby-images/48ec7/48ec78aeec43c860761ee1ac5cb5236c3bc9ad2b" alt=""
data:image/s3,"s3://crabby-images/29a31/29a31d538aec6cfdd37be461831a6b85d970cc8a" alt=""
data:image/s3,"s3://crabby-images/ab87b/ab87b35d3682abad2f75f71e4e0085995cfbb455" alt=""
data:image/s3,"s3://crabby-images/6b351/6b3515011811856e88759b04a0852fac3fa95d6f" alt=""
Flume搭建_案例二:两个Flume做集群
data:image/s3,"s3://crabby-images/3a28a/3a28ac8ffa83658099dbf53919c777c0c369672e" alt=""
MemoryChanel配置
capacity:默认该通道中最大的可以存储的event数量是100,
trasactionCapacity:每次最大可以从source中拿到或者送到sink中的event数量也是100
keep-alive:event添加到通道中或者移出的允许时间
byte**:即event的字节量的限制,只包括eventbody
data:image/s3,"s3://crabby-images/30f71/30f719955f8a4f87b6c85e3401a9295780ea4318" alt=""
data:image/s3,"s3://crabby-images/686b7/686b7494ede349e950b97364de327d36d4e942ba" alt=""
data:image/s3,"s3://crabby-images/adbfd/adbfd835809c6484ddc0f0db01f0f2650e34cea4" alt=""
data:image/s3,"s3://crabby-images/dece7/dece702f5216518ff885468c12df945a6bc1574e" alt=""
先启动node02的Flume
flume-ng agent -n a1 -c conf -f avro.conf -Dflume.root.logger=INFO,console
flume-ng agent -n a1 -c conf -f /home/test_flume/flume22 -Dflume.root.logger=INFO,console
再启动node01的Flume
flume-ng agent -n a1 -c conf -f simple.conf2 -Dflume.root.logger=INFO,console
flume-ng agent -n a1 -c conf -f /home/test_flume/flume21 -Dflume.root.logger=INFO,console
data:image/s3,"s3://crabby-images/15246/15246dfc5c66c02e41f036b67397eeac3be1be25" alt=""
data:image/s3,"s3://crabby-images/595ef/595ef46a3f0e3ac451f6d9f7abcebe7bdafb5d93" alt=""
data:image/s3,"s3://crabby-images/bbe00/bbe003dbe92163f549aadf1f5950f5847eb50361" alt=""
data:image/s3,"s3://crabby-images/bf99a/bf99a92e6200fe5be0d25b99929985353e471562" alt=""
data:image/s3,"s3://crabby-images/46f30/46f30f68839a3c30d47adc3ed12a4cc901288a55" alt=""
data:image/s3,"s3://crabby-images/384f6/384f65bfcfc170984059a82bcac51f9d86067550" alt=""
Flume搭建_案例三:如何监控一个文件的变化?
data:image/s3,"s3://crabby-images/45922/45922f7f7c348ab0e4ea9d748d9528b6bf5419c8" alt=""
data:image/s3,"s3://crabby-images/4ebc2/4ebc2e0f2d62935c914b08b688682bd658c3406c" alt=""
启动Flume
flume-ng agent -n a1 -c conf -f exec.conf -Dflume.root.logger=INFO,console
flume-ng agent -n a1 -c conf -f /home/test_flume/flume3 -Dflume.root.logger=INFO,console
data:image/s3,"s3://crabby-images/1da18/1da1863650eb4aa0d80dfd2a4c7c7bc1e9a1dd31" alt=""
data:image/s3,"s3://crabby-images/deaf2/deaf2b22f119dcf5e25a24346edea9600aa5457a" alt=""
Flume搭建_案例四: 如何监控一个文件:目录的变化?
data:image/s3,"s3://crabby-images/174e9/174e9173654cf12b02e69621a6cae8c39e2cd93d" alt=""
data:image/s3,"s3://crabby-images/1cf97/1cf970c28faa12160ebece6025735e634093fcc9" alt=""
data:image/s3,"s3://crabby-images/cf362/cf3625ba37d675c351e051bf495d105572ef9d3b" alt=""
data:image/s3,"s3://crabby-images/088ef/088efa3e8d16f4914ed4b8ec8f7dbf61b6801560" alt=""
data:image/s3,"s3://crabby-images/2684a/2684a267ab006d1544f840bda2061fbc7a993293" alt=""
data:image/s3,"s3://crabby-images/ce3fc/ce3fcab3625cf76ef288881c370a3d8aced1e7c2" alt=""
Flume搭建_案例五: 如何定义一个HDFS类型的Sink?
data:image/s3,"s3://crabby-images/2e9f6/2e9f6fc86a486ebd6b8c2a80899da1124bb24b59" alt=""
Flume搭建_案例五_配置项解读
data:image/s3,"s3://crabby-images/00a11/00a11e72a1bf2832f968c9e0edb7a361d0ff7270" alt=""
data:image/s3,"s3://crabby-images/c2f7b/c2f7b972fc41b4d4fa19de909b781eeb3e78f598" alt=""
hdfs.rollInterval | 30 | Number of seconds to wait before rolling current file (0 = never roll based on time interval) |
hdfs.rollSize | 1024 | File size to trigger roll, in bytes (0: never roll based on file size) |
hdfs.rollCount | 10 | Number of events written to file before it rolled (0 = never roll based on number of events) |
4. 多长时间没有操作,Flume将一个临时文件生成新文件?
hdfs.idleTimeout | 0 | Timeout after which inactive files get closed (0 = disable automatic closing of idle files) |
5. 多长时间生成一个新的目录?(比如每10s生成一个新的目录)
四舍五入,没有五入,只有四舍
(比如57分划分为55分,5,6,7,8,9在一个目录,10,11,12,13,14在一个目录)
hdfs.round | false | Should the timestamp be rounded down (if true, affects all time based escape sequences except %t) |
hdfs.roundValue | 1 | Rounded down to the highest multiple of this (in the unit configured using hdfs.roundUnit), less than current time. |
hdfs.roundUnit | second | The unit of the round down value - second, minute or hour. |
data:image/s3,"s3://crabby-images/733cb/733cb8d06815eacf5186d09a149d2d66b1277877" alt=""
data:image/s3,"s3://crabby-images/5961a/5961a30dcbaf451df0c8562477ce5c567af5551f" alt=""
data:image/s3,"s3://crabby-images/b1d1d/b1d1dbc94c55b3103f1502a1a982fb749428f6ff" alt=""
data:image/s3,"s3://crabby-images/8a94b/8a94ba9acd61ab64459d6f0c161146b251425829" alt=""
data:image/s3,"s3://crabby-images/61306/61306318fd71ee753fb3846e20b9621d13da938b" alt=""
data:image/s3,"s3://crabby-images/b5876/b5876296a3e824c78e136b48ded233f49cd0100f" alt=""
以上是关于Flume环境搭建_五种案例的主要内容,如果未能解决你的问题,请参考以下文章