flume_monitor_source is a flume source plug-in for monitoring files under a specified directory. It is different from the 'spooldir' source in flume in the following ways:
- The flume_monitor_source can incrementally read data from the specified directory in real time, which means the file under the specified directory is writeable (only append operation). This is not support by 'spooldir' source;
- The flume_monitor_source can handle the multiple lines such as Java call stack or exception as ONE understandable complete record , while the flume can only handle one line per time;
- The flume_monitor_source will process the file at the point which it had already processed when it was stopped at the last time.
- Build the jar
ant jar
- Copy jar to lib of flume
cp dist/flume-monitor-source-0.1.jar ${FLUME_HOME}/lib
You are supposed to known how to use flume. See Flume Documentation.
Property Name | default | Description |
---|---|---|
type | - | The component type name, needs to be com.minsheng.flume.source.MonitorDirectorySource |
monitor_dir | - | Required. The directory under which all files under will be monitored. Files satisfying the condition will be parsed and send to the flume channel |
meta_store_dir |