Apache Falcon

Falcon is a feed processing and feed management system aimed at making it easier for end consumers to onboard their feed processing and feed management on hadoop clusters.

Why Apache Falcon?

Dependencies across various data processing pipelines are not easy to establish. Gaps here typically leads to either incorrect/partial processing or expensive reprocessing. Repeated duplicate definition of a single feed multiple times can lead to inconsistencies / issues.
Input data may not arrive always on time and it is required to kick off the processing without waiting for all data to arrive and accommodate late data separately
Feed management services such as feed retention, replications across clusters, archival etc are tasks that are burdensome on individual pipeline owners and better offered as a service for all customers.
It should be easy to onboard new workflows/pipelines
Smoother integration with metastore/catalog
Provide notification to end customer based on availability of feed groups (logical group of related feeds, which are likely to be used together)

Online Documentation

You can find the documentation on Apache Falcon website.

How to Contribute

Before opening a pull request, please go through the Contributing to Apache Falcon wiki. It lists steps that are required before creating a PR and the conventions that we follow. If you are looking for issues to pick up then you can look at starter tasks or open tasks

Release Notes

You can download release notes of previous releases from the following links.

0.8

0.7

Name		Name	Last commit message	Last commit date
Latest commit History 2,227 Commits
acquisition		acquisition
addons		addons
archival		archival
build-tools		build-tools
cli		cli
client		client
common-types		common-types
common		common
distro		distro
docs		docs
examples		examples
extensions		extensions
falcon-regression		falcon-regression
falcon-ui		falcon-ui
hadoop-dependencies		hadoop-dependencies
html5-ui		html5-ui
lifecycle		lifecycle
messaging		messaging
metrics		metrics
monitoring		monitoring
oozie-el-extensions		oozie-el-extensions
oozie		oozie
prism		prism
release-docs		release-docs
replication		replication
rerun		rerun
retention		retention
scheduler		scheduler
shell		shell
src		src
test-tools		test-tools
test-util		test-util
titan		titan
unit		unit
webapp		webapp
.gitignore		.gitignore
.reviewboardrc		.reviewboardrc
CHANGES.txt		CHANGES.txt
Installation-steps.txt		Installation-steps.txt
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.md		README.md
falcon_merge_pr.py		falcon_merge_pr.py
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apache Falcon

Why Apache Falcon?

Online Documentation

How to Contribute

Release Notes

About

Releases

Packages

Contributors 28

Languages

License

apache/falcon

Folders and files

Latest commit

History

Repository files navigation

Apache Falcon

Why Apache Falcon?

Online Documentation

How to Contribute

Release Notes

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 28

Languages

Packages