You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Apache Flink is quickly gaining momentum as an alternative to Spark Streaming, Storm, etc.
Apache Flink is an open source platform for distributed stream and batch data processing. Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams.
What are your thoughts on developing a plugin for Flink Streaming in StreamFlow? The rationale is that Flink provides a Storm compatible API:
Flink provides a Storm compatible API (org.apache.flink.storm.api) that offers replacements for the following classes:
TopologyBuilder replaced by FlinkTopologyBuilder
StormSubmitter replaced by FlinkSubmitter
NimbusClient and Client replaced by FlinkClient
LocalCluster replaced by FlinkLocalCluster
In order to submit a Storm topology to Flink, it is sufficient to replace the used Storm classes with their Flink replacements in the Storm client code that assembles the topology. The actual runtime code, ie, Spouts and Bolts, can be uses unmodified. If a topology is executed in a remote cluster, parameters nimbus.host and nimbus.thrift.port are used as jobmanger.rpc.address and jobmanger.rpc.port, respectively. If a parameter is not specified, the value is taken from flink-conf.yaml.
The text was updated successfully, but these errors were encountered:
StreamFlow now uses an external process for deploying a StreamFlow topology to a Storm cluster. It should be relatively straight forward to implement an alternate submitter. More changes will likely be required to provide the hooks for topology status and metrics.
What's the primary motivation for Flink integration? Performance?
I saw the word-count performance comparison, but it was comparing a Storm topology to a Flink DSL approach. Are there any performance comparisons of running an unmodified Storm topology in Flink vs natively in a Storm cluster?
Apache Flink is quickly gaining momentum as an alternative to Spark Streaming, Storm, etc.
What are your thoughts on developing a plugin for Flink Streaming in StreamFlow? The rationale is that Flink provides a Storm compatible API:
The text was updated successfully, but these errors were encountered: