You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Should we discuss if it is time to recreate these types?
I noticed that there is also a 1 class variant of sinks.
So it is time to refactor the "old" 3 class sinks?
There is a category called database flink with an Elasticsearch sink.
Would it be better so rename this and call is distributed sinks?
and add other storage like spark or Hadoop in the future?
Why do want to recreate them? Do you think they are outdated?
Yes, this is the same like for processors. The three-class approach is deprecated and we need to migrate all pipeline elements to the one-class approach. This is just some work to do 😀 Maybe these are suitable tasks as good-first-issues if we provide in example on how to do?
I think this naming is tied to the actual runtime environment, like it is for some processor elements as well.
If we want to support further runtime types like Spark then we should create dedicated modules therefore.
As far as I know, we once supported Spark already here and there, but there was close to zero usage.
Since this increases complexity and maintenance effort significantly, we should carefully think about integration additional runtime environments. @tenthe@dominikriemer please correct me if I'm wrong here
Yes, this is the same like for processors. The three-class approach is deprecated and we need to migrate all pipeline elements to the one-class approach. This is just some work to do 😀 Maybe these are suitable tasks as good-first-issues if we provide in example on how to do?
The DataSinkType refers to the category of the data sink (there is a similar class for processors as well). This is only used in the pipeline editor to better find pipeline elements of a certain group (select group in the pipeline element selection panel). We can extend the list but need to be careful when removing items due to backwards compatibility.
We have some modules with flink appendix which can be only used with the Flink wrapper. Anyways, we haven't maintained the Flink wrapper for quite some time and most recent pipeline elements use the default JVM wrapper. We can rework the distributed wrappers in the future, and then I'd be in favor of reusing the same set of processors for distributed and standalone wrappers instead of having separate implementations.
enhancementNew feature or requestpipeline elementsRelates to pipeline elementsstrategicDiscussions only. Indicates that a discussion refers to a strategic topic.
3 participants
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
in StreamPipes there are different Sink Types
https://github.com/apache/streampipes/blob/dev/streampipes-model/src/main/java/org/apache/streampipes/model/DataSinkType.java
Should we discuss if it is time to recreate these types?
I noticed that there is also a 1 class variant of sinks.
So it is time to refactor the "old" 3 class sinks?
There is a category called database flink with an Elasticsearch sink.
Would it be better so rename this and call is distributed sinks?
and add other storage like spark or Hadoop in the future?
Another topic on sinks?
For documentation there is a new dev section https://cwiki.apache.org/confluence/x/NppbDg
Beta Was this translation helpful? Give feedback.
All reactions