Replies: 6 comments 1 reply
-
In a future iteration - we will move this from |
Beta Was this translation helpful? Give feedback.
-
We might also consider adding More info about
ref: https://stackoverflow.com/a/29515696 |
Beta Was this translation helpful? Give feedback.
-
Hi Emre,
We're already using a key.
Partitioning is achieved using the time window configuration that can be
specified in the stream configuration itself.
We want to consiously stay away from ordering.
The reasons are two-fold
- Our target is the Data Lake and not cloud-based business or analytic
applications, some off which require event ordering
- Event ordering introduces lot of infrastructure-related complexities
which, in the long run, impact the product negatively
Thanks & Regards,
Dipanjan
…On Fri, 18 Mar 2022 at 14:29, Emre Baykal ***@***.***> wrote:
We might also consider adding KAFKA *key* to the MODULES variable the
same way you explained above. Providing function name as a KAFKA *key* to
each function would work if message ordering OR partitioning is needed.
Even if it is not needed now, it may be useful in the future since this is
aimed for real-time events, and real-time events may be requested to be in
order.
More info about KAFKA *key*:
Keys are mostly useful/necessary if you require strong order for a key and
are developing something like a state machine. If you require that messages
with the same key (for instance, a unique id) are always seen in the
correct order, attaching a key to messages will ensure messages with the
same key always go to the same partition in a topic. Kafka guarantees order
within a partition, but not across partitions in a topic, so alternatively
not providing a key - which will result in round-robin distribution across
partitions - will not maintain such order.
ref: https://stackoverflow.com/a/29515696
ref: https://stackoverflow.com/a/61912094
—
Reply to this email directly, view it on GitHub
<#20 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AB7GDSZMVTISY2NQWRILEBTVARAWNANCNFSM5RA653MA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Welcome :-)
…On Fri, 18 Mar 2022 at 18:37, Emre Baykal ***@***.***> wrote:
I see. Thank you for the clarification.
—
Reply to this email directly, view it on GitHub
<#20 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AB7GDS5YVKP6XTRKFVZ5LGLVAR5ZNANCNFSM5RA653MA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
Hi @dipanjanb. Any update regarding this feature? Let me know if anything needed. |
Beta Was this translation helpful? Give feedback.
-
Hi Emre,
Not yet - hope to start from this week.
Thanks & Regards,
Dipanjan
…On Mon, 28 Mar, 2022, 11:28 pm Emre Baykal, ***@***.***> wrote:
Hi @dipanjanb <https://github.com/dipanjanb>. Any update regarding this
feature? Let me know if anything needed.
—
Reply to this email directly, view it on GitHub
<#20 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AB7GDSZ5QPGWJDWA4EGML7TVCHXM5ANCNFSM5RA653MA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
@misterbykl @thtmnisamnstr to farther ease daisy chaining of Stateful Functions in RTDL - I'm contemplating an approach that will help each function in the chain dynamially identify the next stage and write to the ingress topic of the next function in the chain
The chain is intended to look like
ingest -> pii-detection -> some-other-function -> some-more-function -> ingester
The beginning and end would be fixed - everything else would be dynamic.
The way this can be handled is as follows
MODULES : pii-detection, some-other-function, some-more-function
<function name>_in
. e.g. pii-detection_in, some-other-function_in and so on. Somodule.yaml
of these functions will have the fixed names_in
to the same and use the resulting string as the topic name to write to. Presently it is read fromKAFKA_TOPIC
environment variable. This variable would still remain and would be used ifMODULES
is empty i.e. no daisy-chaining is intendedingress
topic - which is what will remain fixed inmodule.yaml
foringester
This way - if a user wants to use multiple functions and add/remove functions as required - they will just need to change the MODULES variable in the
docker-compose.yaml
file and not make changes to a number ofmodule.yaml
files within each function directory.Hope I could explain the idea. Please voice queries if any.
@misterbykl I'll let you know what to do to adopt this for
pii-detection
Beta Was this translation helpful? Give feedback.
All reactions