replace deprecated statefulMapConcat #133

pjfanning · 2024-02-17T14:07:44Z

I'm not sure if this is better but statefulMapConcat is deprecated.

The code is trying the take a Source[String, _] and to remove duplicates.

Both approaches involve building sets and those sets will consume a lot of memory if there is a lot of data. I don't think this is avoidable.

I am not a Cassandra expert but I think we might be able to get Cassandra to run 'DISTINCT' on the query. That could mean that we could remove the statefulMapConcat/statefulMap stage.

nvollmar · 2024-02-23T08:02:08Z

Cassandra only supports distinct queries on the partition key. Since both persistence_id and tag are the partition key we can't distinct query on the tag alone.

Since this is part of the reconciler that has to be manually called and is already marked as a rather expensive operation, I'd think this change is not too risky.

nvollmar

lgtm

core/src/main/scala/org/apache/pekko/persistence/cassandra/reconciler/AllTags.scala

…en adding

pjfanning · 2024-02-23T14:11:43Z

Cassandra only supports distinct queries on the partition key. Since both persistence_id and tag are the partition key we can't distinct query on the tag alone.

Since this is part of the reconciler that has to be manually called and is already marked as a rather expensive operation, I'd think this change is not too risky.

@nvollmar I haven't tried it but if it was possible to sort the query result then the duplicate check would only need to know the last element as opposed to keeping a full set of visited tags. Do you think Cassandra is likely to allow this query to be sorted?

nvollmar · 2024-02-23T20:56:48Z

@pjfanning Cassandra does not allow to sort by arbitrary columns. You can define a cluster ordering of a table, but that also has limitations. A more "Cassandra way" to solve this would be using a dedicated table to keep all unique tags for example.

Roiocam

LGTM, only one style suggestion

core/src/main/scala/org/apache/pekko/persistence/cassandra/reconciler/AllTags.scala

…onciler/AllTags.scala Co-authored-by: AndyChen(Jingzhang) <[email protected]>

replace deprecated statefulMapConcat

00b6d21

pjfanning marked this pull request as draft February 17, 2024 14:07

pjfanning added 3 commits February 17, 2024 15:13

Update AllTags.scala

3ccb13c

Update AllTags.scala

4698d9f

another impl

95bf7d2

pjfanning mentioned this pull request Feb 17, 2024

[DRAFT] try to get cassandra to get distinct tags #134

Closed

pjfanning changed the title ~~[DRAFT] replace deprecated statefulMapConcat~~ replace deprecated statefulMapConcat Feb 22, 2024

pjfanning requested review from raboof, gmethvin, He-Pin, nvollmar, mdedetrich, samueleresca and Roiocam February 22, 2024 17:20

nvollmar approved these changes Feb 23, 2024

View reviewed changes

Roiocam reviewed Feb 23, 2024

View reviewed changes

core/src/main/scala/org/apache/pekko/persistence/cassandra/reconciler/AllTags.scala Outdated Show resolved Hide resolved

pjfanning added 2 commits February 23, 2024 13:11

use mutable Set for seen to avoid allocating extra Set instances wh…

1a33143

…en adding

scala 2.12 compile issue

d0e6b35

pjfanning marked this pull request as ready for review February 23, 2024 12:38

refactor

26b5543

Roiocam approved these changes Feb 24, 2024

View reviewed changes

core/src/main/scala/org/apache/pekko/persistence/cassandra/reconciler/AllTags.scala Outdated Show resolved Hide resolved

Update core/src/main/scala/org/apache/pekko/persistence/cassandra/rec…

e147754

…onciler/AllTags.scala Co-authored-by: AndyChen(Jingzhang) <[email protected]>

nvollmar approved these changes Feb 24, 2024

View reviewed changes

pjfanning merged commit 90d80ad into apache:main Feb 24, 2024
13 checks passed

pjfanning deleted the replace-statefulmap branch February 24, 2024 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace deprecated statefulMapConcat #133

replace deprecated statefulMapConcat #133

pjfanning commented Feb 17, 2024 •

edited

Loading

nvollmar commented Feb 23, 2024

nvollmar left a comment

pjfanning commented Feb 23, 2024

nvollmar commented Feb 23, 2024

Roiocam left a comment

replace deprecated statefulMapConcat #133

replace deprecated statefulMapConcat #133

Conversation

pjfanning commented Feb 17, 2024 • edited Loading

nvollmar commented Feb 23, 2024

nvollmar left a comment

Choose a reason for hiding this comment

pjfanning commented Feb 23, 2024

nvollmar commented Feb 23, 2024

Roiocam left a comment

Choose a reason for hiding this comment

pjfanning commented Feb 17, 2024 •

edited

Loading