Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kafka+SQL improve bootstrapping by one-time replication of state between replicas? #1074

Closed
EricWittmann opened this issue Dec 8, 2020 · 0 comments · Fixed by #4665
Closed
Labels
type/enhancement New feature or request

Comments

@EricWittmann
Copy link
Member

A potentially significant problem with the Kafka+SQL storage implementation is the bootstrapping time for a replica. Each replica, when it starts up, must fully consume all messages in the Kafka topic. We are using log compaction to minimize the number of messages that may exist, but even so as the size of the registry grows, so too will the number of messages on the topic.

We should explore the idea of pulling the current state of the in-memory database directly from a peer replica (if one exists and can be found). It's possible that we could perform a database-specific dump and send it over the wire to the new replica in less time than it would take to consume all of the Kafka messages.

There is a question of how to then skip the appropriate number of Kafka messages so that we sync back up with normal message consumption.

@EricWittmann EricWittmann added the type/enhancement New feature or request label Dec 8, 2020
@carlesarnal carlesarnal linked a pull request May 15, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type/enhancement New feature or request
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

1 participant