Add backup and restore for bootstrapping #75

theganyo · 2017-04-22T01:05:27Z

Submitted for feedback
Untested. Don't merge.

gbrail

This seems like a good approach -- do a backup on a schedule and POST it to an endpoint, which then would take care of the rest. Other than one thing I saw in the code I have a few ideas.

First, I see that the changeservers do the backup on a schedule. What if the backup attempt fails? Can each one log the error and try again in less time using an exponential backoff? That way we might survive a temporary glitch better.

Same on a boot of a new changeserver -- if it can't get the backup, what does it do?

I also see that the backup server is trying to manage storage by having a bunch of separate files. GCP isn't like a regular filesystem -- you can overwrite the same "file" and the whole thing will either change atomically or fail. So maybe we only just need one GCS "file" that we keep overwriting.
(Also maybe we should checksum the file first, so we can skip the GCS update if nothing changed since the last backup. Otherwise we keep paying money to overwrite the file over and over. I think that GCS has a way to do this easily.)

Finally, I'm not sure I like the idea of generating a "secret" and putting it in a parameter. At the very least we'll need a way to rotate it. I'm not sure what else to do though -- we'll have to think about it. (One thing we can do is use GCP IAM for this.)

I think this is a clever usage of Cloud Functions. Is there some way that we can unit-test that code using regular node?

gbrail · 2017-04-25T17:44:41Z

bootstrap/bootstrap.go

+	if err != nil {
+		return err, false
+	}
+	defer req.Body.Close()


Do you mean "res.body.Close()" here?

Probably. :) Thanks.

theganyo · 2017-04-26T00:21:09Z

Yeah, I intentionally just allowed the startup to fail if was told to restore, but can't. It could to a retry... or some external process could do a retry. As for the backup, yes, as backoff would definitely make sense.

What you see with the backup server isn't it using different files for different versions... it will actually overwrite the same file as you suggest. The ability to have multiple file IDs is to allow more than one cluster of change server (ie. with different tenant information) to backup to the same bucket. But, yes, a checksum could potentially be used to avoid rewriting the file... or perhaps the change servers could realize that they have nothing new to backup and not even try.

Finally, I agree the "secret" thing isn't a fantastic solution. It was just a hack to enable some kind of basic level of authz. I'm not sure of the correct solution here.

gbrail · 2017-04-27T03:54:16Z

Still playing with an automated presubmit check. It failed because this branch is based on an older master -- not a real failure yet.

gbrail · 2017-05-05T00:18:59Z

Scott -- I haven't merged this because it says "not ready yet." Do we have a plan to get it ready?

theganyo · 2017-05-05T00:35:32Z

It still needs unit tests at a minimum. Also probably needs a more thoughtful approach to authz. There is no plan to finish this at present.

theganyo added 4 commits April 21, 2017 17:42

transicator bootstrap cloud scripts

bcb551a

transicator bootstrap

1765ad0

added bootstrap and backup logic

44a3b25

conditionally start backup goroutine

ade7b19

gbrail reviewed Apr 26, 2017

View reviewed changes

gbrail added the kokoro:run label Apr 26, 2017

address review comment - close res.Body

f098a8f

kokoro-team removed the kokoro:run label Apr 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add backup and restore for bootstrapping #75

Add backup and restore for bootstrapping #75

theganyo commented Apr 22, 2017 •

edited

Loading

gbrail left a comment

gbrail Apr 25, 2017

theganyo Apr 26, 2017

theganyo commented Apr 26, 2017

gbrail commented Apr 27, 2017

gbrail commented May 5, 2017

theganyo commented May 5, 2017

Add backup and restore for bootstrapping #75

Are you sure you want to change the base?

Add backup and restore for bootstrapping #75

Conversation

theganyo commented Apr 22, 2017 • edited Loading

gbrail left a comment

Choose a reason for hiding this comment

gbrail Apr 25, 2017

Choose a reason for hiding this comment

theganyo Apr 26, 2017

Choose a reason for hiding this comment

theganyo commented Apr 26, 2017

gbrail commented Apr 27, 2017

gbrail commented May 5, 2017

theganyo commented May 5, 2017

theganyo commented Apr 22, 2017 •

edited

Loading