CI AWS account: write scripts to clean used resources #26711

mtojek · 2021-07-05T12:48:18Z

The idea of this issue is to enable some scripting to remove/clean old resources that have been used during tests. We can't always trust Terraform that it will remove all resources. The process running "tf" or the entire CI machine may go down and these resources will stay forever.

Possible solutions:

Lambda function which periodically cleans old resources.

Most likely we'll face same problem in elastic/integrations.

cc @jsoriano @kaiyan-sheng

elasticmachine · 2021-07-05T12:48:19Z

Pinging @elastic/integrations (Team:Integrations)

mtojek · 2021-07-05T13:12:52Z

There are few players in the game:

cloud nuke - https://github.com/gruntwork-io/cloud-nuke
auto cleanup - https://github.com/servian/aws-auto-cleanup
awsweeper - https://github.com/jckuester/awsweeper

jsoriano · 2021-07-05T13:12:54Z

Terraform state is archived by the jenkins pipeline. This could be used to discover resources created but not destroyed. Though this would mean to look through all the jobs that may create these scenarios, and won't work for removed jobs.

mtojek · 2021-07-05T13:18:23Z

Yeah, that's actually the reason, why I personally prefer to simplify the logic and just depend on the timestamp (old enough? nuke it please).

I assume we need it for EC2 instances, DynamoDB databases, SQS queues, SNS topics. Is there anything else? Do we create also other resources?

kaiyan-sheng · 2021-07-16T20:56:39Z

Good point!

I assume we need it for EC2 instances, DynamoDB databases, SQS queues, SNS topics. Is there anything else? Do we create also other resources?

S3 bucket also?

kuisathaverat · 2021-07-19T09:32:01Z

The easy way is to tag everything created from the CI, then nuke everything with those tags every daily. If we add the tag CI and another like created-DAY_OF_YEAR we can nuke resources safely a day after they are created.

botelastic · 2022-07-19T10:18:41Z

Hi!
We just realized that we haven't looked into this issue in a while. We're sorry!

We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1.
Thank you for your contribution!

mtojek · 2022-07-19T11:04:28Z

👍

kuisathaverat · 2022-07-19T12:04:36Z

@v1v is this on your radar?

v1v · 2022-07-19T14:03:17Z

IIRC, all the bits and pieces regarding the tagging/labelling was done with:

[ci][terraform] tags with metadata #31355

There is some automation in place to delete all the leftovers, @amannocci can you confirm if the automation is enabled to delete those resources when needed?

beats/Jenkinsfile

Line 46 in 5d4d48c

    
           string(name: 'awsRegion', defaultValue: 'eu-central-1', description: 'Default AWS region to use for testing.')

is the current AWS region

amannocci · 2022-07-19T16:14:09Z

Currently, only EC2 instances are handled by cloud-reaper.
AFAIK we will need to add support for S3, SNS & SQS services.
It should be easy for S3 and a bit less obvious for SNS and SQS services.
Should we add those items in an iteration? @v1v

v1v · 2022-07-19T16:53:05Z

Should we add those items in an iteration? @v1v

Would you mind raising an issue in our project, so we can prioritise it

botelastic · 2023-07-19T17:28:41Z

Hi!
We just realized that we haven't looked into this issue in a while. We're sorry!

We're labeling this issue as Stale to make it hit our filters and make sure we get back to it as soon as possible. In the meantime, it'd be extremely helpful if you could take a look at it as well and confirm its relevance. A simple comment with a nice emoji will be enough :+1.
Thank you for your contribution!

amannocci · 2023-07-25T08:36:21Z

This issue was addressed with internal tooling.

mtojek added the Team:Integrations Label for the Integrations team label Jul 5, 2021

botelastic bot added the Stalled label Jul 19, 2022

botelastic bot removed the Stalled label Jul 19, 2022

mtojek added Team:Cloud-Monitoring Label for the Cloud Monitoring team and removed Team:Integrations Label for the Integrations team labels Jul 19, 2022

botelastic bot added the Stalled label Jul 19, 2023

amannocci closed this as completed Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI AWS account: write scripts to clean used resources #26711

CI AWS account: write scripts to clean used resources #26711

mtojek commented Jul 5, 2021

elasticmachine commented Jul 5, 2021

mtojek commented Jul 5, 2021

jsoriano commented Jul 5, 2021

mtojek commented Jul 5, 2021

kaiyan-sheng commented Jul 16, 2021

kuisathaverat commented Jul 19, 2021

botelastic bot commented Jul 19, 2022

mtojek commented Jul 19, 2022

kuisathaverat commented Jul 19, 2022

v1v commented Jul 19, 2022

amannocci commented Jul 19, 2022

v1v commented Jul 19, 2022

botelastic bot commented Jul 19, 2023

amannocci commented Jul 25, 2023

CI AWS account: write scripts to clean used resources #26711

CI AWS account: write scripts to clean used resources #26711

Comments

mtojek commented Jul 5, 2021

elasticmachine commented Jul 5, 2021

mtojek commented Jul 5, 2021

jsoriano commented Jul 5, 2021

mtojek commented Jul 5, 2021

kaiyan-sheng commented Jul 16, 2021

kuisathaverat commented Jul 19, 2021

botelastic bot commented Jul 19, 2022

mtojek commented Jul 19, 2022

kuisathaverat commented Jul 19, 2022

v1v commented Jul 19, 2022

amannocci commented Jul 19, 2022

v1v commented Jul 19, 2022

botelastic bot commented Jul 19, 2023

amannocci commented Jul 25, 2023