Skip to content

How scaling by requests works? #3392

Answered by efekarakus
coopsmoss asked this question in Q&A
Discussion options

You must be logged in to vote

I interpret that to mean simultaneous requests per instance

Apologies for the confusing verbiage, it represents the average number of requests received by each task. So for example, if you specify requests: 5 that means you expect every minute for an ECS task to handle on average 5 requests.

I just don't want to set it too low and then have my application scale to the max right away. Is it 10,000 per minute/hour/etc?

It is per minute! So the load balancer will track on average how many requests per minute an ECS task processes and ECS will autoscale to maintain your specified number. Setting it to a low number like 5 is totally reasonable if your responses take on average 12s for exam…

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Answer selected by coopsmoss
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@efekarakus
Comment options

@tdsticks
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants