How scaling by requests works? #3392
-
Hi there, I'm just looking for a little bit of clarification regarding scaling based on requests as seen here The docs say "Scale up or down based on the request count handled per tasks."
It has 10,000 which makes me wary that I'm misunderstanding how this operates. I just don't want to set it too low and then have my application scale to the max right away. Is it 10,000 per minute/hour/etc? So just looking for some clarification on that. Thanks a lot in advance. |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 2 replies
-
Apologies for the confusing verbiage, it represents the average number of requests received by each task. So for example, if you specify
It is per minute! So the load balancer will track on average how many requests per minute an ECS task processes and ECS will autoscale to maintain your specified number. Setting it to a low number like 5 is totally reasonable if your responses take on average 12s for example. |
Beta Was this translation helpful? Give feedback.
-
Thanks for the response, that makes a lot of sense. Cheers. |
Beta Was this translation helpful? Give feedback.
-
Can you also help explain further on cpu_percentage and memory_percentage please? "Scale up or down based on the average CPU/memory your service should maintain". If these are set to a higher percentage, does the service in ECS attempt to keep more tasks running to match the average? I've been looking for further documentation from AWS and I found this document https://docs.aws.amazon.com/autoscaling/ec2/userguide/as-scaling-target-tracking.html, but I it's still a little unclear. |
Beta Was this translation helpful? Give feedback.
Apologies for the confusing verbiage, it represents the average number of requests received by each task. So for example, if you specify
requests: 5
that means you expect every minute for an ECS task to handle on average 5 requests.It is per minute! So the load balancer will track on average how many requests per minute an ECS task processes and ECS will autoscale to maintain your specified number. Setting it to a low number like 5 is totally reasonable if your responses take on average 12s for exam…