How do we scale runner independently of the api server without Yatai? #3303
-
In the docs, it is mentioned that runners are
and that
Based on the above and the architecture diagram, it seems possible to scale the api servers and runners separately in different pods. I am confused about how they communicate and how data is being transferred between them. I understand that the recommended way is using Yatai, but since it requires additional resources and that we already have a horizontal scaling solution in place, I was wondering if there is a more direct way of defining the service and runner resources. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
as far as I know there is no option to increase runner process cnt but here is one way what you want to do bentoml support below cli
It is possible to deploy in the same way as the archiecture (not unix socket) referring to the method below, runner container and http-server can be distributed separately in separate pod,
|
Beta Was this translation helpful? Give feedback.
as far as I know there is no option to increase runner process cnt
but here is one way what you want to do
bentoml support below cli
bentoml start-runner-server
bentoml start-http-server
bentoml start-grpc-server
It is possible to deploy in the same way as the archiecture (not unix socket)
referring to the method below, runner container and http-server can be distributed separately in separate pod,
and runner container can be scaled up