Answer the question
In order to leave comments, you need to log in
How to select computing resources for scale-out cluster nodes?
With horizontal scaling, we add more nodes to the cluster to handle more load. But as before, we can choose how much power (CPU, RAM) each cluster node will have.
When the cluster is managed by Kubernetes, it is very easy to set a limit on CPU and memory for Pods. How to choose the optimal CPU and memory size for cluster nodes (or Pods in Kubernetes)?
For example, in a cluster there are 3 nodes with 1 vCPU and 1GB RAM each. To process more requests there are 2 options:
Answer the question
In order to leave comments, you need to log in
The answer has to do with performance metrics like latency and throughput .
Latency - the time interval between sending a request and receiving a response.
Throughput - request processing speed (requests per second).
Latency affects throughput. More latency - less throughput.
If a business transaction consists of consecutive calls to several services that cannot be parallelized, then the computing resources for the cluster nodes should be chosen in such a way as to achieve the required latency, since adding additional service instances (horizontal scaling) will not have an impact on latency.
Adding additional service instances increases throughput (provided there are no bottlenecks).
That is, choose the CPU and RAM in such a way that the response time of the service is optimal, and to process more requests at the same time, add new service instances.
Didn't find what you were looking for?
Ask your questionAsk a Question
731 491 924 answers to any question