Ara Pulido - A Tale of Tail Latency: Understanding k18n CPU Requests and Limits for Sust. and Profit

Ara Pulido - A Tale of Tail Latency: Understanding k18n CPU Requests and Limits for Sust. and Profit

58 Lượt nghe
Ara Pulido - A Tale of Tail Latency: Understanding k18n CPU Requests and Limits for Sust. and Profit
When deploying an application to Kubernetes, each container in a pod should define CPU requests and limits. It is more commonly understood how CPU requests affect the scheduling of your pod and the future pods in the same node. But outside scheduling, CPU requests and limits have some effects on how your containers are created and can heavily impact their performance and their energy footprint. In this talk we will help clarify some misconceptions about CPU requests and limits by explaining, in a developer friendly way, how they translate to some Linux internals. We will offer some quick tips on how to understand those effects, minimise them, and select good values to reduce your application energy footprint while ensuring its performance.