Handles rolling upgrades for AWS ASGs on EKS
APACHE-2.0 License
Published by TwiN over 4 years ago
While upgrading EKS from 1.14 to 1.15, I ran into an issue where an in-flight request would hang, and thus the execution would never end. Restarting the pod fixed the issue, and while I'm not exactly sure why it happened, what I do know is that since there's no health check that the liveness probe can perform, adding a timeout to each execution and panicking on failure to respect that execution time limit is a safe way to either allow the pod to recover on its own, or if the issue keeps happening, then at least it will increase visibility.
Published by TwiN over 4 years ago
Published by TwiN over 4 years ago
The application now panics after 10 failed executions in a row.
Published by TwiN over 4 years ago
The MixedInstancePolicy available through the usage of a Launch Template is at the ASG level, meaning that if you modify the instance type overrides to allow the usage of multiple instance types on a single ASG, no new launch template version would be created. This update verifies if the instance's instance type is part of the overrides, and if it isn't, then it means that the ASG has been updated, which means that it's outdated.
Published by TwiN over 4 years ago
Published by TwiN over 4 years ago
Published by TwiN over 4 years ago
Published by TwiN over 4 years ago
Published by TwiN over 4 years ago
Published by TwiN over 4 years ago
Published by TwiN over 4 years ago
First release