typhoon

Minimal and free Kubernetes distribution with Terraform

MIT License

Stars
1.9K
Committers
53

Bot releases are hidden (Show)

typhoon - v1.14.0

Published by dghubble over 5 years ago

  • Kubernetes v1.14.0
  • Update Calico from v3.6.0 to v3.6.1
  • Add enable_aggregation option for CNCF conformance (#436)
    • Aggregation is disabled by default to retain our security stance. Extensions should be considered part of the control plane and scrutinized carefully. Favor leaving aggregation disabled.

AWS

  • Add ability to load balance TCP applications (#443)
    • Output the network load balancer ARN as nlb_id
    • Accept a worker_target_groups (ARN) list to which worker instances should be added

Azure

  • Add ability to load balance TCP/UDP applications (#447)
    • Output the load balancer ID as loadbalancer_id
  • Output worker_security_group_name and worker_address_prefix for extending firewall rules (#447)

DigitalOcean

  • Harden internal (node-to-node) firewall rules to align with other platforms (#444)
  • Add ability to load balance TCP applications (#444)
    • Output controller_tag and worker_tag to simplify extending firewall rules

Google Cloud

  • Add ability to load balance TCP/UDP applications (#442)
    • Add worker instances to a target pool, output as worker_target_pool
    • Health check for workers with Ingress controllers. Forward rules don't support differing internal/external ports, but some Ingress controllers support TCP/UDP proxy as a workaround
  • Remove Haswell minimum CPU platform requirement (#439)
    • Google Cloud API implements min_cpu_platform to mean "use exactly this CPU". Revert #405 added in v1.13.4.
    • Fix error creating clusters in new regions without Haswell (e.g. europe-west2) (#438)

Addons

  • Update Prometheus from v2.8.0 to v2.8.1
  • Update Grafana from v6.0.2 to v6.1.1
    • Add dashboard for pods in a workload (deployment/daemonset/statefulset) (#446)
    • Add dashboard for workloads by namespace
typhoon - v1.13.5

Published by dghubble over 5 years ago

  • Kubernetes v1.13.5
  • Resolve in-addr.arpa reverse DNS lookups (PTR) for pod IPv4 addresses (#415)
    • Reverse DNS lookups for service IPv4 addresses unchanged
  • Upgrade Calico from v3.5.2 to v3.6.0 (#430)
    • Change pod IPAM from host-local to calico-ipam. pod_cidr is still divided into /24 subnets per node, but managed as ippools and ipamblocks
  • Suggest updating terraform-provider-ct from v0.3.0 to v0.3.1 (#434)
  • Announce: Fedora Atomic modules will be not be updated beyond Kubernetes v1.13.x (#437)
    • Thank you Project Atomic team and users, please see the deprecation notice

AWS

  • Support terraform-provider-aws v2.0+ (#419)

Bare-Metal

  • Change the default iPXE kernel and initrd download protocol from HTTP to HTTPS (#420)
    • Require an iPXE-enabled network boot environment with support for TLS downloads. PXE clients must chainload to iPXE firmware compiled with DOWNLOAD_PROTO_HTTPS enabled. (action required)
    • Only affects Container Linux and Flatcar Linux install profiles that pull public images (default)
    • Add download_protocol variable. Recognizing boot firmware TLS support is difficult in some environments, set the protocol to "http" for the old behavior (discouraged)

DigitalOcean

  • Fix kubelet hostname-override to set node metadata InternalIP correctly (#424)
    • Uniquely, DigitalOcean does not resolve hostnames to instance private IPs. Kubelet auto-detect mechanisms require the internal IP be set directly.
    • Regressed in v1.12.3 (#337) which aimed to provide friendly hostname-based node names on DigitalOcean

Addons

  • Update Prometheus from v2.7.1 to v2.8.0
    • Refresh rules based on upstreams (#426)
    • Define NetworkPolicy to allow only traffic from the Grafana addon
  • Update Grafana from v6.0.0 to v6.0.2
    • Add liveness and readiness probes
    • Refresh dashboards and organize to stay below ConfigMap size limit (#426)
  • Remove heapster manifests from addons (#427)
    • Heapster addon powers kubectl top (in early Kubernetes, running the addon was expected). Today, there are better monitoring options.
    • kubectl top reliance on a non-core extension means its not in-scope for minimal Kubernetes
    • Look to prior releases if you still wish to apply heapster
typhoon - v1.13.4

Published by dghubble over 5 years ago

  • Kubernetes v1.13.4
  • Update etcd from v3.3.11 to v3.3.12
  • Update Calico from v3.5.0 to v3.5.2
  • Assign priorityClassNames to critical cluster and node components (#406)
    • Inform node out-of-resource eviction and scheduler preemption and ordering
  • Add CoreDNS readiness probe (#410)

Bare-Metal

Google Cloud

  • Support terraform-provider-google v2.0+ (#407)
    • Require terraform-provider-google v1.19+ (action required)
  • Set the minimum CPU platform to Intel Haswell (#405)
    • Haswell or better is available in every zone (no price change)
    • A few zones still default to Sandy/Ivy Bridge (shifts in April 2019)

Addons

  • Modernize Prometheus rules and alerts (#404)
    • Drop extraneous metrics (#397)
    • Add pod name label to metrics discovered via service endpoints
    • Rename kubernetes_namespace label to namespace
  • Modernize Grafana and dashboards, see docs (#403, #404)
    • Upgrade Grafana from v5.4.3 to v6.0.0!
    • Enable Grafana Explore UI as a Viewer (inspect/edit without saving)
  • Update nginx-ingress from v0.22.0 to v0.23.0
    • Raise nginx-ingress liveness/readiness timeout to 5 seconds
    • Remove nginx-ingess default-backend (#401)

Fedora Atomic

  • Build Kubelet system container with buildah. The image is an OCI format and slightly larger.
typhoon - v1.13.3

Published by dghubble over 5 years ago

  • Kubernetes v1.13.3
  • Update etcd from v3.3.10 to v3.3.11
  • Update CoreDNS from v1.3.0 to v1.3.1
    • Switch from the proxy plugin to the faster forward plugin for upsteam resolvers
  • Update Calico from v3.4.0 to v3.5.0
  • Update flannel from v0.10.0 to v0.11.0
  • Reduce pod eviction timeout for deleting pods on unready nodes to 1 minute
    • Respond more quickly to node preemption (previously 5 minutes)
  • Fix automatic worker deletion on shutdown for cloud platforms
    • Lowering Kubelet privileges in #372 dropped a needed node deletion authorization. Scale-in due to manual terraform apply (any cloud), AWS spot termination, or Azure low priority deletion left old nodes registered, requiring manual deletion (kubectl delete node name)

AWS

  • Add ingress_zone_id output with the NLB DNS name's Route53 zone for use in alias records (#380)

Azure

  • Fix azure provider warning, public_ip allocation_method replaces public_ip_address_allocation
    • Require terraform-provider-azurerm v1.21+ (action required)

Addons

  • Update nginx-ingress from v0.21.0 to v0.22.0
  • Update Prometheus from v2.6.0 to v2.7.1
  • Update kube-state-metrics from v1.4.0 to v1.5.0
    • Fix ClusterRole to collect and export PodDisruptionBudget metrics (#383)
  • Update node-exporter from v0.15.2 to v0.17.0
  • Update Grafana from v5.4.2 to v5.4.3
typhoon - v1.13.2

Published by dghubble almost 6 years ago

  • Kubernetes v1.13.2
  • Add ServiceAccounts for kube-apiserver and kube-scheduler (#370)
  • Use lower-privilege TLS client certificates for Kubelets (#372)
  • Use HTTPS liveness probes for kube-scheduler and kube-controller-manager (#377)
  • Update CoreDNS from v1.2.6 to v1.3.0
  • Allow the certificates.k8s.io API to issue certificates signed by the cluster CA (#376)
    • Configure controller manager to sign CSRs that are manually approved by an administrator

AWS

  • Change controller_type and worker_type default from t2.small to t3.small (#365)
    • t3.small is cheaper, provides 2 vCPU (instead of 1), and 5 Gbps of pod-to-pod bandwidth!

Bare-Metal

  • Remove the kubeconfig output variable

Addons

  • Update Prometheus from v2.5.0 to v2.6.0
typhoon - v1.13.1

Published by dghubble almost 6 years ago

  • Kubernetes v1.13.1
  • Update Calico from v3.3.2 to v3.4.0 (#362)
    • Install CNI plugins with an init container rather than a sidecar
    • Improve the calico-node ClusterRole
  • Recommend updating terraform-provider-ct plugin from v0.2.1 to v0.3.0 (#363)
    • Migration instructions for upgrading terraform-provider-ct in-place for v1.12.2+ clusters (action required)
    • Require switching from ~/.terraformrc to the Terraform third-party plugins directory ~/.terraform.d/plugins/
    • Require Container Linux 1688.5.3 or newer

Google Cloud

  • Increase TCP proxy apiserver backend service timeout from 1 minute to 5 minutes (#361)
    • Align port-forward behavior closer to AWS/Azure (no timeout)

Addons

  • Update Grafana from v5.4.0 to v5.4.2
typhoon - v1.13.0

Published by dghubble almost 6 years ago

Addons

  • Update Grafana from v5.3.4 to v5.4.0
  • Disable Grafana login form, since admin user can't be disabled (#352)
    • Example manifests aim to provide a read-only dashboard view
typhoon - v1.12.3

Published by dghubble almost 6 years ago

  • Kubernetes v1.12.3
  • Add enable_reporting variable (default "false") to provide upstreams with usage data (#345)
  • Change kube-apiserver --kubelet-preferred-address-types to InternalIP,ExternalIP,Hostname
  • Update Calico from v3.3.0 to v3.3.1
    • Disable Felix usage reporting by default (#345)
  • Improve flannel manifests
    • Rename kube-flannel DaemonSet to flannel and kube-flannel-cfg ConfigMap to flannel-config
    • Drop unused mounts and add a CPU resource request
  • Update CoreDNS from v1.2.4 to v1.2.6
    • Enable CoreDNS loop and loadbalance plugins (#340)
  • Fix pod-checkpointer log noise and checkpointable pods detection (#346)
  • Use kubernetes-incubator/bootkube v0.14.0
  • Recommend switching from ~/.terraformrc to the Terraform third-party plugins directory ~/.terraform.d/plugins/.
    • Allows pinning terraform-provider-ct and terraform-provider-matchbox versions
    • Improves safety of later plugin version migrations

Azure

  • Use eviction policy Delete for Low priority virtual machine scale set workers (#343)
    • Fix issue where Azure defaults to Deallocate eviction policy, which required manually restarting deallocated instances. Delete policy aligns Azure with AWS and GCP behavior.
    • Require terraform-provider-azurerm v1.19+ (action required)

Bare-Metal

  • Add Kubelet /etc/iscsi and iscsadm mounts on bare-metal for iSCSI (#103)

Addons

  • Update nginx-ingress from v0.20.0 to v0.21.0
  • Update Prometheus from v2.4.3 to v2.5.0
  • Update Grafana from v5.3.2 to v5.3.4
typhoon - v1.12.2

Published by dghubble almost 6 years ago

v1.12.2

  • Kubernetes v1.12.2
  • Update CoreDNS from 1.2.2 to 1.2.4
  • Update Calico from v3.2.3 to v3.3.0
  • Disable Kubelet read-only port (#324)
  • Fix CoreDNS AntiAffinity spec to prefer spreading replicas
  • Ignore controller node user-data changes (#335)
    • Once all managed clusters use v1.12.2, it is possible to update terraform-provider-ct

AWS

  • Add disk_iops variable for EBS volume IOPS (#314)

Azure

  • Use new azurerm_network_interface_backend_address_pool_association (#332)
    • Require terraform-provider-azurerm v1.17+ (action required)
  • Add primary field to ip_configuration needed by v1.17+ (#331)

DigitalOcean

  • Add AAAA DNS records resolving to worker nodes (#333)
    • Hosting IPv6 apps requires editing nginx-ingress with hostNetwork: true

Google Cloud

  • Add an IPv6 address and IPv6 forwarding rules for load balancing IPv6 Ingress (#334)
    • Add ingress_static_ipv6 output variable for use in AAAA DNS records
    • Allow serving IPv6 applications via Kubernetes Ingress

Addons

  • Configure Heapster to scrape Kubelets with bearer token auth (#323)
  • Update Grafana from v5.3.1 to v5.3.2
typhoon - v1.12.1

Published by dghubble about 6 years ago

  • Kubernetes v1.12.1
  • Update etcd from v3.3.9 to v3.3.10
  • Update CoreDNS from 1.1.3 to 1.2.2
  • Update Calico from v3.2.1 to v3.2.3
  • Raise scheduler and controller-manager replicas to the larger of 2 or number of controller nodes (#312)
    • Single-controller clusters continue to run 2 replicas as before
  • Raise default CoreDNS replicas to the larger of 2 or the number of controller nodes (#313)
    • Add AntiAffinity preferred rule to favor spreading CoreDNS pods
  • Annotate control plane and addon containers to use the Docker runtime seccomp profile (#319)
    • Override Kubernetes default behavior that starts containers with seccomp=unconfined

Azure

  • Remove admin_password field (disabled) since it is now optional
    • Require terraform-provider-azurerm v1.16+ (action required)

Bare-Metal

  • Add support for cached_install mode with Flatcar Linux (#315)

DigitalOcean

  • Require terraform-provider-digitalocean v1.0+ (action required)

Addons

  • Update nginx-ingress from v0.19.0 to v0.20.0
  • Update Prometheus from v2.3.2 to v2.4.3
  • Update Grafana from v5.2.4 to v5.3.1
typhoon - v1.11.3

Published by dghubble about 6 years ago

  • Kubernetes v1.11.3
  • Introduce Typhoon for Azure as alpha (#288)
    • Special thanks @justaugustus for an earlier variant
  • Update Calico from v3.1.3 to v3.2.1 (#278)

AWS

  • Remove firewall rule allowing ICMP packets to nodes (#285)

Bare-Metal

  • Remove controller_networkds and worker_networkds variables. Use Container Linux Config snippets #277

Google Cloud

  • Fix firewall to allow etcd client port 2379 traffic between controller nodes (#287)
    • kube-apiservers were only able to connect to their node's local etcd peer. While master node outages were tolerated, reaching a healthy peer took longer than necessary in some cases
    • Reduce time needed to bootstrap the cluster
  • Remove firewall rule allowing workers to access Nginx Ingress health check (#284)
    • Nginx Ingress addon no longer uses hostNetwork, Prometheus scrapes via CNI network

Addons

  • Update nginx-ingress from 0.17.1 to 0.19.0
  • Update kube-state-metrics from v1.3.1 to v1.4.0
  • Update Grafana from 5.2.2 to 5.2.4
typhoon - v1.11.2

Published by dghubble about 6 years ago

  • Kubernetes v1.11.2
  • Update etcd from v3.3.8 to v3.3.9
  • Use kubernetes-incubator/bootkube v0.13.0
  • Fix Fedora Atomic modules' Kubelet version (#270)

Bare-Metal

  • Introduce Container Linux Config snippets on bare-metal
    • Validate and additively merge custom Container Linux Configs during terraform plan
    • Define files, systemd units, dropins, networkd configs, mounts, users, and more
    • Require terraform-provider-ct plugin v0.2.1 (action required!)

Addons

  • Update nginx-ingress from 0.16.2 to 0.17.1
  • Add nginx-ingress manifests for bare-metal
  • Update Grafana from 5.2.1 to 5.2.2
  • Update heapster from v1.5.3 to v1.5.4
typhoon - v1.11.1

Published by dghubble about 6 years ago

  • Kubernetes v1.11.1
    • Defaults now enable the pod Priority admission controller

Addons

  • Update Prometheus from v2.3.1 to v2.3.2

Errata

  • Fedora Atomic modules shipped with Kubelet v1.11.0, instead of v1.11.1. Fixed in #270.
typhoon - v1.11.0

Published by dghubble over 6 years ago

  • Kubernetes v1.11.0
  • Force apiserver to stop listening on 127.0.0.1:8080
  • Replace kube-dns with CoreDNS (#261)
    • Edit the coredns ConfigMap to customize
    • CoreDNS doesn't use a resizer. For large clusters, scaling may be required.

AWS

  • Update from Fedora Atomic 27 to 28 (#258)

Bare-Metal

  • Update from Fedora Atomic 27 to 28 (#263)

Google

  • Promote Google Cloud to stable
  • Update from Fedora Atomic 27 to 28 (#259)
  • Remove ingress_static_ip module output. Use ingress_static_ipv4.
  • Remove controllers_ipv4_public module output.

Addons

  • Update nginx-ingress from 0.15.0 to 0.16.2
  • Update Grafana from 5.1.4 to 5.2.1
  • Update heapster from v1.5.2 to v1.5.3
typhoon - v1.10.5

Published by dghubble over 6 years ago

AWS

  • Switch kube-apiserver port from 443 to 6443 (#248)
  • Combine apiserver and ingress NLBs (#249)
    • Reduce cost by ~$18/month per cluster. Typhoon AWS clusters now use one network load balancer.
    • Ingress addon users may keep using CNAME records to the ingress_dns_name module output (few million RPS)
    • Ingress users with heavy traffic (many million RPS) should create a separate NLB(s)
  • Worker pools no longer include an extraneous load balancer. Remove worker module's ingress_dns_name output
  • Disable detailed (paid) monitoring on worker nodes (#251)
    • Favor Prometheus for cloud-agnostic metrics, aggregation, and alerting
  • Add worker_target_group_http and worker_target_group_https module outputs to allow custom load balancing
  • Add target_group_http and target_group_https worker module outputs to allow custom load balancing

Bare-Metal

  • Switch kube-apiserver port from 443 to 6443 (#248)
    • Users who exposed kube-apiserver on a WAN via their router/load-balancer will need to adjust its configuration (e.g. DNAT 6443). Most apiservers are on a LAN (internal, VPN-only, etc) so if you didn't specially configure network gear for 443, no change is needed. (possible action required)
  • Fix possible deadlock when provisioning clusters larger than 10 nodes (#244)

DigitalOcean

  • Switch kube-apiserver port from 443 to 6443 (#248)
    • Update firewall rules and generated kubeconfig's

Google Cloud

  • Use global HTTP and TCP proxy load balancing for Kubernetes Ingress (#252)
    • Switch Ingress from regional network load balancers to global HTTP/TCP Proxy load balancing
    • Reduce cost by ~$19/month per cluster. Google bills the first 5 global and regional forwarding rules separately. Typhoon clusters now use 3 global and 0 regional forwarding rules.
  • Worker pools no longer include an extraneous load balancer. Remove worker module's ingress_static_ip output
  • Allow using nginx-ingress addon on Fedora Atomic clusters (#200)
  • Add worker_instance_group module output to allow custom global load balancing
  • Add instance_group worker module output to allow custom global load balancing
  • Deprecate ingress_static_ip module output. Add ingress_static_ipv4 module output instead.
  • Deprecate controllers_ipv4_public module output

Addons

  • Update CLUO from v0.6.0 to v0.7.0 (#242)
  • Update Prometheus from v2.3.0 to v2.3.1
  • Update Grafana from 5.1.3 to 5.1.4
  • Drop hostNetwork from nginx-ingress addon
    • Both flannel and Calico support host port via portmap
    • Allows writing NetworkPolicies that reference ingress pods in from or to. HostNetwork pods were difficult to write network policy for since they could circumvent the CNI network to communicate with pods on the same node.
typhoon - v1.10.4

Published by dghubble over 6 years ago

  • Kubernetes v1.10.4
  • Update etcd from v3.3.5 to v3.3.6
  • Update Calico from v3.1.2 to v3.1.3

Addons

  • Update Prometheus from v2.2.1 to v2.3.0
  • Add Prometheus liveness and readiness probes
  • Annotate Grafana service so Prometheus scrapes metrics
  • Label namespaces to ease writing Network Policies
typhoon - v1.10.3

Published by dghubble over 6 years ago

  • Kubernetes v1.10.3
  • Add Flatcar Linux (Container Linux derivative) as an option for AWS and bare-metal (thanks @kinvolk folks)
  • Allow bearer token authentication to the Kubelet (#216)
    • Require Webhook authorization to the Kubelet
    • Switch apiserver X509 client cert org to satisfy new authorization requirement
  • Require Terraform v0.11.x and drop support for v0.10.x (migration guide)
  • Update etcd from v3.3.4 to v3.3.5 (#213)
  • Update Calico from v3.1.1 to v3.1.2

AWS

  • Allow Flatcar Linux by setting os_image to flatcar-stable (default), flatcar-beta, flatcar-alpha (#211)
  • Replace os_channel variable with os_image to align naming across clouds
    • Please change values stable, beta, or alpha to coreos-stable, coreos-beta, coreos-alpha (action required!)
  • Allow preemptible workers via spot instances (#202)
    • Add worker_price to allow worker spot instances. Default to empty string for the worker autoscaling group to use regular on-demand instances
    • Add spot_price to internal workers module for spot worker pools

Bare-Metal

  • Allow Flatcar Linux by setting os_channel to flatcar-stable, flatcar-beta, flatcar-alpha (#220)
  • Replace container_linux_channel variable with os_channel
    • Please change values stable, beta, or alpha to coreos-stable, coreos-beta, coreos-alpha (action required!)
  • Replace container_linux_version variable with os_version
  • Add network_ip_autodetection_method variable for Calico host IPv4 address detection
    • Use Calico's default "first-found" to support single NIC and bonded NIC nodes
    • Allow alternative methods for multi NIC nodes, like can-reach=IP or interface=REGEX
  • Deprecate container_linux_oem variable

DigitalOcean

  • Update Fedora Atomic module to use Fedora Atomic 28 (#225)
    • Fedora Atomic 27 images disappeared from DigitalOcean and forced this early update

Addons

  • Fix Prometheus data directory location (#203)
  • Configure Prometheus to scrape Kubelets directly with bearer token auth instead of proxying through the apiserver (#217)
    • Security improvement: Drop RBAC permission from nodes/proxy to nodes/metrics
    • Scale: Remove per-node proxied scrape load from the apiserver
  • Update Grafana from v5.04 to v5.1.3 (#208)
    • Disable Grafana Google Analytics by default (#214)
  • Update nginx-ingress from 0.14.0 to 0.15.0
  • Annotate nginx-ingress service so Prometheus auto-discovers and scrapes service endpoints (#222)
typhoon - v1.10.2

Published by dghubble over 6 years ago

Google Cloud

  • Add support for multi-controller clusters (i.e. multi-master) (#54, #190)
    • Switch from Google Cloud network load balancer to a TCP proxy load balancer. Avoid Google network load balancer bug that limited clusters to only bootstrapping one controller node.
    • Add TCP health check for apiserver pods on controllers. Replace kubelet check approximation.

Addons

  • Update nginx-ingress from 0.12.0 to 0.14.0
  • Update kube-state-metrics from v1.3.0 to v1.3.1
typhoon - v1.10.1

Published by dghubble over 6 years ago

  • Kubernetes v1.10.1
  • Enable etcd v3.3 metrics endpoint (#175)
  • Use k8s.gcr.io instead of gcr.io/google_containers (#180)
    • Kubernetes recommends using the alias to pull from the nearest regional mirror and to abstract the backing container registry
  • Update kube-dns from v1.14.8 to v1.14.9
  • Update etcd from v3.3.2 to v3.3.3
  • Use kubernetes-incubator/bootkube v0.12.0

Bare-Metal

  • Fix need for multiple terraform apply runs to create a cluster with Terraform v0.11.4 (#181)
    • To SSH during a disk install for debugging, SSH as user "core" with port 2222
    • Remove the old trick of using a user "debug" during disk install

Google Cloud

  • Refactor out the controller internal module

Addons

  • Add Prometheus discovery for etcd peers on controller nodes (#175)
    • Scrape etcd v3.3 --listen-metrics-urls for metrics
    • Enable etcd alerts and populate the etcd Grafana dashboard
  • Update kube-state-metrics from v1.2.0 to v1.3.0
typhoon - v1.10.0

Published by dghubble over 6 years ago

  • Kubernetes v1.10.0
  • Remove unused, unmaintained pxe-worker internal module

AWS

  • Add disk_type optional variable for setting the EBS volume type (#176)
    • Change default type from standard to gp2. Prometheus etcd alerts are tuned for fast disks.

Digital Ocean

  • Ensure etcd secrets are only distributed to controller hosts, not workers.
  • Remove networking optional variable. Only flannel works on Digital Ocean.

Google Cloud

  • Add disk_size optional variable for setting instance disk size in GB
  • Add controller_type optional variable for setting machine type for controllers
  • Add worker_type optional variable for setting machine type for workers
  • Remove machine_type optional variable. Use controller_type and worker_type.

Addons

  • Update Grafana from v4.6.3 to v5.0.4 (#153, #174)
    • Restrict dashboard organization role to Viewer