Commit Graph

1376 Commits

Author SHA1 Message Date
bjhaid 88ed9f68bd Add support for configuring Cilium enable-host-reachable-services.
After upgrading Cilium to 1.8 via kops one of our clusters had a total
outage due to cilium reporting errors as below:

```
level=error msg="endpoint regeneration failed" containerID= datapathPolicyRevision=0 desiredPolicyRevision=1 endpointID=592 error="Failed to load tc filter: exit status 1" identity=40147 ipv4= ipv6= k8sPodName=/ subsys=endpoint
```

upon searching Cilium slack we found the below thread:

https://cilium.slack.com/archives/C1MATJ5U5/p1616400216167600

which recommended setting `enable-host-reachable-services` to true will
address the problems. We set the field and it fixed our issues too,
however we observed that kops does not have a means to configure this
hence this PR.

We will like to have this backported after it has been merged.
2021-04-27 12:49:27 -05:00
Kubernetes Prow Robot 17e46e5a2c
Merge pull request #11322 from johngmyers/warmpool-cluster
Add cluster-level warmPool settings
2021-04-27 02:09:38 -07:00
Kubernetes Prow Robot d90ace058b
Merge pull request #11304 from dntosas/update-aws-csi-ebs
[csi/aws] Bump templates + add support for warm pools
2021-04-26 02:37:01 -07:00
John Gardiner Myers 428041bc0f Add cluster-level warmPool settings 2021-04-25 20:22:04 -07:00
John Gardiner Myers 5ad32230bb Fix typo 2021-04-25 13:42:12 -07:00
Kubernetes Prow Robot fb39ddd0e7
Merge pull request #11316 from johngmyers/warm-enabled
Allow disabling warm pool by setting WarmPool.MaxSize to 0
2021-04-25 09:05:00 -07:00
dntosas af6d4d585f
[csi/aws] Bump templates to latest stable version …
- Update manifests
- Bump components version
- Add API capability of setting Version + VolumeLimit
- Remove snapshot-controller resources as it should be independent from
any CSI driver

Signed-off-by: dntosas <ntosas@gmail.com>
2021-04-25 16:45:15 +03:00
Ciprian Hacman 55e154a526 Update Ubuntu images to latest version 2021-04-25 09:24:20 +03:00
John Gardiner Myers 1948018d1f Disallow negative warmpool sizes 2021-04-24 16:44:36 -07:00
John Gardiner Myers 044b5f6d0d Allow disabling warm pool by setting WarmPool.MaxSize to 0 2021-04-24 16:35:46 -07:00
Kubernetes Prow Robot b0664176bc
Merge pull request #11259 from olemarkus/warm-life-cycle-hook
Make nodeup able to complete the warming life cycle hook
2021-04-24 02:05:15 -07:00
Kubernetes Prow Robot bc20262a47
Merge pull request #11281 from javipolo/cert-manager-default-issuer
Add ability to set a default Issuer in certManager addon
2021-04-24 01:21:14 -07:00
Ole Markus With 1ec0bd18e8 Enable support for the ASG WarmPool lifecycle hook
Update pkg/model/iam/iam_builder.go

Co-authored-by: Ciprian Hacman <ciprianhacman@gmail.com>
2021-04-24 09:40:52 +02:00
Kubernetes Prow Robot 2649cbc598
Merge pull request #10995 from haugenj/release-1.19
Add NTH Queue Processor Mode
2021-04-22 12:15:58 -07:00
Jason Haugen 36722afb0f change casing Asg->ASG 2021-04-22 13:07:01 -05:00
Javi Polo bf20c6a4b7 Add ability to set a default Issuer in certManager addon 2021-04-21 22:39:25 +02:00
Kubernetes Prow Robot 9c46f5046a
Merge pull request #11235 from olemarkus/warm-provision
Make it possible to enable/configure warm pool
2021-04-20 21:44:10 -07:00
Kenji Kaneda 221568a7a6 Add Azure image to alpha/stable channel
Remove the default image hard-coded in populate_instancegroup_spec.go.
2021-04-20 09:47:03 -07:00
Ole Markus With 020652e096 Add ability to enable/configure warm pool for ASG
Apply suggestions from code review

Co-authored-by: John Gardiner Myers <jgmyers@proofpoint.com>

Apply suggestions from code review

Co-authored-by: John Gardiner Myers <jgmyers@proofpoint.com>
2021-04-20 09:02:09 +02:00
Jason Haugen 7e48dad4d2 add ManagedAsgTag, merge templates, improve docs 2021-04-19 16:51:08 -05:00
Jason Haugen d07b067249 Add NTH queue-processor mode 2021-04-19 15:43:05 -05:00
Kubernetes Prow Robot 146f9b4cbb
Merge pull request #11248 from dntosas/cilium-resources
[cilium] Add support for choosing resources
2021-04-19 10:23:45 -07:00
dntosas 55524935d7
[cilium] Add support for choosing resources
Cilium as a CNI is a critical component for the cluster so it would be safe
to have some guaranteed resources as well as allowing the users to
define them based on their needs.

In this commit, we init default requested resources and add the
capability of user-defined values.

Signed-off-by: dntosas <ntosas@gmail.com>
2021-04-19 11:30:02 +03:00
Kubernetes Prow Robot c39f7d652c
Merge pull request #11241 from johngmyers/remove-unused
Remove unused constants
2021-04-15 21:58:21 -07:00
Guangwen Feng f67c9daa96 Fix golint issue caused by typo
Signed-off-by: Guangwen Feng <fenggw-fnst@cn.fujitsu.com>
2021-04-16 08:41:57 +08:00
John Gardiner Myers 46834dcdf7 Remove unused constants 2021-04-15 11:35:45 -07:00
AkiraFukushima 1b0117a088
Remove validations for EBS from cluster validation 2021-04-15 01:41:42 +09:00
Kenji Kaneda baff30d66e Add an option to skip NTP installation
Add NTPConfig to ClusterSpec. NTPConfig has the SkipInstall option.

https://github.com/kubernetes/kops/issues/9661
2021-03-31 12:33:32 -07:00
Kubernetes Prow Robot a01ee1d0fe
Merge pull request #11140 from jurriaanpro/cluster-autoscaler-add-scale-down-delay-after-add
Add scaleDownDelayAfterAdd to clusterAutoscaler spec
2021-03-25 05:57:29 -07:00
jurriaanpro 25955ceb6b Add scaleDownDelayAfterAdd to clusterAutoscaler spec 2021-03-25 11:23:48 +01:00
Ole Markus With b1d106e375 Validate that kube-apiserver has the necessary authz modes set 2021-03-24 10:42:06 +01:00
Justin SB c75e084158 Re-add integration tests for jwks
We removed them from #10756, but they can be re-added.
2021-03-20 22:55:11 -04:00
Kubernetes Prow Robot 15e4028c81
Merge pull request #10722 from olemarkus/apiserver-nodes
Apiserver nodes
2021-03-20 16:43:42 -07:00
Ole Markus With 20bd724f5e Add support for scaling out the control plane with dedicated apiserver nodes
Ensure apiserver role can only be used on AWS (because of firewalling)

Apply api-server label to CP as well

Consolidate node not ready validation message

Guard apiserver nodes with a feature flag

Rename Apiserver role to APIServer

Add an integration test for apiserver nodes

Rename Apiserver role to APIServer

Enumerate all roles in rolling update docs

Apply suggestions from code review

Co-authored-by: Steven E. Harris <seh@panix.com>
2021-03-20 20:57:00 +01:00
Ole Markus With 6e3199bf86 Ensure a publicdatastore exists for jwks and that it can only be s3 2021-03-20 07:18:39 +01:00
Peter Rifel 7c900b7fae Generate and upload keys.json + discovery.json to public store
Generate and upload keys.json + discovery.json to public store

Don't enable anonymous auth on publicjwks

Remove tests that won't work using FS VFS anymore
2021-03-19 20:03:26 +01:00
Ciprian Hacman 55f8c70779 Add channels entries for image architecture 2021-03-15 20:36:37 +02:00
Bharath Vedartham 059c7ef33a Create an environment file for kops-configuration systemd process 2021-03-13 22:20:49 +05:30
Kubernetes Prow Robot ad7c793050
Merge pull request #10913 from seh/scope-os-update-policy-to-instance-group-too
Honor OS update policy at InstanceGroup level too
2021-03-12 22:03:03 -08:00
Kubernetes Prow Robot d1f17c6c1b
Merge pull request #11027 from hakman/fix_docker_insecure_registries
Fix rendering of multiple Docker insecure registries
2021-03-12 08:10:18 -08:00
Ciprian Hacman 79a0720143 Fix rendering of multiple Docker insecure registries 2021-03-12 16:30:15 +02:00
Ole Markus With 8e2e60ae44 Allow cilium 1.10 2021-03-12 14:17:28 +01:00
Steven E. Harris 44061601da Test default StorageClass management choice 2021-03-09 11:08:33 -05:00
Bharath Vedartham 0c0767c0c9 Remove support for launch configurations 2021-03-09 09:04:15 +02:00
Kubernetes Prow Robot fea7589d97
Merge pull request #10574 from bharath-123/feat/protokube-systemd
Run protokube as a systemd service
2021-03-07 22:03:44 -08:00
Bharath Vedartham 5a72ef0f94 Remove protokube image references 2021-03-06 00:33:12 +05:30
Steven E. Harris 70e95fccf5 Copyedit the InstanceGroup-related documentation 2021-03-05 08:53:08 -05:00
Steven E. Harris e39c985ee7 Honor OS update policy at InstanceGroup level too
As with the Cluster-level "spec.updatePolicy" field, add a similar
field at the InstanceGroup level, allowing overriding of the
cluster-level choice in each InstanceGroup.

Introduce a new value for the field ("automatic") as equivalent to the
default value applied when the field is absent. Honoring this new
value allows disabling automatic updates at the cluster level, but
then enabling them again for particular InstanceGroups. Without such a
positive affirmation, it's not possible to override a cluster-level
"external" policy at the InstanceGroup level, as there's no way to
specify positively that you want to recover the default
value. Instead, expressing the explicit "automatic" value is clear and
unambiguous.
2021-03-05 08:53:07 -05:00
Otto Sulin 8fa6be1b03 Add etcd-manager discoveryPollInterval option 2021-03-04 09:35:05 +02:00
Kubernetes Prow Robot 83e1424280
Merge pull request #10957 from adrianmoisey/add_kubelet_arg_enable_cadvisor_json_endpoints
Add support for enable-cadvisor-json-endpoints with Kubelet
2021-03-02 23:47:19 -08:00