Łukasz Osipiuk
17a7bc5164
Ignore NominatedNodeName on Pod if node is gone
...
Change-Id: I4a119f46e55ca2223f9f0fdd3e75ce3f279e293b
2019-11-27 20:26:00 +01:00
Łukasz Osipiuk
8161ca6fdb
Use taint keys from k8s.io/api/core/v1 module
2019-11-22 14:08:18 +01:00
Łukasz Osipiuk
f50d9ecce2
Use MoreImportantPod instead GetPodPriority
2019-11-22 14:07:48 +01:00
Kubernetes Prow Robot
8871f1702d
Merge pull request #2521 from losipiuk/lo/rename-stockout
...
Rename STOCKOUT to RESOURCE_POOL_EXHAUSTED
2019-11-12 06:00:07 -08:00
Łukasz Osipiuk
7b499aa4c9
Rename STOCKOUT to RESOURCE_POOL_EXHAUSTED
...
We came into conclusion that using STOCKOUT as error code is too
specific. Migrating to more general term RESOURCE_POOL_EXHAUSTED.
2019-11-12 14:39:51 +01:00
Vivek Bagade
910e75365c
remove temporary nodes logic
2019-11-12 11:58:29 +01:00
Kubernetes Prow Robot
19dcfbd25e
Merge pull request #2476 from tghartland/fix-scale-down-errorf
...
CA: Make error message in scale down node draining consistent
2019-11-04 01:01:40 -08:00
Jarvis-Zhou
7c9d6e3518
Do not assign return values to variables when not needed
2019-10-25 19:28:00 +08:00
Thomas Hartland
229fc959b4
Make error message in scale down consistent
2019-10-23 15:28:09 +02:00
Łukasz Osipiuk
7f083d2393
Move core/utils.go to separate package and split into multiple files
2019-10-22 14:23:40 +02:00
Łukasz Osipiuk
41e9271b9e
Remove unused GetCandidatesForScaleDown
2019-10-22 14:23:38 +02:00
Kubernetes Prow Robot
3f137fde4f
Merge pull request #2448 from hectorj2f/hectorj2f/chore_typos
...
cluster-autoscaler: fix some typos in the code
2019-10-21 00:33:37 -07:00
Łukasz Osipiuk
288d4107b2
Rename GetCreatedNodesWithOutOfResourcesErrors to GetCreatedNodesWithErrors
2019-10-14 10:56:56 +02:00
Hector Fernandez
24401b373f
cluster-autoscaler: fix some typos in the code
2019-10-13 12:52:53 +02:00
Thomas Hartland
c51b7ee72a
Update TestRemoveOldUnregisteredNodes to pass cluster state registry
2019-09-30 14:29:02 +02:00
Thomas Hartland
474eef6d47
Invalidate node instances cache after deleting unregistered nodes
2019-09-30 14:29:02 +02:00
Thomas Hartland
7c17d52ec8
Invalidate node instances cache after deleting failed nodes
2019-09-30 13:56:33 +02:00
Kubernetes Prow Robot
791f0d8355
Merge pull request #2281 from DataDog/JulienBalestra/mig-block
...
cluster-autoscaler: blocked if an instance is detached from MIG
2019-09-11 05:03:22 -07:00
Julien Balestra
3441f616e1
cluster-autoscaler/skip-node: unblock cluster autoscaler when having a single nodegroup for node error
...
Signed-off-by: Julien Balestra <julien.balestra@datadoghq.com>
2019-09-11 13:40:23 +02:00
Krzysztof Jastrzebski
839cdaaa09
Stop disabling Cluster Autoscaler when there is no ready nodes.
2019-09-06 14:45:34 +02:00
Julien Balestra
6d707a08ac
cluster-autoscaler/metrics: expose the scale down cooldown
...
Signed-off-by: Julien Balestra <julien.balestra@datadoghq.com>
2019-08-27 18:12:33 +02:00
Kubernetes Prow Robot
9aac43e237
Merge pull request #2235 from piontec/fix/aws_spots_squashed
...
correctly handle lack of capacity of AWS spot ASGs
2019-08-19 04:27:30 -07:00
Kubernetes Prow Robot
4c056fb8ba
Merge pull request #2259 from towca/jtuznik/rejected-node-groups-more-info
...
Provide ScaleUpStatusProcessor with info about all rejected node groups
2019-08-19 04:05:31 -07:00
Kubernetes Prow Robot
3f0a5fa3c2
Merge pull request #2233 from vivekbagade/surge
...
Adding ScaleDownNodeProcessor
2019-08-19 03:59:32 -07:00
Jakub Tużnik
43466ff837
Provide ScaleUpStatusProcessor with info about all rejected node groups
...
Previously, it had info only about the ones that actually exist.
The changes to the eventing processor are done to keep its previous
behavior the same.
2019-08-19 12:48:10 +02:00
Łukasz Piątkowski
8d9b81caaa
correctly handle lack of capacity of AWS spot ASGs
2019-08-19 12:43:53 +02:00
Kubernetes Prow Robot
60bdca087d
Merge pull request #2255 from towca/jtuznik/create-node-group-result
...
Provide more info to ScaleUpStatusProcessor
2019-08-13 06:51:41 -07:00
Vivek Bagade
dc64d0aab2
Adding ScaleDownNodeProcessor
2019-08-12 20:19:55 +02:00
Jakub Tużnik
935476a7e2
Provide more info to ScaleUpStatusProcessor
...
Add info about considered and created nodegroups to
ScaleUpStatusProcessor
2019-08-12 17:20:09 +02:00
Jakub Tużnik
44ae89dd09
Communicate the result of RemoveUnneededNodeGroups to ScaleDownStatusProcessor
2019-08-12 17:03:51 +02:00
t-qini
f7c563ab06
Modify the code as the simple solution proposed by MaciekPytel.
2019-07-18 23:58:05 +08:00
t-qini
622a838c2c
Modify nodal similarity rules.
2019-07-09 16:04:40 +08:00
Kubernetes Prow Robot
c6067574c1
Merge pull request #2160 from aleksandra-malinowska/scale-up-events-fix
...
Add resource limit type to NotTriggerScaleUp event
2019-07-05 05:48:38 -07:00
Aleksandra Malinowska
0d0c9440f6
Add no scale up test
2019-07-03 16:38:53 +02:00
Aleksandra Malinowska
7b80f4e8b8
Separate running scale up test from checking results
2019-07-03 16:38:52 +02:00
Aleksandra Malinowska
c27ae4eb24
Add resource limit type to NotTriggerScaleUp event
2019-07-03 16:38:46 +02:00
Aleksandra Malinowska
d01a2392db
Make scale down unit tests faster
2019-07-03 13:12:48 +02:00
Pengfei Ni
d45fee06da
Ensure upcoming nodes are different
2019-07-02 16:52:19 +08:00
silenceper
478660a6bb
fix error
2019-06-28 18:49:58 +08:00
Vivek Bagade
0a75333e1b
Potential performance improvement in bin packing unschedulable pods
2019-06-19 18:39:47 +02:00
Vivek Bagade
90aa28a077
Move pod packing in upcoming nodes to RunOnce from Estimator for performance improvements
2019-06-19 14:48:47 +02:00
Kubernetes Prow Robot
da36677d04
Merge pull request #2108 from losipiuk/lo/other-error-ut
...
Add unit test case for OTHER error handling
2019-06-10 05:29:08 -07:00
Łukasz Osipiuk
0bcf5315a7
Do not fail loop iteration if unregistered nodes cannot be removed
...
The mechanism of unregistered nodes removal is not the first
responsibility of Cluster Autoscaler. We do not want to renderi CA
unsable (disable scale-up and scale-down) if removing unregistered nodes
cannot be done for prolonged period of time.
2019-06-10 13:45:54 +02:00
Łukasz Osipiuk
be68d06b40
Add unit test case for OTHER error handling
2019-06-07 16:54:01 +02:00
Jakub Tużnik
bb382f47f9
Retain information about scale-up failures in CSR
...
This will provide the AutoscalingStatusProcessor with information
about failed scale-ups.
2019-06-05 16:53:30 +02:00
Krzysztof Jastrzebski
22b4a6283e
Optimize building node infos by using map with pods for nodes.
2019-06-03 13:24:09 +02:00
Kubernetes Prow Robot
a0853bcc80
Merge pull request #2071 from losipiuk/lo/predicate-checker-speedup
...
Precompute inter pod equivalence groups in checkPodsSchedulableOnNode
2019-06-03 03:52:16 -07:00
Krzysztof Jastrzebski
4831d76288
Cache cloud provider node instances in cluster state.
2019-05-31 10:11:51 +02:00
Łukasz Osipiuk
a849ead286
Precompute inter pod equivalence groups in checkPodsSchedulableOnNode
2019-05-29 18:05:52 +02:00
Krzysztof Jastrzebski
6944f3fc56
Delete zero values from deletionsInProgress map in NodeDeletionTracker.
2019-05-28 14:34:56 +02:00