k8s-ci-robot
03283328a7
Merge pull request #1306 from losipiuk/lo/fluentd-ds-ready
...
Ignore lo/fluentd-ds-ready when checking node similarity
2018-10-10 03:55:57 -07:00
Łukasz Osipiuk
e3891ba025
Ignore lo/fluentd-ds-ready when checking node similarity
2018-10-10 09:57:47 +02:00
Alexey Ermakov
9e8d026b19
deletetaint: retry on conflicts
...
Signed-off-by: Alexey Ermakov <alexey.ermakov@zalando.de>
2018-10-08 11:22:07 +02:00
Łukasz Osipiuk
52aaac362f
Remove GetGpuRequests function
2018-09-05 11:58:46 +02:00
Krishnakumar R
a6b81a6ca2
Code cleanup - use const from the package.
2018-08-30 22:30:44 -07:00
Aleksandra Malinowska
364e2da764
Check for ready condition not true
2018-08-30 13:43:24 +02:00
Aleksandra Malinowska
f5690aab96
Make CheckPredicates return predicateError
2018-08-28 14:11:35 +02:00
Aleksandra Malinowska
8b89c8f9cd
Merge pull request #1168 from yguo0905/preemptible-tpu
...
Ignore resources with Cloud TPU prefix
2018-08-21 09:56:14 +01:00
Yang Guo
b41c9828d9
Ignore resources with Cloud TPU prefix
2018-08-20 17:20:44 -07:00
Pengfei Ni
74045053f5
Fix potential panic
2018-07-23 11:13:09 +08:00
Arto Jantunen
11402b5ca7
Allow using the PodSafeToEvictKey annotation in reverse
...
Adding the "cluster-autoscaler.kubernetes.io/safe-to-evict": "false"
annotation to a pod prevents the cluster autoscaler from touching it.
2018-07-11 09:32:56 +03:00
Arto Jantunen
9f3c17d153
Add test to use the PodSafeToEvictKey in reverse
...
When this is set to false instead of true, the pod should not be evicted by
the autoscaler.
2018-07-11 09:30:20 +03:00
Aleksandra Malinowska
800ee56b34
Refactor and extend GPU metrics error types
2018-07-05 13:13:11 +02:00
Karol Gołąb
553db2c9fc
Separated errors
2018-07-05 11:30:12 +02:00
Karol Gołąb
aae4d1270a
Make GetGpuTypeForMetrics more robust
2018-06-26 21:35:16 +02:00
Karol Gołąb
5eb7021f82
Add GPU-related scaled_up & scaled_down metrics ( #974 )
...
* Add GPU-related scaled_up & scaled_down metrics
* Fix name to match SD naming convention
* Fix import after master rebase
* Change the logic to include GPU-being-installed nodes
2018-06-22 21:00:52 +02:00
Łukasz Osipiuk
57ea19599e
Explicitly return AutoscalerError from GetNodeTargetGpus
2018-06-14 15:46:58 +02:00
Aleksandra Malinowska
bc526e71e8
Merge pull request #960 from krzysztof-jastrzebski/backoff
...
Move backoff mechanism to utils.
2018-06-14 14:35:33 +02:00
Krzysztof Jastrzebski
dd1db7a0ac
Move backoff mechanism to utils.
2018-06-13 15:32:25 +02:00
Łukasz Osipiuk
087a5cc9a9
Respect GPU limits in scale_down
2018-06-13 14:19:59 +02:00
Pengfei Ni
5fd15fc96b
Set enableEquivalenceClassCache for schedulerConfigFactory and fix GPU resource name for unit tests
2018-06-11 15:17:46 +08:00
Pengfei Ni
be3dd85503
Update scheduler cache package
2018-06-11 13:54:12 +08:00
MaciekPytel
c41dc43704
Merge pull request #495 from aleksandra-malinowska/resource-limiter-bytes
...
Use bytes instead of MB for memory limits
2018-06-08 14:47:22 +02:00
Maciej Pytel
5faa41e683
Move PodListProcessor to new directory
...
It's not really a util and with more processors
coming it makes more sense to keep them in dedicated place.
2018-05-29 12:00:47 +02:00
Clayton Coleman
6146e0dbc1
Autoscaler doesn't drain nodes that have terminal pods
...
Terminal pods (Succeeded or Failed phase) are drained by kubectl drain,
and autoscaler should also drain them.
2018-05-28 22:35:37 -04:00
Karol Gołąb
bada827839
Simplify the code by removing superfluous variable
2018-05-18 09:38:47 +02:00
Aleksandra Malinowska
3ccfa5be23
Move universal constants to separate module
2018-05-17 18:36:43 +02:00
Łukasz Osipiuk
c406da4174
Support gpus in nodes and pods definitions in UT
2018-05-15 22:43:31 +02:00
Aleksandra Malinowska
b2ad790121
Merge pull request #830 from aleksandra-malinowska/stateful-set-drain
...
Add support for rescheduled pods with the same name in drain
2018-05-11 13:47:53 +02:00
Aleksandra Malinowska
44ba1c719f
Fix log message
2018-05-11 12:58:32 +02:00
Karol Gołąb
f877f5a64e
Remove unused error handling
2018-05-10 12:15:42 +02:00
Krzysztof Jastrzebski
88b769b324
Refactor cluster autoscaler builder and add pod list processor.
2018-04-26 12:37:51 +02:00
Aleksandra Malinowska
7e1353a865
Ignore TPU resource in simulations
2018-04-11 12:26:22 +02:00
AdamDang
d4ba9120e3
correct the returned message in ready.go
...
feadiness->readiness
2018-04-08 23:00:02 +08:00
Aleksandra Malinowska
8ae3636ccf
Fix method name
2018-03-28 13:23:38 +02:00
Aleksandra Malinowska
feb4ad9e14
Add utility for limiting logging
2018-03-22 12:57:22 +01:00
Marcin Wielgus
04bec08e84
Compilation fix
2018-03-20 20:11:36 +01:00
Aleksandra Malinowska
4c594db7f8
Run spellchecker
2018-03-15 15:47:49 +01:00
AdamDang
5c4693f95f
Typo fix "typ"->"type"
...
line 31 and line 82: "the typ of AutoscalerError"
here shoule be type
2018-03-13 19:50:19 +08:00
Maciej Pytel
abbc45da2e
Delay scale-up including GPU request
...
Nodes with GPU are expensive and it's likely a bunch of pods
using them will be created in a batch. In this case we can
wait a bit for all pods to be created to make more efficient
scale-up decision.
2018-03-02 15:55:04 +01:00
Maciej Pytel
d876d74912
Ignore unfitness in price expander if using GPU
2018-03-02 15:50:43 +01:00
Maciej Pytel
b7f8622eb2
Create node groups with GPU in scale-up.go
...
This is still not implemented in cloudprovider.
Extended NewNodeGroup inteface to have a way of passing
parameters for more complex resources.
2017-12-11 13:12:22 +01:00
Maciej Pytel
6554919700
Helper function to calculate GPU requests for NAP
2017-12-11 13:12:22 +01:00
Marcin Wielgus
f8c0e20ad9
Source fix after godep update
2017-11-28 14:01:43 +01:00
Marcin Wielgus
26960b49df
Merge pull request #460 from sergeylanzman/replace-depricate-func
...
Replace deprecate kubernetes client functions
2017-11-22 15:58:01 +01:00
Marcin Wielgus
ded016dfd8
Merge pull request #461 from MaciekPytel/gpu_unready_fix
...
Consider GPU nodes unready until allocatable GPU is > 0
2017-11-13 15:29:27 +01:00
Maciej Pytel
d81dca5991
Mark nodes with uninitialized GPUs as unready
2017-11-10 17:56:10 +01:00
Sergey Lanzman
eb546b87a0
Replace deprecate kubernetes client functions
2017-11-09 19:49:41 +02:00
Marcin Wielgus
439fd3c9ec
Merge pull request #411 from krzysztof-jastrzebski/priority
...
Adds priority preemption support to cluster autoscaler.
2017-11-08 09:09:26 +01:00
Beata Skiba
2b28ac1a04
Add a workaround for scaling of VMs with GPUs
...
When a machine with GPU becomes ready it can take
up to 15 minutes before it reports that GPU is allocatable.
This can cause Cluster Autoscaler to trigger a second
unnecessary scale up.
The workaround sets allocatable to capacity for GPU so that
a node that waits for GPUs to become ready to use will be
considered as a place where pods requesting GPUs can be
scheduled.
2017-11-06 16:04:22 +01:00