Commit Graph

60 Commits

Author SHA1 Message Date
Łukasz Osipiuk 29c22c0a3d Store single ScaleUpRequest per node group 2018-10-18 18:27:31 +02:00
Jakub Tużnik b105f28ebd Add a method to determine if a node group is at its its target size to CSR 2018-09-07 20:24:38 +02:00
Aleksandra Malinowska 364e2da764 Check for ready condition not true 2018-08-30 13:43:24 +02:00
Jakub Tużnik 51334f283e Fix GetClusterSize to return actual size in line with the rest of CSR
It returned the number of registered nodes, but should return the number
of provisioned nodes instead.
2018-08-27 14:58:07 +02:00
Jakub Tużnik 054f0b3b90 Add AutoscalingStatusProcessor 2018-08-07 14:47:06 +02:00
Krzysztof Jastrzebski dd1db7a0ac Move backoff mechanism to utils. 2018-06-13 15:32:25 +02:00
GFilipek 4a54745d19 Fix timestamp format in ClusterAutoscaler ConfigMap 2018-05-29 09:42:51 +02:00
Aleksandra Malinowska 820f688d2a Update max unready nodes to 45% 2018-05-17 12:51:45 +02:00
AdamDang cefb7cc36d
Update types.go
line 35-36: "ClusterAutoscalerScaleUp is a condition that explains what is the current status of a node group with regard to scale down activities".
It should be "...up activities"
2018-03-30 00:13:35 +08:00
Aleksandra Malinowska 4c594db7f8 Run spellchecker 2018-03-15 15:47:49 +01:00
AdamDang d96d1ef3ba
Typo fix "record"->"records"
"record" should be replaced with "records" here.
2018-03-12 21:06:31 +08:00
Hang Yan b4713c22d5 Fix various typos in clusterstate package 2018-02-07 16:03:51 +08:00
Aleksandra Malinowska 3894ecb470 Export unregistered node count metric 2018-01-16 16:56:40 +01:00
Maciej Pytel 53603d0a2a Increase MaxNodeStartupTime to 15 minutes. 2017-11-13 15:14:47 +01:00
Edward Tsang 4104a91991 more spelling fixes 2017-11-02 14:21:36 -07:00
Maciej Pytel c376ef3c87 Add metrics for autoprovisioning 2017-10-31 17:42:58 +01:00
Maciej Pytel 02ccba3338 Update clusterstate after scale-up 2017-10-17 16:11:25 +02:00
Marcin Wielgus f658450b16 Merge pull request #379 from MaciekPytel/long_unregistered_node
Keep track of nodes that failed to register for a long time
2017-09-28 15:01:32 +02:00
Maciej Pytel ff21b0b00c Keep track of nodes that failed to register for a long time
Previously a node that failed to register and couldn't be deleted
basically broke CA.
2017-09-27 16:32:04 +02:00
Maciej Pytel e12ee88f5f Add failed scale-up reason in metric 2017-09-26 13:40:34 +02:00
Maciej Pytel 5e05c84cf0 Add metric counting failed scale-ups
A minor refactor was required to avoid cyclic imports
2017-09-22 18:12:50 +02:00
Maciej Pytel a440d92a60 Log event on scale-up timeout 2017-09-01 14:19:14 +02:00
Marcin Wielgus 998b3f1acd Merge pull request #198 from MaciekPytel/support_zone_failures
Backoff for node group after failed scale-up
2017-08-16 20:46:45 +05:30
Marcin Wielgus 9116e4c08c Compilation fix for CA after godeps update 2017-08-11 17:56:47 +02:00
Maciej Pytel 6aacbb5bf7 Backoff for node group after failed scale-up 2017-08-04 15:40:23 +02:00
Ivan Towlson 902d2414b7 Fixed typoes of name 'Kubernetes' 2017-08-03 14:20:23 +12:00
Maciej Pytel 566aa38099 Stop spamming warnings for every 0 size nodegroups 2017-07-28 15:25:58 +02:00
Aleksandra Malinowska c159a90f04 rename test provider package 2017-07-06 16:23:15 +02:00
Marcin Wielgus fc43808149 Godeps bump for CA 2017-07-03 22:05:11 +02:00
Maciej Pytel fe514ed75d Make status configmap respect namespace parameter 2017-06-14 14:07:13 +02:00
Maciej Pytel 7c5679baaf Improve error handling in status configmap 2017-06-14 13:07:06 +02:00
Marcin Wielgus 0a8a88c580 Handle empty node groups in cluster state 2017-05-19 17:46:53 +02:00
Maciej Pytel 83ef3d2be3 Added CA metrics related to cluster state 2017-05-11 13:54:04 +02:00
Marcin Wielgus 34eb4973f8 Fix imports in cluster autoscaler after migrating it from contrib 2017-04-18 15:42:04 +02:00
Maciej Pytel bf9b6ce203 Cluster-Autoscaler: make status less confusing
Previously min and max in status were refering to
non-obvious internal variables, which was pretty confusing.
2017-03-28 13:31:12 +02:00
Maciej Pytel 28b40c45db Cluster-Autoscaler: fix scaleup status reporting
Previously the cluster would move to NoActivity too quickly
(before new nodes were actually ready).
2017-03-28 11:58:56 +02:00
Marcin Wielgus eb3e6173d1 Cluster-autoscaler: Fix isNodeStarting 2017-03-27 23:27:14 +02:00
Maciej Pytel 10d560dae6 Cluster-Autoscaler: handle nil node group
In a few place we assumed it's not-nil, leading
to segfaults.
2017-03-13 14:46:11 +01:00
Maciej Pytel 46d2c66473 Cluster-autoscaler: set timestamps in status configmap 2017-03-08 11:51:20 +01:00
Maciej Pytel d3bf5d3d51 Cluster-Autoscaler: log events on status configmap 2017-03-06 12:21:24 +01:00
Maciej Pytel 585c46419d Cluster-autoscaler: implemented LogCollector 2017-02-23 17:57:59 +01:00
Kubernetes Submit Queue 7fc42e9419 Merge pull request https://github.com/kubernetes/contrib/pull/2409 from MaciekPytel/ca_status_string
Automatic merge from submit-queue

Cluster-autoscaler: readable status printing

Example output (github seems to slightly mess up tabs):
```
Health:      Healthy (ready=0 unready=1 notStarted=0 longNotStarted=0 registered=1)
             LastProbeTime:      0001-01-01 00:00:00 +0000 UTC                     
             LastTransitionTime: 0001-01-01 00:00:00 +0000 UTC                     
ScaleUp:     NoActivity (ready=0 registered=1)                                     
             LastProbeTime:      0001-01-01 00:00:00 +0000 UTC                     
             LastTransitionTime: 0001-01-01 00:00:00 +0000 UTC                     
ScaleDown:   NoCandidates (candidates=0)                                           
             LastProbeTime:      0001-01-01 00:00:00 +0000 UTC                     
             LastTransitionTime: 0001-01-01 00:00:00 +0000 UTC                     
                                                                                   
NodeGroups:                                                                        
  Name:        ng1                                                                 
  Health:      Unhealthy (ready=0 unready=1 notStarted=0 longNotStarted=0 registered=1 cloudProviderTarget=5 (min=5, max=5))
               LastProbeTime:      0001-01-01 00:00:00 +0000 UTC                   
               LastTransitionTime: 0001-01-01 00:00:00 +0000 UTC                   
  ScaleUp:     NoActivity (ready=0 cloudProviderTarget=5)                          
               LastProbeTime:      0001-01-01 00:00:00 +0000 UTC                   
               LastTransitionTime: 0001-01-01 00:00:00 +0000 UTC                   
  ScaleDown:   NoCandidates (candidates=0)                                         
               LastProbeTime:      0001-01-01 00:00:00 +0000 UTC                   
               LastTransitionTime: 0001-01-01 00:00:00 +0000 UTC
```
2017-02-23 06:38:58 -08:00
fate-grand-order 2f3b410844 fix missspell in clusterstate.go 2017-02-23 21:42:03 +08:00
Maciej Pytel b7e79af2d8 Cluster-autoscaler: readable status printing 2017-02-23 13:24:48 +01:00
Marcin Wielgus 8cfed0b474 Cluster-autoscaler: GetStatus - scaleDown 2017-02-21 19:56:07 +01:00
Marcin Wielgus 87f0d62b28 Cluster-autoscaler: scale up status 2017-02-21 16:21:36 +01:00
Marcin Wielgus d9d5a751f5 Cluster-autoscaler: GetState() - health condition 2017-02-21 13:15:19 +01:00
Marcin Wielgus d42ef4e1ea Cluster-autoscaler: api type object for status reporting 2017-02-20 17:15:40 +01:00
Marcin Wielgus ce45c33d29 Cluster-autoscaler: update CA code for godep refresh 2017-01-20 14:46:34 +01:00
Marcin Wielgus 1d19914f93 Merge branch 'master' into enable-unready 2017-01-18 15:10:50 +01:00