Commit Graph

20 Commits

Author SHA1 Message Date
googs1025 14190b2f84 feature: use multi factor for overcommit plugin
Signed-off-by: googs1025 <googs1025@gmail.com>
2024-08-07 19:00:33 +08:00
lowang-bh 3ee5e8ef4f resource compare support only consider the requested resource item
Signed-off-by: lowang-bh <lhui_wang@163.com>
2024-07-06 10:28:59 +08:00
guoqinwill a26b897936 fix panic when the gpu is faulty
Signed-off-by: guoqinwill <guoqinwill@163.com>
2024-03-20 11:27:04 +08:00
Volcano Bot c58d710b6c
Merge pull request #3039 from lowang-bh/little_enhancement
enhancement: copy cluster total resource from ssn, instead of summing up them again
2024-01-18 15:25:49 +08:00
lowang-bh 7a0c6c0637 enhancement: don't need to re-calculate job's allocated resource
Signed-off-by: lowang-bh <lhui_wang@163.com>
2023-08-17 17:36:38 +08:00
lowang-bh ac8fe7454b enhancement: copy cluster total resource from ssn, instead of summing up them again
Signed-off-by: lowang-bh <lhui_wang@163.com>
2023-08-11 11:21:29 +08:00
shaoqiu 80a6064265 ignore no metrics server error log, lower useless log info level;
Signed-off-by: shaoqiu <516595344@qq.com>
2023-03-30 22:45:11 +08:00
shaoqiu 2ab0ecb2b3 upgrade klog package version to v2.80.1;
Signed-off-by: shaoqiu <516595344@qq.com>
2023-02-14 19:47:40 +08:00
jiangkaihua 0c6febf894 Modify format verification by gofmt.
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2022-10-13 19:16:46 +08:00
Binbin Zou 97e14bb138 fix calculate inqueue resource bug in opensession
Signed-off-by: Binbin Zou <zbbkeepgoing@163.com>
2022-05-07 11:19:35 +08:00
merryzhou 1678d064d4 add events
Signed-off-by: merryzhou <merryzhou0882@gmail.com>
2022-04-27 18:19:38 +08:00
Thor-wl 7dfec97c04 fix the proportion plugin that ignore the inqueue resource in running jobs when podgroup's minResource is not nil
Signed-off-by: Thor-wl <13164644535@163.com>
2022-03-04 21:59:41 +08:00
Thor-wl 3e098ec2d4
rename resource comparision functions
Signed-off-by: Thor-wl <13164644535@163.com>
2021-07-16 11:30:30 +08:00
merryzhou 88f06fd4e9 fix job enqueue issue
Signed-off-by: merryzhou <merryzhou0882@gmail.com>
2021-07-13 17:37:54 +08:00
Thor-wl cb1ee8f566
add LessEqualInAllDimension function and remove LessEqual/LessEqualStrict functions
Signed-off-by: Thor-wl <13164644535@163.com>
2021-07-12 12:04:47 +08:00
WuLei 3d2aa257a1
update dependency to volcano.sh/api (#1401)
* update dependency to volcano.sh/api

Signed-off-by: Thor-wl <13164644535@163.com>

* update CI timeout to five minutes

Signed-off-by: Thor-wl <1187526662@qq.com>

* resovle the conflict

Signed-off-by: Thor-wl <13164644535@163.com>
2021-04-15 04:08:02 +00:00
jiangkaihua 91202a1929 Update overcommit plugin.
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-20 18:52:52 +08:00
jiangkaihua e2358304a2 Add sla plugin for job resource reservation feature.
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-19 10:59:23 +08:00
jiangkaihua 763fdbff0e Update log for overcommit plugin.
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-08 22:34:56 +08:00
jiangkaihua 7014b84787 Add overcommit plugin to limit pending jobs going through enqueue action.
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-08 19:57:37 +08:00