googs1025
14190b2f84
feature: use multi factor for overcommit plugin
...
Signed-off-by: googs1025 <googs1025@gmail.com>
2024-08-07 19:00:33 +08:00
lowang-bh
3ee5e8ef4f
resource compare support only consider the requested resource item
...
Signed-off-by: lowang-bh <lhui_wang@163.com>
2024-07-06 10:28:59 +08:00
guoqinwill
a26b897936
fix panic when the gpu is faulty
...
Signed-off-by: guoqinwill <guoqinwill@163.com>
2024-03-20 11:27:04 +08:00
Volcano Bot
c58d710b6c
Merge pull request #3039 from lowang-bh/little_enhancement
...
enhancement: copy cluster total resource from ssn, instead of summing up them again
2024-01-18 15:25:49 +08:00
lowang-bh
7a0c6c0637
enhancement: don't need to re-calculate job's allocated resource
...
Signed-off-by: lowang-bh <lhui_wang@163.com>
2023-08-17 17:36:38 +08:00
lowang-bh
ac8fe7454b
enhancement: copy cluster total resource from ssn, instead of summing up them again
...
Signed-off-by: lowang-bh <lhui_wang@163.com>
2023-08-11 11:21:29 +08:00
shaoqiu
80a6064265
ignore no metrics server error log, lower useless log info level;
...
Signed-off-by: shaoqiu <516595344@qq.com>
2023-03-30 22:45:11 +08:00
shaoqiu
2ab0ecb2b3
upgrade klog package version to v2.80.1;
...
Signed-off-by: shaoqiu <516595344@qq.com>
2023-02-14 19:47:40 +08:00
jiangkaihua
0c6febf894
Modify format verification by gofmt.
...
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2022-10-13 19:16:46 +08:00
Binbin Zou
97e14bb138
fix calculate inqueue resource bug in opensession
...
Signed-off-by: Binbin Zou <zbbkeepgoing@163.com>
2022-05-07 11:19:35 +08:00
merryzhou
1678d064d4
add events
...
Signed-off-by: merryzhou <merryzhou0882@gmail.com>
2022-04-27 18:19:38 +08:00
Thor-wl
7dfec97c04
fix the proportion plugin that ignore the inqueue resource in running jobs when podgroup's minResource is not nil
...
Signed-off-by: Thor-wl <13164644535@163.com>
2022-03-04 21:59:41 +08:00
Thor-wl
3e098ec2d4
rename resource comparision functions
...
Signed-off-by: Thor-wl <13164644535@163.com>
2021-07-16 11:30:30 +08:00
merryzhou
88f06fd4e9
fix job enqueue issue
...
Signed-off-by: merryzhou <merryzhou0882@gmail.com>
2021-07-13 17:37:54 +08:00
Thor-wl
cb1ee8f566
add LessEqualInAllDimension function and remove LessEqual/LessEqualStrict functions
...
Signed-off-by: Thor-wl <13164644535@163.com>
2021-07-12 12:04:47 +08:00
WuLei
3d2aa257a1
update dependency to volcano.sh/api ( #1401 )
...
* update dependency to volcano.sh/api
Signed-off-by: Thor-wl <13164644535@163.com>
* update CI timeout to five minutes
Signed-off-by: Thor-wl <1187526662@qq.com>
* resovle the conflict
Signed-off-by: Thor-wl <13164644535@163.com>
2021-04-15 04:08:02 +00:00
jiangkaihua
91202a1929
Update overcommit plugin.
...
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-20 18:52:52 +08:00
jiangkaihua
e2358304a2
Add sla plugin for job resource reservation feature.
...
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-19 10:59:23 +08:00
jiangkaihua
763fdbff0e
Update log for overcommit plugin.
...
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-08 22:34:56 +08:00
jiangkaihua
7014b84787
Add overcommit plugin to limit pending jobs going through enqueue action.
...
Signed-off-by: jiangkaihua <jiangkaihua1@huawei.com>
2021-02-08 19:57:37 +08:00