Skip to content

Releases: volcano-sh/volcano

v1.4.0-Beta

04 Sep 11:40
9aed970
Compare
Choose a tag to compare

What's New

1. Support multi-scheduler

In Kubernetes cluster with multiple schedulers, different kinds of workloads should be mapped to certain scheduler sometimes. For example, K8s native workloads such as deployment in namespace kube-system are mapped to default-sheduler while AI and Big data jobs are mapped to Volcano. This feature aims to implements that automaticallty. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/multi-scheduler.md. (#1576, #1521, @huone1 @william-wang )

2. Support proportion of resources for GPU node

In order to make full use of scarce resources such as GPU, one solution is to bind them with other resources as shares. For example, it is common to see a lot of CPU-intensive workloads are scheduled to GPU nodes. When GPU-intensive workloads come, they cannot be scheduled because of lack of CPU or Memory in GPU nodes. If workloads requiring both GPU, CPU, Memory at certatin range can be scheduled to GPU nodes first, it is possible to make full use of GPUs. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/proportional.md. (#1527, @king-jingxiang )

3. Support CPU NUMA-Aware scheduling

As to CPU-intensive workloads especially in AI, Big Data and HPC fileds, It will result in a significant performance improvement if CPU NUMA is enabled. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/numa-aware.md. (#1493, @huone1 )

4. Provide framework of stress test

In this release, A kind of framework for Volcano stress test is provided. (#1516, @rudeigerc )

Other Notable Changes

Bug Fixes

v1.3.0

27 May 08:18
44ec8eb
Compare
Choose a tag to compare

What's New

1. Support minAvailable at task level

Just as the minAvailable at job level, minAvailable at task level will regard replicases at the same task as group and decide whether to schedule pods at the task. Only when the minAvailable is meet will the pods will be scheduled together. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/task-minavailable.md. (#1459, @shinytang6 )

2. Support minSuccess for Job

Support to configure the least number of pods belonging to the job. It's useful to mark the status of job when minsuccess reaches or not and accelerates the job status judgement. (#1384, @zen-xu )

3. Support task-topology

In big data processing jobs like Tensorflow & Spark, tasks transmitted a large amount of data between each other, causing transmission delay took a large proportion in job execution time. So task topology plugin was proposed to modify scheduling strategy according to transmission topology inside a job, so as to cut the data amount to be transmitted between nodes, decrease transmission delay proportion in job execution time, and improve resource utilization. More details please refer to https://github.com/volcano-sh/volcano/blob/master/docs/design/task-topology-plugin.md. (#1353, @jiangkaihua )

4. Create new repository volcano.sh/apis

Separate apis from volcano.sh/volcanosh. Any downstream projects can introduce the CRD clientset/lister/informer with the K8s version it needs. (https://github.com/volcano-sh/apis, @Thor-wl )

Other Notable Changes

Bug Fixes

  • fix: lose preemptor when considering Preemption between Tasks within same Job (#1453, @lowang-bh )
  • scheduler need configmap role to enable elect funtion(#1443, @wpeng102 )
  • fix(scheduler): use nodeMap to fix anti-affinity problem(#1430. @shinytang6 )
  • fix: use task.Name to make podName in admission(#1412, @merryzhou )
  • add bindingTasks to judge whether adding node to the snapshot.(#1388, @zen-xu )
  • fix reserving for deleted targetJob raise nil pointer(#1371, @zen-xu )
  • fix sla jobOderFn when sla not set(#1365, @merryzhou )
  • fix: it is possible to Occur OutOfCpu, when exist some pods including init container(#1364, @huone1 )
  • fix wrong Pipeline in action allocate(#1360, @yzs981130 )
  • fix: prevent SelectBestNode func arise panic(#1344, @yahaa )
  • fix(scheduler): move JobInfo helper functions to method(#1343, @Thrimbda )

v1.2.0

27 Feb 07:11
Compare
Choose a tag to compare

What's New

1. Add TDM plugin

TDM(Time Division Multiplexing) plugin aims to provide a mechanism for nodes, which can be used for K8S and other cluster(such as Yarn) in separate time.(#1269, @yahaa )

2. Add SLA plugin

SLA(Service Level Agreement) plugin works for job resource reservation feature. Users can set SLA for jobs to ensure specified jobs to be scheduled in time. It provides an better design and implementation for job resource reservation. (#1303, @jiangkaihua )

Other Notable Changes

Bug Fixes

v1.1.2

23 Feb 06:30
Compare
Choose a tag to compare

Changes since v1.1.1

  • bug fix: Use musl-gcc build image, because vc-scheduler default image is alpine, which only has musl-libc(#1225, @zen-xu)

v1.1.1

31 Dec 07:40
acc6a56
Compare
Choose a tag to compare

What's New

1. support vc-scheduler loading custom plugins

Separate plugin implementation with scheduler. Support implement custom plugins and load to vc-scheduler dynamically.(#1218, @zen-xu)

2. add MaxRequeueNum as a controller-manager param

Support configure MaxRequeueNum in config file of vc-scheduler, default to 15 times.(#1087, @shinytang6)

3. add design documentation of CPU careful regulation

Give the design of CPU careful regulation in socket level.(#1051, @ProgramerGu)

Other Notable Changes

Bug Fixes

v1.1.0

30 Oct 15:35
Compare
Choose a tag to compare

What's New

1. Add monitor compontent

Monitor compontent added support display some base metrics about Volcano.(#1066, @alcorj-mizar)

2. Support resource reservation for big job automatically

Reserve resource for pending job which is at highest priority among pending jobs and waits for a long time. The big job is recognized by scheduler automatically.(#1044, @Thor-wl)

3. Support HDRF

Hierarchical dominant resource fairness is configured with a weighted tree, such that each node in the tree has a positive weight value.(#928, @ggaaooppeenngg)

Other Notable Changes

Bug Fixes

v0.4.2

04 Aug 12:58
89c2fa6
Compare
Choose a tag to compare
  • Fix queue capability validation failed when some running jobs finished or deleted (#959, @Thor-wl)

v1.0.1

30 Jul 13:50
68f40e2
Compare
Choose a tag to compare

Changelog since v1.0.0

v0.4.1

16 Jul 01:12
4feb02f
Compare
Choose a tag to compare

v0.4.1 (2020-07-14)

v1.0.0

08 Jul 03:39
Compare
Choose a tag to compare

1.0 What's New

1. GPU Sharing

Volcano now supports gpu sharing between different pods (#852, @tizhou86, @hzxuzhonghu).

2. Preempt and reclaim enhancement

Volcano is now able to support preempt for batch job (#738, @carmark).

3. Dynamic scale up and down

Volcano job now supports dynamically scale up and down (#787, @hzxuzhonghu).

4. Support integrate with flink operator

Users are now able to run flink job with volcano. Follow the instructions here to make use of the feature. @hzxuzhonghu).

5. Support DAG job with argo

Users are now able to run DAG job with volcano. Follow the instructions here to make use of the feature. @alcorf-mizar).

Other Notable Changes

Bug Fixes