You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the cases where the appwrapper contains resources that are managed by Kueue and the component implemented ReclaimablePods, we should monitor the workload instances and flow that information through to Kueue.
The text was updated successfully, but these errors were encountered:
In Kueue 0.7 only Job, JobSet, and Pod (for PodGroups) implement this optional interface. Until the interface is more widely adopted, there is limited value to recognizing this situation and flowing it through the AppWrapper to Kueue.
While support for Ray jobs seems problematic given #174, we should be able to implement this interface for wrapped PyTorchJobs, in the following sense: if a PyTorchJob status is failed then we can assume no more pods will be created by the Training operator for this job.
In the cases where the appwrapper contains resources that are managed by Kueue and the component implemented ReclaimablePods, we should monitor the workload instances and flow that information through to Kueue.
The text was updated successfully, but these errors were encountered: