PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters
Abstract
References
Index Terms
- PAL: A Variability-Aware Policy for Scheduling ML Workloads in GPU Clusters
Recommendations
Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling
SOSP '23: Proceedings of the 29th Symposium on Operating Systems PrinciplesThe Sia scheduler efficiently assigns heterogeneous deep learning (DL) cluster resources to elastic resource-adaptive jobs. Although some recent schedulers address one aspect or another (e.g., heterogeneity or resource-adaptivity), none addresses all ...
A novel GPU resources management and scheduling system based on virtual machines
Virtual machine (VM) technologies offer lots of benefits such as users' isolation, server consolidation and live migration. However, owing to the overhead incurred by indirect access to physical resources such as GPU, IO devices and VM technologies have ...
Cluster scheduling for real-time systems: utilization bounds and run-time overhead
Cluster scheduling, where processors are grouped into clusters and the tasks that are allocated to one cluster are scheduled by a global scheduler, has attracted attention in multiprocessor real-time systems research recently. In this paper, assuming ...
Comments
Information & Contributors
Information
Published In
Sponsors
Publisher
IEEE Press
Publication History
Check for updates
Badges
Author Tags
Qualifiers
- Research-article
- Research
- Refereed limited
Conference
Acceptance Rates
Upcoming Conference
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 156Total Downloads
- Downloads (Last 12 months)156
- Downloads (Last 6 weeks)142
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in