Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling
Published in Proceedings of the 29th Symposium on Operating Systems Principles (SOSP 23), 2023
Recommended citation: Suhas Jayaram Subramanya, Daiyaan Arfeen, Shouxu Lin, Aurick Qiao, Zhihao Jia, and Gregory R. Ganger. 2023. Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling. In Proceedings of the 29th Symposium on Operating Systems Principles (SOSP 23). Association for Computing Machinery, New York, NY, USA, 642–657. https://doi.org/10.1145/3600006.3613175 /files/sia-sosp23.pdf