Papers

  • [SOSP ‘23] Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling

  • [SC ‘23] Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach A learning-based method to multiplex GPUs with ML tasks, modeled with integer programming problem.