Resource Optimization for ML Inference Serving
EB2 3211 Seminar Room 890 Oval Dr., Raleigh, NC, United StatesTitle: Resource Optimization for ML Inference Serving Abstract: My research focuses on job scheduling and resource management in Machine Learning (ML) and Large Language Model (LLM) systems. With the growing…