mlsys_papers

[WIP] mlsys_papers

A curated list of machine learning papers from recent major system conferences, specifically SOSP and OSDI within the last three years, and high popularity paper. Topics, titles, keywords, and authors that are bolded reflect personal preferences or special relationship to us.

Topics of interest

System for Machine Learning

Distributed Systems

Model training Framework

Model Serving

Model Inference Framework
Scheduling and Resource Management

Fault Tolerance

Accelerate and optimize Machine Learning

AI Compiler and Programming Languages

Parallelism

Database and Storage

GPU Arch

Machine Learning for Systems

Resource Management

Reliability