Apparate: Early-Exit Models for ML Latency and Throughput Optimization - Additional Related Work | HackerNoonApparate offers a solution to latency-throughput tension in model serving with a focus on early-exit strategies.