Netflix Simplified Batch Compute with Kueue
Netflix migrated its batch compute infrastructure to leverage Kueue, an open-source Kubernetes scheduler, to simplify job scheduling and improve resource utilization. By adopting Kueue, Netflix replaced custom-built solutions, achieving better scalability and operational efficiency for its data processing workloads.
Background
Netflix runs massive batch compute jobs (e.g., video encoding, data processing) on Kubernetes. For years they used a custom-built internal scheduler called Titus. Kueue is an open-source Kubernetes job scheduler originally from Google; Netflix has now adopted it alongside their existing GPU infrastructure. This shift lets them handle diverse batch workloads with a single, community-standard tool instead of maintaining their own. The article discusses how they adapted Kueue to work with their GPU fleets and high-throughput encoding pipelines — a concrete example of a broader industry move away from bespoke internal systems toward shared open-source projects in the cloud-native ecosystem.