Kubernetes is the de facto platform for running workloads at scale. This talk will present KWOK (
https://kwok.sigs.k8s.io/), an open-source toolkit that enables the creation and testing of large-scale Kubernetes clusters with minimal resources, even on a laptop.
Shiming Zhang, the creator and maintainer of KWOK, and Yuan Chen, an engineer at NVIDIA GPU Cloud, will outline KWOK's capabilities to generate and manage a large number of virtual nodes that simulate Kubelet APIs and mimic real nodes, allowing for workload deployment and testing. They will discuss practical use cases of KWOK.
The talk will then introduce KWOK's recent enhancements for reliability and fault-tolerance testing, showcasing its ability to simulate failures by injecting targeted faults into nodes and pods. Through examples and demos, the talk will demonstrate how KWOK can be used for reliability testing and evaluating fault-tolerance mechanisms, ultimately improving workload resilience in Kubernetes.
Kubernetes是运行大规模工作负载的事实标准平台。本次演讲将介绍KWOK(
https://kwok.sigs.k8s.io/),这是一个开源工具包,可以利用极少的资源(甚至在笔记本电脑上)创建和测试大规模Kubernetes集群。
KWOK的创始人和维护者张世明,以及NVIDIA GPU Cloud的工程师陈源,将详细阐述KWOK的功能,包括生成和管理大量模拟Kubelet API和真实节点的虚拟节点,从而支持工作负载的部署和测试。他们将讨论KWOK的实际使用案例。
演讲还将介绍KWOK最近针对可靠性和容错性测试的增强功能,展示其通过向节点和Pod注入有针对性的故障来模拟故障的能力。通过示例和演示,演讲将展示如何利用KWOK进行可靠性测试和评估容错机制,从而最终提升Kubernetes中工作负载的弹性能力。