Distributed ML Training and Fine-Tuning on Kubernetes
APACHE-2.0 License
Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
A performant and modular runtime for TensorFlow
TensorFlow 最新官方文档中文版
Machine Learning Toolkit for Kubernetes
Machine learning operator & controller for Kubernetes