Stars
Device-plugin for volcano vgpu which support hard resource isolation
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
kubectl plugin to browse Kubernetes object hierarchies as a tree 🎄 (star the repo if you are using)
Making large AI models cheaper, faster and more accessible
DLRover: An Automatic Distributed Deep Learning System
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Open source hyperconverged infrastructure (HCI) software
renwenlong-github / client-go
Forked from kubernetes/client-goGo client for Kubernetes.
Leader election for your own services, building on Kubernetes' APIs/etcd.
renwenlong-github / volcano
Forked from volcano-sh/volcanoA Cloud Native Batch System (Project under CNCF)
skytodmoon / cube-studio
Forked from tencentmusic/cube-studio云原生一站式机器学习平台,oa,项目组权限,在线开发,分布式训练,超参搜索,推理服务,多集群调度,多项目组资源组,边缘计算
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…
Using CRDs to manage GPU resources in Kubernetes.
【2021年新鲜出炉】K8s(Kubernetes)的工程师资料合辑,书籍推荐,面试题,精选文章,开源项目,PPT,视频,大厂资料
A basic instruction for Kubernetes setup and understanding.
Kubernetes Native Edge Computing Framework (project under CNCF)
项目使用华为云 ModelArts AI 开发平台进行训练部署,采用 Mask R-CNN 算法模型进行识别预测。
The storageclass-accessor webhook is an HTTP callback which responds to admission requests.