Crater - Cloud-Native AI Platform
A one-stop solution for machine learning based on Kubernetes.
Efficient experience in AI training and serving.

Why Choose Crater
Out-of-the-Box Deep Learning Platform
Provides an intuitive, easy-to-use interface, eliminating the need to master containers or Kubernetes and lowering the barrier to entry.
Open-Source Enhanced, No Vendor Lock-In
Deeply integrated with open-source projects like Volcano, Fluid, and Envd, ensuring compatibility with the K8s ecosystem and technological autonomy.
Intelligent GPU Sharing for Cost Optimization
Increases GPU resource utilization by 12% through an interference-aware intelligent sharing strategy, without impacting the user's experience.
Core Capabilities
Crater provides comprehensive machine learning platform capabilities, from data management to model training, to solve your AI workflow needs in one stop.
Data Management
- Distributed caching system accelerated by Fluid
- Fine-grained data sharing mechanism
- Intelligent data preprocessing pipeline
Environment Setup
- Envd environment templates, no Docker skills required
- Support for JupyterLab/VSCode remote development
- Environment sharing and rapid reuse
Model Training
- Support for distributed training frameworks
- Real-time GPU utilization monitoring
- Automatic scheduling of training tasks
Performance Monitoring
- Real-time loss curve visualization
- Resource Usage Statistics Reports
- Training Progress Tracking
Version Control
- Model version management
- Experiment tracking and comparison
- Configuration history records
Model Deployment
- One-click model serving
- Auto-scaling
- API management and monitoring
Technical Advantages
High-Performance Computing Architecture
A high-performance computing architecture built on Kubernetes, supporting large-scale distributed training and inference to fully leverage the computational potential of GPU clusters.
- Optimized CUDA accelerated computing
- Efficient memory management mechanisms
- Intelligent resource scheduling algorithms
Enterprise-Grade Security
Provides comprehensive security mechanisms to protect your data and model assets, meeting enterprise-level security and compliance requirements.
- Fine-grained access control
- Data-in-transit encryption
- Audit logs and compliance reporting
Open-Source Ecosystem Integration
Deeply integrates with mainstream open-source components to provide a unified user experience and avoid technological fragmentation.
- Volcano job scheduling engine
- Fluid data acceleration system
- Envd environment management tool
Flexible Scalability
A modular design that supports flexible expansion to adapt to the needs of different scales and scenarios.
- Plugin-based architecture
- Custom workflow support
- API integration capabilities
Customer Scenarios
Crater provides customized solutions for different types of organizations to meet various AI computing needs.
University Research
Replaces traditional Slurm clusters to manage private high-performance GPU nodes, offering a more user-friendly experience and higher resource utilization.
- Multi-user resource isolation
- Research project management
- Flexible permission control
Enterprise AI Teams
Provides a unified development and production environment for enterprise AI teams, accelerating the entire model lifecycle from R&D to deployment.
- DevOps integration
- Model version management
- CI/CD pipelines
Cloud Service Providers
Build public or private cloud AI platform services, offering customers elastic, secure, and efficient machine learning infrastructure.
- Multi-tenant architecture
- Metering and billing system
- Service Level Guarantees
Get Started with Crater
Explore the resources below to quickly learn and deploy Crater, starting your journey into cloud-native AI.