Cortex (NVIDIA)

Open-source ML deployment platform for Kubernetes.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

Cortex is an open-source platform for deploying machine learning models in production on Kubernetes and AWS EKS. It supports real-time, async, and batch inference workloads, elastic CPU and GPU autoscaling, spot instance scheduling, and integration with monitoring stacks via Grafana, CloudWatch, and Prometheus. The project is no longer actively maintained by its original authors and has shifted to community-only support.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Added
Apr 3, 2026

Related Tools

Container tool by Replicate for packaging ML models as standard Docker images.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Local AI API platform that runs LLMs on your hardware with OpenAI-compatible API.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Production model serving system for TensorFlow models.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

PyTorch model serving framework for production deployment.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)
Featured

NVIDIA inference serving platform for deploying AI models at scale.

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)
Featured

Framework for building production-ready AI application services.

Open SourceSelf HostedOffline
Easy
0.0 (0)
Browse all AI Deployment & MLOps tools