Tools/AI Deployment & MLOps/Cortex (NVIDIA)

Cortex (NVIDIA)

Open-source ML deployment platform for Kubernetes.

Open SourceSelf HostedOffline Capable

0.0 (0)

Visit Website View on GitHub

About

Cortex is an open-source platform for deploying machine learning models in production on Kubernetes and AWS EKS. It supports real-time, async, and batch inference workloads, elastic CPU and GPU autoscaling, spot instance scheduling, and integration with monitoring stacks via Grafana, CloudWatch, and Prometheus. The project is no longer actively maintained by its original authors and has shifted to community-only support.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Category: AI Deployment & MLOps
Price: Free
Platform: Local/Desktop
Difficulty: Intermediate (3/5)
License: Apache-2.0
Added: Apr 3, 2026

Tags

deployment kubernetes scaling monitoring cli

Related Tools

Cortex

AI Deployment & MLOps

Local AI API platform that runs LLMs on your hardware with OpenAI-compatible API.

Open SourceSelf HostedOffline

Easy

0.0 (0)

Bifrost

AI Deployment & MLOps

Self-hosted Go gateway that routes LLM traffic across providers with failover, caching, and guardrails.

Open SourceSelf Hosted

Intermediate

0.0 (0)

dstack

AI Deployment & MLOps

Open-source orchestrator for AI training and inference across clouds, Kubernetes, and bare metal.

Open SourceSelf HostedOffline

Advanced

0.0 (0)

Flyte

AI Deployment & MLOps

Kubernetes-native workflow orchestration platform for machine learning and data pipelines.

Open SourceSelf HostedOffline

Advanced

0.0 (0)

Portkey AI Gateway

AI Deployment & MLOps

Open-source AI gateway that routes requests to more than 1,600 LLMs through one API with guardrails and caching.

Open SourceSelf Hosted

Easy

0.0 (0)

Featured

BentoML

AI Deployment & MLOps

Framework for building production-ready AI application services.

Open SourceSelf HostedOffline

Easy

0.0 (0)

Browse all AI Deployment & MLOps tools