Ray Serve

Scalable model serving library built on Ray for ML applications.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

Ray Serve is a scalable model-serving library built on the Ray distributed computing framework. It composes multiple models and business logic into deployment graphs, autoscales across CPUs and GPUs, and integrates with FastAPI for HTTP endpoints, staying framework-agnostic. It suits serving ML and LLM applications from a laptop to a cluster. Released under the Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Added
Apr 3, 2026

Related Tools

Featured

Framework for building production-ready AI application services.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Container tool by Replicate for packaging ML models as standard Docker images.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Local AI API platform that runs LLMs on your hardware with OpenAI-compatible API.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Production model serving system for TensorFlow models.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

PyTorch model serving framework for production deployment.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

Open-source ML deployment platform for Kubernetes.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)
Browse all AI Deployment & MLOps tools