Featured Tool

BentoML

Framework for building production-ready AI application services.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

BentoML is a Python framework for building model inference APIs and multi-model serving systems from models in any framework. It packages models with their dependencies into a standard unit, exposes them as online services with request batching and GPU support, and containerizes them for deployment anywhere. It targets reliable, cost-efficient AI services. Released under the Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
Apache-2.0
Added
Apr 3, 2026

Related Tools

Container tool by Replicate for packaging ML models as standard Docker images.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Local AI API platform that runs LLMs on your hardware with OpenAI-compatible API.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Production model serving system for TensorFlow models.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

PyTorch model serving framework for production deployment.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)
Featured

NVIDIA inference serving platform for deploying AI models at scale.

Open SourceSelf HostedOfflineGPU 8GB+
Advanced
0.0 (0)

Open-source ML deployment platform for Kubernetes.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)
Browse all AI Deployment & MLOps tools