Cortex
Local AI API platform that runs LLMs on your hardware with OpenAI-compatible API.
About
Cortex by Jan AI is a local AI platform that runs language and other models on your own hardware behind an OpenAI-compatible API, with llama.cpp and ONNX backends and a model-pull command for the Hugging Face hub. It can run as a background service for integration into other applications. The repository is now archived and development has moved to the Menlo Research fork of llama.cpp. Apache 2.0 licensed.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- AI Deployment & MLOps
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Easy (2/5)
- License
- Apache-2.0
- Added
- Apr 3, 2026
Related Tools
Framework for building production-ready AI application services.
Container tool by Replicate for packaging ML models as standard Docker images.
Production model serving system for TensorFlow models.
PyTorch model serving framework for production deployment.
NVIDIA inference serving platform for deploying AI models at scale.
Open-source ML deployment platform for Kubernetes.