Tools/Vector Databases & Embeddings/Infinity Embedding Server

Infinity Embedding Server

Fast embedding inference server supporting many embedding models.

Open SourceSelf HostedOffline Capable
0.0 (0)

About

Infinity is a high-throughput, low-latency REST API for serving text embedding and reranking models, with added support for CLIP, CLAP, and ColPali multimodal embeddings. It offers dynamic batching, caching, and an OpenAI-compatible interface, and serves many embedding models from Hugging Face. It installs via pip and runs from a CLI for production deployment. Released under the MIT license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Easy (2/5)
License
MIT
Added
Apr 3, 2026

Related Tools

Python client library for Qdrant vector database.

Open SourceSelf HostedOffline
Beginner
0.0 (0)

Approximate nearest neighbor library by Spotify optimized for memory usage.

Open SourceSelf HostedOffline
Easy
0.0 (0)
Featured

Efficient similarity search library by Meta for dense vector clustering and retrieval.

Open SourceSelf HostedOffline
Intermediate
0.0 (0)

Open-source big data serving engine with built-in vector search and ML inference.

Open SourceSelf HostedOffline
Advanced
0.0 (0)
Featured

Open-source vector similarity search extension for PostgreSQL.

Open SourceSelf HostedOffline
Easy
0.0 (0)

End-to-end vector search engine with built-in model inference.

Open SourceSelf HostedOffline
Easy
0.0 (0)
Browse all Vector Databases & Embeddings tools