TEI (Text Embeddings Inference)
High-performance embedding server by Hugging Face for production deployment.
About
Text Embeddings Inference by Hugging Face is a server for deploying text embedding and sequence-classification models with high throughput. It supports dynamic batching, GPU acceleration, and many model families including FlagEmbedding, GTE, E5, and BERT-style architectures, and serves rerankers as well. It is production-ready with a Docker deployment. Released under the Apache 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Vector Databases & Embeddings
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Easy (2/5)
- License
- Apache-2.0
- Minimum VRAM
- 4 GB
- Added
- Apr 3, 2026
Related Tools
Python client library for Qdrant vector database.
Approximate nearest neighbor library by Spotify optimized for memory usage.
Efficient similarity search library by Meta for dense vector clustering and retrieval.
Open-source big data serving engine with built-in vector search and ML inference.
Open-source vector similarity search extension for PostgreSQL.
End-to-end vector search engine with built-in model inference.