SigLIP

Improved vision-language model by Google using sigmoid loss for contrastive learning.

Open SourceSelf HostedOffline CapableGPU Required (4GB+ VRAM)
0.0 (0)

About

SigLIP by Google replaces the softmax loss in CLIP with a sigmoid loss, enabling better scaling and performance. Produces strong image-text embeddings for zero-shot classification and retrieval. Available via Hugging Face. Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Minimum VRAM
4 GB
Added
Apr 3, 2026

Similar Tools

Featured

State-of-the-art real-time object detection supporting YOLOv5 through v11.

Open SourceSelf HostedOffline
Easy
0.0 (0)

Open-vocabulary real-time object detection using YOLO with text prompts.

Open SourceSelf HostedOfflineGPU 4GB+
Intermediate
0.0 (0)
Featured

Open-set object detection combining DINO with grounded pre-training.

Open SourceSelf HostedOfflineGPU 4GB+
Intermediate
0.0 (0)