Pixtral

Multimodal vision-language model by Mistral AI for image understanding.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)
0.0 (0)

About

Pixtral by Mistral AI is a 12B multimodal model that understands images alongside text. Supports visual QA, chart reading, document understanding, and image reasoning. Natively multimodal architecture. Apache 2.0 license.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
Apache-2.0
Minimum VRAM
12 GB
Added
Apr 3, 2026

Similar Tools

Featured

Open-weight LLM by Meta available in 8B, 70B, and 405B parameter sizes.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Featured

Latest Llama model family by Meta with Mixture-of-Experts architecture.

Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Featured

High-performance open-weight LLMs by Mistral AI with MoE architecture.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)