LLaVA
Visual instruction tuned multimodal LLM for image understanding and chat.
Open SourceSelf HostedOffline CapableGPU Required (8GB+ VRAM)
0.0 (0)
About
LLaVA (Large Language and Vision Assistant) is a multimodal model combining a vision encoder with an LLM for visual chat. Understands images, charts, documents, and screenshots. Available in 7B and 13B sizes. Apache 2.0 license.
Reviews (0)
Leave a Review
No reviews yet. Be the first to review!
Details
- Category
- Large Language Models (LLMs)
- Price
- Free
- Platform
- Local/Desktop
- Difficulty
- Intermediate (3/5)
- License
- Apache-2.0
- Minimum VRAM
- 8 GB
- Added
- Apr 3, 2026
Similar Tools
Featured
Open-weight LLM by Meta available in 8B, 70B, and 405B parameter sizes.
Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Featured
Latest Llama model family by Meta with Mixture-of-Experts architecture.
Open SourceSelf HostedOfflineGPU 16GB+
Advanced
0.0 (0)
Featured
High-performance open-weight LLMs by Mistral AI with MoE architecture.
Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)