InternVL 2.5

Open-source multimodal LLM competitive with commercial models by Shanghai AI Lab.

Open SourceSelf HostedOffline CapableGPU Required (12GB+ VRAM)
0.0 (0)

About

InternVL 2.5 by Shanghai AI Laboratory is an open-source multimodal LLM that pairs a vision encoder with a language model for image and document understanding. It performs strongly on visual question answering, OCR, chart and table reading, and multi-image reasoning, and is released in several sizes to fit different hardware. Later InternVL3 versions extended the line. Released under the MIT license with weights on Hugging Face.

Reviews (0)

Leave a Review

No reviews yet. Be the first to review!

Details

Price
Free
Platform
Local/Desktop
Difficulty
Intermediate (3/5)
License
MIT
Minimum VRAM
12 GB
Added
Apr 3, 2026

Related Tools

Open-weight models by Google in 2B, 9B, and 27B sizes with strong performance.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Open-weight LLM by Meta in 8B and 70B sizes with strong general capabilities.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)
Featured

Open-weight LLM family by Alibaba with strong multilingual and coding abilities.

Open SourceSelf HostedOfflineGPU 8GB+
Intermediate
0.0 (0)

Small language model by Microsoft in 3.8B size with strong benchmark performance.

Open SourceSelf HostedOfflineGPU 4GB+
Easy
0.0 (0)

Open-access 176B parameter multilingual LLM by BigScience supporting 46 languages.

Open SourceSelf HostedOfflineGPU 80GB+
Expert
0.0 (0)

Retrieval-augmented generation optimized LLM by Cohere with 128K context.

Open SourceSelf HostedOfflineGPU 24GB+
Advanced
0.0 (0)
Browse all Large Language Models (LLMs) tools