AI Study Online
📷

Pixtral Large

Mistral AI · 2025

Mistral's vision-language model with 124B parameters for image understanding and generation.

Visit Website

Quick Facts

Parameters

124B

Context Window

128K tokens

Modalities

text, image

Open Source

No

Pricing

API from $3.00/1M tokens

Released

2025

Developer

Mistral AI

About

Pixtral Large is Mistral AI's flagship vision-language model with 124 billion parameters, capable of both understanding and generating content based on visual inputs. It seamlessly integrates image and text processing, enabling tasks like image captioning, visual question answering, document understanding, chart analysis, and multimodal reasoning. Pixtral Large builds on Mistral's architecture with specialized vision encoders and demonstrates strong performance on vision-language benchmarks. Available through Mistral's API platform for multimodal AI applications.

Strengths

  • +124B parameters for strong vision-language performance
  • +European language advantage in multimodal context
  • +Excellent document and chart understanding
  • +Competitive pricing vs GPT-4V alternatives

Weaknesses

  • No native image generation capability
  • Smaller ecosystem than OpenAI vision models
  • Limited third-party integrations

Best For

Multimodal document analysis and digitization

European multilingual vision-language applications

Chart, diagram, and technical drawing understanding

Visual Q&A in enterprise contexts

Pricing

API

From $3.00/1M tokens

  • Vision-language
  • 128K context
  • Document understanding
  • Chart analysis

Technical Specs

Parameters

124B

Context Window

128K tokens

Modalities

text, image

Languages

EnglishFrenchGermanSpanishItalian+3

Open Source

No

Developer

Mistral AI

Released: 2025

Share this article

Related Models