AI Study Online
🎙

Whisper Large v3

Open Source

OpenAI · 2024-09

OpenAI's state-of-the-art speech recognition model with multilingual transcription at high accuracy.

Visit Website

Quick Facts

Parameters

~1.55B

Context Window

N/A

Modalities

audio, text

Open Source

Yes

License

MIT

Pricing

Free (open-source) / API from $0.006/minute

Released

2024-09

Developer

OpenAI

About

Whisper Large v3 is OpenAI's most advanced automatic speech recognition (ASR) model, capable of transcribing and translating audio in 99+ languages with near-human accuracy. Built on a large-scale weakly supervised training approach, it handles diverse audio conditions including background noise, multiple speakers, and various accents. Whisper Large v3 supports multilingual transcription, direct translation to English, language identification, and timestamp generation. Available as an open-weight model under the MIT license, it can be run locally for privacy-sensitive applications or accessed through OpenAI's API for scalable deployment.

Strengths

  • +Near-human accuracy across 99+ languages
  • +Open-source with permissive MIT license
  • +Runs locally for privacy-sensitive use cases
  • +Handles noise, multiple speakers, and various accents

Weaknesses

  • Large model requires significant compute for real-time
  • Less accurate on very specialized domain terminology
  • No speaker diarization built-in

Best For

Multilingual audio transcription at scale

Privacy-sensitive speech processing

Content accessibility and subtitling

Voice-controlled applications and assistants

Pricing

Open Source

$0

  • Full model weights
  • Local execution
  • Unlimited use
  • Full privacy

API

From $0.006/minute

  • Scalable deployment
  • No GPU needed
  • 99+ languages
  • Translation

Benchmarks

BenchmarkWhisper Large v3Competitor
Common Voice15.1% WERGoogle Speech: 18.2% WER

Technical Specs

Parameters

~1.55B

Context Window

N/A

Modalities

audio, text

Languages

EnglishChineseSpanishArabicFrench+4

Open Source

Yes

License

MIT

Developer

OpenAI

Released: 2024-09

Share this article

Related Models