OpenAI Silently Unveils Whisper 3, A New Generation Open Source ASR Model

During its inaugural Developer Day, AI startup OpenAI released a series of open-source models. The slew of products included an upgraded version of its open-source automatic speech recognition model, Whisper large-v3. The company’s future plans involve making the model’s API accessible to users.

The models for English-only applications tend to perform better, especially for the `tiny.en` and `base.en` models as per the official page. The model’s performance varies widely depending on the language.

(Source: OpenAI)

Initially focused on English, the neural net model was released in September last year. Then it got an upgraded version 2 in December which was enhanced to support multiple languages, although specific languages were not explicitly mentioned.

Accessible on GitHub under a permissive license, Whisper large-v3 effortlessly transcribes various content for users and has been called the best transcription tool out there. The model features a unique timestamp section that facilitates its application as subtitles on platforms such as YouTube.

The tool initiates the process by segmenting audio into 30-second clips, converting them, and subsequently passing them through an encoder and decoder, which predict the corresponding text caption. Technical intricacies also involve language identification, facilitating multilingual speech transcription, and translation to English.

The model was initially expected to be integrated with ChatGPT, to let the users converse directly with the chatbot through speech. But OpenAI then decided to release the model to the public directly. Interestingly, Whisper is not aimed at the end users as of now but rather at researchers.

The reason for open-sourcing as per OpenAI was to “serve as a foundation for building useful applications and for further research on robust speech processing“. OpenAI’s AI tool was honed using an extensive dataset of 680,000 hours of meticulously supervised data sourced from the internet, with one third portion originating from non-English sources.

The post OpenAI Silently Unveils Whisper 3, A New Generation Open Source ASR Model appeared first on Analytics India Magazine.

Previous News

Sequretek Secures $8 Mn Funding To Fuel Expansion In The US And Indian Markets

Next News

Flipkart Cofounder Binny Bansal To Launch AI Startup

OpenAI Silently Unveils Whisper 3, A New Generation Open Source ASR Model

Disclaimer

Popular

The Best Smart Home Accessories to Boost Your Curb Appeal (2026)

iPhones to Get These New Satellite Features

The lobster is loose, and it’s not going back: Peter Steinberger on building OpenClaw at TED 2026

Kiwi cofounder Mohit Bedi steps down from executive role after four years

Razer Pro Type Ergo Review: Effortless Ergonomics With Gaming Roots

More Like this

Where to Shop for Vinyl Records Online (2026): Discogs, Bandcamp, Ebay

Double Dazzle: This Weekend, There Are 2 Meteor Showers in the Night Sky

It’s Peak Birding Season. Here’s the Tech I Use to Find Birds and Take Better Photos

Nvidia RTX 5070 Ti gaming laptop is on sale for 19% off — MSI’s Vector 16 has a 144 Hz screen, and comes with...

FSF to OnlyOffice: You Can’t Use the GNU (A)GPL to Take Software Freedom Away

Forget Photos and Maps, this is the Google app I can’t live without anymore

OpenAI Silently Unveils Whisper 3, A New Generation Open Source ASR Model

Disclaimer

More like this

Where to Shop for Vinyl Records Online (2026): Discogs,...

Double Dazzle: This Weekend, There Are 2 Meteor Showers...

It’s Peak Birding Season. Here’s the Tech I Use...

Popular

Block title

Never lose anything again with these Find My accessories

OpenAI Executive Kevin Weil Is Leaving the Company

Canyon Spectral:ON CF 8 Electric Mountain Bike: Beginner-Friendly, Under $5K

Today’s NYT Mini Crossword Answers for April 13

Here’s How Researchers Stole $10,000 From MKBHD’s Locked iPhone

Get Ready For ‘Godzilla Minus Zero’ By Watching All the Kaiju’s Movies in Order

You Should Be More Freaked Out by Shingles

Startup Events

Trending News

Where to Shop for Vinyl Records Online (2026): Discogs, Bandcamp, Ebay

Double Dazzle: This Weekend, There Are 2 Meteor Showers in the Night Sky

It’s Peak Birding Season. Here’s the Tech I Use to Find Birds and Take Better Photos

Nvidia RTX 5070 Ti gaming laptop is on sale for 19% off — MSI’s Vector 16 has a 144 Hz screen, and comes with...

FSF to OnlyOffice: You Can’t Use the GNU (A)GPL to Take Software Freedom Away

About

Partnership

Contact us