Openai Whisper Model Architecture Diagram

News

OpenAI Releases 1.6 Billion Parameter Multilingual Speech Recognition AI Whisper - InfoQ

OpenAI recently released Whisper, a 1.6 billion parameter AI model that can transcribe and translate speech audio from 97 different languages. Whisper was trained on 680,000 hours of audio data collec ...

VentureBeat2y

How will OpenAI’s Whisper model impact AI applications?

Last week, OpenAI released Whisper, an open-source deep learning model for speech recognition. OpenAI’s tests on Whisper show promising results in transcribing audio not only in English, but ...

6don MSN

Apple’s new AI-powered transcription outperforms OpenAI’s Whisper

Apple has several under-the-hood AI improvements in the works for iOS 26 and macOS Tahoe, including a powerful transcription ...

Apple Intelligence transcription is twice as fast as OpenAI's Whisper

Newly released to developers, Apple Intelligence's transcription tools are fast, accurate, and typically double the speed of ...

Tom's Guide7mon

OpenAI's Whisper model is reportedly 'hallucinating' in high-risk situations - Tom's Guide

Researchers have found that OpenAI's audio-powered transcription tool, Whisper, is inventing things that were never said with potentially dangerous consequences, according to a new report. As per ...

VentureBeat10mon

aiOla drops ultra-fast ‘multi-head’ speech recognition model, beats OpenAI Whisper

the model builds on Whisper but uses a novel “multi-head attention” architecture that predicts far more tokens at a time than the OpenAI offering. Its code and weights have been released on ...

Geeky Gadgets8mon

OpenAI Whisper Turbo: Advanced Speech Transcription - Geeky Gadgets

At the heart of Whisper Turbo lies its sophisticated Transformer model architecture, enhanced by a convolutional neural network encoder. This framework operates by: 1.

Business Insider5mon

ChatGPT isn't the only cool AI tool made by OpenAI — check out Sora, DALL-E, and more

OpenAI is the startup behind the viral ... on detailed text descriptions and Sora creates videos. Whisper is a speech-recognition model that can transcribe and translate audio from many languages.

The Verge7mon

Hospitals use a transcription tool powered by an error-prone OpenAI model - The Verge

Hospitals routinely use a tool powered by OpenAI’s Whisper transcription model, which researchers find can hallucinate entire passages during periods of silence. More than 30,000 clinicians and ...

Engadget7mon

OpenAI's Whisper invents parts of transcriptions — a lot - Engadget

In one case, Whisper invented that three people discussed were Black. In another, Whisper changed "He, the boy, was going to, I’m not sure exactly, take the umbrella." to "He took a big piece of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results