AI speech recognition model for transcribing and translating audio, useful for developing applications requiring speech-to-text capabilities.
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web.
Open-source; API pricing at $0.006 per minute