First month for free!

Get started

  Overview

WhisperAPI.com Transcription API

Powered by Whisper v3

Speaker diarization - Automatically detect who is speaking.

Just $0.50 per 3 hours of speech - Lowest price on the market.

The WhisperAPI.com transcription API is a high-quality speech-to-text API powered by the Whisper v3 model. It is the same technology as our Speech-to-Text API, but hosted on WhisperAPI.com.

When starting from scratch, we recommend using the Speech-to-Text API as it is more feature-rich and has a more user-friendly interface.

API Usage

If you haven't already, you will need to create an API key to authenticate your requests.

const body = new FormData();
body.append('url', 'https://output.lemonfox.ai/brownfox.mp3');
// instead of providing a URL you can also upload a file object:
// body.append('file', new Blob([await fs.readFile('/path/to/audio.mp3')]));
body.append('language', 'english');

fetch('https://transcribe.whisperapi.com', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer YOUR_API_KEY'
  },
  body: body
})
.then(response => response.json()).then(data => {
  console.log(data);
})

API Response

{
  "language": "en",
  "text": "The quick brown fox jumps over the lazy dog.",
  "segments": [
    {
      "start": 0,
      "end": 2.4,
      "text": " The quick brown fox jumps over the lazy dog.",
      "whole_word_timestamps": [
        {"word": "The", "start": 0, "end": 0.16, "timestamp": 0.16, "probability": 0.77197265625},
        {"word": "quick", "start": 0.16, "end": 0.34, "timestamp": 0.34, "probability": 0.90283203125},
        {"word": "brown", "start": 0.34, "end": 0.64, "timestamp": 0.64, "probability": 0.8623046875},
        {"word": "fox", "start": 0.64, "end": 0.98, "timestamp": 0.98, "probability": 0.982421875},
        {"word": "jumps", "start": 0.98, "end": 1.32, "timestamp": 1.32, "probability": 0.99658203125},
        {"word": "over", "start": 1.32, "end": 1.64, "timestamp": 1.64, "probability": 0.99951171875},
        {"word": "the", "start": 1.64, "end": 1.78, "timestamp": 1.78, "probability": 0.98974609375},
        {"word": "lazy", "start": 1.78, "end": 2.02, "timestamp": 2.02, "probability": 0.9716796875},
        {"word": "dog.", "start": 2.02, "end": 2.4, "timestamp": 2.4, "probability": 0.994140625}
      ]
    }
  ],
  "diarization": [
    {
      "startTime": 0.5034129692832765,
      "stopTime": 2.6194539249146755,
      "speaker": "SPEAKER_00"
    }
  ]
}

API Parameters

The API POST https://transcribe.whisperapi.com takes the following parameters:

Start Your Free Trial