WhisperAPI.com Transcription API

Powered by Whisper v3

Speaker diarization - Automatically detect who is speaking.

Just $0.50 per 3 hours of speech - Lowest price on the market.

The WhisperAPI.com transcription API is a high-quality speech-to-text API powered by the Whisper v3 model. It is the same technology as our Speech-to-Text API, but hosted on WhisperAPI.com.

When starting from scratch, we recommend using the Speech-to-Text API as it is more feature-rich and has a more user-friendly interface.

API Usage

If you haven't already, you will need to create an API key to authenticate your requests.

const body = new FormData();
body.append('url', 'https://output.lemonfox.ai/brownfox.mp3');
// instead of providing a URL you can also upload a file object:
// body.append('file', new Blob([await fs.readFile('/path/to/audio.mp3')]));
body.append('language', 'english');

fetch('https://transcribe.whisperapi.com', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer YOUR_API_KEY'
  },
  body: body
})
.then(response => response.json()).then(data => {
  console.log(data);
})

API Response

{
  "language": "en",
  "text": "The quick brown fox jumps over the lazy dog.",
  "segments": [
    {
      "start": 0,
      "end": 2.4,
      "text": " The quick brown fox jumps over the lazy dog.",
      "whole_word_timestamps": [
        {"word": "The", "start": 0, "end": 0.16, "timestamp": 0.16, "probability": 0.77197265625},
        {"word": "quick", "start": 0.16, "end": 0.34, "timestamp": 0.34, "probability": 0.90283203125},
        {"word": "brown", "start": 0.34, "end": 0.64, "timestamp": 0.64, "probability": 0.8623046875},
        {"word": "fox", "start": 0.64, "end": 0.98, "timestamp": 0.98, "probability": 0.982421875},
        {"word": "jumps", "start": 0.98, "end": 1.32, "timestamp": 1.32, "probability": 0.99658203125},
        {"word": "over", "start": 1.32, "end": 1.64, "timestamp": 1.64, "probability": 0.99951171875},
        {"word": "the", "start": 1.64, "end": 1.78, "timestamp": 1.78, "probability": 0.98974609375},
        {"word": "lazy", "start": 1.78, "end": 2.02, "timestamp": 2.02, "probability": 0.9716796875},
        {"word": "dog.", "start": 2.02, "end": 2.4, "timestamp": 2.4, "probability": 0.994140625}
      ]
    }
  ],
  "diarization": [
    {
      "startTime": 0.5034129692832765,
      "stopTime": 2.6194539249146755,
      "speaker": "SPEAKER_00"
    }
  ]
}

API Parameters

The API POST https://transcribe.whisperapi.com takes the following parameters:

file

File object

url

URL

diarization

boolean, optional

numSpeakers

integer, optional

language

string, optional

task

string, optional

initialPrompt

string, optional

callbackURL

string, optional

Start Your Free Trial

Lemonfox.ai

WhisperAPI.com Transcription API

Powered by Whisper v3

Speaker diarization - Automatically detect who is speaking.

API Usage

API Response

API Parameters