💡 Studio-Quality Voiceovers   🔥 100% Free to Start

Speakatoo - Speech to Text API Documentation

Speakatoo Speech-to-Text API Documentation

This document provides full integration details for the Speakatoo Speech‑to‑Text API.

Overview

The Speakatoo Speech-to-Text API enables developers to upload audio files and receive accurate transcriptions. It supports multiple languages, minute-based usage deduction, and full logging into the user's dashboard.

This API is designed for real-time usage without queue or cron dependency.

Authentication

Speakatoo uses API-key–based authentication.

Include your API key in the header of every request:

X-API-KEY: your_api_key_here

Failure to include this header will result in a 403 error.

Endpoint

POST

https://www.speakatoo.com/api/v1/speech_to_text

Method: multipart/form-data

Request Body Parameters

Required:

- file: Audio file to be transcribed

- file_language: Language code (ISO format)

- response_format: text | json

Optional:

- prompt: Provide context or instructions to improve transcription accuracy

POSTMAN SCREENSHOT

Speech to Text API

Supported Languages (Sample List)

Below is a sample list of supported languages:

Language Code Language Name
afAfrikaans
arArabic
hyArmenian
azAzerbaijani
beBelarusian
bsBosnian
bgBulgarian
caCatalan, Valencian
zhChinese
hrCroatian
csCzech
daDanish
nlDutch, Flemish
enEnglish
etEstonian
fiFinnish
frFrench
glGalician
deGerman
elGreek, Modern
heHebrew
hiHindi
huHungarian
isIcelandic
idIndonesian
itItalian
jaJapanese
knKannada
kkKazakh
koKorean
lvLatvian
ltLithuanian
mkMacedonian
msMalay
mrMarathi
miMaori
neNepali
noNorwegian
faPersian
plPolish
ptPortuguese
roRomanian, Moldavian, Moldovan
ruRussian
srSerbian
skSlovak
slSlovenian
esSpanish, Castilian
swSwahili
svSwedish
tlTagalog
taTamil
thThai
trTurkish
ukUkrainian
urUrdu
viVietnamese
cyWelsh

For more language support please get in touch with our support team.

Note: Additional languages may work depending on input quality.

Supported Audio File Formats

The following formats are accepted: mp3, mp4, wav, flac, ogg, m4a, mpeg, webm.

Response Formats

Supported response formats:

- text → plain text transcription

- json → JSON structure { text: "..." }

Example success response:
{
  "status": true,
  "message": "Transcription generated successfully.",
  "data": {
    "file_language": "en",
    "response_format": "text",
    "transcripted_text": "Hello world",
    "transcripted_text_length": 11
  }
}
Example failure:
{
  "status": false,
  "message": "Insufficient Speech-to-Text minutes. Please recharge."
}

Error Codes

Status Code Message
400 Missing parameters
401 Unauthorized (not used normally)
402 Insufficient balance
403 Invalid or missing API key
404 Endpoint not found
500 Internal server error
502 Upstream processing error

Dashboard Logging Behavior

Each API call logs a detailed record in the user dashboard including:

  • - File path
  • - File language
  • - Duration (minutes consumed)
  • - Response format
  • - Raw payload
  • - Transcript length
  • - Success/Failed status

Best Practices

  • ✔ Compress audio before uploading, maximum size allowed over API – 20MB.
  • ✔ Keep duration under 30 minutes for best performance
  • ✔ Always check balance before high-volume jobs
  • ✔ Use stable internet connection for large uploads
  • ✔ Store logs to reprocess if needed

Support

For assistance, email: support@speakatoo.com

Website: https://www.speakatoo.com

AI Support Live