Other Endpoints
REST with Client auth
Convert speech to text using client-side authentication
POST
Convert audio to text with client-side JWT authentication. This endpoint is identical to the standardDocumentation Index
Fetch the complete documentation index at: https://api-docs.wisprflow.ai/llms.txt
Use this file to discover all available pages before exploring further.
/api endpoint but uses a client token (JWT) instead of an org-level API key.
Request Body
Base64 encoded, 16kHz wav audio. Maximum size is 25MB or 6 minutes of audio.
Optional list of (ISO 639-1) language codes that the user is expected to speak.Setting the list size to 1 forces the transcription into the specified language. Not providing an input attempts autodetection on full list of languages (less accurate).
Optional contextual information about the circumstances surrounding the user dictation.Flow can use these information to make its output more accurate by for example, getting names right, resolving speech ambiguities, etc.All properties are optional and will use default values if not provided.
Authorizations
Client-side token (format: Bearer <JWT>) for calling client endpoints like /client_api.
Body
application/json
Previous
Overview: Warm up (REST)The Warm-up REST API helps optimize latency and ensure smooth dictation sessions by maintaining a warm connection to Flow's servers.
Next