Audio dictation converts provider speech into text while a session is active. Providers can dictate during a visit or into a field in your application while your product receives partial and final transcript text. Dictation is different from ambient clinical documentation, which transcribes the visit and generates structured clinical notes, LOINC-based sections, and related outputs. Dictation gives you speech-to-text you can route into your own UI, EHR fields, or downstream workflows.Documentation Index
Fetch the complete documentation index at: https://developer.suki.ai/llms.txt
Use this file to discover all available pages before exploring further.
When to use dictation
Choose dictation when you need:- Real-time transcription without generating a full ambient clinical note
- Transcript text you control in your application (display, edit, save, or send elsewhere)
- Browser-based dictation with Suki-managed UI (Web SDK or Dictation SDK), or full control with REST and WebSocket APIs
Related guides
Audio Dictation
Capabilities, common use cases, how dictation works, and how to choose APIs, Web SDK, or Dictation SDK.
Basic Usage
Step-by-step Partner API workflow: create a session, stream audio, receive transcripts, and end the session.
Dictation Audio Streaming
WebSocket handshake, wire format, and message order for
GET /ws/transcribe.Related SDKs and APIs documentation
Refer to these guides when you already know your integration path and want to dive deeper:Dictation API References
REST and WebSocket references for transcription sessions.
Web SDK for Audio Dictation
JavaScript and React dictation with Suki Web SDK packages.
Dictation SDK
Hosted iframe dictation with Suki Dictation SDK packages.
This SDK is in Beta phase of development.