Skip to main content

Documentation Index

Fetch the complete documentation index at: https://developer.suki.ai/llms.txt

Use this file to discover all available pages before exploring further.

Audio dictation converts provider speech into text while a session is active. Providers can dictate during a visit or into a field in your application while your product receives partial and final transcript text. Dictation is different from ambient clinical documentation, which transcribes the visit and generates structured clinical notes, LOINC-based sections, and related outputs. Dictation gives you speech-to-text you can route into your own UI, EHR fields, or downstream workflows.

When to use dictation

Choose dictation when you need:
  • Real-time transcription without generating a full ambient clinical note
  • Transcript text you control in your application (display, edit, save, or send elsewhere)
  • Browser-based dictation with Suki-managed UI (Web SDK or Dictation SDK), or full control with REST and WebSocket APIs
Choose Ambient clinical documentation when you need visit capture, note generation, structured clinical data, and related ambient outputs.

Audio Dictation

Capabilities, common use cases, how dictation works, and how to choose APIs, Web SDK, or Dictation SDK.

Basic Usage

Step-by-step Partner API workflow: create a session, stream audio, receive transcripts, and end the session.

Dictation Audio Streaming

WebSocket handshake, wire format, and message order for GET /ws/transcribe.
Refer to these guides when you already know your integration path and want to dive deeper:

Dictation API References

REST and WebSocket references for transcription sessions.

Web SDK for Audio Dictation

JavaScript and React dictation with Suki Web SDK packages.

Dictation SDK

Hosted iframe dictation with Suki Dictation SDK packages.
This SDK is in Beta phase of development.

Next steps

Complete the Partner onboarding process and get your credentials.
Last modified on May 22, 2026