Usage scenarios
- You need speech-to-text without the full ambient clinical note flow.
- You want to create a transcription session, open a WebSocket to stream audio, then end the session when capture is finished so resources can close cleanly.
How it works
The dictation APIs work in four steps:Authenticate
sdp_suki_token. Refer to Provider authentication and Partner authentication.Create a transcription session
transcription_session_id.Open a WebSocket to stream audio
READY or IDLE. Refer to Dictation streaming for outbound audio messages and inbound transcript frames.End the session when capture is finished