Speech to Text
Supported Audio Formats
The Speech-to-Text solution supports various audio formats for flexible integration:
Audio Format | Description |
---|---|
16k_int16 | Default format: Signed 16-bit, 16KHz sampling rate in WAV format |
16k_uint8 | Unsigned 8-bit, 16KHz sampling rate in WAV format |
8k_int16 | Signed 16-bit, 8KHz sampling rate in WAV format |
8k_uint8 | Unsigned 8-bit, 8KHz sampling rate in WAV format |
opus_16k | Opus-encoded audio frames, 16KHz sampling rate |
opus_8k | Opus-encoded audio frames, 8KHz sampling rate |
ogg_opus | Opus-encoded audio frames in Ogg container |
16k_ulaw | µ-Law audio frames, 16KHz sampling rate |
8k_ulaw | µ-Law audio frames, 8KHz sampling rate |