Skip to main content

MiniMax

MiniMax provides text-to-speech (TTS) with multiple voice styles and low-latency streaming output.

Use a preset

To use MiniMax TTS with an Agora managed key, specify one of the following presets in the preset field when starting an agent:

  • minimax_speech_2_6_turbo
  • minimax_speech_2_8_turbo

When using a preset, you do not need to provide the API key, group ID, model, or endpoint URL. You can still use the tts field to configure additional settings such as voice_setting. To use a model not listed above, or to provide your own API key, see Sample configuration.

Sample configuration

The following example shows a starting tts parameter configuration you can use when you Start a conversational AI agent.


_11
"preset": "minimax_speech_2_8_turbo",
_11
"properties": {
_11
//...
_11
"tts": {
_11
"params": {
_11
"voice_setting": {
_11
"voice_id": "English_captivating_female1"
_11
}
_11
}
_11
}
_11
}

caution

The parameters listed on this page are validated for use with Conversational AI Engine. Required parameters must be provided as documented. Any additional parameters are passed through directly to the underlying vendor without validation. For a full list of supported options, refer to the MiniMax documentation.

paramsrequired
  • key stringrequired

    The MiniMax API key used to authenticate requests. You must provide a valid key for the service to function.

  • group_id stringrequired

    The MiniMax group identifier. This is required for your account’s configuration.

  • model stringrequired

    The TTS model to use. For example, speech-02-turbo.

  • voice_setting.voice_id stringrequired

    The voice style identifier to use. For example, English_captivating_female1.

  • url stringrequired

    The WebSocket endpoint for streaming TTS output. For example, wss://api-uw.minimax.io/ws/v1/t2a_v2.