Deepgram adds GPT-5.3, GPT-5.4, and Gemini 3.1 Flash Lite support to Voice Agent API

New LLM Model Support

Deepgram has extended the Voice Agent API with support for three new language models, enabling developers to leverage the latest AI capabilities for voice-based applications:

OpenAI GPT-5.3 Instant (gpt-5.3-chat-latest) - latest lightweight variant for efficient inference
OpenAI GPT-5.4 (gpt-5.4) - newest flagship model from OpenAI
Google Gemini 3.1 Flash Lite (gemini-3.1-flash-lite) - optimized Gemini variant for faster responses

Integration & Usage

Developers can specify these models in their Voice Agent configuration by setting the provider.model field in the agent's think settings. For example, you can now configure GPT-5.3 Instant as follows:

{
  "type": "Settings",
  "agent": {
    "think": {
      "provider": {
        "type": "open_ai",
        "model": "gpt-5.3-chat-latest"
      }
    }
  }
}

Bug Fix

The update also addresses a critical issue where the GPT-5.2 Instant model was using an incorrect model ID and pricing tier. The model now correctly uses the ID gpt-5.2-chat-latest and is properly assigned to the Advanced pricing tier. Existing implementations should verify their configurations if they're using GPT-5.2.

For complete documentation on all available models and their associated pricing tiers, refer to the Voice Agent LLM Models documentation.

New LLM Model Support

Integration & Usage

Bug Fix

Tags

Published

Source