AI Digest
← Back to all articles
OpenAI
·OpenAI·1 min read

# OpenAI Launches Advanced Speech-to-Speech Model with New Realtime API Features

OpenAI announced major updates to its Realtime API, introducing a more sophisticated speech-to-speech model alongside several powerful new capabilities.

The company revealed four key enhancements to its real-time communication platform. The centerpiece is an improved speech-to-speech model that promises more natural voice interactions. Alongside this, OpenAI is adding support for the Model Context Protocol (MCP) server, enabling better integration with external data sources and tools.

Two particularly notable additions expand the API's practical applications: image input support, which allows the model to process visual information during conversations, and Session Initiation Protocol (SIP) phone calling support, which enables direct integration with traditional phone systems.

The SIP integration is especially significant for businesses, as it allows companies to deploy AI voice agents that can handle regular phone calls without requiring users to download apps or