GPT-4.1 released
AI Impact Summary
GPT 4.1 and GPT 4.1-nano are now available. These are the latest models from Azure OpenAI. GPT 4.1 has a 1 million token context limit. For more information, see the models page . gpt-4o audio models released New audio models powered by GPT-4o are now available. The gpt-4o-transcribe and gpt-4o-mini-transcribe speech to text models are released. Use these models via the /audio and /realtime APIs. The gpt-4o-mini-tts text to speech model is released. Use the gpt-4o-mini-tts model for text to speech generation via the /audio API. For more information about available models, see the models and versions documentation . March 2025 Responses API & computer-use-preview model The Responses API is a new stateful API from Azure OpenAI. It brings together the best capabilities from the chat completions and assistants API in one unified experience. The Responses API also adds support for the new computer-use-preview model, which powers the Computer use capability. For access to computer-use-preview registration is required, and access will be granted based on Microsoft's eligibility criteria . Customers who have access to other limited access models still need to request access for this model. Request access: computer-use-preview limited access model application For more information on model capabilities, and region availability see the models documentation . Playwright integration demo code . Provisioned spillover (preview) Spillover manages traffic fluctuations on provisioned deployments by routing overages to a designated standard deployment. To learn more about how to maximize utilization for your provisioned deployments with spillover, see Manage traffic with spillover for provisioned deployments (preview) . Specify content filtering configurations In addition to the deployment-level content filtering configuration, we now also provide a request header that allows you specify your custom configuration at request time for every API call. For more information, see Use conten