0
Technology

OpenAI Introduces Websocket-Based Execution Mode to Reduce Latency in Agentic Workflows

May 7, 2026
Scroll

Posted 4 days ago by

OpenAI introduces a WebSocket-based execution mode for its Responses API to improve agentic workflow performance in coding agents and real-time AI systems. The update reduces latency by up to 40 percent by replacing HTTP request-response cycles with persistent connections, improving streaming, tool execution, and multi-step orchestration in production-scale AI systems.

By Leela Kumili

InfoQ
InfoQ

Coverage and analysis from Canada. All insights are generated by our AI narrative analysis engine.

Canada
Bias: center

Explore More