AI brokers are remodeling industries by automating workflows, enhancing productiveness, and enabling clever decision-making. Companies are leveraging AI brokers to course of insurance coverage claims, handle IT service desks, optimize provide chain logistics, and even help healthcare professionals in analyzing medical information. The potential is huge, and we’re excited to introduce two highly effective improvements in Azure AI Foundry.
AI brokers are remodeling industries by automating workflows, enhancing productiveness, and enabling clever decision-making. Companies are leveraging AI brokers to course of insurance coverage claims, handle IT service desks, optimize provide chain logistics, and even help healthcare professionals in analyzing medical information. The potential is huge, and we’re excited to introduce two highly effective improvements in Azure AI Foundry:
- Responses API: A robust API enabling AI-powered purposes to retrieve info, course of knowledge, and take motion seamlessly.
- Pc-Utilizing Agent (CUA): A breakthrough AI mannequin that navigates software program interfaces, executes duties, and automates workflows.
Collectively, these capabilities empower companies to reimagine AI not simply as an assistant—however as an lively digital workforce. Enterprise prospects will quickly acquire entry to those improvements driving automation, effectivity, and intelligence at scale.
Enhancing AI Brokers with the Responses API
The Responses API is the important thing to unlocking agentic AI in Azure AI Foundry, remodeling how enterprises harness AI for real-world affect. It’s the new basis for leveraging Azure OpenAI Service’s highly effective built-in instruments, combining the simplicity of the Chat Completions API with the superior capabilities out there by way of Assistants API and Azure AI Agent Service. The Responses API permits seamless interplay with instruments like CUA, operate calling, and file search—all in a single API name. This API permits AI techniques to retrieve knowledge, course of info, and take actions—seamlessly connecting agentic AI with enterprise workflows.
How the Responses API Works
The Responses API gives a structured response format that enables AI to work together with a number of instruments whereas sustaining context throughout interactions. It helps:
- Instrument calling in a single easy API name: Now, builders can seamlessly combine AI instruments, making execution extra environment friendly.
- Pc use: Use the pc use device inside the Responses API to drive automation and execute software program interactions.
- File search: Work together with enterprise knowledge dynamically and extract related info.
- Perform calling: Develop and invoke customized capabilities to reinforce AI capabilities.
- Chaining responses into conversations: Maintain monitor of interactions by linking responses collectively utilizing distinctive response IDs, making certain continuity in AI-driven dialogues.
- Enterprise-grade knowledge privateness: Constructed with Azure’s trusted safety and compliance requirements, making certain knowledge safety for organizations.
By consolidating retrieval, reasoning, and motion execution right into a single API, the Responses API simplifies AI agent growth, decreasing the complexity of orchestrating a number of AI instruments inside an automation pipeline.
This scalability makes it well-suited for enterprise use circumstances throughout industries similar to customer support, IT operations, finance, and provide chain administration, the place AI-powered automation can streamline workflows and enhance effectivity. For even higher flexibility and management, organizations can discover Azure AI Agent Service, which presents further instruments and fashions for growing and scaling AI brokers. Azure AI Agent Service integrates with Semantic Kernel and AutoGen, enabling seamless multi-agent orchestration for extra complicated situations requiring a number of brokers to collaborate on duties.
Empowering AI Brokers with the Pc-Utilizing Agent
The Pc-Utilizing Agent (CUA) is a specialised AI mannequin in Azure OpenAI Service that enables AI to work together with graphical consumer interfaces (GUIs), navigate purposes, and automate multi-step duties—all by way of pure language directions. In contrast to conventional automation instruments that depend on predefined scripts or API-based integrations, CUA can interpret visible components, adapt dynamically, and take motion based mostly on on-screen content material.
What makes the Pc-Utilizing Agent distinctive?
- Autonomous UI navigation: Can open purposes, click on buttons, fill out varieties, and navigate multi-page workflows.
- Dynamic adaptation: Interprets UI modifications and adjusts actions accordingly, decreasing reliance on inflexible automation scripts.
- Cross-application activity execution: Operates throughout web-based and desktop purposes, integrating disparate techniques with out API dependencies.
- Pure language command interface: Customers can describe a activity in plain language, and CUA determines the right UI interactions to execute.
With at the moment’s announcement, builders can begin constructing further agentic capabilities instantly with CUA. As enterprises look to deploy this know-how at scale, we’re evaluating integration with Home windows 365 and Azure Digital Desktop to allow CUA automation to run seamlessly in a managed host setting on Cloud PCs or digital machines (VMs), making certain constant efficiency whereas sustaining enterprise compliance and safety requirements.
Making certain safe and reliable AI automation
As AI techniques grow to be extra autonomous, making certain safety, reliability, and alignment with human intent is essential. The CUA mannequin is likely one of the first agentic AI fashions able to straight interacting with software program environments, bringing new challenges in misuse prevention, unintended actions, and adversarial dangers. To deal with these, Microsoft and OpenAI have carried out a multi-layered security strategy spanning the mannequin, system, and deployment ranges.
The CUA mannequin is developed with safeguards to refuse dangerous duties, reject unauthorized actions, and stop misuse. On the system degree, Microsoft implements enterprise-grade content material filtering and execution monitoring to assist detect and stop coverage violations. To reduce unintended actions, CUA is designed to request consumer confirmations earlier than executing irreversible duties and to limit high-risk actions similar to monetary transactions.
Microsoft’s Reliable AI framework additional ensures real-time observability, logging, and compliance auditing for enterprise deployments. Automated and human-in-the-loop detection techniques monitor execution patterns, figuring out anomalous behaviors and imposing governance insurance policies. These safeguards are constantly refined based mostly on inner red-teaming, exterior audits, and real-world testing to strengthen safety towards immediate injections, adversarial manipulations, and unauthorized entry. Given the present reliability degree of the CUA mannequin—significantly in non-browser environments—human oversight stays strongly really helpful for delicate operations.
As AI brokers evolve, Microsoft is dedicated to transparency, safety, and ongoing threat mitigation. By combining CUA’s built-in safeguards with Azure’s enterprise compliance and governance instruments, organizations can deploy AI-powered automation with confidence, making certain secure and accountable AI adoption at scale.
Getting began with CUA and Responses API
Azure AI Foundry continues to push the boundaries of AI-powered automation. Enterprise prospects can acquire entry to the Responses API and CUA in Azure OpenAI Service.
We’re excited to see how builders and companies innovate with these new capabilities.