Securing Agentic AI and the New Security Reality

Question

Securing Agentic AI and the New Security Reality

calendar_todayJun 10 • schedule4 min read

The most dangerous AI failures aren't dramatic. They're invisible.

With just 0.001% of poisoned data, a model can lose up to 30% of its accuracy. The drift happens gradually. The agent keeps running. Nobody gets an alert. By the time anyone notices something is wrong, the damage is done.

"Your teams don't even know it happened," said Pearl Almeida, Research Director, Security and Privacy at Info-Tech Research Group, opening her keynote at Info-Tech LIVE 2026 in Las Vegas. "We don't have enough people who understand the scope of agents. Our tooling isn't designed to catch drift. And we're not monitoring or logging transactions the way we need to."

The numbers back her up. According to Info-Tech's research, 52.9% of agents across organizations are not monitored or secured. The average cost of a shadow AI breach is $4.63 million — more than the average ransomware incident. And that figure, Almeida noted, is likely conservative.

"This isn't a technology problem," she said. "It's a governance problem."

Why Traditional Security Controls Fail

The root issue is architectural. Traditional security was built for predictable systems — static code, tightly scoped permissions, deterministic workflows. Agentic AI operates on entirely different terms: dynamic reasoning, nondeterministic decisions, shifting permissions, and autonomous loops that can change behavior from one session to the next.

Two identical inputs can produce completely different outputs depending on context. Agents can adopt permissions from other agents as their tasks evolve. They're making decisions in real time, autonomously, and the controls built for conventional software simply weren't designed to track any of that.

Almeida traced the timeline: from the ChatGPT launch in late 2022 and early prompt injection attacks, through indirect injection and RAG poisoning, to the current era of multi-agent environments and agent-to-agent protocols. Each phase created a new attack surface before the previous one was secured. The gap between adoption and governance, she said, is the attack surface.

Four Questions That Frame the Entire Problem

Almeida's framework starts not with controls but with questions — four of them, in order:

What is the agent doing? What resources does the agent require? Does the agent have predefined ranges for its decisions? And what happens when something goes wrong?

"By answering these questions," she said, "you establish autonomy boundaries for the agent. And autonomy boundaries are the new perimeter."

The Anatomy of an Attack Surface

To understand where to apply controls, you first have to understand how an agent actually works. Almeida walked through the full architecture: input arrives from a user, API, or event trigger; a RAG engine retrieves relevant context; a context manager assembles that background information; the LLM plans, reasons, and decides; and the agent executes through tool calls — API calls, database queries, application actions — before producing output.

Every one of those components is an entry point. And every action the agent takes is executed through an API call.

"APIs are a dominant attack vector," Almeida said, "and agents increase the blast radius." Her research found that 36% of AI vulnerabilities are API vulnerabilities. Most API keys in production are carrying more permissions than the task requires, stored incorrectly, and not logged or monitored.

The Three-Layer Security Architecture

Almeida's framework builds security in layers, each one addressing a different question from her opening framework.

The first layer is API security — controlling who can request an action and what actions are allowed. Every agent interaction is authenticated, authorized, and auditable. Scoped tokens define what the agent can access, what tasks it can perform, and the conditions for valid access.

API security works like a hotel key. Your key gives you access to your room, the gym, the pool, and the elevator — but nowhere else the hotel wants you to have access to. The elevator is like an MCP server: it reads your key, but only takes you to your floor. The token defines what you're allowed to reach. The MCP gateway enforces where you can actually go.

The second layer is the MCP gateway. Model Context Protocol has become the standard for how agents call tools, and the gateway creates a single enforcement point for every tool call. It handles authentication verification, routes the agent to the relevant MCP server, enforces policy requirements, filters which tools are visible to the agent, and provides centralized logging. API controls scope. MCP controls access to resources.

The third layer is the orchestration layer — where behavioral nuance lives. This is where intent-based access control, policy-based access control, and indirect prompt injection detection operate. The orchestration layer provides contextual runtime enforcement: intent consistency checking, output scanning, and behavioral anomaly detection.

Together, the three layers mean no single point of failure. Orchestrator, MCP gateway, API security — intent and context, call enforcement, credential validation — each layer catches what the previous one can't.

Identity Governance: Still Emerging

The fourth layer — identity governance for agents — adds attribution on top of access control. An identity provider takes over token issuance, rotation, and revocation. When something goes wrong, you can revoke the token immediately, which shuts the agent down. The identity layer also handles agent and MCP discovery and registration, making it possible to know what agents are running in your environment at any given time.

Almeida was candid about the maturity of this space. "Identity for agents is still in theory," she said. Vendors are in beta testing. The tooling is still developing. But that doesn't mean organizations should wait — it means they should be evaluating vendors now based on behavioral monitoring capabilities, policy controls, and the quality of the governance interface their security teams will actually use.

Governance Is the Missing Link

The final question — what happens when something goes wrong — requires organizations to think through the full NIST framework: identify, protect, detect, respond, recover. Most organizations have the protect layer partially in place. Almost none have the detect layer working properly.

"Logging isn't just a compliance checkbox," Almeida said. "You need to start baselining actions for every agent, especially agents that are touching sensitive systems. You can't secure what you can't see."

The call to action she left the room with was direct: get started today. Map your agents. Understand what they can see, what they can do, and what they can touch. Define what catastrophic failure looks like for each one. Build the revocation capability before you need it. And don't wait for the governance framework to be perfect before you start.

"We secured the internet once," Almeida said. "We can do it again. We just need to start."

3 Comments

🔥 Join developers growing publicly

Share your knowledge, build in public, and grow your developer presence with a global community.

Join CoderLegion

chevron_left

Tom Smithverified

15.4k Points • 641 Badges

Raleigh, NC • insightsfromanalytics.com

187Posts

117Comments

81Connections

LLM Training & Evaluation Specialist with hands-on experience building major AI models. As one of th... Show more

Commenters (This Week)

Contribute meaningful comments to climb the leaderboard and earn badges!

Fluxerv · Answer 1 · 2026-06-11T04:59:35+0000

Fluxerv • Jun 10

The 0.001% poisoned data stat is sobering. The invisibility of the drift is what makes it dangerous. In traditional security you can audit a system at a point in time. With agents that keep running and learning, the attack surface is constantly shifting. The governance piece feels like it's still catching up to how fast these systems are being deployed.

Tom Smithverified • Jun 11

@[Fluxerv] The point-in-time audit problem is exactly what Pearl was driving at — the tools we built for static systems don't transfer. She made the point that 52.9% of agents aren't monitored at all, which means most organizations won't know drift happened until something downstream breaks. The governance gap is real, but her argument was that you don't need to wait for the frameworks to mature before you start — baselining agent behavior now gives you something to measure against later.

Fluxerv • Jun 11

@[Tom Smith] Baselining now makes sense, even if the frameworks are not there yet. You cannot retroactively establish what normal looked like. The 52.9% unmonitored stat is the real problem though. Baselining only helps if someone is actually watching. For most organizations the monitoring infrastructure does not exist before the agent does, and by the time they think about it there is already nothing to compare against.

	Helping Clients Move from Pilot to Production: The Agentic AI Governance Playbook Tom Smithverified - Jun 8
	The Sovereign Vault — A Comprehensive Guide to Protocol-Driven AI Ken W. Algerverified - Jun 4
	MCP Is the USB-C of AI. So Why Are You Plugging Everything In? Ken W. Algerverified - Jun 10
	The Security Conversation Your Clients Aren't Having About Agentic AI Tom Smithverified - Jun 29
	Defending Against AI Worms: Securing Multi-Agent Systems from Self-Replicating Prompts alessandro_pignati - Apr 2

Securing Agentic AI and the New Security Reality

Why Traditional Security Controls Fail

Four Questions That Frame the Entire Problem

The Anatomy of an Attack Surface

The Three-Layer Security Architecture

Identity Governance: Still Emerging

Governance Is the Missing Link

3 Comments

Please log in to add a comment.

Please log in to comment on this post.

More Posts

Helping Clients Move from Pilot to Production: The Agentic AI Governance Playbook

The Sovereign Vault — A Comprehensive Guide to Protocol-Driven AI

MCP Is the USB-C of AI. So Why Are You Plugging Everything In?

The Security Conversation Your Clients Aren't Having About Agentic AI

Defending Against AI Worms: Securing Multi-Agent Systems from Self-Replicating Prompts

More From Tom Smithverified

Torq SOC Brain: Why Reasoning From Precedent Beats Retrieving It

Developer Weekly Briefing — July 25, 2026

Sophos Traces a Ransomware Operation Back to 12 AI Agents Running Inside a Coding Assistant

Related Jobs

Commenters (This Week)

Welcome to Coder Legion

Connect with 4,755 amazing developers

Don't have an account? Sign up

OR

Securing Agentic AI and the New Security Reality

Why Traditional Security Controls Fail

Four Questions That Frame the Entire Problem

The Anatomy of an Attack Surface

The Three-Layer Security Architecture

Identity Governance: Still Emerging

Governance Is the Missing Link

3 Comments

Please log in to add a comment.

Please log in to comment on this post.

More Posts

More From Tom Smithverified

Related Jobs

Commenters (This Week)