MCP Security Explained: What the Model Context Protocol Means for Enterprise Risk.

The protocol that quietly became the default integration layer for enterprise AI agents also created an attack surface most security teams aren't equipped to see. Here's what you need to know — with real CVEs, confirmed incidents, and controls that actually work.

97M
Monthly MCP SDK downloads as of March 2026
Moweb / Bonjoy, 2026
71%
Of orgs cannot properly monitor their AI agents
Cisco State of AI Security 2026
60%+
Tool poisoning success rate against major LLM agents
MCPTox / arXiv, 2026
50
Tracked MCP ecosystem vulnerabilities, 13 critical
Vulnerable MCP Project, 2026

In November 2024, Anthropic published a technical specification called the Model Context Protocol. Most enterprise security teams didn't notice. By March 2026, MCP had 97 million monthly SDK downloads, was supported by every major AI provider — OpenAI, Google, Microsoft, AWS — and had become the default architecture for connecting AI agents to enterprise systems. The Vulnerable MCP Project, run by researchers from SentinelOne, Snyk, Trail of Bits, and CyberArk, now tracks 50 known vulnerabilities across the ecosystem. Thirteen of them are classified critical.

If your organization is deploying AI agents — and statistically, it is — you are almost certainly running MCP. This article explains what it is, what attack surface it creates, what real-world incidents have already happened, and what controls actually close the gaps.

What MCP Actually Is

Before MCP, connecting an AI agent to enterprise tools was a bespoke engineering problem. Every integration required custom API code. Connecting an agent to Salesforce, Slack, GitHub, and a PostgreSQL database meant building and maintaining four separate integrations — and rebuilding them every time a provider changed their API. The technical term for this is the N×M problem: N agents multiplied by M tools equals an unmanageable number of custom connections.

MCP solves this by standardizing the interface. An MCP server exposes any tool, database, or service through a uniform protocol. Any MCP-compatible AI agent can connect to any MCP server without custom integration code. The math shifts from multiplicative to additive.

MCP Architecture: How Agents Connect to Enterprise Systems
AI Agent Claude / GPT / Gemini MCP Client MCP Gateway / Host ↳ Authentication ↳ Permission scoping ↳ Audit logging ↳ Rate limiting MCP Server GitHub / Code MCP Server Salesforce / CRM MCP Server Database / SQL MCP Server Slack / Comms Without a gateway: agent connects directly to servers — no inspection, no auth, no audit trail

MCP defines three types of primitives: Tools (executable functions — read a file, send an email, run a query), Resources (data the agent can read), and Prompts (templated instructions the server can inject into the agent's context). Each primitive creates a different attack surface.

Context

MCP was introduced by Anthropic in November 2024 and donated to the Linux Foundation's Agentic AI Foundation in December 2025. As of March 2026, it is supported by every major AI provider and has 5,800+ published servers with 97 million monthly SDK downloads.

The Adoption Curve — and Why Speed Is the Problem

Eighteen months from initial release to 97 million monthly downloads. Security frameworks, by contrast, move on a timescale measured in years. That gap between how fast MCP was adopted and how slowly security controls developed is precisely where the current incident wave is coming from.

NOVEMBER 2024
Anthropic launches MCP. ~2M monthly downloads. Most enterprise security teams haven't heard of it.
APRIL 2025
OpenAI adopts MCP. Downloads jump to 22M/month. First tool-poisoning research papers appear on arXiv.
JUNE 2025
Supabase Cursor incident. AI agent with service-role access leaks integration tokens via injected SQL in support tickets. First major confirmed MCP production breach.
JULY 2025
Microsoft integrates MCP into Copilot Studio. Downloads reach 45M/month. Enterprise deployment accelerates.
NOVEMBER 2025
AWS Bedrock adds MCP support. 68M/month. Vulnerable MCP Project launches — SentinelOne, Snyk, Trail of Bits, CyberArk.
MARCH 2026
All major providers on board. 97M downloads/month. 5,800+ servers. 50 vulnerabilities tracked, 13 critical. CIS publishes MCP Security Companion Guide.

"Adoption outpaced hardening. The CVEs and disclosures stacking up in public databases reflect that gap, and the defense market is still sorting itself into camps."

— PipeLab: State of MCP Security 2026, April 2026

Real Incidents. Not Hypotheticals.

These are confirmed or researcher-demonstrated attacks on production-realistic configurations. They happened because security architectures were not designed for agentic AI.

Supabase Cursor Agent — Researcher-Demonstrated Token Exfiltration
July 2025 · Security Research
Unpatched at disclosure
Security researchers demonstrated this attack on a production-realistic Supabase + Cursor configuration. A developer reviews support tickets via Cursor with the Supabase MCP connected using a service_role key — bypassing all Row-Level Security. A malicious ticket containing hidden SQL instructions causes the agent to SELECT every row from the integration_tokens table and INSERT the contents back into the support thread. Supabase confirmed no customer data was actually breached, but the attack mechanism works exactly as demonstrated on any similarly configured deployment.
Risk: Any deployment using service_role key + untrusted user input is vulnerable. Supabase now recommends read-only MCP access by default.
GitHub MCP — Private Repository Code Exfiltration
May 2025
Indirect prompt injection
Indirect prompt injection through public GitHub Issues caused an MCP-connected agent to exfiltrate code from private repositories. The agent had legitimate read access — the attack simply instructed it to use that access via injected content in public Issues.
Impact: Private source code exfiltrated via legitimate agent permissions.
MCP Inspector — Remote Code Execution
2025
CVE-2025-49596 · CVSS 9.4
The MCP Inspector proxy lacked authentication between the Inspector client and proxy server, allowing unauthenticated requests to launch MCP commands. The attack vector was browser-based: a developer visits a malicious website, which uses DNS rebinding or CSRF to send JavaScript that silently connects to the locally running Inspector (default port 6277) and achieves remote code execution on the developer's machine — no direct network access required. Discovered by Oligo Security, patched in version 0.14.1.
Impact: Full remote code execution on exposed systems.
Datadog Audit — 12,000+ Credentials Exposed via MCP Config Files
2026
Configuration exposure
Datadog security researchers audited MCP deployments and found over 12,000 API keys and passwords stored as plaintext in MCP server configuration files. Tool poisoning attacks can force MCP clients to read and return these files. Local MCP servers executing OS commands can further escalate to SSH keys and cloud credentials.
Impact: Mass credential exposure across thousands of enterprise MCP deployments.

The 6 MCP Attack Vectors Your Team Must Understand

  1. 1

    Indirect Prompt Injection via MCP Tool Responses

    An attacker embeds malicious instructions inside content the agent processes — a support ticket, a GitHub Issue, a document. When the agent retrieves this via MCP, it executes the embedded instructions against real systems. The critical difference from chatbot jailbreaking: agents take actions, not just produce text.

    Real incidents: GitHub MCP · Supabase Cursor (2025)
  2. 2

    Tool Poisoning — Malicious MCP Servers

    A malicious or compromised MCP server presents as legitimate but returns manipulated responses designed to alter agent behavior or exfiltrate data. Real-world benchmarks report tool poisoning success rates exceeding 60% against major LLM agents — 72% against some models.

    Source: MCPTox / arXiv benchmarks, 2026
  3. 3

    Excessive Privilege — Overpowered Agent Permissions

    The Supabase research scenario was possible because the agent had full service-role database access — bypassing all Row-Level Security — for a task requiring only read access to support ticket text. AI-related access configurations with excess permissions jumped from 12% to 39% of deployments between 2024 and 2026.

    Source: Microsoft 2026 Data Security Index
  4. 4

    Credential Exposure in MCP Config Files

    API keys and service tokens stored as plaintext in MCP server config files. Tool poisoning attacks can force clients to read and return these files. Local servers executing OS commands can further escalate by reading SSH keys and cloud provider credentials from the host environment.

    Datadog audit: 12,000+ credentials exposed (2026)
  5. 5

    Agent Memory Poisoning

    Malicious content injected into an agent's context across multiple interactions gradually shifts its behavior — with no single interaction looking suspicious. Corrupts long-term memory or RAG knowledge bases incrementally, persisting across sessions and evading single-event monitoring.

    Source: TrueFoundry MCP Security Research, 2026
  6. 6

    Zero Authentication on Development MCP Instances

    CVE-2025-49596 (CVSS 9.4) exploits a missing authentication layer between the MCP Inspector client and its proxy. The attack is browser-based: a malicious website uses DNS rebinding to connect to the Inspector running on localhost and execute arbitrary OS commands. No direct network exposure required — a developer simply visiting a malicious site while running Inspector is enough. Patched in v0.14.1; the broader OX Security research identified ~200,000 vulnerable instances across the MCP ecosystem.

    CVE-2025-49596 · CVSS 9.4 · OX Security, 2026
32%
of organizations have had at least one AI-related data security incident
Microsoft 2026 Data Security Index
29%
of organizations feel prepared to secure agentic AI applications
Cisco State of AI Security 2026
88%
of organizations reported confirmed or suspected AI agent incidents in the past year
Gravitee State of AI Agent Security 2026

Why Traditional Security Tools Don't See These Attacks

Standard enterprise security — SIEM, WAF, DLP, endpoint detection — was designed for a world where humans initiate actions and systems execute them. MCP creates a different model: an AI agent autonomously initiates tool calls, processes responses, and takes subsequent actions with no human in the loop for individual decisions.

Where Traditional Tools Fail Against MCP Attack Vectors
Attack Vector Traditional Tool Why It Fails Indirect Prompt Injection via MCP tool response WAF / Input Validation Injection arrives as trusted tool response, not user input. WAF never sees it. Tool Poisoning malicious MCP server Supply Chain Scanners No SBOM standard for MCP yet. Server integrity is not verified at runtime. Excessive Agent Privilege over-permissioned tool access IAM / RBAC IAM assigns permissions to human users. Agent non-human identities are invisible to it. Memory Poisoning gradual context corruption SIEM / Anomaly Detection Attack spans multiple sessions. No single event triggers an alert threshold. Credential Exposure in Config plaintext keys in MCP server files DLP DLP monitors data leaving the network. MCP reads credentials from local files first.

Controls That Actually Work

ControlAttack Vector AddressedImplementation NoteEffectiveness
MCP Gateway with I/O Inspection Prompt injection, tool poisoning Proxy all agent-to-server traffic through an inspection layer evaluating both tool call inputs and server responses before the agent acts High
Least Privilege Agent Permissions Privilege escalation, excessive access Scope each agent's MCP tool access to the minimum required. Service-role access for a read-only task is a configuration error. High
MCP Server Verification and Pinning Tool poisoning, supply chain attacks Hash-pin MCP server versions. Verify server identity before connection. Alert on unexpected version changes. High
Secrets Management Integration Credential exposure in config files No plaintext credentials in MCP server config. Use a secrets manager (Vault, AWS Secrets Manager) and inject credentials at runtime. High
Agent Activity Audit Logging All vectors — detection and forensics Log every tool call, every resource access, every prompt received. Structured logs with session IDs allowing full decision-chain reconstruction. Medium (detection)
Network Auth on All MCP Instances Unauthenticated inspector exposure Every MCP instance — including dev tools — requires authentication. CVE-2025-49596 exploited a missing auth layer via browser-based DNS rebinding, not direct network access. Bind proxy servers to 127.0.0.1 only and upgrade to Inspector v0.14.1+. High
On Inspection Latency

The most common objection to adding an inspection layer between agents and MCP servers is latency. If inspection adds 500ms to every tool call, agentic workflows become unusable. Effective MCP security requires sub-100ms enforcement — achievable with purpose-built Small Language Models running locally on CPU, rather than routing traffic through external cloud inspection services.

Summary: Three Actions This Week

MCP is not going away. Neither is the attack surface it creates. Three actions that materially reduce your MCP risk exposure immediately:

1. Inventory every MCP server your agents connect to. You cannot secure what you cannot see. Complete list, who deployed each server, what permissions it runs with, and whether any instance is running without authentication.

2. Audit agent permission scopes against least privilege. If any agent has write access or service-role credentials for a task that only requires reading data — that misconfiguration needs immediate correction.

3. Establish an inspection point between agents and MCP servers. This is the control that would have prevented every attack in this article. It has to exist.

The Uncomfortable Reality

88% of organizations reported confirmed or suspected AI agent security incidents in the past year (Gravitee, survey of 900+ executives and practitioners). In healthcare, that number is 92.7%. These are not organizations that ignored security. They are organizations whose existing architectures were not designed for the layer where MCP attacks occur.

Polygraf AI

See How Polygraf AI Secures MCP Deployments

Sub-100ms inspection of every MCP tool call and server response — on-premise, CPU-only, zero data leaves your network.

Request a Demo →
Air-gap ready · HIPAA · SOC 2 · Deploys in under an hour
Polygraf AI
Enterprise AI Security · Built for Regulated Industries

Polygraf AI's Behavioral Control Plane intercepts and controls every AI interaction inline — enforcing organizational policy on input and output, across user-facing and agentic AI, with zero data leaving your environment. Runs on existing infrastructure at sub-100ms latency. No GPU required.

NEWS & More

Insights & Updates from Polygraf.

Blog posts

The protocol that became a default integration layer. Here's what enterprises need to know to secure themselves.

To learn more about Polygraf, please get in touch.

At Polygraf, we envision a future where AI augments human capabilities without compromising safety, privacy, or ethical standards. Trust in our commitment to building this future with you.

Products

thank you

Your download will start now.

Thank you!

Please provide information below and
we will send you a link to download the white paper.