{"id":46041,"date":"2026-06-19T21:04:37","date_gmt":"2026-06-19T13:04:37","guid":{"rendered":"https:\/\/apacvision.com\/?p=46041"},"modified":"2026-06-19T21:04:37","modified_gmt":"2026-06-19T13:04:37","slug":"orcarouter-releases-ai-threat-report-2026-and-makes-its-security-controls-free-amid-rise-in-prompt-injection-attacks","status":"publish","type":"post","link":"https:\/\/apacvision.com\/?p=46041","title":{"rendered":"OrcaRouter Releases AI Threat Report 2026 and Makes Its Security Controls Free Amid Rise in Prompt-Injection Attacks"},"content":{"rendered":"<p>OrcaRouter has published The AI Threat Report 2026 and made its agent Firewall and input\/output Guardrails free for every user \u2014 same API key, one switch, no code changes. The report argues that AI systems have become the attack surface, with prompt injection now the #1 risk to LLM applications and one that cannot be patched. OrcaRouter&#8217;s answer is architectural: gateway-level controls that bind to credentials, so any team can enforce them without rewriting their agents.<\/p>\n<p><b>Prompt injection ranks as the top risk<br \/>\nto LLM applications and, the company says, cannot be fully patched. OrcaRouter<br \/>\nSecurity Research has made its agent Firewall and input\/output Guardrails<br \/>\navailable at no cost to all users, attached to an existing API key.<\/b><\/p>\n<p><b>SINGAPORE \u2014 June 18, 2026 \u2014 <\/b><a href=\"https:\/\/www.orcarouter.ai\/\">OrcaRouter<\/a>, the OpenAI-compatible LLM<br \/>\ngateway, today published <a href=\"https:\/\/docs.orcarouter.ai\/whitepapers\/ai-threat-report-2026\"><i>The AI<br \/>\nThreat Report 2026<\/i><\/a> and made two of its security controls available at<br \/>\nno cost to all users: the agent Firewall and input\/output Guardrails. According<br \/>\nto the company, the controls can be attached to an API key already in use,<br \/>\nwithout a separate integration or purchase.<\/p>\n<p><i>The AI Threat Report 2026 \u2014 14<br \/>\nkey risks across four threat categories.<\/i><\/p>\n<p>The report states that <b>AI systems have themselves become an<br \/>\nattack surface, and that most organizations cannot see the attacks directed<br \/>\nagainst them.<\/b> Telemetry from production LLM applications shows the average<br \/>\nsuccessful attack completing in <b>42 seconds<\/b>, with <b>90% of them leaking<br \/>\nsensitive data<\/b> (Pillar Security). Prompt-injection attacks rose <b>340%<br \/>\nyear over year<\/b> (OWASP, Q1 2026). And <b>13% of organizations<\/b> have<br \/>\nalready been breached through an AI model or application \u2014 <b>97% of those<br \/>\nlacked basic AI access controls<\/b> (IBM, 2025).<\/p>\n<p><i>By OrcaRouter Security Research \u00b7 June<br \/>\n2026<\/i><\/p>\n<p>In June 2025, attackers exfiltrated corporate data from Microsoft<br \/>\n365 Copilot. The victim did nothing wrong \u2014 no link clicked, no attachment<br \/>\nopened, no prompt approved. They received an email. Their AI assistant later<br \/>\nread it, and obeyed the instructions hidden inside. Disclosed by Aim Security<br \/>\nas <b>EchoLeak (CVE-2025-32711)<\/b>, the attack gathered sensitive context from<br \/>\nmail, files, and chat history and smuggled it out through an auto-loading image<br \/>\nURL. Zero clicks.<\/p>\n<p>According to the report, EchoLeak was not an isolated case <b>but an<br \/>\nearly example of a broader pattern.<\/b><\/p>\n<h2>A year of escalating, increasingly automated incidents<\/h2>\n<p>The report&#8217;s 2026 incident record spans cases that challenged<br \/>\nlongstanding assumptions in enterprise security:<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Chat &amp; Ask AI<\/b> left roughly 300<br \/>\nmillion private chat messages from more than 25 million users exposed through a<br \/>\nFirebase misconfiguration (404 Media; Malwarebytes, Jan 2026).<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Sears Home Services<\/b> exposed 3.7 million<br \/>\nAI chat transcripts and call recordings \u2014 names, addresses, emails \u2014 spanning<br \/>\n2024\u20132026 (ExpressVPN; Cybernews, Mar 2026).<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 An attacker chained a single CVE <b>(CVE-2026-39987<\/b> in the<br \/>\nmarimo notebook tool) into a live LLM agent that extracted cloud credentials,<br \/>\npulled an SSH key from AWS Secrets Manager, and exfiltrated an entire internal<br \/>\nPostgreSQL database in under two minutes (Sysdig; The Hacker News, May 2026).<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Microsoft and Salesforce<\/b> both shipped<br \/>\npatches for AI-agent data-leak flaws. In CVE-2026-21520, a poisoned SharePoint<br \/>\nfield steered Copilot into emailing customer data to an attacker \u2014 and the data<br \/>\nleft even after a safety mechanism flagged the attack (Dark Reading).<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Denial-of-wallet<\/b> \u2014 a hijacked or runaway<br \/>\nagent that simply spends \u2014 has been observed burning $46,000 a day (Sysdig,<br \/>\n\u201cLLMjacking\u201d). No data is stolen. There is only a bill.<\/p>\n<p><i>Three years of public<br \/>\nincidents, research, and regulation \u2014 2023 to 2026.<\/i><\/p>\n<h2>Why traditional security tools miss these attacks<\/h2>\n<p>Traditional security assumes a boundary: trusted inside, untrusted<br \/>\noutside, controls at the seam. Language models dissolve that boundary, because <b>a<br \/>\nmodel&#8217;s input is also its programming.<\/b> Every email, document, web page, and<br \/>\ntool result an agent reads can carry instructions it will follow. There is no<br \/>\nreliable, general mechanism by which today&#8217;s models separate content to process<br \/>\nfrom commands to obey.<\/p>\n<p>That is why prompt injection holds the <b>#1 position in the OWASP<br \/>\nTop 10 for LLM Applications<\/b> \u2014 and why, the company argues, it will not be<br \/>\n\u201cpatched\u201d the way a buffer overflow is. It is described as a structural<br \/>\nproperty of the medium: a web application firewall inspects the request and<br \/>\nsees a perfectly valid API call, because the attack is in the words.<br \/>\nPer-request checks pass every step of a chained attack, because the damage<br \/>\nlives in the sequence \u2014 volume, repetition, and spend against time \u2014 not in any<br \/>\none call.<\/p>\n<p>The report concludes that <b>AI security is not a model-training<br \/>\nproblem. It is an architecture problem<\/b> \u2014 and it is solvable with the same<br \/>\ndiscipline enterprises already apply to every other production system.<\/p>\n<p><i>The 14 key risks across four<br \/>\nthreat categories: content plane, action plane, economic, and trust &amp;<br \/>\nsupply chain.<\/i><\/p>\n<h2>A gateway-level approach: two planes, six layers<\/h2>\n<p>Every attack above succeeds against unscoped authority and fails<br \/>\nagainst scoped, policed, audited authority. Containing them requires<br \/>\ncontrolling <b>two distinct planes:<\/b><\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>The content plane<\/b> \u2014 what the model reads<br \/>\nand writes. This is the job of Guardrails.<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>The action plane<\/b> \u2014 what the agent does:<br \/>\nthe tools it calls, the networks it reaches, the money it spends. This is the<br \/>\njob of the Firewall.<\/p>\n<p>The report notes that the most damaging incidents cross both planes:<br \/>\nan injection arrives as content, then executes as an action. OrcaRouter&#8217;s<br \/>\ndesign places six independent, auditable layers between a request and its<br \/>\nexecution:<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Scoped identity<\/b> \u2014 every agent calls<br \/>\nthrough its own key carrying allowed models, an IP allow-list, a hard spend<br \/>\ncap, and an expiry. An out-of-scope request dies before any content is read.<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Input guardrails<\/b> \u2014 injection and<br \/>\njailbreak rules, PII detection and masking, secret blocking, and a semantic<br \/>\nLLM-judge that catches what regex cannot.<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>The action firewall<\/b> \u2014 every tool call,<br \/>\nMCP dispatch, and network egress is judged against ordered, default-deny policy<br \/>\nwith six verdicts: allow, audit, deny, sanitize, pending-approval, and<br \/>\ncap-cost. A hijacked agent cannot reach a tool, a host, or a spend limit that<br \/>\nwas not explicitly listed.<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Output guardrails<\/b> \u2014 the reply is<br \/>\nscreened on the way out for unsafe output, PII, and secrets, with grounding<br \/>\nchecks. This is the layer that catches EchoLeak&#8217;s exfiltration URL before it<br \/>\nleaves.<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Anomaly detection<\/b> \u2014 behavioral baselines<br \/>\nflag what static rules can&#8217;t predict: the same call hammered in a tight window,<br \/>\nspend spiking against a learned baseline, a tool-to-tool transition the<br \/>\nworkspace has never made.<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>Signed audit<\/b> \u2014 every match, verdict,<br \/>\napproval, and policy change lands in a tamper-evident trail, correlated by<br \/>\nagent run and session, exportable as evidence.<\/p>\n<p>The decisive property is placement. These controls live at the<br \/>\ngateway, in the request path, so they bind to <b>credentials rather than<br \/>\napplication code<\/b> \u2014 enforceable across every team and framework, with no<br \/>\nagent rewrites.<\/p>\n<p><i>Observed prevalence versus<br \/>\npotential business impact, mapped by threat plane.<\/i><\/p>\n<h2>Evaluation against open red-team benchmarks<\/h2>\n<p>The company says Guardrails and Firewall ship with an evaluation<br \/>\nharness that scores them against more than 80 open-source red-team corpora,<br \/>\neach cited and licensed:<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>HarmBench<\/b> (MIT; ICML 2024), <b>JailbreakBench<\/b><br \/>\n(NeurIPS 2024), and <b>AdvBench<\/b> (Zou et al., 2023) for harmful-behavior and<br \/>\njailbreak robustness;<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>NVIDIA&#8217;s garak<\/b> (Apache-2.0), the open<br \/>\nLLM vulnerability scanner, for injection and encoding attacks;<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>AgentDojo<\/b> (NeurIPS 2024) \u2014 the agent<br \/>\nprompt-injection benchmark the US and UK AI Safety Institutes used in joint<br \/>\nred-teaming \u2014 to grade the action-plane firewall specifically;<\/p>\n<p>\u2022\u00a0\u00a0\u00a0\u00a0 <b>TruthfulQA<\/b> and others for grounding and<br \/>\nhallucination.<\/p>\n<p>OrcaRouter integrates open tooling directly: <b>OSV<\/b> for<br \/>\ndependency CVEs and <b>Semgrep<\/b> for code that transits a prompt.<\/p>\n<h2>Aligning with incoming regulation<\/h2>\n<p>On <b>August 2, 2026, the EU AI Act becomes fully applicable<\/b>,<br \/>\nand \u201cshow me\u201d replaces \u201ctell me\u201d as the regulatory baseline. The same<br \/>\nevidentiary instinct is spreading through SOC 2 scopes, cyber-insurance<br \/>\nquestionnaires, and procurement reviews. OrcaRouter ships <b>36 compliance<br \/>\nframework packs<\/b> \u2014 including OWASP LLM Top 10, NIST AI RMF, ISO\/IEC 42001,<br \/>\nEU AI Act, SOC 2, HIPAA, PCI DSS, and GDPR \u2014 that apply controls within a<br \/>\nworkspace and generate signed evidence. According to the company, one control<br \/>\nlayer can produce attestation for all of them at once.<\/p>\n<h2>What is being released<\/h2>\n<p><b>OrcaRouter Firewall + Guardrails are now free for every user.<\/b> The controls attach to an API key already in use and do not require<br \/>\na separate integration.<\/p>\n<p>The company said it made the controls free deliberately, citing the<br \/>\nreport&#8217;s finding that restricting AI use without an approved alternative tends<br \/>\nto increase unsanctioned, or \u201cshadow,\u201d AI rather than reduce it \u2014 and that<br \/>\nshadow AI already drives <b>one in five breaches at a $670,000 premium<\/b><br \/>\n(IBM, 2025). The company argues that the response is as much economic as<br \/>\ntechnical: <b>make the governed path the easiest path.<\/b> A control that<br \/>\ncarries an extra cost, requires manual integration, and must be justified to a<br \/>\nbudget committee is, it says, one that many teams will skip.<\/p>\n<p>Guardrails and a Firewall policy attach to an existing key, and the<br \/>\ncompany recommends a staged rollout: <b>observe<\/b> (run in audit mode and let<br \/>\nreal traffic write the baseline), <b>shadow<\/b> (run the real policy in<br \/>\nwould-block mode until false positives approach zero), then <b>enforce<\/b><br \/>\n(flip verdicts live, with human approval reserved for the genuinely<br \/>\nirreversible). Most teams convert in weeks \u2014 and keep the controls on.<\/p>\n<h2>Outlook<\/h2>\n<p>The report frames the 2026 threat landscape not as a reason to slow<br \/>\nAI adoption but as a guide to managing it. Its central argument is that the<br \/>\ndocumented attacks succeed against unscoped authority and fail against scoped,<br \/>\npoliced, and audited authority \u2014 a property the company says can be implemented<br \/>\nat the gateway level.<\/p>\n<p><b>Availability: <\/b>The<br \/>\nFirewall and Guardrails are available now to all OrcaRouter users. The AI<br \/>\nThreat Report 2026 is published on the OrcaRouter documentation site.<\/p>\n<p>\u00a0<\/p>\n<p><b>About OrcaRouter<\/b><\/p>\n<p>OrcaRouter is an OpenAI-compatible LLM gateway from Continuum AI<br \/>\nPte. Ltd. (Singapore), routing across 200+ models with around 40% cost<br \/>\nreduction, sub-millisecond routing overhead, and zero token markup. A<br \/>\nself-hosted edition, OrcaRouter-Lite, is available under the MIT license.<\/p>\n<p><b>Media contact: <\/b>Yi Shi \u00b7<br \/>\nyi@continuum01.ai<\/p>\n<p>This press release has also been published on <a href=\"https:\/\/vritimes.com\/sg\/articles\/cc24febe-b707-4f66-8dd9-ffb7564a0c9c\/d2462d39-910a-494a-9883-1daeb8b634d0\">VRITIMES<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>OrcaRouter has published The AI Threat Report 2026 and made its agent Firewall and input\/output Guardrails free for every user \u2014 same API key, one switch, no code changes. The report argues that AI systems have become the attack surface, with prompt injection now the #1 risk to LLM applications and one that cannot be&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"magazine_newspaper_sidebar_layout":"","footnotes":""},"categories":[4],"tags":[],"class_list":["post-46041","post","type-post","status-publish","format-standard","hentry","category-singapore"],"_links":{"self":[{"href":"https:\/\/apacvision.com\/index.php?rest_route=\/wp\/v2\/posts\/46041","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/apacvision.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/apacvision.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/apacvision.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/apacvision.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=46041"}],"version-history":[{"count":0,"href":"https:\/\/apacvision.com\/index.php?rest_route=\/wp\/v2\/posts\/46041\/revisions"}],"wp:attachment":[{"href":"https:\/\/apacvision.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=46041"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/apacvision.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=46041"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/apacvision.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=46041"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}