CaneCorso Archives - MSI :: State of SecurityMSI

AI agents are powerful because they can read, reason, summarize, decide, and act across a wide range of information sources.

That is also what makes them dangerous.

The more useful an agent becomes, the more likely it is to consume data I do not fully trust. Emails. Newsletters. RSS feeds. API responses. Documents sent as attachments. Social media. YouTube transcripts. Scraped search results. Web pages. Translated content. Random bits of text pulled from places where I do not control the author, the formatting, the intent, or the payload.

That is a very different security model than the one most of us are used to.

In traditional applications, we spend a lot of time separating code from data, users from administrators, trusted networks from untrusted networks, and internal systems from the internet. With LLMs and agents, all of those boundaries start to blur. Instructions, context, content, and intent all arrive in the same stream. The model has to reason over that stream, and the agent has to decide what to do with the result.

That is exactly why I wanted a security control plane in front of my own AI agents.

For me, that control plane became CaneCorso™.

CaneCorsoAI

The Problem Was Not Theoretical

My agents support me personally. They monitor and process a wide range of information sources, each usually aligned to a specific focus area, query, or web mission. Some are looking for security research. Some are watching industry news. Some are digesting newsletters. Some are collecting data from APIs, documents, email attachments, social media, transcripts, and scraped search results.

In other words, they spend their time eating untrusted data.

That creates a meaningful risk profile.

I wanted to protect the agents against prompt injection and malformed data attacks. I also wanted to protect upstream and downstream systems from malicious URLs, private data exposure, and unsafe content that could be carried forward into decision-making. These agents are not just producing novelty summaries. Their outputs are used to support decisions.

That matters.

If an agent reads a poisoned page, a malicious email, or a document with hidden instructions, I do not want that content passed directly to the underlying LLM. If the LLM produces something unsafe, misleading, privacy-sensitive, or operationally risky, I do not necessarily want that output passed into the next stage of logic without inspection.

Before CaneCorso, the basic pipeline looked like this:

Collect inputs → summarize/extract → reason/decide → write output.

There was some logging in place for decision analysis, KPIs, and tuning. But logging is not a trust boundary. Observability is useful after the fact. It does not, by itself, prevent hostile or malformed content from entering the LLM context window.

I needed something more like a firewall for agentic workflows.

Moving CaneCorso Into the Agent Path

CaneCorso is now the single control plane for multiple agents in my environment.

Each agent has a defined CaneCorso workflow and API key configured with specific rules and outcomes. From a security practitioner’s perspective, the model feels familiar. It is not unlike firewall or IPS policy tuning. Each workflow can be adjusted based on what the agent does, what data it sees, and what level of risk is acceptable for that mission.

Every agent now sends data through CaneCorso before that data is passed to an LLM.

That is the first and most important control point. Untrusted input does not go straight to the model anymore. It is inspected, filtered, redacted, defanged, and rated before the LLM sees it.

About half of my agents also send the LLM output corpus back through CaneCorso for a second pass before the result is allowed into downstream decision logic. That double-checking pattern has become important for workflows where the output itself may influence actions, prioritization, or further analysis.

The result is a two-layer safety pattern:

Input inspection before the LLM.

Output inspection before downstream use.

That simple architectural shift changes the trust model. I am no longer depending only on model behavior, prompt discipline, or good luck. I have a monitored, auditable control plane sitting in the path.

Token Vault Sanitization and SIEM Logging

One of the other important pieces for me has been token vault sanitization.

Private or sensitive values can be protected before they move through the workflow. That is especially important when agents are handling emails, documents, API results, and mixed internal/external content. Even personal agents can encounter sensitive material, and enterprise agents will almost certainly do so.

I am also sending full transaction details, safety ratings, and decision-making context into my SIEM logs.

That is not just for compliance theater. It gives me a way to perform forensics, review blocked or redacted content, tune policies over time, and understand how different sources behave. If a feed repeatedly triggers injection protections, I can see that. If a workflow is too permissive or too noisy, I can tune it. If something gets blocked, I can understand why.

That feedback loop is essential.

AI security is not a one-and-done configuration exercise. The attack patterns are evolving. The data sources change. The agents change. The business logic changes. The controls need to be visible enough and adjustable enough to keep up.

The Integration Experience

My agents are written in Python, and the CaneCorso documentation made the integration straightforward.

The samples were relevant, accurate, and concise. I started by building a simple API harness from the documentation. Then I tuned that harness for each agent so it used the proper workflow-specific API key. After that, I used the CaneCorso web GUI to tune each workflow.

The first agent took about 30 minutes.

Each following agent took about 10 minutes.

That is an important detail for buyers. This did not turn into a rewrite of my agent stack. It felt more like adding a security middleware layer or API gateway into the agent path. Once the pattern existed, repeating it across agents was simple.

The workflow tuning was also approachable. The GUI presents functional modules in plain language. You can turn capabilities on and off and tune the behavior without needing to write complex detection logic or understand obscure heuristics. Security people will recognize the rhythm: enable controls, test, review outcomes, tune, and repeat.

It felt like firewall or IPS rule tuning, but for AI workflows.

After testing, the agents were back in service. The system has been running seamlessly for weeks with no significant hiccups.

What It Has Caught

So far, I have seen multiple prompt injection redactions. That is not surprising, because some of my agents monitor discussions around LLM threats and AI security. In those environments, malicious or adversarial examples are not theoretical. They show up in the data.

I have also had excellent results with PII redaction and URL defanging.

The URL handling matters more than many people realize. Agents often collect links, summarize pages, follow references, or pass URLs into later workflows. Defanging malicious or suspicious URLs reduces the chance that a downstream system, user, or automation accidentally treats dangerous content as safe.

The PII redaction has also been strong. For agentic workflows, privacy protection has to be built into the pipeline. You do not want every agent team inventing its own ad hoc redaction function, especially in a regulated environment.

Another pleasant surprise has been cross-language support. Some of the feeds my agents process are in languages other than English. CaneCorso has handled injection protection well even when the LLM is being used for translation. That is a big deal, because attackers do not have to limit themselves to English, and global data sources rarely cooperate with neat security assumptions.

Latency has been in the milliseconds per API call on consumer-grade hardware.

Not too shabby.

The Confidence Gain

The biggest practical gain has been confidence.

CaneCorso does not make untrusted data magically trustworthy. No tool does that. But it significantly raises the trust level of the workflow, even when some of the data is known to be hostile or suspicious.

That confidence matters when agents are used for decision support. I am more comfortable letting agents process messy public data because I know the underlying LLMs and downstream systems have another layer of protection. I am not relying solely on system prompts, model alignment, or careful source selection.

The web is untrusted. Email is untrusted. Documents are untrusted. Social media is untrusted. Scraped content is untrusted.

Agent architectures need to be designed with that assumption in mind.

Why Potential Buyers Should Care

Prompt injection is real, prevalent, and dangerous.

We are still early in the evolution of LLM attacks. The patterns are changing quickly, and the impact will grow as agents gain access to more tools, more data, and more authority. It does not take much imagination to see these attacks evolving into deeper compromise, exfiltration, fraud, and ransomware-style workflows.

That is why I think anyone experimenting with or implementing AI agents should be looking closely at this class of control.

If your agents consume data that is not 100% trusted, you need a plan.

That applies to security teams, automation teams, developers building RAG applications, MSPs, MSSPs, executives using personal agents, and organizations building internal agentic workflows. It applies even more strongly to regulated organizations.

In my opinion, regulated organizations implementing agentic workflows without this level of protection are asking for trouble.

The enterprise argument is especially straightforward. It makes sense to have a single, monitored, auditable control plane for agents so every team does not have to roll its own controls. Without that shared layer, each agent team makes its own decisions about redaction, prompt injection protection, URL handling, logging, blocking, alerting, and auditability.

That is expensive.

It is inconsistent.

It is hard to defend.

A shared control plane reduces time, cost, and mistrust. It makes agent adoption safer and helps organizations move toward ROI without pretending the risks are not there.

The Buyer’s Note

CaneCorso is not magic.

No product can provide 100% trust in untrusted data. That is not how security works, and it is definitely not how AI security works.

But the right control can raise the trust level significantly. It can provide a consistent inspection point. It can enforce privacy protections. It can defang URLs. It can redact prompt injection attempts. It can generate logs. It can give security teams something concrete to monitor, tune, and audit.

That is the point.

The organizations that succeed with AI agents will not be the ones that simply connect models to everything and hope for the best. They will be the ones that build control points, observe behavior, tune policies, and treat agentic workflows like the high-impact systems they are becoming.

For my own agents, CaneCorso became that control point.

And once it was in place, I would not want to run them without it.

How to Learn More or Leverage MSI Expertise

If you want to discuss our experience with CaneCorso in more detail, or pilot the tool in your own environment, just get in touch. You can reach us at info@microsolved.com, or give us a call at +1.614.351.1237. We’d be happy to have a zero-pressure discussion with you. Thanks for reading, and stay safe out there!

MSI :: State of Security

Insight from the Information Security Experts

Tag Archives: CaneCorso

Why My AI Agents Needed CaneCorso as a Security Control Plane