I built a Safety Kernel for Crews - blocks dangerous file ops automatically

imran-siddique · February 2, 2026, 10:46am

Hey CrewAI community!

I’ve been working on kernel-level safety for AI agents and wanted to share a demo specifically for CrewAI.

The Problem: Agents can hallucinate dangerous operations like rm -rf or DROP TABLE. Prompt engineering alone can’t reliably prevent this.

The Solution: Agent OS intercepts these at the kernel level - before they execute.

Demo

I just submitted a PR to crewAI-examples: feat: Add Agent OS safety governance example by imran-siddique · Pull Request #300 · crewAIInc/crewAI-examples · GitHub

Run it yourself:

git clone https://github.com/imran-siddique/agent-os
cd agent-os/examples/crewai-safe-mode
python crewai_safe_mode.py

What it does:

Wraps your CrewAI agents in a safety kernel
Blocks operations like rm -rf, sudo, chmod 777
Maintains full audit log of all agent actions
Zero code changes to your existing crews

Screenshot

Image: Agent OS Demo →
agent-os/examples/crewai-safe-mode/demo.svg at master · imran-siddique/agent-os

Would love feedback on:

What other operations should we block by default?
Would this be useful as a native CrewAI integration?

Happy to contribute upstream if there’s interest!

Bin_Zhang · March 9, 2026, 1:10pm

Really interesting approach. Intercepting dangerous operations at the kernel level makes a lot of sense, especially as agents start executing real system commands.

One thing I’ve been thinking about in a similar space is what happens after the execution layer — how we verify what the agent actually did.

Blocking rm -rf or DROP TABLE is important, but in more complex multi-agent systems we also start needing something like verifiable execution logs: a structured record of the agent’s decisions and actions that can be audited later.

We’ve been experimenting with this idea as part of an execution-integrity layer for agents, where actions are recorded and verifiable rather than just logged.

Curious if you’ve thought about that side of the problem — not just preventing dangerous actions, but also making agent behavior provable afterwards.I’ve been exploring something related here:

Topic		Replies	Views
Looking for feedback: Help shape open-source governance for CrewAI agents General crewai	0	24	September 16, 2025
CrewAI Examples Repo General	2	322	May 9, 2025
Alternative for Docker, while doing code execution by agent General	4	1461	June 21, 2025
AgentOps observability removed from docs? CrewAI Community Support crewai , feature	0	40	August 18, 2025
Python code not executed by Agent CrewAI Community Support tools_issues	1	288	January 30, 2025

I built a Safety Kernel for Crews - blocks dangerous file ops automatically

Demo

Related topics