Hero Image

Mitigations/ web articles

This page describes mitigations and AI model safety.

Topic Description Rating
Dual LLM prompt filterer This explores the topic of using 2 models 1 supervisory and privileged and 1 non-privileged for prompt safety managemnt
Centralized MCP governeror Lets you do all your MCP enforcement from a single place.
LLM code reviwer lets you use llm to review changes in code
Agents rule of Two Paper rule of two to help with the lethal trifecta
SELF-SUPERVISED INFERENCE OF AGENTS IN TRUSTLESS ENVIRONMENTS good paper to read on self-supervised agents
why models hallucinate good paper on why ai models are somewhat untrustworthy