Mitigations/ web articles | Thomas Munn's Homepage

This page describes mitigations and AI model safety.

Topic	Description	Rating
Dual LLM prompt filterer	This explores the topic of using 2 models 1 supervisory and privileged and 1 non-privileged for prompt safety managemnt
Centralized MCP governeror	Lets you do all your MCP enforcement from a single place.
LLM code reviwer	lets you use llm to review changes in code
Agents rule of Two Paper	rule of two to help with the lethal trifecta
SELF-SUPERVISED INFERENCE OF AGENTS IN TRUSTLESS ENVIRONMENTS	good paper to read on self-supervised agents
why models hallucinate	good paper on why ai models are somewhat untrustworthy