Question 1

What is adversarial AI security?

Accepted Answer

Adversarial AI security is the study and testing of how machine-learning systems — especially large language models and AI agents — fail under deliberate attack. It covers prompt injection, jailbreaking, model and data poisoning, agent hijacking, and the supply-chain and trust-boundary gaps that emerge when LLMs are wired into tools, memory, and execution contexts. SnailSploit researches it from the offensive side: identifying the underlying principle that makes a class of failures possible, then proving it reproduces across targets.

Question 2

What is AI agent security?

Accepted Answer

AI agent security concerns the new attack surface created when an LLM is given tools, memory, skills, and the ability to execute actions. Unlike a chatbot, an agent reads untrusted content, calls tools, and runs code — so an attacker who controls any input it ingests can redirect its behavior. Key classes include indirect prompt injection, malicious skill packages (see SKILBin), memory poisoning, and the MCP/agent tool-execution attack surface.

Question 3

What is AATMF?

Accepted Answer

AATMF (the Adversarial AI Tactics, Techniques and Mitigations Framework) is SnailSploit's operational taxonomy for testing AI systems: 15 tactics, 240 techniques, 2,152 executable procedures, and 4,980 adversarial prompts. Where MITRE ATLAS documents what AI attacks have happened, AATMF is the artifact a red-teamer actually uses against a live LLM, agent, or RAG pipeline to prove a system is vulnerable.

Question 4

What is the difference between prompt injection and jailbreaking?

Accepted Answer

Jailbreaking manipulates a model into ignoring its own safety policy through crafted input in the user's own turn. Prompt injection places attacker instructions in content the model later ingests — a web page, a document, a tool result, an email — so the model follows instructions the user never wrote. Jailbreaking targets the model's alignment; prompt injection targets the trust boundary between the model and the data it processes.

Question 5

Who produces this AI security research?

Accepted Answer

SnailSploit is an independent adversarial-AI research group. The AI security work is led by Kai Aizen — creator of AATMF, author of Adversarial Minds, NVD contributor, with 84 published CVEs and 5 Linux kernel mainline patches across the group. The research is original, coordinated-disclosure-based, and mapped to AATMF, MITRE ATT&CK, and MITRE ATLAS.

AI Security Research.

AI security — the essentials