// MODEL OPTIMIZATION AND PROMPT SYNTAX TERM

Prompt Injection

A security vulnerability where a malicious user manipulates an AI model's behavior by inserting hidden or adversarial instructions into a prompt, overriding its original purpose.

TECHNICAL DEFINITION

Prompt injection is a security vulnerability where an attacker manipulates a large language model's (LLM) behavior or overrides its system instructions by embedding adversarial or conflicting directives within user input, potentially leading to unintended actions, data exfiltration, or harmful content generation.

BACKGROUND

Prompt engineering is the process of structuring natural language inputs to produce specified outputs from a generative artificial intelligence (GenAI) model. Context engineering is the related area of software engineering that focuses on the management of non-prompt and prompt contexts supplied to the GenAI model, such as system instructions, metadata, API tools and tokens.

SYNONYMS & ALIASES

Jailbreaking
Adversarial prompting
Prompt hijacking
Instruction overriding

USAGE NOTE

Prompt injection is a significant security concern for applications built on LLMs, requiring robust defense mechanisms.

DEVELOPERS

Organizations developing technology related to Prompt Injection.

Anthropic
Develops leading large language models and focuses heavily on AI safety and alignment, including mitigating prompt injection and other adversarial attacks.
OpenAI
Creator of prominent large language models like GPT-4 and ChatGPT, deeply involved in researching and developing defenses against prompt injection to secure their products.
Google DeepMind
A major AI research lab developing advanced large language models and actively researching and implementing robust defenses against various adversarial attacks, including prompt injection.
Microsoft Research / Azure AI
Engaged in cutting-edge AI research and product development for Azure AI, including robust security measures for large language models against vulnerabilities like prompt injection.
Lakera AI
Specializes in securing large language models, offering platforms and tools specifically designed to detect and prevent prompt injection attacks.
Robust Intelligence
Provides an AI security platform that helps enterprises secure their AI systems, including large language models, against vulnerabilities like prompt injection.
Giskard AI
Offers an open-source and enterprise platform for testing and evaluating AI models, including identifying and mitigating security vulnerabilities such as prompt injection in LLMs.
OWASP (Open Worldwide Application Security Project)
A non-profit foundation that works to improve software security. They publish the OWASP Top 10 for LLM Applications, which prominently features Prompt Injection as a critical vulnerability, influencing industry standards and development.
Preamble AI
Focuses on AI safety and security for enterprises using large language models, providing solutions to prevent prompt injection and other adversarial attacks.
HiddenLayer
An AI security company that provides a platform to detect and prevent adversarial attacks, including prompt injection, on machine learning models across various deployments.

RELATED TERMS IN PROMPTING & LOGIC

BACK TO AI ENGINEERING & PROMPT DESIGN LEXICON

TECHNICAL DEFINITION

BACKGROUND

SYNONYMS & ALIASES

USAGE NOTE

DEVELOPERS

Anthropic

OpenAI

Google DeepMind

Microsoft Research / Azure AI

Lakera AI

Robust Intelligence

Giskard AI

OWASP (Open Worldwide Application Security Project)

Preamble AI

HiddenLayer

RELATED TERMS IN PROMPTING & LOGIC