// THREAT DETECTION AND DATA PRIVACY TERM

Data Poisoning

Data poisoning is an attack where a malicious actor intentionally corrupts the data used to train a machine learning model. This causes the model to learn the wrong things, leading to inaccurate or biased results once it's deployed.

TECHNICAL DEFINITION

Data poisoning is a machine learning security attack where an adversary manipulates or injects malicious samples into a model's training dataset to compromise its learning process. This adversarial contamination degrades model performance, reduces accuracy, and can introduce targeted backdoors or biases, affecting classifications and predictions during inference.

BACKGROUND

Prompt injection is a cybersecurity exploit and an attack vector in which innocuous-looking inputs are designed to cause unintended behavior in machine learning models, particularly large language models (LLMs). The attack takes advantage of the model's inability to distinguish between developer-defined prompts and user inputs to bypass safeguards and influence model behaviour. While LLMs are designed to follow trusted instructions, they can be manipulated into carrying out unintended responses through carefully crafted inputs.

SYNONYMS & ALIASES

adversarial contamination
dataset poisoning
training data attack
model poisoning
data manipulation attack
input poisoning

USAGE NOTE

This attack targets the integrity of the model during its training phase, which can be very difficult to detect compared to attacks on a live system.

DEVELOPERS

Organizations developing technology related to Data Poisoning.

Robust Intelligence
A startup focused on AI security that provides a platform to test, validate, and protect machine learning models from vulnerabilities, including data poisoning and adversarial attacks.
HiddenLayer
An AI security company that develops a Machine Learning Security (MLSec) platform designed to detect and respond to adversarial attacks against machine learning models, including data poisoning techniques.
MITRE Corporation
A not-for-profit organization managing federally funded research and development centers (FFRDCs). They developed the Adversarial Threat Landscape for Artificial-Intelligence Systems (ATLAS) framework, which catalogues and analyzes attacks like data poisoning to build defenses.
DARPA
The Defense Advanced Research Projects Agency, a research and development agency of the U.S. Department of Defense. DARPA's GARD (Guaranteeing AI Robustness against Deception) program specifically funds the development of defenses against data poisoning and other adversarial ML attacks.
IBM Research
The research and development division for IBM. They actively research and publish on 'Trusted AI,' developing novel algorithms and toolkits to detect and mitigate data poisoning attacks on machine learning models.
Northrop Grumman
A major aerospace and defense technology company that develops AI-enabled systems for military applications. Their work includes research and development into creating trusted and resilient AI that can resist adversarial manipulation, including data poisoning.
Microsoft Research
The research subsidiary of Microsoft. It has dedicated teams working on 'Responsible AI' and 'Trustworthy Machine Learning,' which includes building frameworks and defenses to secure AI systems from data poisoning and other adversarial threats.
Bosch Center for Artificial Intelligence (BCAI)
The corporate research lab for Bosch, focusing on foundational AI research. They publish studies and develop methods for creating robust machine learning models that are resilient to data poisoning, particularly for safety-critical applications like autonomous driving.

RELATED TERMS IN THREATS & ATTACKS

BACK TO CYBERSECURITY & DEFENSE LEXICON

TECHNICAL DEFINITION

BACKGROUND

SYNONYMS & ALIASES

USAGE NOTE

DEVELOPERS

Robust Intelligence

HiddenLayer

MITRE Corporation

DARPA

IBM Research

Northrop Grumman

Microsoft Research

Bosch Center for Artificial Intelligence (BCAI)

RELATED TERMS IN THREATS & ATTACKS