// MODEL OPTIMIZATION AND PROMPT SYNTAX TERM

Human Oversight

The practice of keeping humans in charge of monitoring, intervening in, and ultimately controlling AI systems.

TECHNICAL DEFINITION

Human Oversight refers to the principle and practice of ensuring that human agents retain the capacity to monitor, intervene in, and ultimately control the decisions and actions of artificial intelligence systems, preventing full autonomy and ensuring accountability.

BACKGROUND

Prompt injection is a cybersecurity exploit and an attack vector in which innocuous-looking inputs are designed to cause unintended behavior in machine learning models, particularly large language models (LLMs). The attack takes advantage of the model's inability to distinguish between developer-defined prompts and user inputs to bypass safeguards and influence model behaviour. While LLMs are designed to follow trusted instructions, they can be manipulated into carrying out unintended responses through carefully crafted inputs.

READ MORE ON WIKIPEDIA

SYNONYMS & ALIASES

  • Human control
  • human supervision
  • human intervention
  • human governance

USAGE NOTE

Human oversight is critical in high-stakes AI applications like autonomous vehicles or medical diagnostics.

DEVELOPERS

Organizations developing technology related to Human Oversight.

  • Arthur AI

    Develops an AI performance monitoring platform that provides visibility into model behavior, detects issues like bias and drift, and offers explainability features to empower human oversight of AI systems in production.

  • Scale AI

    Provides human-in-the-loop data labeling, validation, and AI evaluation services, directly enabling human oversight and feedback for training, fine-tuning, and testing AI models and prompts.

  • Weights & Biases

    Offers an MLOps platform for tracking, visualizing, and comparing machine learning experiments, enabling human engineers to monitor and oversee the entire AI development lifecycle, including prompt engineering iterations.

  • Anthropic

    A research company focused on AI safety and alignment, developing techniques like 'Constitutional AI' and other methods to ensure AI systems are steerable, interpretable, and align with human values, facilitating effective human oversight.

  • Google

    Through Google Cloud AI, DeepMind, and Responsible AI initiatives, Google develops MLOps platforms, AI safety research, and tools that incorporate human-in-the-loop mechanisms, monitoring, and governance frameworks for robust human oversight.

  • Microsoft

    With Azure AI and its Responsible AI principles, Microsoft provides tools and platforms that support AI governance, explainability (XAI), and human-in-the-loop processes, ensuring human oversight throughout the AI lifecycle.

  • OpenAI

    Conducts extensive research into AI alignment and safety, including the use of Reinforcement Learning from Human Feedback (RLHF) and other techniques to imbue models with desired behaviors and enable human steering and oversight.

  • IBM

    Offers enterprise AI solutions with strong emphasis on AI governance, explainability (e.g., within Watson Studio), and tools for managing AI ethics and compliance, empowering human teams to maintain oversight of AI systems.

  • Gantry

    Provides a platform for ML observability and monitoring, allowing human teams to track the performance, data quality, and behavior of AI models in production, enabling informed oversight and intervention.

RELATED TERMS IN AI ETHICS & SAFETY