// MODEL OPTIMIZATION AND PROMPT SYNTAX TERM

Model Training

The process where a machine learning model learns patterns from data to make accurate predictions.

TECHNICAL DEFINITION

The iterative procedure of adjusting a machine learning model's internal parameters (weights and biases) by exposing it to a labeled dataset, minimizing a loss function, and enabling it to learn underlying data distributions.

BACKGROUND

Prompt engineering is the process of structuring natural language inputs to produce specified outputs from a generative artificial intelligence (GenAI) model. Context engineering is the related area of software engineering that focuses on the management of non-prompt contexts supplied to the GenAI model, such as metadata, API tools, and tokens.

READ MORE ON WIKIPEDIA

SYNONYMS & ALIASES

  • Training
  • model fitting
  • learning phase
  • parameter optimization

USAGE NOTE

During model training, the algorithm learns from the input data to map features to targets.

DEVELOPERS

Organizations developing technology related to Model Training.

  • Google Cloud / Vertex AI

    Provides a unified platform for building, training, and deploying ML models, including managed datasets, training pipelines, and experiment tracking.

  • AWS SageMaker

    Offers a comprehensive suite of services that enable developers to build, train, and deploy machine learning models at scale, featuring managed training environments.

  • Microsoft Azure Machine Learning

    A cloud-based platform for building, training, and deploying machine learning models, supporting various ML frameworks and MLOps capabilities.

  • Databricks

    Offers a unified data and AI platform, leveraging Apache Spark and Delta Lake, with MLOps capabilities (including MLflow) for model development, training, and lifecycle management.

  • Hugging Face

    Develops tools and platforms (like Transformers and Accelerate) that facilitate the training, fine-tuning, and deployment of state-of-the-art machine learning models, especially for natural language processing.

  • Weights & Biases

    Provides MLOps tools for experiment tracking, visualization, and collaboration during the deep learning model training process, helping optimize performance and manage runs.

  • NVIDIA

    Designs and manufactures GPUs and develops software platforms (e.g., CUDA, cuDNN, TensorRT) that provide the computational backbone and acceleration libraries essential for training large-scale AI models.

  • Comet ML

    Offers an MLOps platform for tracking, comparing, and managing machine learning experiments and models, providing visibility and debugging tools specifically for the training phase.

RELATED TERMS IN DATA SCIENCE