// MODEL OPTIMIZATION AND PROMPT SYNTAX TERM

Model Selection

Choosing the best machine learning model from a set of candidates for a specific task.

Model Selection — illustration from Wikipedia
Image via Wikipedia

TECHNICAL DEFINITION

The process of evaluating and selecting the optimal machine learning model, often based on performance metrics on a validation set, to generalize well to unseen data.

BACKGROUND

Generative artificial intelligence (GenAI) is a subfield of artificial intelligence (AI) that uses generative models to generate text, images, videos, audio, software code or other forms of data. These models learn the underlying patterns and structures of their training data, and use them to generate new data in response to input, which often takes the form of natural language prompts.

READ MORE ON WIKIPEDIA

SYNONYMS & ALIASES

  • Algorithm choice
  • model picking
  • best model selection

USAGE NOTE

Model selection is crucial to ensure the deployed model performs effectively in real-world scenarios.

DEVELOPERS

Organizations developing technology related to Model Selection.

  • Hugging Face

    Hugging Face offers a vast Hub of pre-trained models, datasets, and demos, serving as a central platform where developers discover, share, and select models for various AI tasks. Their libraries (e.g., Transformers) facilitate fine-tuning and deploying these models, making model selection a core part of their ecosystem.

  • OpenAI

    OpenAI provides a suite of powerful foundational models like GPT-3.5 and GPT-4 through their API. Users engage in model selection by choosing the most appropriate model variant based on complexity, cost, and specific application requirements for prompt engineering and AI development.

  • Anthropic

    Anthropic develops and offers its family of Claude large language models (e.g., Claude 3 Opus, Sonnet, Haiku) via API. AI engineers and prompt designers must select among these models based on their distinct capabilities, speed, and cost-efficiency for their specific use cases.

  • Google Cloud Vertex AI

    Google Cloud's Vertex AI is an end-to-end platform for building, deploying, and scaling machine learning models. It provides tools for comparing different models, managing experiments, and selecting the optimal model for deployment, including access to Google's own foundational models like Gemini.

  • Microsoft Azure AI

    Azure AI offers a comprehensive portfolio of AI services, including Azure Machine Learning and Azure OpenAI Service. It provides tools and services for training, deploying, and managing various AI models, enabling developers to compare and select the best models for their applications based on performance and cost.

  • Weights & Biases (W&B)

    W&B provides a developer tool for machine learning experiment tracking, model optimization, and collaboration. It is crucial for comparing the performance of different models, hyperparameter configurations, and prompt strategies, thereby directly aiding in the data-driven selection of the best model.

  • Vellum.ai

    Vellum is a platform designed to help developers build and manage AI applications using large language models. It often includes features for A/B testing different prompts across multiple LLMs, evaluating their outputs, and facilitating the selection of the best-performing model-and-prompt combination for a given task.

RELATED TERMS IN DATA SCIENCE