// MODEL OPTIMIZATION AND PROMPT SYNTAX TERM

Git

Git is a widely used version control system that helps developers track changes in their code, coordinate work with others, and manage different versions of projects.

TECHNICAL DEFINITION

Git is a distributed version control system (DVCS) designed for tracking changes in source code during software development, enabling efficient collaboration, branching, merging, and historical revision management for projects of any size.

BACKGROUND

A large language model (LLM) is a neural network trained on a vast amount of text for natural language processing tasks, especially language generation. LLMs can typically generate, summarize, translate, and analyze text in many contexts, and are a foundational technology behind modern chatbots. Biased or inaccurate training data can make an LLM's output less reliable.

READ MORE ON WIKIPEDIA

SYNONYMS & ALIASES

  • Distributed VCS
  • Git VCS
  • Code repository
  • GitHub/GitLab

USAGE NOTE

Git is the de facto standard for version controlling code in almost all software and machine learning projects.

DEVELOPERS

Organizations developing technology related to Git.

  • GitHub (Microsoft)

    As the world's leading platform for Git-based version control, GitHub is foundational for AI engineering, providing tools for code collaboration, MLOps CI/CD pipelines via GitHub Actions, and hosting repositories for models, datasets, and prompt engineering artifacts. Microsoft also develops AI-powered tools like GitHub Copilot, deeply integrated with Git workflows.

  • GitLab

    GitLab offers a comprehensive DevSecOps platform, deeply integrated with Git, which is extensively used for AI engineering. It provides Git-based version control for code, data, models, and prompt scripts, alongside robust CI/CD capabilities for MLOps, experiment tracking, and model deployment.

  • Iterative.ai (DVC, CML)

    Iterative.ai develops DVC (Data Version Control) and CML (Continuous Machine Learning), which extend Git's capabilities for versioning large datasets and machine learning models. These tools enable reproducible AI engineering by allowing data, models, and prompt-related code to be managed and tracked alongside Git repositories.

  • Hugging Face

    Hugging Face utilizes Git and Git LFS extensively for its Hub, which serves as a central repository for sharing and versioning AI models, datasets, and 'Spaces' (interactive ML apps). This Git-centric approach is critical for collaboration, reproducibility, and managing various artifacts in AI engineering, including prompt templates and evaluations.

  • Databricks (MLflow)

    Databricks is behind MLflow, an open-source platform for managing the end-to-end machine learning lifecycle. MLflow integrates with Git for source code versioning, linking experiments, models, and prompt engineering efforts to specific code commits for reproducibility and tracking.

  • Weights & Biases (W&B)

    Weights & Biases provides an MLOps platform for experiment tracking, model versioning, and collaboration. It deeply integrates with Git to link machine learning experiments and prompt engineering iterations to specific code commits, ensuring traceability and reproducibility in AI development.

  • Comet ML

    Comet ML offers an MLOps platform for experiment tracking, model production monitoring, and dataset versioning. It integrates with Git to automatically capture code versions, enabling AI engineers and prompt designers to track changes, reproduce results, and collaborate effectively.

RELATED TERMS IN MLOPS & DEPLOYMENT