// MODEL OPTIMIZATION AND PROMPT SYNTAX TERM

Kubernetes

Kubernetes is an open-source system for automating the deployment, scaling, and management of containerized applications, often used to orchestrate many Docker containers.

Kubernetes — illustration from Wikipedia
Image via Wikipedia

TECHNICAL DEFINITION

Kubernetes (K8s) is an open-source container orchestration platform designed to automate the deployment, scaling, and management of containerized AI applications and microservices, providing robust capabilities for resource scheduling, self-healing, and declarative configuration.

BACKGROUND

Gemini is a generative artificial intelligence chatbot and virtual assistant developed by Google. It is powered by the family of large language models (LLMs) of the same name, after previously being based on LaMDA and PaLM 2.

READ MORE ON WIKIPEDIA

SYNONYMS & ALIASES

  • K8s
  • container orchestrator
  • cluster manager

USAGE NOTE

Kubernetes is the de-facto standard for managing large-scale AI microservices and inference clusters in production.

RELATED TERMS IN MLOPS & DEPLOYMENT