용어집

Kubernetes

확장 가능한 모델 배포, 분산 교육, 효율적인 리소스 관리를 통해 Kubernetes가 어떻게 AI/ML 워크로드를 간소화하는지 알아보세요.

Kubernetes, often abbreviated as K8s, is an open-source platform designed to automate the deployment, scaling, and management of containerized applications. Originally developed by Google and now maintained by the Cloud Native Computing Foundation (CNCF), Kubernetes provides a robust framework for running distributed systems resiliently. For those working in Artificial Intelligence (AI) and Machine Learning (ML), Kubernetes offers powerful tools to manage the complex lifecycle of models, from training to deployment and inference. It helps bridge the gap between developing ML models and reliably running them in production environments.

Core Concepts Simplified

Kubernetes orchestrates containers, which are lightweight, standalone packages containing software and its dependencies. Key concepts include:

Pods: The smallest deployable units in Kubernetes, typically holding one or more containers that share resources and network. Think of a Pod as a wrapper around your ML application or inference server container.
Nodes: Worker machines (virtual or physical) where Pods run. Kubernetes manages distributing Pods across available Nodes.
Services: An abstraction that defines a logical set of Pods and a policy to access them, often providing a stable IP address or DNS name for dynamic Pods. Essential for exposing ML inference endpoints.
Deployments: Describe the desired state for your application, managing ReplicaSets (groups of identical Pods) to ensure availability and handle updates. Useful for rolling out new model versions without downtime.

Understanding these building blocks helps in designing scalable and resilient ML systems.

AI 및 머신 러닝의 관련성

Kubernetes has become a cornerstone of modern Machine Learning Operations (MLOps) due to several advantages:

Scalability: ML tasks like training large models or serving inference requests often have fluctuating resource demands. Kubernetes can automatically scale the number of containers (Pods) up or down based on load, ensuring efficient use of resources like GPUs.
Resource Management: It allows fine-grained control over CPU and memory allocation for containers, preventing resource contention and ensuring performance, especially critical when managing expensive GPU resources across multiple experiments or services.
Portability and Consistency: Kubernetes provides a consistent environment across different infrastructures, whether on-premises servers or various cloud computing platforms like Amazon EKS, Google GKE, or Azure AKS. This simplifies moving ML workflows between development, testing, and production. You can often start with a Docker setup and scale up with Kubernetes.
Automation and Orchestration: It automates complex tasks like service discovery, load balancing, self-healing (restarting failed containers), and configuration management, reducing manual overhead for ML teams.

실제 AI/ML 애플리케이션

Distributed Model Training: Training large deep learning (DL) models, such as complex Ultralytics YOLO variants for object detection, often requires immense computational power. Kubernetes can manage a cluster of machines for distributed training using frameworks like Kubeflow or native integrations with PyTorch or TensorFlow. It handles scheduling training jobs, managing data access, and allocating GPUs efficiently across nodes.
Scalable Inference Services: Deploying ML models like those for real-time inference requires high availability and low latency. Kubernetes can host inference servers (e.g., using NVIDIA Triton Inference Server, which integrates with Ultralytics models - see the Triton guide) behind a load balancer. It automatically scales the number of inference server Pods based on incoming traffic, ensuring responsiveness even during peak loads for tasks like image classification or natural language processing (NLP).

쿠버네티스 대 관련 기술

Kubernetes vs. Docker: Docker is a tool for creating, shipping, and running individual containers (containerization). Kubernetes is an orchestrator for containers, managing potentially thousands of containers across many machines. They work together: you typically build container images with Docker and then deploy and manage them using Kubernetes. Check the Docker Quickstart guide for container basics.
Kubernetes vs. Serverless Computing: Serverless platforms (like AWS Lambda or Google Cloud Functions) abstract away server management entirely, focusing on event-driven functions. Kubernetes provides more control over the underlying infrastructure and is better suited for long-running applications or complex stateful services, though serverless frameworks can run on Kubernetes (e.g., Knative).

도구 및 에코시스템

The Kubernetes ecosystem includes many tools to simplify management:

Helm: A package manager for Kubernetes, helping define, install, and upgrade complex applications.
Prometheus & Grafana: Popular open-source tools for monitoring Kubernetes clusters and applications.
Cloud Provider Integrations: Managed Kubernetes services (EKS, GKE, AKS) simplify cluster setup and maintenance.
ML Platforms: Tools like Kubeflow build on Kubernetes to provide ML-specific workflows. Platforms like Ultralytics HUB aim to simplify the deployment pipeline, sometimes abstracting Kubernetes complexities for easier model deployment.

Kubernetes provides a powerful foundation for building, deploying, and managing scalable and reliable AI/ML applications in diverse environments, making it a crucial skill in the MLOps landscape.

Kubernetes

YOLO 모델을 Ultralytics HUB로 간단히
훈련

혁신을 지원하는 유연한 엔터프라이즈 라이선싱 솔루션

다음을 사용하여 몇 초 만에 AI 모델을 훈련하세요. Ultralytics YOLO

Ultralytics HUB로 간단히 YOLO 모델 교육

Core Concepts Simplified

AI 및 머신 러닝의 관련성

실제 AI/ML 애플리케이션

쿠버네티스 대 관련 기술

도구 및 에코시스템

블로그 더 보기

Ultralytics 커뮤니티 가입하기

Kubernetes

YOLO 모델을 Ultralytics HUB로 간단히훈련

혁신을 지원하는 유연한 엔터프라이즈 라이선싱 솔루션

다음을 사용하여 몇 초 만에 AI 모델을 훈련하세요. Ultralytics YOLO

Ultralytics HUB로 간단히 YOLO 모델 교육

Core Concepts Simplified

AI 및 머신 러닝의 관련성

실제 AI/ML 애플리케이션

쿠버네티스 대 관련 기술

도구 및 에코시스템

블로그 더 보기

Ultralytics 커뮤니티 가입하기

YOLO 모델을 Ultralytics HUB로 간단히
훈련