Glossary

GPU (Graphics Processing Unit)

Discover how GPUs revolutionize AI and machine learning by accelerating deep learning, optimizing workflows, and enabling real-world applications.

Train YOLO models simply
with Ultralytics HUB

Learn more

A Graphics Processing Unit (GPU) is a specialized electronic circuit designed to rapidly manipulate and alter memory to accelerate the creation of images intended for output to a display device. Originally developed for rendering graphics in video games and professional design applications, GPUs have become fundamental components in the fields of Artificial Intelligence (AI) and Machine Learning (ML). Their architecture, featuring thousands of processing cores, allows them to perform many calculations simultaneously, making them exceptionally efficient for the complex mathematical operations required by deep learning algorithms and enabling rapid real-time inference. You can explore the history of the GPU to understand its evolution.

Importance in AI and Machine Learning

The parallel processing power of GPUs has been a key driver behind the recent advancements in AI. Training deep neural networks involves vast amounts of data and computationally intensive operations like matrix multiplications. GPUs excel at these tasks, significantly reducing the time required to train complex models compared to traditional Central Processing Units (CPUs). This acceleration allows researchers and developers to iterate faster, experiment with larger datasets, and tackle problems like object detection and image segmentation with greater accuracy and speed. For example, Ultralytics YOLO models heavily rely on GPUs to achieve high performance in real-time vision tasks. Access to powerful GPUs, often via cloud computing platforms or dedicated hardware, is crucial for modern AI development.

Key Differences From CPUs and TPUs

While often working together in a system, GPUs, CPUs, and Tensor Processing Units (TPUs) have distinct architectures and optimal use cases:

  • CPU (Central Processing Unit): Designed for general-purpose computing, CPUs excel at sequential tasks and managing system operations. They have a few powerful cores optimized for low latency on a wide variety of workloads. See a comparison of CPU vs GPU.
  • GPU (Graphics Processing Unit): Optimized for parallel processing, GPUs have thousands of simpler cores designed to handle many tasks simultaneously. This makes them ideal for the data-parallel computations common in deep learning, graphics rendering, and high-performance computing (HPC). Major providers include NVIDIA and AMD.
  • TPU (Tensor Processing Unit): Developed by Google, TPUs are Application-Specific Integrated Circuits (ASICs) specifically designed to accelerate ML workloads, particularly those using the TensorFlow framework. They offer high performance for specific tensor operations but may be less versatile than GPUs. More details can be found on Google's TPU page.

GPUs offer a balance of high performance for parallel tasks and broad applicability, supported by mature software ecosystems like NVIDIA's CUDA and frameworks such as PyTorch. Setting up GPU environments can be simplified using tools like Docker; see the Docker Quickstart guide for details.

Real-World Applications in AI

GPUs are integral to many cutting-edge AI applications:

  • Autonomous Vehicles: GPUs process vast streams of data from cameras, LiDAR, and radar in real-time to enable perception, path planning, and decision-making for AI in self-driving cars. Companies like Tesla rely heavily on GPUs for their autonomous systems.
  • Medical Image Analysis: In healthcare, GPUs accelerate the analysis of complex medical images like MRIs, CT scans, and X-rays, aiding radiologists in detecting anomalies, segmenting tissues, and improving diagnostic accuracy. This is a key area for medical image analysis, with platforms like NVIDIA Clara providing specialized tools.

GPUs are also crucial for training models deployed on edge devices, such as those using the NVIDIA Jetson platform. Training these models often occurs on powerful GPUs, possibly using platforms like Ultralytics HUB for streamlined workflows.

Read all