용어집

비최대 억제(NMS)

물체 감지를 위한 비최대 억제(NMS)에 대해 알아보세요. 결과를 개선하고 정확도를 높이며 YOLO 과 같은 AI 애플리케이션을 지원하는 방법을 알아보세요.

YOLO 모델을 Ultralytics HUB로 간단히
훈련

자세히 알아보기

Non-Maximum Suppression (NMS) is a crucial post-processing technique used widely in computer vision (CV), especially within object detection pipelines. Its main role is to refine the raw output generated by detection models, which often identify multiple, overlapping bounding boxes for the same object instance. By intelligently filtering these redundant boxes, NMS ensures that each distinct object in an image or video frame is represented by a single, optimal bounding box. This significantly improves the clarity and accuracy of the final detection results, making them more useful for subsequent tasks.

비 최대 억제 작동 방식

Object detection models, such as various Ultralytics YOLO versions, typically scan an image and propose numerous potential bounding boxes around detected objects. Each proposed box comes with a confidence score, indicating the model's certainty that the box contains an object and belongs to a specific class. NMS operates by systematically reducing these proposals based on their confidence scores and spatial overlap.

The process generally follows these steps:

  1. All proposed bounding boxes are sorted based on their confidence scores, usually in descending order.
  2. The bounding box with the highest confidence score is selected as a definitive detection.
  3. All other bounding boxes that have a significant overlap with this selected box are suppressed or removed. The overlap is measured using the Intersection over Union (IoU) metric, and suppression occurs if the IoU exceeds a predefined threshold (e.g., 0.5). A detailed explanation of this core concept can be found in resources like PyImageSearch's NMS guide.
  4. The process repeats iteratively: the next highest-scoring box among the remaining ones is selected, and overlapping boxes are suppressed.
  5. This continues until all boxes have been either selected as final detections or suppressed.

This ensures that only the most confident, non-overlapping boxes remain, providing a much cleaner and more interpretable output, as visualized in many computer vision tutorials.

AI 및 머신 러닝의 중요성

In the broader fields of Artificial Intelligence (AI) and Machine Learning (ML), NMS is fundamental for achieving reliable object detection performance. Without NMS, the output of a detector like YOLO11 would be cluttered with multiple boxes for single objects. This redundancy can lead to errors in downstream applications, such as counting objects (object counting guide), object tracking, or complex scene understanding in robotics.

By eliminating these redundant detections (often contributing to false positives), NMS significantly enhances the precision of the model's predictions. This refinement is critical for applications demanding high reliability and accuracy. The impact of NMS is reflected in evaluation metrics like Mean Average Precision (mAP), which are typically calculated after NMS has been applied, as detailed in the YOLO Performance Metrics guide.

실제 애플리케이션

NMS is a cornerstone technology enabling numerous practical AI applications:

관련 기술과의 비교

NMS is specifically a post-processing step applied after an object detection model has generated its initial set of candidate bounding boxes. It should not be confused with the detection architecture itself, such as the difference between anchor-based detectors and anchor-free detectors. These architectures define how potential boxes are proposed, while NMS refines these proposals.

Interestingly, the computational cost and potential bottlenecks associated with NMS have spurred research into NMS-free object detectors. Models like YOLOv10 integrate mechanisms during training (like consistent dual assignments) to inherently avoid predicting redundant boxes, aiming to reduce inference latency and enable truly end-to-end detection (YOLOv10 NMS-free approach). This contrasts with traditional approaches like Ultralytics YOLOv8 or YOLOv5, where NMS remains a standard and essential part of the inference pipeline. You can explore technical comparisons, such as YOLOv10 vs YOLOv8, in our documentation. Variants like Soft-NMS (paper on Soft-NMS) offer alternative approaches that decay the scores of overlapping boxes instead of eliminating them entirely.

Ultralytics 도구와 통합

NMS is seamlessly integrated within the Ultralytics ecosystem. Ultralytics YOLO models automatically apply NMS during the prediction (predict)validation (val) modes, ensuring users receive clean and accurate detection outputs by default. The parameters controlling NMS behavior (like the IoU threshold and confidence threshold) can often be tuned for specific application needs.

Platforms like Ultralytics HUB further abstract these details, allowing users to train models (cloud training guide) and deploy them where NMS is handled automatically as part of the optimized pipeline. This integration ensures that users, regardless of their deep technical expertise in MLOps, can benefit from state-of-the-art object detection results for various computer vision tasks. The specific implementation details within the Ultralytics framework can be explored in the Ultralytics utilities reference. For more definitions, check out the main Ultralytics Glossary.

모두 보기