Green check
Link copied to clipboard

The benefits of Ultralytics YOLO11 being an anchor-free detector

Understand how Ultralytics YOLO11 supports anchor-free object detection and the benefits that this model architecture brings to various applications.

If we take a look back at the history of Vision AI models, the concept of object detection - a core computer vision task that involves identifying and locating objects within an image or video - has been around since the 1960s. However, the key reason for its significance in cutting-edge innovations today is that object detection techniques and model architectures have advanced and rapidly improved since then. 

In a previous article, we discussed the evolution of object detection and the road that has led to the Ultralytics YOLO models. Today, we’ll focus on exploring a more specific milestone in this journey: the jump from anchor-based detectors to anchor-free detectors. 

Anchor-based detectors rely on predefined boxes, called "anchors," to predict where objects are in an image. In contrast, anchor-free detectors skip these predefined boxes and instead predict object locations directly.

While this shift may seem like a simple, logical change, it has actually led to major improvements in object detection accuracy and efficiency. In this article, we’ll understand how anchor-free detectors have reshaped computer vision through advancements like Ultralytics YOLO11.

What are Anchor-Based Detectors?

Anchor-based detectors use predefined boxes, known as anchors, to help locate objects in an image. Think of these anchors as a grid of boxes of different sizes and shapes placed over the image. The model then adjusts these boxes to fit the objects it detects. For example, if the model identifies a car, it will modify the anchor box to match the car’s position and size more accurately.

Each anchor is associated with a possible object in the image, and during training, the model learns how to tweak the anchor boxes to better match the object’s location, size, and aspect ratio. This allows the model to detect objects at different scales and orientations. However, selecting the right set of anchor boxes can be time-consuming, and the process of fine-tuning them can be prone to errors.

Fig 1. What is an Anchor Box?

While anchor-based detectors, like YOLOv4, have worked well in many applications, they do have some drawbacks. For example, anchor boxes don’t always align well with objects of different shapes or sizes, making it harder for the model to detect small or irregularly shaped objects. The process of selecting and fine-tuning anchor box sizes can also be time-consuming and requires a lot of manual effort. Aside from this, anchor-based models often struggle with detecting objects that are occluded or overlapping, as the predefined boxes may not adapt well to these more complex scenarios.

The Shift to Anchor-Free Object Detection

Anchor-free detectors started gaining attention in 2018 with models like CornerNet and CenterNet, which took a fresh approach to object detection by eliminating the need for predefined anchor boxes. Unlike traditional models that rely on anchor boxes of different sizes and shapes to predict where objects are, anchor-free models predict the locations of objects directly. They focus on key points or features of the object, like the center, which simplifies the detection process and makes it faster and more accurate.

Here’s how anchor-free models generally work:

  • Keypoint detection: Instead of using predefined boxes, some models identify important points on an object, like the center or specific corners. These key points help models figure out where the object is and how big it is.
  • Center prediction: Some models focus on predicting the center of an object. Once the center is located, the model can predict the size and position of the entire object from there.
  • Heatmap regression: Many anchor-free models use heatmaps, where each pixel represents a possible location of an object. Stronger heatmap values indicate higher confidence that an object is present at that point.
Fig 2. Anchor-Based Detection Vs. Anchor-Free Detection.

Because anchor-free models don’t rely on anchor boxes, they have a simpler design. This means they are more computationally efficient. Since they don’t have to process multiple anchor boxes, they can detect objects more quickly - an important advantage in real-time applications like autonomous driving and video surveillance. 

Anchor-free models are also much better at handling small, irregular, or occluded objects. Since they focus on detecting key points rather than trying to fit anchor boxes, they are much more flexible. This enables them to detect objects accurately in cluttered or complex environments where anchor-based models may fail.

Ultralytics YOLO11: An Anchor-Free Detector

Originally designed for speed and efficiency, YOLO models have gradually shifted from anchor-based methods to anchor-free detection, making models like YOLO11 faster, more flexible, and better suited for a wide range of real-time applications.

Here’s a quick look at how the anchor-free design has evolved across different YOLO versions:

  • Ultralytics YOLOv5u: Introduced the Anchor-Free Split Ultralytics Head, removing the need for predefined anchor boxes. Instead, the model directly predicts where objects are in an image, simplifying the process and improving flexibility and speed.
  • YOLOv6: A new method called Anchor-Aided Training (AAT) was used, where anchors were used only during training. This allowed the model to benefit from the structure of anchor-based methods during training, while still using anchor-free detection at runtime for better speed and adaptability.
  • Ultralytics YOLOv8: Fully switched to anchor-free detection by using the Anchor-Free Split Ultralytics Head. This made the model faster and more accurate, especially for small or oddly shaped objects that don’t fit well with anchor boxes.
  • Ultralytics YOLO11: Builds on YOLOv8’s anchor-free approach, optimizing detection even further by eliminating anchor boxes entirely. This results in faster, more accurate detection for real-time applications like animal behavior monitoring and retail analytics.
Fig 3. Comparing Ultralytics YOLOv8 and Ultralytics YOLO11.

Real-World Applications of YOLO11

A great example of the benefits of anchor-free detection using YOLO11 is in autonomous vehicles. In self-driving cars, detecting pedestrians, other vehicles, and obstacles quickly and accurately is crucial for safety. YOLO11's anchor-free approach simplifies the detection process by directly predicting the key points of objects, like the center of a pedestrian or the boundaries of another vehicle, rather than relying on predefined anchor boxes. 

Fig 4. Benefits of Anchor-Free Detection in YOLO11 (Image By Author).

YOLO11 doesn't need to adjust or fit a grid of anchors to each object, which can be computationally expensive and slow. Instead, it focuses on key features, making it faster and more efficient. For example, when a pedestrian steps into the vehicle's path, YOLO11 can quickly identify its location by pinpointing key points, even if the person is partially hidden or moving. The ability to adapt to varying shapes and sizes without anchor boxes allows YOLO11 to detect objects more reliably and at higher speeds, which is vital for real-time decision-making in autonomous driving systems.

Other applications where YOLO11’s anchor-free abilities really stand out include:

  • Retail and inventory management: YOLO11 makes it easier to monitor products on shelves, even when they’re stacked or partially blocked. This helps with quicker, more accurate inventory tracking and reduces errors.
  • Medical imaging: YOLO11 is also effective in healthcare, where it can detect tumors or other abnormalities in medical scans. Its ability to work with irregularly shaped objects helps improve accuracy in diagnosing complex conditions.
  • Wildlife monitoring: In wildlife research, YOLO11 can track animals in dense forests or tough terrain, helping researchers monitor behavior or protect endangered species.
  • Sports analytics: YOLO11 can be used to track players, ball movements, or other elements in real-time during sports events to provide valuable insights for teams, coaches, and broadcasters.

Considerations to Make When Working With Anchor-Free Models

While anchor-free models like YOLO11 offer many advantages, they do come with certain limitations. One of the main practical considerations to make is that even anchor-free models can struggle with occlusions or highly overlapping objects. The rationale behind this is that computer vision aims to replicate human vision, and just as we sometimes struggle to identify occluded objects, AI models can face similar challenges.

Another interesting factor is related to the processing of model predictions. Although the architecture of anchor-free models is simpler than anchor-based, additional refinement becomes necessary in certain cases. For example, post-processing techniques like non-maximum suppression (NMS) may be required to clean up overlapping predictions or improve accuracy in crowded scenes.

Anchoring Down on the Future of AI with YOLO11

The shift from anchor-based to anchor-free detection has been a significant advancement in object detection. With anchor-free models like YOLO11, the process is simplified, leading to improvements in both accuracy and speed.

Through YOLO11, we’ve seen how anchor-free object detection excels in real-time applications like self-driving cars, video surveillance, and medical imaging, where fast and precise detection is crucial. This approach enables YOLO11 to adapt more easily to varying object sizes and complex scenes, providing better performance across diverse environments.

As computer vision continues to evolve, object detection will only become faster, more flexible, and more efficient.

Explore our GitHub repository and join our engaging community to stay updated on all things AI. Check out how Vision AI is impacting sectors like manufacturing and agriculture.

Facebook logoTwitter logoLinkedIn logoCopy-link symbol

Read more in this category

Let’s build the future
of AI together!

Begin your journey with the future of machine learning