0% found this document useful (0 votes)

14 views7 pages

YOLOv5 Architecture and Algorithm for Object Detection

The document details the architecture and algorithm of YOLOv5 for object detection, highlighting its components such as the CSPDarknet backbone, PANet neck, and detection head. It discusses the training process, including loss functions, data augmentation, and model evaluation, as well as optimization techniques for deployment. Additionally, it compares YOLOv5 with YOLOv8, noting YOLOv8's improvements in accuracy, efficiency, and versatility while maintaining real-time performance.

Uploaded by

mmertyenigun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

YOLOv5 Architecture and Algorithm for Object Detection

Uploaded by

mmertyenigun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

YOLOv5 Architecture and Algorithm for Object Detection

Architectural Components

Backbone (CSPDarknet): The backbone of YOLOv5 is a modified version of Darknet53, called

CSPDarknet53. CSPDarknet integrates Cross Stage Partial Networks (CSPNet), which splits
feature maps from a layer into two parts: one goes through a set of layers, and the other
bypasses them. This reduces computational redundancy while preserving accuracy and
improves gradient flow. Earlier versions of YOLOv5 used a "Focus" layer that sliced the input
image to increase channels, but this was later replaced with a more efficient 6x6 convolution
with stride 2.

Neck (PANet and SPPF): YOLOv5's neck consists of a PANet (Path Aggregation Network)
structure combined with a Spatial Pyramid Pooling - Fast (SPPF) block. The SPPF block
sequentially applies three 5x5 max-pooling operations to simulate larger receptive fields
efficiently. PANet fuses features across multiple scales both top-down and bottom-up,
enhancing detection across object sizes. Feature maps from the backbone are upsampled
and concatenated in FPN style, then downsampled again to propagate high-resolution
context downward.

Head (Detection Layer): The head is responsible for producing final predictions across three
different scales (P3, P4, P5). Each scale corresponds to detecting small, medium, and large
objects respectively. Each cell predicts bounding boxes using predefined anchor boxes.
YOLOv5 uses a total of 9 anchors (3 per scale), and each prediction includes 4 box
coordinates, 1 objectness score, and N class scores (N+5 outputs per anchor). The decoding
formulas are designed to allow coordinate predictions to exceed grid boundaries and
produce reasonably sized bounding boxes.

Training Process

Loss Functions: YOLOv5 uses a composite loss function made of three main components: (1)
Classification loss (Binary Cross Entropy), (2) Objectness loss (Binary Cross Entropy), and (3)
Localization loss (CIoU loss - Complete Intersection over Union). Each scale’s loss is weighted
differently to prioritize small object detection. The total loss guides the optimization during
backpropagation.

Learning Process: During training, the model performs a forward pass by feeding input
images through the backbone, neck, and head to generate predictions. These predictions are
then compared with the ground truth labels using the defined loss functions. The error (loss)
is propagated backward using backpropagation, and the weights of the neural network are
updated using an optimizer such as SGD or Adam. YOLOv5 often benefits from transfer
learning, where pre-trained weights (e.g., trained on COCO) are fine-tuned on the target
dataset.

Anchor Optimization and Target Assignment: Before training starts, YOLOv5 automatically
adjusts anchor boxes using the AutoAnchor algorithm to match the object sizes in the
dataset. During training, each ground-truth box is assigned to one or more anchors at
different scales, and positive/negative samples are dynamically selected based on IoU and
center proximity.

Data Augmentation: To improve generalization, YOLOv5 applies extensive augmentation

techniques such as Mosaic, HSV color space shifts, random scaling, flipping, and MixUp.
These augmentations expose the model to a wide variety of visual conditions, helping
prevent overfitting and improving performance on unseen data.

Model Evaluation During Training: The training pipeline includes periodic evaluation on a
validation set using metrics like Precision, Recall, and mean Average Precision ([email protected]
and [email protected]:0.95). YOLOv5 selects the best performing weights based on the highest
validation mAP.

Data Augmentation: YOLOv5 employs extensive data augmentation strategies to enhance

model generalization:

 Mosaic Augmentation: Combines four images into one.

 Copy-Paste: Pastes object segments from one image to another.

 Affine Transformations: Includes rotation, scaling, and translation.

 MixUp: Merges two images and their labels.

 HSV Augmentation: Alters color channels.

 Horizontal Flip: Applies random horizontal flips. These methods help in exposing the
model to varied visual contexts.

Anchor Boxes and Target Assignment: YOLOv5 includes an AutoAnchor mechanism to adjust
anchor sizes to the dataset. During training, each ground-truth box is matched to suitable
anchors based on size ratios. YOLOv5’s coordinate system allows bounding boxes to span
multiple cells, improving detection of border-aligned objects.

YOLOv5 Model Variants

YOLOv5 is released in multiple size variants, allowing trade-offs between speed and
accuracy:

 YOLOv5n (nano): With a depth multiplier of 0.33 and a width multiplier of 0.25, this
is the smallest YOLOv5 model. It has approximately 1.9 million parameters, making it
extremely lightweight. It offers a speed advantage for mobile and embedded devices,
though its accuracy is lower compared to larger models.

 YOLOv5s (small): This version has a depth multiplier of 0.33 and a width multiplier of
0.50, totaling around 7.2 million parameters. It is capable of detecting small objects
and is a popular choice for real-time applications due to its balanced speed and
accuracy.

 YOLOv5m (medium): With a depth multiplier of 0.67 and width multiplier of 0.75,
this model contains approximately 21.2 million parameters. It runs slower than
YOLOv5s but achieves about 8% higher COCO mAP scores (e.g., YOLOv5s ~37.4 mAP
vs. YOLOv5m ~45.4 mAP).

 YOLOv5l (large): Defined with a depth multiplier of 1.0 and width multiplier of 1.0,
YOLOv5l has around 46.5 million parameters. Due to its greater depth and width, it
may fall below real-time speeds even on mid-range GPUs, but provides higher
accuracy (COCO mAP ~49.0).

 YOLOv5x (x-large): The largest variant, with a depth multiplier of 1.33 and width
multiplier of 1.25, reaching approximately 86.7 million parameters. It achieves the
highest accuracy (COCO mAP ~50.7), but also requires the most computational
resources. As such, it tends to be slower in real-time applications.

PyTorch Implementation and Model Configuration

YOLOv5’s architecture is defined using YAML configuration files. The models/yolov5s.yaml

file lists layers in a [from, number, module, args] format. Each module (e.g., Conv, C3, SPPF)
is implemented in models/common.py using standard PyTorch blocks. The Detect module
processes anchor-based outputs and converts raw predictions into bounding boxes.

Inference Pipeline
1. Preprocessing: Input image is resized and letterboxed to preserve aspect ratio, then
normalized.

2. Forward Pass: The image passes through the backbone, neck, and head producing
predictions.

3. Confidence Thresholding: Predictions below a confidence threshold are filtered out.

4. Non-Maximum Suppression (NMS): Redundant overlapping boxes are removed

using IoU thresholding.

5. Rescaling and Output: Final boxes are mapped back to the original image size.

YOLOv5 inference is optimized for real-time performance, capable of running at 30+ FPS
depending on model size and hardware.

Model Optimization and Deployment

 Half Precision (FP16) Computation: YOLOv5 supports Mixed Precision Training where
weights and activations are calculated in 16-bit floating point format. This
significantly speeds up training and inference by leveraging Tensor Cores available in
modern GPUs. Similarly, during inference, running the model in FP16 mode (e.g.,
using model.half()) can reduce memory usage and increase throughput. When
exporting the model to TorchScript or TensorRT using Ultralytics tools, setting
half=True converts model weights into half-precision. This typically results in only a
minor drop in accuracy while achieving substantial speed gains (1.5–2x), particularly
on compatible GPUs.
 Model Compression via Quantization (INT8): A more aggressive optimization
approach is converting the model to 8-bit integer format through quantization.
YOLOv5 supports Post-Training Quantization (PTQ) when exporting to formats like
TensorRT or TFLite. With the int8=True option, both model weights and optionally
activations are quantized to INT8. This considerably reduces the model size and
accelerates inference on supported hardware (e.g., NVIDIA GPUs with TensorRT, Intel
VNNI, ARM NPUs). For instance, an INT8-quantized model can achieve 2–3x speedup
on CPUs and reduce memory usage by almost half. While this may slightly reduce
accuracy, with proper calibration, the loss is generally minimal (typically just a few
percentage points). Ultralytics documentation emphasizes that INT8 quantization
offers major performance gains with only minor accuracy trade-offs, making it
suitable for deployment on edge devices and latency-sensitive environments.
 Model Pruning: YOLOv5 models can be pruned during or after training to reduce
complexity. Pruning involves removing less important weights (e.g., filters with small
magnitude values), effectively making the network sparser. Experiments with YOLOv5
show that pruning up to 30% of the parameters results in only a small drop in
accuracy. For example, pruning 30% of the YOLOv5x model reduced its mAP from
0.507 to 0.489 (a ~3.6% drop) while keeping inference speed nearly unchanged. In
YOLOv5, pruning is typically performed by analyzing the scale (gamma) parameters in
BatchNorm layers and removing filters below a certain threshold. The pruned model
is then fine-tuned to recover any lost accuracy. This technique is especially valuable
for deployment in environments with limited memory and compute resources, such
as embedded systems.
 Layer Fusion and Other Fine-Tuning: YOLOv5 can be further optimized with minor
pre-inference enhancements. For example, calling model.fuse() merges every Conv2d
+ BatchNorm pair into a single Conv layer, eliminating one memory access and
compute step per pair. Additionally, compiling the model with PyTorch JIT, or
exporting to ONNX Runtime or TensorRT, can yield backend-specific speed
improvements. Input size optimization is another useful strategy: training the model
with smaller resolutions like 512 or 416 instead of 640 can significantly boost
inference speed with only a slight decrease in accuracy. Therefore, adjusting input
resolution and model variant based on application requirements helps achieve the
best balance between speed and performance.

These optimizations allow YOLOv5 to be deployed on edge devices, mobile platforms, and
GPUs with varying capabilities.

Comparative Analysis: YOLOv5 vs. YOLOv8

Architectural Differences

YOLOv5, developed by Ultralytics using PyTorch, is based on conventional CNN structures

like CSPDarknet53 for the backbone and PANet for the neck. It follows an anchor-based
object detection approach and uses predefined bounding box templates to detect objects at
three different scales (P3, P4, P5). Its modular YAML-based architecture and compatibility
with various deployment formats (ONNX, TensorRT, TFLite) make it widely used in both
research and industry.

YOLOv8, on the other hand, introduces several key architectural innovations:

 It is anchor-free, removing the dependency on manually predefined anchor boxes,

which simplifies training and generalizes better across datasets.

 The architecture employs a C2f module (Cross-Stage Feature Fusion), which replaces
C3 in YOLOv5, allowing for more efficient feature reuse.

 YOLOv8 uses a single decoupled head (instead of a shared head) for objectness,
classification, and box regression, improving detection accuracy.
 It also includes native support for task-specific variants (e.g., classification,
segmentation, pose estimation) and adopts a more modern and simplified PyTorch
implementation.

Performance Comparison

The attached figure plots COCO mAP50-95 (mean Average Precision) against latency on an
NVIDIA T4 GPU using TensorRT10 in FP16 mode. This offers a clear view of the accuracy-
latency trade-off among different YOLO models.

Key observations:

 YOLOv8 variants (n, s, m, l, x) consistently outperform YOLOv5 counterparts in terms

of mAP, especially as model size increases. For instance, YOLOv8x achieves
approximately 53.5 mAP with ~14 ms latency, while YOLOv5x reaches only 50.1 mAP
at a similar latency.

 The curve representing YOLOv8 is higher and steeper than YOLOv5, indicating a
better accuracy-to-latency efficiency. In other words, YOLOv8 provides higher
detection accuracy for a given inference time.

 YOLOv5n and YOLOv8n both offer extremely low latency (~2ms), but YOLOv8n
exhibits a slight accuracy advantage (~38.5 vs. 37 mAP).

 As we move from nano to x-large variants, the performance gap widens, showcasing
the scalability and architectural efficiency of YOLOv8.

 The chart also includes other models like YOLOv6, PP-YOLOE, and EfficientDet, but
YOLOv8 achieves state-of-the-art performance on the COCO dataset while
maintaining real-time inference capability.

Summary
YOLOv8 significantly enhances the YOLO architecture by:

 Removing anchors and simplifying training,

 Increasing mAP across all sizes with minimal latency penalty,

 Offering broader task versatility (segmentation, pose),

 Providing a cleaner and more modern implementation framework.

While YOLOv5 remains highly popular due to its stability and extensive deployment tools,
YOLOv8 is technically superior for new projects prioritizing detection accuracy, model
generalization, and modularity.

Mastering All YOLO Models From YOLOv1 To YOLO
100% (1)
Mastering All YOLO Models From YOLOv1 To YOLO
58 pages
Object Detection Week 2 YOLOv1-YOLOv8
100% (1)
Object Detection Week 2 YOLOv1-YOLOv8
264 pages
02 NOF Dam Section
No ratings yet
02 NOF Dam Section
26 pages
2407.20892v1
No ratings yet
2407.20892v1
8 pages
YOLO
No ratings yet
YOLO
10 pages
A Deep Look Into YOLOv5 Theoretical Foundations and Structural Analysis of an Object Detector
No ratings yet
A Deep Look Into YOLOv5 Theoretical Foundations and Structural Analysis of an Object Detector
6 pages
Model Overview
No ratings yet
Model Overview
1 page
YOLOv 5
No ratings yet
YOLOv 5
10 pages
Csit 121602
No ratings yet
Csit 121602
12 pages
Object Detection Using YOLOv5 and OpenCV DNN in C++ & Python
No ratings yet
Object Detection Using YOLOv5 and OpenCV DNN in C++ & Python
21 pages
yolov8
No ratings yet
yolov8
12 pages
DL Documentation
No ratings yet
DL Documentation
4 pages
Report
No ratings yet
Report
9 pages
Object_Detection_Document
No ratings yet
Object_Detection_Document
4 pages
Yolov5 and Yolov8
No ratings yet
Yolov5 and Yolov8
6 pages
Image Detection and Segmentation Using YOLO v5 For
No ratings yet
Image Detection and Segmentation Using YOLO v5 For
6 pages
基于YOLOv5：车轮检测器的光照和旋转不变性实时检测器
No ratings yet
基于YOLOv5：车轮检测器的光照和旋转不变性实时检测器
16 pages
Final-Project IS
No ratings yet
Final-Project IS
11 pages
Yolo Report
No ratings yet
Yolo Report
6 pages
Metal Strands Detection
No ratings yet
Metal Strands Detection
10 pages
yolopdf
No ratings yet
yolopdf
10 pages
YOLO Versions
No ratings yet
YOLO Versions
1 page
Research Paper
No ratings yet
Research Paper
14 pages
Object Detection Using Yolo Algorithm-1
No ratings yet
Object Detection Using Yolo Algorithm-1
9 pages
Paper 5
No ratings yet
Paper 5
13 pages
Constructon
No ratings yet
Constructon
10 pages
YOLO V2 For Object Detection
No ratings yet
YOLO V2 For Object Detection
38 pages
Yolo Paper
No ratings yet
Yolo Paper
10 pages
Lecture 10 Summary
No ratings yet
Lecture 10 Summary
2 pages
Project Presentation Lab
No ratings yet
Project Presentation Lab
13 pages
YOLO
No ratings yet
YOLO
7 pages
19bce0014 VL2021220702099 Pe003
No ratings yet
19bce0014 VL2021220702099 Pe003
17 pages
Li 2023 J. Phys. Conf. Ser. 2560 012001
No ratings yet
Li 2023 J. Phys. Conf. Ser. 2560 012001
8 pages
You Only Look Once - Unified, Real-Time Object Detection
No ratings yet
You Only Look Once - Unified, Real-Time Object Detection
10 pages
YED-YOLO: An Object Detection Algorithm For Automatic Driving
No ratings yet
YED-YOLO: An Object Detection Algorithm For Automatic Driving
9 pages
WHAT IS YOLOV8
No ratings yet
WHAT IS YOLOV8
10 pages
Red Mon 2016
No ratings yet
Red Mon 2016
10 pages
YOLOX: Exceeding YOLO Series in 2021: Zheng Ge Songtao Liu Feng Wang Zeming Li Jian Sun Megvii Technology
No ratings yet
YOLOX: Exceeding YOLO Series in 2021: Zheng Ge Songtao Liu Feng Wang Zeming Li Jian Sun Megvii Technology
7 pages
dc project
No ratings yet
dc project
4 pages
Guidance On Yolov5
No ratings yet
Guidance On Yolov5
16 pages
Evolution of Yolo Algorithm and Yolov5: The State-Of-The-Art Object Detection Algorithm
100% (1)
Evolution of Yolo Algorithm and Yolov5: The State-Of-The-Art Object Detection Algorithm
61 pages
YOLOX: Exceeding YOLO Series in 2021: Zheng Ge Songtao Liu Feng Wang Zeming Li Jian Sun Megvii Technology
No ratings yet
YOLOX: Exceeding YOLO Series in 2021: Zheng Ge Songtao Liu Feng Wang Zeming Li Jian Sun Megvii Technology
7 pages
Yolo
No ratings yet
Yolo
10 pages
Deep Learning YOLOv2
No ratings yet
Deep Learning YOLOv2
3 pages
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
No ratings yet
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized For Non-GPU Computers
8 pages
Sannan_Yaqoob__YAQ23622005
No ratings yet
Sannan_Yaqoob__YAQ23622005
4 pages
Documents 2025-0 [v3]-Object Detection Object Detection-L3 v3
No ratings yet
Documents 2025-0 [v3]-Object Detection Object Detection-L3 v3
170 pages
Object Detection RTRP Report
No ratings yet
Object Detection RTRP Report
4 pages
Ex No 06
No ratings yet
Ex No 06
4 pages
Object Detection Technique (YOLO)
No ratings yet
Object Detection Technique (YOLO)
19 pages
Overview of YOLO ObjectDetectionAlgorithm
No ratings yet
Overview of YOLO ObjectDetectionAlgorithm
7 pages
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
No ratings yet
Multiple Object Tracking Using Deep Learning With Yolo v5 IJERTCONV9IS13010
5 pages
Yolov10: Real-Time End-To-End Object Detection: Ao Wang Hui Chen Lihao Liu Kai Chen Zijia Lin Jungong Han Guiguang Ding
No ratings yet
Yolov10: Real-Time End-To-End Object Detection: Ao Wang Hui Chen Lihao Liu Kai Chen Zijia Lin Jungong Han Guiguang Ding
21 pages
Project
100% (1)
Project
30 pages
Evaluating the Evolution of YOLO You Only Look Onc
No ratings yet
Evaluating the Evolution of YOLO You Only Look Onc
20 pages
MJEER-Volume 30-Issue 1 - Page 52-57
No ratings yet
MJEER-Volume 30-Issue 1 - Page 52-57
6 pages
Improving the Vehicle Small Object Detection Algorithm of Yolov5
No ratings yet
Improving the Vehicle Small Object Detection Algorithm of Yolov5
11 pages
test2
No ratings yet
test2
5 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Oriented Gradients Histogram: Unveiling the Visual Realm: Exploring Oriented Gradients Histogram in Computer Vision
From Everand
Oriented Gradients Histogram: Unveiling the Visual Realm: Exploring Oriented Gradients Histogram in Computer Vision
Fouad Sabry
No ratings yet
Motion Estimation: Advancements and Applications in Computer Vision
From Everand
Motion Estimation: Advancements and Applications in Computer Vision
Fouad Sabry
No ratings yet
Introduction To Competitive Coding: S E S S I O N 1
No ratings yet
Introduction To Competitive Coding: S E S S I O N 1
30 pages
The Influence of Width of Road Humps On Operating Speed: Basil David Daniel
No ratings yet
The Influence of Width of Road Humps On Operating Speed: Basil David Daniel
4 pages
CHE Thermo Beaucage
No ratings yet
CHE Thermo Beaucage
58 pages
Plane Failure Analysis Wedge Failure Analysis Toppling Failure Analysis
No ratings yet
Plane Failure Analysis Wedge Failure Analysis Toppling Failure Analysis
30 pages
11 Transportation
0% (1)
11 Transportation
61 pages
Maths Final Revision PDF
No ratings yet
Maths Final Revision PDF
56 pages
Working With Database Files in CL Procedures
100% (1)
Working With Database Files in CL Procedures
7 pages
LESSON 3 Mensuration and Calculation 2
100% (1)
LESSON 3 Mensuration and Calculation 2
4 pages
CHMT3050A-CHMT3018A - 1st Lecture - Comminution PART 2 - 2023
No ratings yet
CHMT3050A-CHMT3018A - 1st Lecture - Comminution PART 2 - 2023
56 pages
Exp 2023 Excellence
No ratings yet
Exp 2023 Excellence
17 pages
Admission To Master of Computer Applications (Mca) Courses-2012
No ratings yet
Admission To Master of Computer Applications (Mca) Courses-2012
3 pages
Math Problem Book I - 20170322
No ratings yet
Math Problem Book I - 20170322
158 pages
3D Flux Machines: Abstract - The Design Process of A Double-Sided Slotted TORUS
No ratings yet
3D Flux Machines: Abstract - The Design Process of A Double-Sided Slotted TORUS
3 pages
Masonry Wall
No ratings yet
Masonry Wall
9 pages
Bachelor of Computer Adminstration (Bca) : Practical File On
No ratings yet
Bachelor of Computer Adminstration (Bca) : Practical File On
93 pages
Teejay Maths Book 2b Homework
100% (2)
Teejay Maths Book 2b Homework
8 pages
Soling Linear Inequalities
No ratings yet
Soling Linear Inequalities
9 pages
Hypothesis Testing in Concrete Nptel
No ratings yet
Hypothesis Testing in Concrete Nptel
43 pages
Math Test
No ratings yet
Math Test
13 pages
4040_w13_qp_22
No ratings yet
4040_w13_qp_22
20 pages
Altering Consciousness To Achieve Non-Local Awareness
No ratings yet
Altering Consciousness To Achieve Non-Local Awareness
48 pages
Therblig Color Symbol/Icon Therblig Color Symbol/Icon: Search Black Use Purple
No ratings yet
Therblig Color Symbol/Icon Therblig Color Symbol/Icon: Search Black Use Purple
10 pages
Civil Iii Year 2017-2018
No ratings yet
Civil Iii Year 2017-2018
109 pages
python worksheet 3
No ratings yet
python worksheet 3
5 pages
Research Examination
No ratings yet
Research Examination
9 pages
S Curve
No ratings yet
S Curve
5 pages
Stepph Curry RoboticsFinalProjectReport
No ratings yet
Stepph Curry RoboticsFinalProjectReport
5 pages
Center of Mass & Center of Gravity
67% (3)
Center of Mass & Center of Gravity
29 pages
Lab Manual: Process Dynamics and Control
No ratings yet
Lab Manual: Process Dynamics and Control
25 pages

YOLOv5 Architecture and Algorithm for Object Detection

Uploaded by

YOLOv5 Architecture and Algorithm for Object Detection

Uploaded by

YOLOv5 Architecture and Algorithm for Object Detection

Backbone (CSPDarknet): The backbone of YOLOv5 is a modified version of Darknet53, called

Data Augmentation: To improve generalization, YOLOv5 applies extensive augmentation

Data Augmentation: YOLOv5 employs extensive data augmentation strategies to enhance

 Mosaic Augmentation: Combines four images into one.

 Copy-Paste: Pastes object segments from one image to another.

 Affine Transformations: Includes rotation, scaling, and translation.

 MixUp: Merges two images and their labels.

 HSV Augmentation: Alters color channels.

YOLOv5 Model Variants

PyTorch Implementation and Model Configuration

YOLOv5’s architecture is defined using YAML configuration files. The models/yolov5s.yaml

3. Confidence Thresholding: Predictions below a confidence threshold are filtered out.

4. Non-Maximum Suppression (NMS): Redundant overlapping boxes are removed

Model Optimization and Deployment

Comparative Analysis: YOLOv5 vs. YOLOv8

YOLOv5, developed by Ultralytics using PyTorch, is based on conventional CNN structures

YOLOv8, on the other hand, introduces several key architectural innovations:

 It is anchor-free, removing the dependency on manually predefined anchor boxes,

 YOLOv8 variants (n, s, m, l, x) consistently outperform YOLOv5 counterparts in terms

 Removing anchors and simplifying training,

 Increasing mAP across all sizes with minimal latency penalty,

 Offering broader task versatility (segmentation, pose),

 Providing a cleaner and more modern implementation framework.

You might also like