Woodlog Inventory Optimization Using Object Detection and Object Tracking
Woodlog Inventory Optimization Using Object Detection and Object Tracking
Fig 1 CRISP-ML (Q) Methodological Framework, Outlining its Key Components and Steps Visually.
Source: - Open-Source ML Workflow Tool- 360DigiTMG
II. LITERATURE REVIEW demand forecasts, thereby improving the overall efficiency of
inventory management processes .[6]
Inventory management is essential to the success of any
business and plays an important role in ensuring profitability. Object tracking, mainly within the context of inventory
Small and medium-sized enterprises (SMEs), which typically management, also benefits from advances in technology. Han
operate on limited budgets, often rely on manual methods of et al. (2004) proposed a more than one speculation approach
inventory management. While this manual process is for monitoring multiple objects, which integrates object
common, it can be labor intensive, error prone, and generally detection and monitoring to ensure a sturdy and efficient
inefficient. The use of an inventory management system has monitoring set of rules . This technique uses a neural
been shown to improve the efficiency and accuracy of these network-based totally item detection module to become
methods.[4] aware of and tune items, offering real-time updates and
feedback for progressed accuracy.[7]
Manual inventory work presents many challenges.
These include time-consuming inventory counts, difficulty in Sun et al. (2021) introduced a Deep Affinity Network
tracking inventory levels, increased risk of error, and (DAN) that learns target object appearances and their
increased costs associated with over-inventory or excess affinities in pairs of video frames. This technique enhances
inventory. Salih et al. (2023) highlighted these problems in item tracking by means of incorporating hierarchical feature
their study on adopting automated inventory management gaining knowledge of, which improves the accuracy of
systems in SMEs, emphasizing the potential for such systems tracking over more than one ranges of abstraction .[8]
to streamline operations and improve accuracy in inventory
tracking .[4] Behrendt et al. (2017) presented a deep learning
approach for real-time searches and tracking with OpenCV.
In an SQL- and PHP-based program implementation of Their pipeline includes steps for detecting, classifying and
an automated catalog management system, Mascarenhas et al. tracking objects, with applications ranging from traffic light
(2020) demonstrated the practical advantages of such detection to complex situations Similarly, Liu et al. (2018)
technologies. Their system was designed to monitor developed an effective fruit first algorithm that combines
inventory in real time, significantly reducing the manual deep learning, tracking, and planning from motion (SfM).
effort required and minimizing the possibility of error.[5] Their system proved to be highly accurate in fruit counting in
image sequences, and demonstrated deep learning in practical
Accurate demand forecasting is another important observational experiments.[9,10]
component of effective inventory management. Inaccurate
expectations can lead to excess inventories or inventories, In the end, the combination of computerized systems
both of which are important.Albayrak et al. (2023) reviewed and superior tracking technology can considerably improve
the applications of artificial intelligence in inventory the efficiency and accuracy of stock control, especially for
management, noting that AI can enhance the precision of SMEs. By decreasing manual effort and minimizing
mistakes, these technologies allow companies to better workflow and present experimental results demonstrating its
manage their sources, forecast demand appropriately, and in effectiveness in enhancing object detection performance.
the end enhance their general operational efficiency.
Overview of YOLO Algorithm:
III. METHODOLOGY AND TECHNIQUES YOLOv8 is an advanced version of the YOLO object
detection algorithm that uses a deep neural network to
A. Object Detection analyze entire images, identifying objects with class
Object detection is a computer vision technique used to probabilities and bounding boxes. It is faster, more accurate,
identify and locate objects in an image or video. Unlike and better at detecting small objects compared to traditional
simple image recognition, which assigns a single label to an methods. YOLOv8 employs a convolutional neural network
entire image, object detection not only labels objects but also (CNN) to extract features, predict bounding boxes, and refine
draws bounding boxes around them. This provides precise these predictions with offsets, calculating confidence by
information about the location of objects within a given combining class probability and bounding box confidence
scene. For instance, while image recognition might label an score.
entire image as containing a "dog," object detection would
draw individual bounding boxes around each dog in the The YOLO algorithm has significantly impacted
image and label each box accordingly. This allows for computer vision, particularly in real-time object detection,
accurate detection of the location and number of objects due to its speed and efficiency. It is popular in applications
present. like video surveillance, self-driving cars, and augmented
reality. One of its main advantages is the ability to process
To leverage the capabilities of object detection images quickly, enabling real-time detection with relatively
techniques like YOLO (You Only Look Once) [11] and little training data.
address its limitations, we propose a new workflow that
combines YOLO's efficiency with additional methods [12] However, YOLO also has limitations. It may struggle
(ak.2) Fig 2 . This workflow involves a multi-step approach with accurately detecting small objects and performing fine-
where YOLO's initial detections are refined using grained classification. Despite these challenges, YOLO has
sophisticated post-processing algorithms. Through feature- advanced object detection and created new opportunities for
based refinement and context analysis, our proposed computer vision research and applications. As the field
workflow aims to improve detection accuracy, particularly evolves, improvements to YOLO and other algorithms will
for scenarios involving small objects or fine classification. In likely address these limitations and meet new challenges.
the following sections, we detail the components of this
Fig 2 ML Workflow Architecture: A comprehensive overview of the ML pipeline for Wood Logs Inventory Optimization.
Source: Open-Source ML Workflow Tool- 360DigiTMG
Online Real Time Tracking) algorithm, DeepSORT adds a using a combination of the Intersection over Union (IOU)
deep learning-based appearance descriptor to reduce identity metric and the appearance descriptor. The Kalman filter’s
switches and enhance tracking efficiency. DeepSORT uses state is then updated using the associated detection, and the
the Kalman filter to predict an object's location in the next process is repeated for the subsequent frames. Fig 3 shows
frame and associates the detection with the predicted location the overview of the DeepSORT algorithm.
Model Evaluation (MOTA, MOTP, IDF) Desired Output where Stacks are Labeled Based on When
The model's performance is evaluated using metrics They Come into the Frame Obtained
such as Multiple Object Tracking Accuracy (MOTA), The system tracks and registers the stacks of logs based
Multiple Object Tracking Precision (MOTP), and Identity F1 on the time they enter the frame, following FIFO (First-In-
Score (IDF). These metrics assess the model's detection and First-Out) principles. The combination of the YOLOv8
tracking capabilities, ensuring it meets the required standards. model with Deep SORT ensures that each stack is constantly
monitored and accurately identified. As the characters enter
Best Model Selected and DeepSORT Tracker Updated the frame, they are assigned unique signals and their positions
with the IDS and Bounding Boxes are updated in real time. This approach ensures that the
Based on the results of the analysis, the best performing catalog is maintained, and that old records are processed
model was selected. The selected model is then merged with before new ones are created, so key.
the Deep SORT tracker, which updates the IDs and bounding
boxes for each known log stack. Deployment
The fully trained and validated model, integrated with a
Bounding Boxes Along with Their Classes Shown tracking algorithm, has been successfully deployed on the
The system displays bounding boxes with their class Streamlit platform for real-time monitoring. This deployment
labels around each detected tree log stack. This visual image currently focuses on monitoring wood log stacks using an AI-
helps verify the search and tracking process. driven camera system. Future work includes extending this
deployment to a warehouse environment for broader
application.
Fig 6 Views of Object Detection Bounding Boxes, Contributing to a Comprehensive Understanding of Detection Accuracy.
Roboflow[17] is a computer vision tool used by Shear: ±10° Horizontal, ±10° Vertical,
developers for various deep learning tasks, including Hue: Between -15° and +15°,
annotation, model building, and deployment. Roboflow Saturation: Between -25% and +25%,
Annotate is an online tool for labeling images for object Brightness: Between -15% and +15%,
detection, classification, and segmentation. For this project, Noise: Up to 0.1% of pixels,
Roboflow was used for annotation and pre-processing.
During the pre-processing stage, we applied auto-orientation, Ref. [Fig: 7, 8, 9, 10, 11, 12] These figures collectively
auto-resizing, and automatic contrast adjustment to the provide a comprehensive understanding of both the dataset
images. In the augmentation process, we applied: and the model's performance. They cover aspects from data
composition and annotation patterns to the model's training
Flip: Horizontal,Vertical progression and final evaluation metrics, offering a holistic
Rotate: Clockwise, Counter-Clockwise, Upside Down view of the entire process from data preparation to model
Rotation: Between -10° and +10°, validation and testing.
Fig 7 Dataset's Images, Annotations, and Average Image Sizes, Offering Insight into its Composition and Characteristics
Fig 8 Distribution of Image Sizes within our Dataset, Highlighting Variations in Dimensions Across the Data.
Fig 9 Annotation Heatmap, Providing a Visual Representation of the Density of Object Annotations within Images
Fig 10 Histogram Illustrating how the Number of Objects per Image is Distributed Across the Dataset
Fig 11 Validation Set Results for Average Precision, Compactly Conveying the Model's Performance.
Fig 12 Test Set Results for Average Precision are Depicted, Showcasing the Model's Performance.
Fig 13 Training Graphs for the YOLO Model, Presenting its Learning Progress and Performance.
Fig 14 The Reduction of Object Detection Precision Losses via Improved Labeling of Boxes, Classes, and Objects
MOTA: Evaluates tracking accuracy, considering All parameters together provide a comprehensive level
false positives, false negatives, and identity switches. of evaluation ,to compare DeepSORT, OC-SORT and
MOTP: Measures average error between predicted ByteTrack. For detailed investigation and description, tools
and actual positions. such as statistical indicators and evaluation methods on
IDF1: Balances precision and recall for identity websites such as MOTChallenge [18] provide guidelines for
tracking. evaluating and selecting the best monitoring methods for
IDP & IDR: Assess accuracy of identity assignments. specific projects.
FP & FN: Count incorrect and missed detections.
IDSW: Tracks changes in object identities. These metrics provide a comprehensive evaluation of a
Track Fragmentation: Measures trajectory tracking algorithm's performance, balancing accuracy,
interruptions. precision, and consistency.
Given these metrics in table 1, DeepSORT performs the to varying environmental conditions and logistical
best among the three algorithms. It has the highest MOTA, intricacies, holds significant promise. Such research
MOTP, IDF1, IDP, Precision, Recall, and MT, indicating endeavors could yield valuable insights by testing these
better overall tracking performance. systems across different warehouse settings and product
types.
Model Deployment
We deployed our model using a Streamlit application Energy Efficiency:
designed for video processing and displaying annotated Given the computational demands of these algorithms,
outputs. The application identifies and labels wood log stacks exploring avenues to enhance their energy efficiency assumes
as they appear in the video stream, numbering each stack heightened relevance, particularly in resource-constrained
based on its appearance order for easy identification and operational environments.
management. Users upload warehouse videos via the
Streamlit interface, where the application processes them Robustness and Error Handling:
frame by frame using YOLOv8 for object detection and Further research efforts could concentrate on fortifying
DeepSORT algorithm for object tracking. The application the robustness of these systems, ensuring robust error-
provides real-time annotations, displaying each detected and handling mechanisms that effectively manage challenges
tracked stack with a unique identifier. The annotated video such as occlusions, fluctuating lighting conditions, and other
output serves as a visual guide for inventory management, real-world complexities.
ensuring accurate tracking and retrieval of stacks for
shipment. The user-friendly Streamlit interface allows users In essence, addressing these research avenues promises
to upload videos and observe real-time annotations, to advance the efficacy and applicability of advanced object
enhancing operational efficiency and decision-making. This detection and tracking technologies in optimizing wood log
deployment approach optimizes inventory management by inventory management, thereby driving efficiencies and
leveraging advanced detection and tracking technologies to operational enhancements in relevant industrial sectors.
streamline warehouse operations.
V. RESULTS AND DISCUSSION
IV. IMPLICATIONS FOR FUTURE RESEARCH
This examine investigated the impact of mixing
The findings from this study open up several avenues YOLOv8 for item discovery and DeepSORT for tracking to
for future research in the domain of wood log inventory optimize tree log stock control in warehouse structures We
optimization leveraging advanced object detection and accomplished comparative evaluation with different tracking
tracking algorithms: algorithms, together with OC-sort and ByteTrack. Time was
spent focusing on key features which include a couple of item
Algorithm Improvement: tracking Accuracy (MOTA), more than one object tracking
While YOLOv8 and DeepSORT have exhibited Precision (MOTP), and IDF1 ratings.Our outcomes
promising outcomes, further refinement and evolution of consistently display that the YOLOv8-DeepSORT aggregate
these algorithms stand to augment their precision and outperforms the alternatives,with a excessive diploma of
efficiency. Exploring novel models or hybrid approaches accuracy and reliability required for an powerful inventory
amalgamating the strengths of diverse algorithms could control.
potentially enhance overall performance.
DeepSORT's performance advantage over OC-kind and
Real-time Processing: ByteTrack highlights its robustness in handling complicated
The imperative focus on implementing and fine-tuning monitoring eventualities in dynamic warehouse
real-time processing capabilities for extensive industrial environments.Utilizing YOLOv8 for unique item detection,
applications remains paramount. Ensuring seamless handling our integrated technique now not handiest boosts monitoring
of substantial data volumes in real-time, without accuracy but additionally complements real-time processing
compromising accuracy, emerges as a critical requirement for abilties. That is vital for adapting to variable lighting
practical deployment scenarios. situations, occlusions, and various warehouse layouts, as
evidenced via the robustness visible within the screenshots
Integration with IoT: from our video outputs (Ref [Fig 16]).
Future investigations may delve into the integration of
these algorithms with Internet of Things (IoT) infrastructure, The realistic implications of our findings are sizeable for
aimed at establishing a more interconnected and automated advancing inventory control systems in industrial settings.
system for inventory management. This could encompass Integrating superior object detection and monitoring era like
leveraging sensors and smart devices to furnish real-time YOLOv8 and DeepSORT can streamline warehouse
inputs to detection and tracking algorithms. operations, reduce mistakes related to guide tracking, and
optimize useful resource allocation. The tested effectiveness
Scalability and Adaptability: of our approach underscores its capability to beautify
Evaluating the scalability of these systems across operational performance and accuracy in warehouse logistics.
diverse inventory sizes and types, alongside their adaptability
Fig 15 Output Tracking Results using Deep SORT, OC-SORT and ByteTrack with Bounding Boxes on the Object.