0% found this document useful (0 votes)

12 views11 pages

Resnet 152

Uploaded by

moomina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views11 pages

Resnet 152

Uploaded by

moomina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

AUTOMATED HUMAN ACTIVITY RECOGNITION FROM

CONTROLLED ENVIRONMENT VIDEOS

Pranay Mandadapu

W
A Thesis Submitted in
IE
Partial Fulfillment of the

Requirements for the Degree of

Master of Science
PR

in Computer Science

The University of Wisconsin-Milwaukee

December 2023
ABSTRACT

AUTOMATED HUMAN ACTIVITY RECOGNITION FROM

CONTROLLED ENVIRONMENT VIDEOS

Pranay Mandadapu

The University of Wisconsin-Milwaukee, 2023

Under the Supervision of Professor Rohit J. Kate

This thesis explores deep learning methods for Human Activity Recognition (HAR) from videos

to automate the annotation of human activities in videos. The research is particularly relevant for

continuous monitoring in healthcare settings such as nursing homes and hospitals. The innovative

W
part of the approach lies in using YOLO models to first detect humans in video frames and then
IE
isolating them from the rest of the image for activity recognition which leads to an improvement

in accuracy. The study employs pre-trained deep residual networks, such as ResNet50, ResNet152-
EV
V2, and Inception-ResNetV2, which were found to work better than custom CNN-based models.

The methodology involved extracting frames at one-minute intervals from 12-hour-long videos of

18 subjects and using this data for training and testing the models for human activity recognition.
PR

This thesis contributes to HAR research by demonstrating the effectiveness of combining deep

learning with advanced image processing, suggesting new directions for healthcare monitoring

applications.

ii
W
IE
© Copyright by Pranay Mandadapu, 2023
All Rights Reserved
EV
PR

iii
TABLE OF CONTENTS
LIST OF FIGURES ...................................................................................................................... VI
LIST OF TABLES...................................................................................................................... VII
LIST OF ABBREVIATIONS.................................................................................................... VIII
ACKNOWLEDGEMENTS .......................................................................................................... IX
CHAPTER 1 ................................................................................................................................... 1
1 I NTRODUCTION:......................................................................................................................... 1
1.1 Background and Research Challenge......................................................................... 1
1.2 Significance of Research ............................................................................................. 2
1.3 Objectives and Methodology....................................................................................... 2
1.4 Hypothesis Testing and Model Development.............................................................. 3
CHAPTER 2 ................................................................................................................................... 4
2 LITERATURE REVIEW:.......................................................................................................... 4

W
CHAPTER 3 ................................................................................................................................... 7
3. METHODOLOGY AND MATERIALS ............................................................................................ 7
IE
3.1 Data Source....................................................................................................................... 7
3.2 Machine Learning and Deep Learning Techniques.......................................................... 8
3.2.1 Classification.............................................................................................................. 8
3.2.2 Neural Networks ........................................................................................................ 9
EV
3.2.3 Convolution Neural Networks ................................................................................. 10
3.2.4 Pre-trained Image Processing Models ..................................................................... 10
3.2.4.1 ResNet50........................................................................................................... 10
3.2.4.2 ResNet152V2.................................................................................................... 11
3.2.4.3 Inception-ResNet V2......................................................................................... 12
PR

3.2.5 Object detection system ........................................................................................... 14

3.2.5.1 YOLO V3.......................................................................................................... 14
3.2.5.2 YOLO V8.......................................................................................................... 15
3.2.6 Model Evaluation ..................................................................................................... 16
3.2.6.1 Accuracy ........................................................................................................... 16
3.2.6.2 Precision............................................................................................................ 16
3.2.6.3 Recall (or Sensitivity) ....................................................................................... 16
3.2.6.4 F1 Score ............................................................................................................ 16
3.2.7 Python Libraries ....................................................................................................... 17
3.3 Methodology.................................................................................................................... 19
3.3.1 Data Pre-processing & Selection of Frames ............................................................ 19
3.3.2 Data Distribution of Images ..................................................................................... 19
3.3.3 Pre-Trained Models Training................................................................................... 23
3.3.4 Evaluation of Machine Learning Model .................................................................. 24
CHAPTER 4 ................................................................................................................................. 26
4. RESULTS................................................................................................................................. 26
4.1 Results and Analysis ....................................................................................................... 26
iv
4.1.1 Evaluation of Inception-ResNet V2 without YOLO image pre-processing ............ 26
4.1.2 Evaluation of Inception-ResNet V2 with YOLO image pre-processing ................. 27
4.1.3 Subject-wise evaluation without YOLO pre-processing. ........................................ 30
4.1.4 Subject-wise evaluation with YOLO pre-processing. ............................................. 34
4.2 Discussion ....................................................................................................................... 35
CHAPTER 5 ................................................................................................................................. 37
5 CONCLUSION........................................................................................................................... 37
5.1 Summary.......................................................................................................................... 37
5.2 Limitations and Future Work .......................................................................................... 37
BIBLIOGRAPHY ......................................................................................................................... 39

W
IE
EV
PR

v
LIST OF FIGURES
FIGURE 3.1 COLLAGE OF DIFFERENT SUBJECTS DOING DIFFERENT ACTIVITIES............................................................................................. 8
FIGURE 3.2 YOLO MODEL OBJECT AND HUMAN DETECTION WITH PROBABILITIES ...................................................................................15
FIGURE 3.3: DATA DISTRIBUTION OF UNCROPPED IMAGES AMONG DIFFERENT CLASSES ACROSS DIFFERENT SUBJECTS ..............................21
FIGURE 3.4: FROM TOP TO BOTTOM, IMAGE FRAME FROM THE VIDEO, HUMAN DETECTED WITH YOLO V8 AND CROPPED HUMAN
SUBJECT....................................................................................................................................................................................22
FIGURE 3.6: A RCHITECTURE O VERVIEW ...............................................................................................................................................23
FIGURE 4.1: TEST SUBJECT 1031 – SITTING POSITION ..........................................................................................................................31
FIGURE 4.2: TRAIN SUBJECT 1002 – SITTING POSITION. .......................................................................................................................32
FIGURE 4.3: TEST SUBJECT 1073 – STANDING POSITION. .....................................................................................................................33
FIGURE 4.4: TRAIN SUBJECT 1025 – STANDING POSITION. ...................................................................................................................33

W
IE
EV
PR

vi
LIST OF TABLES
TABLE 3.1 PRE-TRAINED MODELS PERFORMANCE ON IMAGENET DATASET.............................................................................................13
TABLE 3.2 DATA DISTRIBUTION OF UNCROPPED IMAGES AMONG DIFFERENT CLASSES AND SUBJECTS.......................................................20
TABLE 4.1 CONFUSION MATRIX OF INCEPTION-RESNET V2 WITHOUT YOLO IMAGE PRE-PROCESSING....................................................26
TABLE 4.2 YOLO DETECTION RATE FROM THE ORIGINAL DATASET .........................................................................................................28
TABLE 4.3 CONFUSION MATRIX OF INCEPTION-RESNET V2 WITH YOLO IMAGE PRE-PROCESSING ..........................................................29
TABLE 4.4 SUBJECT-WISE ACCURACY WITHOUT YOLO IMAGE PRE-PROCESSING.....................................................................................30
TABLE 4.5 SUBJECT-WISE ACCURACY WITH YOLO IMAGE PRE-PROCESSING ..........................................................................................34

W
IE
EV
PR

vii
LIST OF ABBREVIATIONS
HAR Human Activity Recognition
CNN Convolutional Neural Network
YOLO You Look Only Once
IoHT Internet of Healthcare Things
IoT Internet of Things
ML Machine Learning
ResNet Residual Network

W
IE
EV
PR

viii
ACKNOWLEDGEMENTS

I extend my heartfelt thanks to my advisor, Prof. Rohit J. Kate, for his invaluable guidance and

support throughout my thesis research. His endless patience, encouragement, and dedication have

shaped my research journey. His mentorship has been instrumental in the completion of my work.

I am also grateful to Prof. Scott Strath and the Department of Kinesiology at the University

of Wisconsin Milwaukee for their generosity in providing the experimental data for this study.

W
Thanks to Prof. Jun Zhang and Prof. Scott Strath for their willingness to serve on my thesis

committee.
IE
Lastly, my most profound appreciation goes to my parents. Their constant love,
EV
unwavering support, and encouragement have been the bedrock of my academic pursuits. I am

eternally grateful for their guidance, faith in me, and all the sacrifices they have made on my behalf.
PR

ix
Chapter 1

1 Introduction:

1.1Background and Research Challenge

This thesis explores the Human Activity Recognition (HAR) [2], focusing on developing deep

learning models to annotate human activities in videos automatically. The central research

motivation is the inefficiency and lack of scalability of manual annotation for video datasets. For

instance, in our dataset, human annotators meticulously labeled every second of 12-hour-long

videos for each of the 18 subjects. These annotations span diverse activities, including sitting,

W
walking, standing, lying, crouching/kneeling/squatting, and other less frequent postures like
IE
stepping and dark/obscured/off-frame (oof) scenarios. This manual process is time-consuming,

labor-intensive, and costly, thus highlighting the need for an automated solution.
EV
The motivation for this research is deeply rooted in the desire to enhance the efficiency and

accuracy of activity recognition in settings where continuous monitoring is crucial. One of the

driving inspirations behind this work is the potential application of automated HAR systems in
PR

nursing homes and hospitals [1]. In such environments, continuous monitoring is vital for patient

safety and care, yet resource constraints and the impracticality of round -the-clock manual

observation often hinder it. By automating the activity recognition process, this research aims to

provide a scalable solution that could significantly improve patient monitoring, ensuring timely

intervention and care while reducing the workload on healthcare staff.

1
1.2 Significance of Research

The novelty of this research is in going beyond the conventional use of Convolutional Neural

Networks (CNNs) in Human Activity Recognition (HAR). While employing CNNs and pre-

trained models like ResNet 50 [6] in HAR is not novel, this research introduces a unique

application of these deep-learning techniques. The novelty lies in the integration of advanced

image processing using Yolo V8 [13] for detecting and isolating humans in the video frames and

building a separate model for activity recognition for these images.

Specifically, this study innovatively employs two separate models: one trained on the

W
original, unaltered dataset and another on a subset where humans are isolated from their

environment. This bifurcated model approach is designed to enhance the accuracy and efficiency
IE
of activity recognition. The decision of which model to use is dynamically based on whether a

human is detected in the frame, allowing for a more focused and precise annotation of human
EV
activities, or not in which case the model trained on the unaltered dataset is employed.

Such an approach has not been extensively explored in existing HAR research,
PR

particularly in a controlled environment like a metabolic chamber.

1.3 Objectives and Methodology

The primary objective of this research is to develop an accurate model capable of automatically

annotating human activities from video frames. This study utilizes a dataset comprising 18

subjects, each captured in extensive 12-hour-long video sessions doing activities of daily living in

a metabolic chamber. The methodology involves initially extracting frames from these videos at

one-minute intervals. These frames are then used for training and for evaluating by employing the

trained model to label the observed activities automatically.

Reproduced with permission of copyright owner. Further reproduction prohibited without permission.

Summer Internship Report.
No ratings yet
Summer Internship Report.
27 pages
Article 2
No ratings yet
Article 2
64 pages
2017 Allah Bux PHD
No ratings yet
2017 Allah Bux PHD
173 pages
Batch 6
No ratings yet
Batch 6
28 pages
FULLTEXT01
No ratings yet
FULLTEXT01
52 pages
B.E Cse Batchno 10
No ratings yet
B.E Cse Batchno 10
81 pages
Anomalous Behavior Detection Using Spatio Temporal Feature
No ratings yet
Anomalous Behavior Detection Using Spatio Temporal Feature
61 pages
Sensors 22 06463 v2
No ratings yet
Sensors 22 06463 v2
33 pages
Project Files 9
No ratings yet
Project Files 9
32 pages
Presented By: Dewan Imdadul Islam
No ratings yet
Presented By: Dewan Imdadul Islam
13 pages
Abinaya (Hybrid Posture) 01.01
No ratings yet
Abinaya (Hybrid Posture) 01.01
13 pages
Human Activity and Behavior Analysis Advances in Computer - MD Atiqur Rahman Ahad, Sozo Inoue, Guillaume Lopez Tahera - 2024 - Anna's Archive
No ratings yet
Human Activity and Behavior Analysis Advances in Computer - MD Atiqur Rahman Ahad, Sozo Inoue, Guillaume Lopez Tahera - 2024 - Anna's Archive
285 pages
HAR Documentation
No ratings yet
HAR Documentation
15 pages
Rubel & Durjoy
No ratings yet
Rubel & Durjoy
20 pages
Transfer Learning Enhanced Vision-Based Human Activity Recognition
No ratings yet
Transfer Learning Enhanced Vision-Based Human Activity Recognition
15 pages
Deep Neural Network Approachesfor Video Based Human Activity Recognition
No ratings yet
Deep Neural Network Approachesfor Video Based Human Activity Recognition
4 pages
Detection of Abnormal Human Behavior Using Deep Learning
No ratings yet
Detection of Abnormal Human Behavior Using Deep Learning
10 pages
Deep Learning Application For Human Activity Recognition System Using CNN Algorithm
No ratings yet
Deep Learning Application For Human Activity Recognition System Using CNN Algorithm
11 pages
Report Batch 14
No ratings yet
Report Batch 14
57 pages
Sample Report - Abiram
No ratings yet
Sample Report - Abiram
86 pages
Batch 7
No ratings yet
Batch 7
21 pages
Object-Based Hybrid Deep Learning Technique For Recognition of Sequential Actions
No ratings yet
Object-Based Hybrid Deep Learning Technique For Recognition of Sequential Actions
15 pages
Informatics 09 00056
No ratings yet
Informatics 09 00056
13 pages
Performance Analysis of Inception v2 and Yolov3 Based Human Activity Recognition in Videos
No ratings yet
Performance Analysis of Inception v2 and Yolov3 Based Human Activity Recognition in Videos
7 pages
Human Activity Recognition Using Machine Learning: Bachelor of Technology
No ratings yet
Human Activity Recognition Using Machine Learning: Bachelor of Technology
19 pages
Human Activity Recogniton Using Machine Learning IJERTV10IS040236
No ratings yet
Human Activity Recogniton Using Machine Learning IJERTV10IS040236
5 pages
Human Activity Detection Using Deep - 2-1
No ratings yet
Human Activity Detection Using Deep - 2-1
8 pages
International Journal of Information Technology Decision Making-3
No ratings yet
International Journal of Information Technology Decision Making-3
19 pages
Deepanshu Training
No ratings yet
Deepanshu Training
18 pages
Activity Recognition Based On Spatio-Temporal Features With Transfer Learning
No ratings yet
Activity Recognition Based On Spatio-Temporal Features With Transfer Learning
9 pages
Ensembled Transfer Learning Based Multichannel Attention Networks For Human Activity Recognition in Still Images
No ratings yet
Ensembled Transfer Learning Based Multichannel Attention Networks For Human Activity Recognition in Still Images
12 pages
Irjet V7i61094
No ratings yet
Irjet V7i61094
3 pages
Paper 4143
No ratings yet
Paper 4143
8 pages
Enhanced Human Activity Recognition in Medical Emergencies Using A Hybrid Deep CNN and Bi-Directional LSTM Model With Wearable Sensors
No ratings yet
Enhanced Human Activity Recognition in Medical Emergencies Using A Hybrid Deep CNN and Bi-Directional LSTM Model With Wearable Sensors
24 pages
Human Activity Recog Paper1
No ratings yet
Human Activity Recog Paper1
5 pages
Midterm Paper Example
No ratings yet
Midterm Paper Example
5 pages
Iarjset 2024 11739
No ratings yet
Iarjset 2024 11739
4 pages
Human Activity
No ratings yet
Human Activity
53 pages
Project Report (DC)
No ratings yet
Project Report (DC)
15 pages
V8i2 1320
No ratings yet
V8i2 1320
4 pages
HUMAN ACTIVITY RECOGNITION - IEEE ConferencePaper
No ratings yet
HUMAN ACTIVITY RECOGNITION - IEEE ConferencePaper
8 pages
Synopsis Human
No ratings yet
Synopsis Human
5 pages
IJE Volume38 Issue5 Pages1213-1222
No ratings yet
IJE Volume38 Issue5 Pages1213-1222
11 pages
HUMAN RECOGNITION - IEEE ConferencePaper
No ratings yet
HUMAN RECOGNITION - IEEE ConferencePaper
8 pages
Smart Video Monitoring: Advanced Deep Learning For Activity and Object Recognition
No ratings yet
Smart Video Monitoring: Advanced Deep Learning For Activity and Object Recognition
5 pages
Basic Activity Recognition From Wearable
No ratings yet
Basic Activity Recognition From Wearable
20 pages
Deep2019 2
No ratings yet
Deep2019 2
4 pages
Design and Implementation of A Convolutional Neural Network On An Edge Computing Smartphone For Human Activity Recognition
No ratings yet
Design and Implementation of A Convolutional Neural Network On An Edge Computing Smartphone For Human Activity Recognition
12 pages
ABstr
No ratings yet
ABstr
1 page
Chapter 2
No ratings yet
Chapter 2
3 pages
Foul Legacy
No ratings yet
Foul Legacy
17 pages
International Journal of Cognitive Computing in Engineering
No ratings yet
International Journal of Cognitive Computing in Engineering
10 pages
Human Activity Recognition LSTM Report
No ratings yet
Human Activity Recognition LSTM Report
7 pages
Intelligent Recognition of Multimodal Human Activities For Personal Healthcare
No ratings yet
Intelligent Recognition of Multimodal Human Activities For Personal Healthcare
11 pages
Human Activity Recognition Using CNN & LSTM: A. WISDM Dataset
No ratings yet
Human Activity Recognition Using CNN & LSTM: A. WISDM Dataset
6 pages
1 s2.0 S2666307424000214 Main 3
No ratings yet
1 s2.0 S2666307424000214 Main 3
1 page
VP ResearchPaper 12
No ratings yet
VP ResearchPaper 12
4 pages
HumanActivity Recognition Deep Learning
No ratings yet
HumanActivity Recognition Deep Learning
6 pages
On A Mathematical Understanding of Deep Neural Networks
No ratings yet
On A Mathematical Understanding of Deep Neural Networks
170 pages
GenSpark Tracker For - AI Architect Curriculum
No ratings yet
GenSpark Tracker For - AI Architect Curriculum
4 pages
Reimagining Semiconductor Development Machine Learning Applications From Device Physics To System Architectures Survey Paper
No ratings yet
Reimagining Semiconductor Development Machine Learning Applications From Device Physics To System Architectures Survey Paper
8 pages
Ai Foundation Syllabus
No ratings yet
Ai Foundation Syllabus
30 pages
Application of Machine Learning in FPGA EDA Tool D
No ratings yet
Application of Machine Learning in FPGA EDA Tool D
18 pages
Slides For 'Large Language Model: From Theory To Implementations', Chapter 3
No ratings yet
Slides For 'Large Language Model: From Theory To Implementations', Chapter 3
67 pages
English Project
No ratings yet
English Project
13 pages
For 3rd Review
No ratings yet
For 3rd Review
24 pages
QB DL
No ratings yet
QB DL
2 pages
Deep Learning in Hydrology
No ratings yet
Deep Learning in Hydrology
137 pages
(IJCST-V12I3P8) :annie Florance V, Fathima G
No ratings yet
(IJCST-V12I3P8) :annie Florance V, Fathima G
6 pages
Artificial Intelligence (Ai) and Its Impact
No ratings yet
Artificial Intelligence (Ai) and Its Impact
10 pages
Springer Nature LaTeX Template
No ratings yet
Springer Nature LaTeX Template
23 pages
Prediction of Energy Consumption
No ratings yet
Prediction of Energy Consumption
12 pages
AS2 Deep Learning
No ratings yet
AS2 Deep Learning
3 pages
Msds iuFUXPCU
No ratings yet
Msds iuFUXPCU
47 pages
A Narrowing of AI Research? Joel Klinger
No ratings yet
A Narrowing of AI Research? Joel Klinger
58 pages
Improved Anomaly Detection in Surveillance Videos Based On A Deep Learning Method (2018)
No ratings yet
Improved Anomaly Detection in Surveillance Videos Based On A Deep Learning Method (2018)
9 pages
The Automatic Detection of Speech Disorders in Children Challenges Opportunities and Preliminary Results
No ratings yet
The Automatic Detection of Speech Disorders in Children Challenges Opportunities and Preliminary Results
13 pages
Hanooman Chatbot
No ratings yet
Hanooman Chatbot
9 pages
Hybrid Intrusion Detection System Based On Combination of
No ratings yet
Hybrid Intrusion Detection System Based On Combination of
16 pages
3D Reconstruction Using Stereo Images For Pose Estimation
No ratings yet
3D Reconstruction Using Stereo Images For Pose Estimation
10 pages
Internet of Medical Things (IoMT) For Cardio-Vascular Disease
No ratings yet
Internet of Medical Things (IoMT) For Cardio-Vascular Disease
15 pages
Project - Report - College - Management - System (1) Rashmi Pal2
No ratings yet
Project - Report - College - Management - System (1) Rashmi Pal2
34 pages
Zhao 2023 JNL A Self-Supervised Contrastive Learning Method For Leaf Disease Iden
No ratings yet
Zhao 2023 JNL A Self-Supervised Contrastive Learning Method For Leaf Disease Iden
16 pages
Decision Tree Classification Example
No ratings yet
Decision Tree Classification Example
3 pages
A Survey On Waste Detection and Classification Using Deep Learning
No ratings yet
A Survey On Waste Detection and Classification Using Deep Learning
15 pages
Explainable Artificial Intelligence For Manufacturing Cost Estimation and Machining Feature Visualization
No ratings yet
Explainable Artificial Intelligence For Manufacturing Cost Estimation and Machining Feature Visualization
20 pages
Machine Learning Trends Perspectives and Prospects
No ratings yet
Machine Learning Trends Perspectives and Prospects
7 pages
Forecasting Crude Oil Prices A Deep Learning Based Model
No ratings yet
Forecasting Crude Oil Prices A Deep Learning Based Model
8 pages
Unlocking Statistics for the Social Sciences
From Everand
Unlocking Statistics for the Social Sciences
Norma Sinclair
No ratings yet
Introduction To Augmented Reality Hardware: Augmented Reality Will Change The Way We Live Now: 1, #1
From Everand
Introduction To Augmented Reality Hardware: Augmented Reality Will Change The Way We Live Now: 1, #1
Kaviyaraj R
No ratings yet

Resnet 152

Uploaded by

Resnet 152

Uploaded by

AUTOMATED HUMAN ACTIVITY RECOGNITION FROM

CONTROLLED ENVIRONMENT VIDEOS

Requirements for the Degree of

The University of Wisconsin-Milwaukee

AUTOMATED HUMAN ACTIVITY RECOGNITION FROM

The University of Wisconsin-Milwaukee, 2023

3.2.5 Object detection system ........................................................................................... 14

1.1Background and Research Challenge

intervention and care while reducing the workload on healthcare staff.

building a separate model for activity recognition for these images.

particularly in a controlled environment like a metabolic chamber.

1.3 Objectives and Methodology

trained model to label the observed activities automatically.

You might also like