Video Based Action Recognition RM

The document discusses human action recognition in videos. It covers the challenges of the task, including variations in appearance, actions under different conditions, and temporal complexity. Technological advancements like deep learning and increased computational power have helped advance the field. Action recognition has important applications in security, healthcare, and entertainment and can transform daily activities. The landscape of research is documented in journals and conference proceedings, with findings applicable across many disciplines.

Uploaded by

fs12uno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views3 pages

Video Based Action Recognition RM

Uploaded by

fs12uno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Video-based action recognition

Balgynbek Dikhan
School of Engineering and Digital Sciences
Data Science
[email protected]

1. Introduction 2.2. Technological Advancements

Human action recognition in videos stands at the conflu- Technological advancements play a pivotal role in the
ence of several challenges that underscore the complexity progression of human action recognition. The advent of
and dynamism of interpreting human movements through deep learning and the exponential increase in computational
computational models. At its core, the task involves accu- power have revolutionized the field, enabling the analysis of
rately identifying and categorizing a wide array of human complex video data in ways that were previously unattain-
actions captured in video streams, ranging from simple ges- able. High-resolution cameras, sophisticated sensors, and
tures to complex sequences of movements. This process is enhanced data storage capabilities contribute to the cre-
fraught with difficulties stemming from variations in human ation of comprehensive datasets that fuel the development
appearance, actions performed under diverse environmental of more accurate and efficient recognition systems. These
conditions, and the presence of occlusions and rapid mo- technological leaps forward not only improve existing ap-
tion. Moreover, the temporal dimension of video data in- plications in surveillance, healthcare, and entertainment but
troduces additional complexity, requiring the extraction and also open up new possibilities for human-computer interac-
analysis of features over time to discern patterns indicative tion, augmented reality, and autonomous systems, making
of specific actions. The multifaceted nature of this problem the integration of action recognition technologies into daily
demands innovative solutions that can navigate the nuances life increasingly feasible.
of spatial and temporal variability, adapt to different con-
texts, and accurately reflect the intent and nuances of human 2.3. Societal Impact
actions. This intricate tapestry of challenges not only high- The societal impact of human action recognition is pro-
lights the task’s inherent complexity but also its potential as found and multifaceted, addressing critical needs across
a fertile ground for advancing computational understanding various sectors. In security, it enhances surveillance sys-
of human behavior in the visual domain. tems, enabling more responsive and effective measures to
ensure public safety. In healthcare, it offers innovative so-
2. Motivation lutions for patient monitoring, physical therapy, and elder
2.1. Societal Impact care, promising to improve quality of life and reduce health-
care burdens. Furthermore, in the realm of sports and en-
The scientific exploration within human action recogni- tertainment, it enriches user experiences through interac-
tion extends the boundaries of understanding how compu- tive gaming and detailed performance analytics, engaging
tational models can interpret complex human behaviors in a wider audience. The potential for these technologies to
varied contexts and environments. This exploration not only transform everyday activities, bolster safety and security,
challenges existing algorithms in terms of robustness, ac- and support health and wellbeing underscores the impor-
curacy, and efficiency but also pushes the envelope in the tance of continued research and development in this field.
development of novel computational techniques that can
mimic or even surpass human perceptual capabilities. By 3. Overview of Main Sources
delving into the intricacies of spatio-temporal feature ex-
traction, deep learning architectures, and multimodal data The landscape of human action recognition research is
integration, researchers are uncovering new insights into the richly documented across a variety of prestigious journals
fundamentals of visual cognition and machine interpreta- and conference proceedings, constituting a diverse reposi-
tion, setting the stage for significant breakthroughs in com- tory of knowledge and advancements in the field. Promi-
puter vision and artificial intelligence. nent among these sources are the International Journal of

1
Computer Vision and IEEE Transactions on Pattern Anal-
ysis and Machine Intelligence, which are esteemed for
their rigorous peer-reviewed articles that span the depth
and breadth of computer vision and machine learning tech-
niques. These journals have been pivotal in disseminat-
ing cutting-edge research, including foundational theories
and innovative methodologies that underpin current action
recognition technologies.
Conference proceedings, notably from the IEEE Confer-
ence on Computer Vision and Pattern Recognition (CVPR)
and the International Conference on Computer Vision
(ICCV), serve as critical platforms for the exchange of ideas Figure 1. Enter Caption
and the unveiling of breakthrough technologies in real-time.
These conferences bring together the global community of
researchers, practitioners, and thought leaders, facilitating in its myriad forms.
a vibrant discourse on the latest developments, practical
challenges, and future directions in action recognition. Pa- References
pers presented at these venues often set new benchmarks, [1] A. Alpher. Frobnication. Journal of Foo, 12(1):234–778,
propose novel architectures such as convolutional neural 2002.
networks for video analysis, and introduce comprehensive [2] A. Alpher and J. P. N. Fotheringham-Smythe. Frobnication
datasets that become standard resources for the research revisited. Journal of Foo, 13(1):234–778, 2003.
community. [3] A. Alpher, J. P. N. Fotheringham-Smythe, and G. Gamow.
The contributions within these sources are not only tech- Can a machine frobnicate? Journal of Foo, 14(1):234–778,
2004.
nical in nature but also encompass a broad range of appli-
[4] Authors. The frobnicatable foo filter, 2014. Face and
cation areas, from surveillance and security to healthcare
Gesture submission ID 324. Supplied as additional material
and entertainment. This breadth underscores the interdisci- fg324.pdf.
plinary appeal of action recognition, drawing insights from [5] Authors. Frobnication tutorial, 2014. Supplied as additional
computer science, psychology, and neuroscience to enhance material tr.pdf.
algorithmic interpretations of human motion. The evolution [6] J. Donahue, L. Anne Hendricks, S. Guadarrama,
of research in this field, as chronicled by these main sources, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Dar-
reflects a trajectory of increasing sophistication and appli- rell. Long-term recurrent convolutional networks for visual
cability, highlighting both the challenges overcome and the recognition and description. In Proceedings of the IEEE
vast potential for future explorations. conference on computer vision and pattern recognition,
Moreover, specialized workshops and symposia that ac- pages 2625–2634, 2015.
[7] C. Feichtenhofer, A. Pinz, and R. P. Wildes. Spatiotemporal
company major conferences, as well as focused issues in
multiplier networks for video action recognition. In Proceed-
journals, underline the dynamic sub-domains within ac-
ings of the IEEE Conference on Computer Vision and Pattern
tion recognition, such as the analysis of complex actions in Recognition (CVPR), pages 4724–4733, 2017.
sports videos, the role of sensor fusion in enhancing recog- [8] H. Gammulle, S. Denman, S. Sridharan, and C. Fookes. Two
nition accuracy, and the exploration of deep learning models stream lstm: A deep fusion framework for human action
for spatial-temporal feature extraction. These venues offer recognition. In 2017 IEEE winter conference on applications
nuanced insights into specific challenges and innovations, of computer vision (WACV), pages 177–186. IEEE, 2017.
enriching the broader discourse with targeted investigations [9] C. Gao, Y. Du, J. Liu, J. Lv, L. Yang, D. Meng, and A. G.
and specialized knowledge. Hauptmann. Infar dataset: Infrared action recognition at dif-
In summary, the overview of main sources in the hu- ferent times. Neurocomputing, 212:36–47, 2016.
man action recognition domain reveals a vibrant and evolv- [10] R. Ghosh, A. Gupta, A. Nakagawa, A. Soares, and
N. Thakor. Spatiotemporal filtering for event-based action
ing field. The depth and diversity of these sources reflect
recognition. arXiv preprint arXiv:1903.07067, 2019.
the ongoing dialogue within the research community, the
[11] Z. Jiang, V. Rozgic, and S. Adali. Learning spatiotemporal
continuous refinement of methodologies, and the persis- features for infrared action recognition with 3d convolutional
tent quest to bridge the gap between human cognitive ca- neural networks. In Proceedings of the IEEE conference on
pabilities and computational models. Through this exten- computer vision and pattern recognition workshops, pages
sive body of work, the field progresses towards more nu- 115–123, 2017.
anced, accurate, and versatile action recognition systems, [12] R. Kavi, V. Kulathumani, F. Rohit, and V. Kecojevic. Mul-
capable of understanding and interpreting human behavior tiview fusion for activity recognition using deep neural
networks. Journal of Electronic Imaging, 25(4):043010–
043010, 2016.
[13] Y. Kim and T. Moon. Human detection and activity classifi-
cation based on micro-doppler signatures using deep convo-
lutional neural networks. IEEE geoscience and remote sens-
ing letters, 13(1):8–12, 2015.
[14] I. Laptev. On space-time interest points. International jour-
nal of computer vision, 64:107–123, 2005.
[15] W. Lin, M.-T. Sun, R. Poovandran, and Z. Zhang. Human
activity recognition for video surveillance. 2008 IEEE Inter-
national Symposium on Circuits and Systems (ISCAS), pages
2737–2740, 2008.
[16] J. Liu, A. Shahroudy, D. Xu, A. C. Kot, and G. Wang.
Skeleton-based action recognition using spatio-temporal
lstm network with trust gates. IEEE transactions on pat-
tern analysis and machine intelligence, 40(12):3007–3021,
2017.
[17] R. Poppe. A survey on vision-based human action recogni-
tion. Image and Vision Computing, 28(6):976–990, 2010.
[18] H. Rahmani and A. Mian. 3d action recognition from novel
viewpoints. In Proceedings of the IEEE conference on
computer vision and pattern recognition, pages 1506–1515,
2016.
[19] I. Rodomagoulakis, N. Kardaris, V. Pitsikalis, E. Mavroudi,
A. Katsamanis, A. Tsiami, and P. Maragos. Multimodal hu-
man action recognition in assistive human-robot interaction.
2016 IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP), pages 2702–2706, 2016.
[20] M. D. Rodriguez, J. Ahmed, and M. Shah. Action mach
a spatio-temporal maximum average correlation height filter
for action recognition. In 2008 IEEE conference on computer
vision and pattern recognition, pages 1–8. IEEE, 2008.
[21] K. Simonyan and A. Zisserman. Two-stream convolutional
networks for action recognition in videos. Advances in neu-
ral information processing systems, 27, 2014.
[22] K. Soomro and A. R. Zamir. Action recognition in realistic
sports videos. In Computer vision in sports, pages 181–208.
Springer, 2015.
[23] Y. Wang, Y. Xiao, F. Xiong, W. Jiang, Z. Cao, J. T. Zhou,
and J. Yuan. 3dv: 3d dynamic voxel for action recognition
in depth video. In Proceedings of the IEEE/CVF conference
on computer vision and pattern recognition, pages 511–520,
2020.
[24] R. Yang and R. Yang. Dmm-pyramid based deep architec-
tures for action recognition with depth cameras. In Asian
Conference on Computer Vision, pages 37–49. Springer,
2014.
[25] M. Zeng, L. T. Nguyen, B. Yu, O. J. Mengshoel, J. Zhu,
P. Wu, and J. Zhang. Convolutional neural networks for hu-
man activity recognition using mobile sensors. In 6th inter-
national conference on mobile computing, applications and
services, pages 197–205. IEEE, 2014.

Aecc English
No ratings yet
Aecc English
153 pages
(Routledge International Handbooks) Garry Marvin, Susan McHugh (Eds.) - Routledge Handbook of Human-Animal Studies-Routledge (2014)
No ratings yet
(Routledge International Handbooks) Garry Marvin, Susan McHugh (Eds.) - Routledge Handbook of Human-Animal Studies-Routledge (2014)
333 pages
Psychology: Intermediate Part Ii Resource Book
No ratings yet
Psychology: Intermediate Part Ii Resource Book
10 pages
Human Activity Recognition With Sensor Approach
No ratings yet
Human Activity Recognition With Sensor Approach
179 pages
Computer Vision and Action Recognition A Guide For Image Processing and Computer Vision Community For Action Understanding
No ratings yet
Computer Vision and Action Recognition A Guide For Image Processing and Computer Vision Community For Action Understanding
228 pages
Human Activity Recognization
No ratings yet
Human Activity Recognization
8 pages
Personal Philosophy of Education Power Point
No ratings yet
Personal Philosophy of Education Power Point
9 pages
A Survey On Video Based Human Action Recognition: Recent Updates, Datasets, Challenges, and Applications
No ratings yet
A Survey On Video Based Human Action Recognition: Recent Updates, Datasets, Challenges, and Applications
64 pages
A Survey On Video Based Human Action Recognitio
No ratings yet
A Survey On Video Based Human Action Recognitio
64 pages
Screenshot 2024-03-20 at 11.11.57 PM
No ratings yet
Screenshot 2024-03-20 at 11.11.57 PM
58 pages
Deepanshu Training
No ratings yet
Deepanshu Training
18 pages
566600917
No ratings yet
566600917
43 pages
CSC141 Introduction To Computer Programming
No ratings yet
CSC141 Introduction To Computer Programming
8 pages
A Comprehensive Study of Deep Video Action Recognition
No ratings yet
A Comprehensive Study of Deep Video Action Recognition
30 pages
Continuous Human Action Recognition For Human Machine Interaction A Review
No ratings yet
Continuous Human Action Recognition For Human Machine Interaction A Review
31 pages
3D Convolutional Neural Networks For Human Action Recognition
No ratings yet
3D Convolutional Neural Networks For Human Action Recognition
11 pages
Black Book Finall Zaid
No ratings yet
Black Book Finall Zaid
75 pages
Association For Computing Machinery ACM Large 2 Column Format Template
No ratings yet
Association For Computing Machinery ACM Large 2 Column Format Template
6 pages
Seminar I
No ratings yet
Seminar I
26 pages
Computer Vision and Action Recognition A Guide for Image Processing and Computer Vision Community for Action Understanding 2011th Edition Md. Atiqur Rahman Ahad - The ebook in PDF/DOCX format is ready for download now
No ratings yet
Computer Vision and Action Recognition A Guide for Image Processing and Computer Vision Community for Action Understanding 2011th Edition Md. Atiqur Rahman Ahad - The ebook in PDF/DOCX format is ready for download now
74 pages
2017 Allah Bux PHD
No ratings yet
2017 Allah Bux PHD
173 pages
HAR Documentation
No ratings yet
HAR Documentation
15 pages
2017-New Insights Into Burnout and Health Care Strategies For Improving Civility and Alleviating Burnout
No ratings yet
2017-New Insights Into Burnout and Health Care Strategies For Improving Civility and Alleviating Burnout
5 pages
Sensors 19 03160
No ratings yet
Sensors 19 03160
25 pages
APCS Thesis-Proposal
No ratings yet
APCS Thesis-Proposal
18 pages
Guardian Ai 2
No ratings yet
Guardian Ai 2
11 pages
10.1007@s00371 020 01868 8
No ratings yet
10.1007@s00371 020 01868 8
15 pages
Project Files 9
No ratings yet
Project Files 9
32 pages
Seminar PPT On HAR Depth
No ratings yet
Seminar PPT On HAR Depth
37 pages
1 s2.0 S0031320316000169 Main
No ratings yet
1 s2.0 S0031320316000169 Main
14 pages
(2017) A Survey On Human Action Recognition
No ratings yet
(2017) A Survey On Human Action Recognition
9 pages
Attention Based Bidirectional Long Short Term Memory For Abnormal Human Activity Detection
No ratings yet
Attention Based Bidirectional Long Short Term Memory For Abnormal Human Activity Detection
12 pages
HAR PPT New 123
No ratings yet
HAR PPT New 123
26 pages
Nirmalaya 5000 Word Synopsis
No ratings yet
Nirmalaya 5000 Word Synopsis
30 pages
Reference Paper 3
No ratings yet
Reference Paper 3
16 pages
Purcom Lesson 5
No ratings yet
Purcom Lesson 5
28 pages
1806 11230 PDF
No ratings yet
1806 11230 PDF
20 pages
Human Action Recognition: Aman Pandey (2017csb1127) Amit Srivastava (2017csb1189) IIT Ropar Ropar, Punjab
No ratings yet
Human Action Recognition: Aman Pandey (2017csb1127) Amit Srivastava (2017csb1189) IIT Ropar Ropar, Punjab
12 pages
Chaquet - Human Activity Recognition - 2013
No ratings yet
Chaquet - Human Activity Recognition - 2013
27 pages
Human Activity
No ratings yet
Human Activity
25 pages
Human Activity Recognition Using Machine Learning
No ratings yet
Human Activity Recognition Using Machine Learning
5 pages
Human Activity Detection Using Pose Net
No ratings yet
Human Activity Detection Using Pose Net
5 pages
Convolution Neural Network For Human Activity
No ratings yet
Convolution Neural Network For Human Activity
5 pages
180-Article Text-797-1-10-20230101
No ratings yet
180-Article Text-797-1-10-20230101
11 pages
Human Activity Detection Using Deep - 2-1
No ratings yet
Human Activity Detection Using Deep - 2-1
8 pages
Action Classification and Highlighting in Videos
No ratings yet
Action Classification and Highlighting in Videos
12 pages
SLFLSDFKSFLDKJ
No ratings yet
SLFLSDFKSFLDKJ
3 pages
Xiao-Song2018 Article ActionRecognitionBasedOnHierar PDF
No ratings yet
Xiao-Song2018 Article ActionRecognitionBasedOnHierar PDF
14 pages
General Activity Detection
No ratings yet
General Activity Detection
10 pages
Action Recognition 2
No ratings yet
Action Recognition 2
6 pages
International Journal of Intelligent Systems and Applications in Engineering
No ratings yet
International Journal of Intelligent Systems and Applications in Engineering
6 pages
Human Action Recognition Using CNN and LSTM-RNN With Attention Model
No ratings yet
Human Action Recognition Using CNN and LSTM-RNN With Attention Model
5 pages
Raushan Pandey Review Paper of Deep Learning
No ratings yet
Raushan Pandey Review Paper of Deep Learning
3 pages
Lecture 13 Text200220060602024747
No ratings yet
Lecture 13 Text200220060602024747
12 pages
Health System Performance Assessment
No ratings yet
Health System Performance Assessment
32 pages
Recent Advances in Video-Based Human Action Recognition Using Deep Learning: A Review
No ratings yet
Recent Advances in Video-Based Human Action Recognition Using Deep Learning: A Review
8 pages
GE 105 (Art Appreciation) Lesson 2: Basic Art Criticism The Steps in Art Criticism
No ratings yet
GE 105 (Art Appreciation) Lesson 2: Basic Art Criticism The Steps in Art Criticism
2 pages
Action Recognition
No ratings yet
Action Recognition
14 pages
A Critical Review of Action Recognition Benchmarks: Tal Hassner The Open University of Israel
No ratings yet
A Critical Review of Action Recognition Benchmarks: Tal Hassner The Open University of Israel
6 pages
Final Selected Report
No ratings yet
Final Selected Report
4 pages
ORGANISATIONAL BEHAVIOUR - Unit 5
No ratings yet
ORGANISATIONAL BEHAVIOUR - Unit 5
31 pages
Deep Neural Network Approachesfor Video Based Human Activity Recognition
No ratings yet
Deep Neural Network Approachesfor Video Based Human Activity Recognition
4 pages
Machine Recognition of Human Activities: A Survey
No ratings yet
Machine Recognition of Human Activities: A Survey
15 pages
Learning Drone-Control Actions in Surveillance Videos: Sunyoung Cho, Dae Hoe Kim, Yong Woon Park
No ratings yet
Learning Drone-Control Actions in Surveillance Videos: Sunyoung Cho, Dae Hoe Kim, Yong Woon Park
4 pages
Group 1 Math 100L Hqt2 HW
No ratings yet
Group 1 Math 100L Hqt2 HW
5 pages
The Language of Actions: Recovering The Syntax and Semantics of Goal-Directed Human Activities
No ratings yet
The Language of Actions: Recovering The Syntax and Semantics of Goal-Directed Human Activities
8 pages
Human Action Recognition Using Support Vector Machine and K-Nearest Neighbor
No ratings yet
Human Action Recognition Using Support Vector Machine and K-Nearest Neighbor
6 pages
II. Task 1 Critical Appraisal of Philosophy and Practice of Education Advocated by John Dewey
No ratings yet
II. Task 1 Critical Appraisal of Philosophy and Practice of Education Advocated by John Dewey
11 pages
Video Survivallence
No ratings yet
Video Survivallence
3 pages
3D Convolutional Neural Networks For Human Action Recognition
No ratings yet
3D Convolutional Neural Networks For Human Action Recognition
11 pages
Human Activity Recognition Synopsis
No ratings yet
Human Activity Recognition Synopsis
4 pages
0.1 SC CYRP ME Guidelines 2020
No ratings yet
0.1 SC CYRP ME Guidelines 2020
171 pages
Explorations An OpenInvitationToBiologicalAnthropology
No ratings yet
Explorations An OpenInvitationToBiologicalAnthropology
713 pages
Language and Society
No ratings yet
Language and Society
13 pages
East West University: Assignment For Final Assessment
No ratings yet
East West University: Assignment For Final Assessment
8 pages
All Pak Reserve Quota Seats For GB
No ratings yet
All Pak Reserve Quota Seats For GB
25 pages
Tema 4
No ratings yet
Tema 4
10 pages
Residents Attitudes Toward Support For Island Sus
No ratings yet
Residents Attitudes Toward Support For Island Sus
16 pages
Chapter 1 PR
No ratings yet
Chapter 1 PR
13 pages
PLC Based Car Parking System
No ratings yet
PLC Based Car Parking System
2 pages
Aranya Saha - 21103026 - ENG111 - Assignment2 - Section2
No ratings yet
Aranya Saha - 21103026 - ENG111 - Assignment2 - Section2
3 pages
Open-Endedness Is Essential For Artificial Superhuman Intelligence
No ratings yet
Open-Endedness Is Essential For Artificial Superhuman Intelligence
20 pages
NLP Notes
No ratings yet
NLP Notes
6 pages
List of Courses - m2 Ape
No ratings yet
List of Courses - m2 Ape
3 pages
PART 1: Write Down Your PICS: (You Must State at Least 2)
No ratings yet
PART 1: Write Down Your PICS: (You Must State at Least 2)
5 pages
Course 3
No ratings yet
Course 3
1 page
Activity Recognition: Fundamentals and Applications
From Everand
Activity Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Computer Vision: Fundamentals and Applications
From Everand
Computer Vision: Fundamentals and Applications
Fouad Sabry
No ratings yet
Computer Vision: Exploring the Depths of Computer Vision
From Everand
Computer Vision: Exploring the Depths of Computer Vision
Fouad Sabry
No ratings yet
Lunenburg, Fred C The Generation & Verification of Theory-A Bridge To NFEASJ V29 N4 2011
No ratings yet
Lunenburg, Fred C The Generation & Verification of Theory-A Bridge To NFEASJ V29 N4 2011
9 pages

Video Based Action Recognition RM

Uploaded by

Video Based Action Recognition RM

Uploaded by

Video-based action recognition

1. Introduction 2.2. Technological Advancements

You might also like