0% found this document useful (0 votes)

61 views8 pages

Detection of Human Motion: Adopting Machine and Deep Learning

This document discusses techniques for detecting human motion using machine and deep learning. It presents a method for human action recognition using multi-scale and multi-modal deep learning that does not rely on labels for real data. Inertial sensors are used to capture data, and convolutional clockwork RNNs are proposed to make learned features shift-invariant. Deep features are then incorporated into a probabilistic framework for real-time user authentication. Common approaches to human motion detection including background subtraction, object tracking using models, contours and regions, and statistical scene analysis are also reviewed. Edge detection techniques such as Sobel, Roberts, and Prewitt operators as well as the Laplacian of Gaussian are additionally described.

Uploaded by

Pavithra iyer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views8 pages

Detection of Human Motion: Adopting Machine and Deep Learning

Uploaded by

Pavithra iyer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Detection of Human Motion: Adopting machine and deep learning

Abstract

Human motion recognition has confounded the research workers on the grounds that its radical
challenges. The surveillance system bounds from a regular detection of motion to understanding a
complex behavior in the motion. This leads to major development in the techniques related to
human motion representation and recognition. This paper discourse about the applications,
general framework of human motion detection and the details of each of its components. The
paper underlines on human motion representation and the recognition methods along with their
merits and demerits. This study puts head together on the popular datasets and concludes with
the difficulties in the domain along with a future direction. This domain has been active for more
than two decades.

Firstly, this presents a method for human action and spotting and classification based on multi-
scale and multi-modal deep learning. Our method does not rely on labels for the real data, and no
explicit transfer function is defined or learned between synthetic and real data.

In this project, the data is captured by inertial sensors (such as accelerometers and gyroscopes)
built in mobile devices. Having explored existing temporal models (RNN, LSTM, clockwork
RNN), we show how the convolutional Clockwork RNN can be extended in a way that makes
the learned features shift-invariant, and propose a more efficient training strategy for this
architecture. Finally, we incorporate the learned deep features in a probabilistic biometric
framework for real time user authentication.

Introduction
Effective techniques for human detection are of special interest in computer vision since many
applications involve people's locations and movements. Thus, significant research has been
devoted to detecting, locating and tracking people in images and videos. Over the last few years
the problem of detecting humans in single images has received considerable interest. Variations
in illumination, shadows, and pose, as well as frequent inter- and intra-person occlusion render
this a challenging task. Figure 1 shows an image of a particularly challenging scene with a large
number of persons, overlaid with the results of our system.

Two main approaches to human detection have been explored over the last few years. The first
class of methods consists of a generative process where detected parts of the human body are
combined according to a prior human model. The second class of methods considers purely
statistical analysis that combine a set of low-level features within a detection window to classify
the window as containing a human or not. The method presented in this paper belongs to the
latter category.

Recently, automated visual surveillance systems to observe certain areas are becoming more
important in the research field of computer vision. Conventional surveillance systems are already
installed in many areas ranging from traffic surveillance to security relevant scenarios.
However, these systems present limitations making them unsuitable in many situations. On the
one hand, the systems archive huge volumes of video for eventual offline human inspection. On
the other hand, security areas must be monitored by human operators, located in a control room
containing a bank of screens streaming live video from each camera, for the system to be
effective. CVL’s contribution to visual surveillance is in the area of image sequence analysis
focusing on the topics motion detection, object tracking and scene understanding:

 Motion Detection
 Object Tracking
 Scene Analysis

Motion Detection
Motion detection algorithms are the basics for a wide range of applications in computer vision
like visual surveillance, object recognition and tracking and compression of video streams. The
most common approach for motion detection in surveillance systems with static cameras are the
so called background subtraction algorithms. In these algorithms, a (moving) foreground object
is detected by comparing the current image with the static background of the scene. The
acquisition of this background image is the main challenge of background subtraction
algorithms, since the background image might not be static but has to adapt to several changes
as:

1. Illumination changes
 sudden changes (e.g., clouds, light-switch)
 gradual changes (e.g., position of the sun changing during the day)
2. Background motion
 e.g., waving trees, waves
3. Changes in the background geometry
 e.g., parking cars, moved items

Fig.1()
Object Tracking
Object tracking can be described as a correspondence problem and involves finding which
object in a video frame relates to which object in the next frame. Tracking methods can be
classified into four major categories:

 Model based tracking

 Active contour-based tracking
 Feature based tracking
 Region based tracking.

Fig.2()

Scene Analysis
The aim of this type of algorithm is to recognize activities in scene. Our recognition algorithms
are mainly based on statistical analysis of the scene. Rule based approaches are applied to
identify e.g. abnormal behavior. The system indicates the behavior of the person.

Fig.3()
Related work
Human detection is closely related to general object recognition techniques. It involves two steps
- feature extraction and training a classifier as shown in Figure below.

Fig.4 Components of Human Detection System.

The image feature set that needs to be extracted should be the most relevant ones for object
detection or classification, while providing invariance to changes in illumination, changes in
viewpoint and shifts in object contours. Such features can be based on points [1] and [2], blobs
(Laplacian of Gaussian [3] or Difference of Gaussian [4]), intensities [5], gradients [6] and [7],
color, texture, or combinations of several or all of these [8]. The final descriptors need to
characterize the image sufficiently well for the detection and classification task at hand. We will
divide the various approaches to descriptor selection into two broad categories:
Sparse representations are based on local descriptors of relevant local image regions. The regions
can be selected using either key point detectors, image fragments or parts detectors. On the other
hand, dense representations are based on image intensities, gradients or higher order differential
operators. Image features are often extracted densely (often pixel-wise) over an entire image or
detection window and collected into a high-dimensional descriptor vector that can be used for
discriminative image classification or labeling the window as object or non-object.

Edge Detection Techniques

 Sobel Operator
The operator consists of a pair of 3×3 convolution kernels as shown in Table 1. One kernel is
simply the other rotated by 90°

Table 1: Masks used by Sobel Operator

These kernels are designed to respond maximally to edges running vertically and horizontally
relative to the pixel grid, one kernel for each of the two perpendicular orientations. The kernels
can be applied separately to the input image, to produce separate measurements of the gradient
component in each orientation (call these Gx and Gy). These can then be combined together to
find the absolute magnitude of the gradient at each point and the orientation of that gradient. The
gradient magnitude is given by
Typically, an approximate magnitude is computed using:

which is much faster to compute. The angle of orientation of the edge (relative to the pixel grid)
giving rise to the spatial gradient is given by:

Robert’s cross operator:

The Roberts Cross operator performs a simple, quick to compute, 2-D spatial gradient
measurement on an image. Pixel values at point in the output represent the estimated absolute
magnitude of the spatial gradient of the input image at that point. The operator consists of a pair
of 2×2 convolution kernels as shown below. One kernel is simply the other rotated by 90°. This
is very similar to the Sobel operator

Table 2: Masks used for Robert operator

These kernels are designed to respond maximally to edges running at 45° to the pixel grid, one
kernel for each of the two perpendicular orientations. The kernels can be applied separately to
the input image, to produce separate measurements of the gradient component in each orientation
(call these Gx and Gy). These can then be combined together to find the absolute magnitude eof
the gradient at each point and the orientation of that gradient. The gradient magnitude is given
by:
although typically, an approximate magnitude is computed using:

which is much faster to compute.

The angle of orientation of the edge giving rise to the spatial gradient is given by:

Prewitt’s operator:
Prewitt operator is similar to the Sobel operator and is used for detecting vertical and horizontal
edges in images.

Fig: Masks for the Prewitt gradient edge detector

Laplacian of Gaussian:
The Laplacian is a 2-D isotropic measure of the 2nd spatial derivative of an image. The
Laplacian of an image high lights regions of rapid intensity change and is therefore often used
for edge detection. The Laplacian is often applied to an image that has first been smoothed with
something approximating a Gaussian Smoothing filter in order to reduce its sensitivity to noise.
The operator normally takes a single gray level image as input and produces another gray level
image as output.
The Laplacian L(x,y) of an image with pixel intensity values I(x,y) is given by:

Since the input image is represented as a set of discrete pixels, we have to find a discrete
convolution kernel that can approximate the second derivatives in the definition of the Laplacian.
Three commonly used small kernels are shown below:
Proposed Method
Previous studies have shown that significant improvement in human detection can be achieved
using different types (or combinations) of low-level features. A strong set of features provides
high discriminatory power, reducing the need for complex classification methods.

Humans in standing positions have distinguishing characteristics. First, strong vertical edges are
present along the boundaries of the body. Second, clothing is generally uniform. Clothing
textures are different from natural textures observed outside of the body due to constraints on the
manufacturing of printed cloth. Third, the ground is composed mostly of uniform textures.
Finally, discriminatory color information is found in the face/head regions.

Thus, edges, colors and textures capture important cues for discriminating humans from the
background. To capture these cues, the low-level features we employ are the original HOG
descriptors with additional color information, called color frequency, and texture features
computed from co-occurrence matrices.

To handle the high dimensionality resulting from the combination of features, PLS is employed
as a dimensionality reduction technique. PLS is a powerful technique that provides
dimensionality reduction for even hundreds of thousands of variables, accounting for class labels
in the process. The latter point is in contrast to traditional dimensionality reduction techniques
such as Principal Component Analysis (PCA).

The steps performed in our detection method are the following. For each detection window in the
image, features extracted using original HOG, color frequency, and co-occurrence matrices are
concatenated and analyzed by the PLS model to reduce dimensionality, resulting in a low
dimensional vector. Then, a simple and efficient classifier is used to classify this vector as either
a human or non-human. These steps are explained in the following subsections.

Flow diagram below shows a basic architecture of proposed human detection. In this propose
system, images are captured using a digital camera. These images are passed through the human
detection module. In the human detection module, input RGB images to convert into Gray-scale
images; then normalized boundary is compared with predefined templates and if enough match is
found then human is bounded by a rectangular box. After detecting human from the real-time
image, the system can take several actions. Such as it can aware about the presence of the human
by making alarm or displaying some light signal instructions.

The global and sensational topic of the year is human detection using the closest and shortest
path algorithm by binding two or more plots together.

Code:

from PIL import Image

def black_and_white(input_image_path,

output_image_path):

color_image = Image.open(input_image_path)

bw = color_image.convert('L')

bw.save(output_image_path)
if __name__ == '__main__':

black_and_white('test.jpg',

'bw_test.jpg')

Uniform Detection Using Image Processingrev
100% (2)
Uniform Detection Using Image Processingrev
68 pages
(IJET-V1I6P15) Authors: Sadhana Raut, Poonam Rohani, Sumera Shaikh, Tehesin Shikilkar, Mrs. G. J. Chhajed
No ratings yet
(IJET-V1I6P15) Authors: Sadhana Raut, Poonam Rohani, Sumera Shaikh, Tehesin Shikilkar, Mrs. G. J. Chhajed
7 pages
Performance Comparison of Optical Flow and Background Subtraction and Discrete Wavelet Transform Methods For Moving Objects
No ratings yet
Performance Comparison of Optical Flow and Background Subtraction and Discrete Wavelet Transform Methods For Moving Objects
10 pages
1207 6774 PDF
No ratings yet
1207 6774 PDF
14 pages
A Review On Human Behavior Recognition Algorithms in Video Surveillance Systems
No ratings yet
A Review On Human Behavior Recognition Algorithms in Video Surveillance Systems
8 pages
Research Paper
No ratings yet
Research Paper
13 pages
An Intelligent Motion Detection Using Open CV
No ratings yet
An Intelligent Motion Detection Using Open CV
13 pages
Council For Innovative Research: Efficient Motion Detection Algorithm in Video Sequences
No ratings yet
Council For Innovative Research: Efficient Motion Detection Algorithm in Video Sequences
6 pages
Object Detection and Trackinfg in Videos: N. Rasathi
No ratings yet
Object Detection and Trackinfg in Videos: N. Rasathi
8 pages
Computers 02 00088 v2 PDF
No ratings yet
Computers 02 00088 v2 PDF
44 pages
Unisys SIP Final Draft
No ratings yet
Unisys SIP Final Draft
8 pages
Finalreport
No ratings yet
Finalreport
56 pages
Object Detection
No ratings yet
Object Detection
4 pages
Ruchitha Paper
No ratings yet
Ruchitha Paper
5 pages
SANGEETHA
No ratings yet
SANGEETHA
19 pages
Smart Cards
No ratings yet
Smart Cards
39 pages
Marathwada Mitra Mandal's College of Engineering Karvenagar, Pune 52
No ratings yet
Marathwada Mitra Mandal's College of Engineering Karvenagar, Pune 52
16 pages
Tempest 160314194757
No ratings yet
Tempest 160314194757
28 pages
Final-Report Img Pro
No ratings yet
Final-Report Img Pro
15 pages
Ouriginal Report - Research Paper-1.pdf (D148071491) PDF
No ratings yet
Ouriginal Report - Research Paper-1.pdf (D148071491) PDF
12 pages
The Method of The Real-Time Human Detection and Tracking: ISSN 2710 - 1673 Artificial Intelligence 2023 1
No ratings yet
The Method of The Real-Time Human Detection and Tracking: ISSN 2710 - 1673 Artificial Intelligence 2023 1
8 pages
Human and Moving Object Detection and Tracking Using Image Processing
No ratings yet
Human and Moving Object Detection and Tracking Using Image Processing
4 pages
[9]
No ratings yet
[9]
6 pages
Flow Measurement Tool For Crowd Management Systems
No ratings yet
Flow Measurement Tool For Crowd Management Systems
3 pages
Object Tracking Based On Pattern Matching
No ratings yet
Object Tracking Based On Pattern Matching
4 pages
V2i1 0135 PDF
No ratings yet
V2i1 0135 PDF
9 pages
Image Processing Techniques For Object Tracking in Video Surveillance A Survey 2015 2
No ratings yet
Image Processing Techniques For Object Tracking in Video Surveillance A Survey 2015 2
6 pages
A Study On Smart Video Security For Banks Using Mobile Remote Control
No ratings yet
A Study On Smart Video Security For Banks Using Mobile Remote Control
4 pages
9 Histogram of Oriented Gradients For Human Detection in Video
No ratings yet
9 Histogram of Oriented Gradients For Human Detection in Video
5 pages
Human Fall Detection Using Optical Flow Farne Back
No ratings yet
Human Fall Detection Using Optical Flow Farne Back
15 pages
Video Surveillance Systems - A Survey: Keywords
No ratings yet
Video Surveillance Systems - A Survey: Keywords
8 pages
Proposed Multi Object Tracking Algorithm
No ratings yet
Proposed Multi Object Tracking Algorithm
10 pages
Computer Vision Application
No ratings yet
Computer Vision Application
2 pages
Synopsis of Real Time Security System: Submitted in Partial Fulfillment of The Requirements For The Award of
No ratings yet
Synopsis of Real Time Security System: Submitted in Partial Fulfillment of The Requirements For The Award of
6 pages
Stationary Objects Detection
No ratings yet
Stationary Objects Detection
40 pages
Object Detection and Tracking in Video Sequences
No ratings yet
Object Detection and Tracking in Video Sequences
6 pages
Real Time Detection of Human and Non-Human Parts From Video Surveillance
No ratings yet
Real Time Detection of Human and Non-Human Parts From Video Surveillance
5 pages
Object PDF
No ratings yet
Object PDF
6 pages
Human Activity Recognition Based On Spatial Transform in Video Surveillance
No ratings yet
Human Activity Recognition Based On Spatial Transform in Video Surveillance
5 pages
Motion Detection and Target Tracking Using Neural Network Correlation Co-Efficient Technique
No ratings yet
Motion Detection and Target Tracking Using Neural Network Correlation Co-Efficient Technique
3 pages
Moving Object Detection Using Matlab PDF
100% (2)
Moving Object Detection Using Matlab PDF
7 pages
Paper 7 - The Object Detection Based On Deep Learning
No ratings yet
Paper 7 - The Object Detection Based On Deep Learning
6 pages
Regions of Interest For Accurate Object Detection
No ratings yet
Regions of Interest For Accurate Object Detection
8 pages
Doaa Nasser Alghamdi-442202873
No ratings yet
Doaa Nasser Alghamdi-442202873
5 pages
Moving Objects Detection Based On Histogram of Oriented Gradient Algorithm Chip For Hazy Environment
No ratings yet
Moving Objects Detection Based On Histogram of Oriented Gradient Algorithm Chip For Hazy Environment
12 pages
Object Detection Report
No ratings yet
Object Detection Report
48 pages
Detecting Pedestrians Using Patterns of Motion and Appearance
No ratings yet
Detecting Pedestrians Using Patterns of Motion and Appearance
8 pages
An Insight Into The Algorithms On Real-Time People Tracking and Counting System
No ratings yet
An Insight Into The Algorithms On Real-Time People Tracking and Counting System
6 pages
19bce2565 Ar2023240501534 Pe003
No ratings yet
19bce2565 Ar2023240501534 Pe003
37 pages
I Jcs It 20120302103
No ratings yet
I Jcs It 20120302103
6 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
Study On Self Organizing Approach For Moving Object Detection and Tracking For Visual Surveillance
No ratings yet
Study On Self Organizing Approach For Moving Object Detection and Tracking For Visual Surveillance
3 pages
Survey On Human Motion Detection in Static Background
No ratings yet
Survey On Human Motion Detection in Static Background
3 pages
Motion Object Detector With Higher Detection Levels
No ratings yet
Motion Object Detector With Higher Detection Levels
4 pages
Motion Detect Application With Frame Difference Me
No ratings yet
Motion Detect Application With Frame Difference Me
11 pages
IJERTV3IS100721
No ratings yet
IJERTV3IS100721
11 pages
Detecting and Identifying Occluded and Camouflaged Objects in Low-Illumination Environments
No ratings yet
Detecting and Identifying Occluded and Camouflaged Objects in Low-Illumination Environments
9 pages
CV Unit 5
No ratings yet
CV Unit 5
11 pages
Chapter 5 - Morphological Binary Image Processing
No ratings yet
Chapter 5 - Morphological Binary Image Processing
58 pages
Adobe Photoshop Private Tutoring Syllabus
No ratings yet
Adobe Photoshop Private Tutoring Syllabus
2 pages
6 - 2D Viewing Transformation
No ratings yet
6 - 2D Viewing Transformation
31 pages
Computer Vision - Session 1
No ratings yet
Computer Vision - Session 1
36 pages
Depth Buffer Method
No ratings yet
Depth Buffer Method
7 pages
Gujarat Technological University: Type of Course: Core Engineering
No ratings yet
Gujarat Technological University: Type of Course: Core Engineering
4 pages
Grid Formulas RT Book 2008 Rev2
No ratings yet
Grid Formulas RT Book 2008 Rev2
1 page
Image Processing
No ratings yet
Image Processing
3 pages
Brand Guide V2-Compressed
No ratings yet
Brand Guide V2-Compressed
15 pages
Feature Detection Hassana Ozigi Otaru
No ratings yet
Feature Detection Hassana Ozigi Otaru
57 pages
Output Log
No ratings yet
Output Log
75 pages
KARIKATURA
100% (1)
KARIKATURA
35 pages
Btech It 6 Sem Digital Image Processing Pit6j002 2019
No ratings yet
Btech It 6 Sem Digital Image Processing Pit6j002 2019
2 pages
05 Edges
No ratings yet
05 Edges
129 pages
DC 40 Brochure - En.es
No ratings yet
DC 40 Brochure - En.es
8 pages
Computer Graphics BCS053 Important Questions EduShine Classes 1
No ratings yet
Computer Graphics BCS053 Important Questions EduShine Classes 1
8 pages
PhotoshopCAFE Blending Modes Ebook
No ratings yet
PhotoshopCAFE Blending Modes Ebook
13 pages
Comp Graphics Overview
No ratings yet
Comp Graphics Overview
9 pages
Morphological Image Processing: Presented By: Hiba Faisal Nisar Ahmad Anam Qureshi
No ratings yet
Morphological Image Processing: Presented By: Hiba Faisal Nisar Ahmad Anam Qureshi
24 pages
Image Compression Using Proposed Enhanced Run Length Encoding Algorithm
No ratings yet
Image Compression Using Proposed Enhanced Run Length Encoding Algorithm
14 pages
PBR Basics Antb1
No ratings yet
PBR Basics Antb1
8 pages
What Is Compositing ?
100% (1)
What Is Compositing ?
15 pages
Vision Lec 10
No ratings yet
Vision Lec 10
23 pages
Computer Graphics
No ratings yet
Computer Graphics
14 pages
Recognition of Vehicle Number Plate Using Matlab
No ratings yet
Recognition of Vehicle Number Plate Using Matlab
8 pages
Video Lecture PPT Format
No ratings yet
Video Lecture PPT Format
11 pages
Histogram Equalization Techniques
No ratings yet
Histogram Equalization Techniques
18 pages
2020 Houdini Learning
100% (1)
2020 Houdini Learning
87 pages
Color Scheme
100% (1)
Color Scheme
29 pages
Augmented Virtual Reality Presentation
No ratings yet
Augmented Virtual Reality Presentation
31 pages