0% found this document useful (0 votes)

7 views8 pages

Human Activity Recognition

This document explains a Python code implementation for human activity recognition using signal processing and machine learning techniques. It details the process of loading data, extracting time and frequency domain features, and training classifiers with k-fold cross-validation. Additionally, it includes tasks for students to explore feature engineering, classifier comparison, and advanced signal processing techniques.

Uploaded by

anhbin531

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views8 pages

Human Activity Recognition

Uploaded by

anhbin531

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Human Activity Recognition using Signal Feature Extraction and

Machine Learning: A Code Explanation

This Python code implements human activity recognition using signal processing techniques and
machine learning classifiers. It processes data from a CSV file, extracts time and frequency
domain features, and trains various classifiers to predict human activities. This explanation is
structured to help students understand the code and the underlying theory.

1. Libraries and Data Loading:

import pandas as pd
import numpy as np
from scipy.signal import find_peaks
from scipy.fft import fft
from sklearn.model_selection import KFold
from sklearn.preprocessing import StandardScaler
# ... (other sklearn imports)

The code starts by importing necessary libraries: pandas for data manipulation, numpy for
numerical operations, scipy.signal for signal processing, scipy.fft for Fourier transforms, and
various modules from sklearn for machine learning tasks. The Google Drive mounting code is
included (commented out), useful if your data is stored there.
try:
df = pd.read_csv(file_path)
print("File loaded successfully!")
except FileNotFoundError:
# ... (error handling)

This block reads the data from a CSV file into a pandas DataFrame. Robust error handling is
included to catch potential issues like the file not being found.

2. Feature Extraction:
The core of the code lies in the feature extraction functions. Features are calculated from the raw
sensor data to represent the underlying activity.
2.1 Time Domain Features:
def calculate_time_features(signal):
rms = np.sqrt(np.mean(signal**2)) # Root Mean Square
shape_factor = rms / np.mean(np.abs(signal)) if np.mean(np.abs(signal))
!= 0 else 0
peak_value = np.max(np.abs(signal))
crest_factor = peak_value / rms if rms != 0 else 0
clearance_factor = peak_value /
np.mean(np.abs(np.sqrt(np.abs(signal))))**2 if
np.mean(np.abs(np.sqrt(np.abs(signal)))) != 0 else 0
impulse_factor = peak_value / np.mean(np.abs(signal)) if
np.mean(np.abs(signal)) != 0 else 0
return rms, shape_factor, peak_value, crest_factor, clearance_factor,
impulse_factor

This function calculates several time-domain features:

• RMS (Root Mean Square): Measures the effective magnitude of the signal. It's a good
indicator of the signal's energy.
• Shape Factor: Relates the RMS value to the mean absolute value. It provides information
about the shape of the signal.
• Peak Value: The maximum absolute value of the signal.
• Crest Factor: The ratio of the peak value to the RMS value. It's sensitive to peaks and
outliers in the signal.
• Clearance Factor: Similar to the crest factor but uses the mean of the square root of the
absolute signal.
• Impulse Factor: Ratio of the peak value to the mean absolute value.
These features capture characteristics of the signal in the time domain.
2.2 Frequency Domain Features:

def calculate_frequency_features(signal):
N = len(signal)
yf = fft(signal) # Fast Fourier Transform
xf = np.linspace(0.0, fs/2, N//2) # Frequency axis

peaks, _ = find_peaks(np.abs(yf[0:N//2]), height=0,

distance=int(0.25*N/fs)) # Find peaks
peak_amplitude = np.abs(yf[peaks[0]]) if peaks.size > 0 else 0
peak_location = xf[peaks[0]] if peaks.size > 0 else 0

mean_frequency = np.sum(xf[0:N//2] * np.abs(yf[0:N//2])) /

np.sum(np.abs(yf[0:N//2])) if np.sum(np.abs(yf[0:N//2])) != 0 else 0

# Band power
f_low = 0.5
f_high = 4
band_indices = np.where((xf >= f_low) & (xf <= f_high))[0]
band_power = np.sum(np.abs(yf[band_indices]))**2

power_bandwidth = xf[band_indices[-1]] - xf[band_indices[0]] if

band_indices.size > 1 else 0

return peak_amplitude, peak_location, mean_frequency, band_power,

power_bandwidth

This function calculates frequency-domain features using the Fast Fourier Transform (FFT):
• FFT (Fast Fourier Transform): The FFT decomposes the signal into its constituent
frequencies. yf contains the frequency components, and xf represents the corresponding
frequencies.
• Peak Amplitude and Location: The code finds the peaks in the frequency spectrum using
find_peaks (with refined parameters as per your requirements) and extracts the amplitude
and frequency of the dominant peak.
• Mean Frequency: The average frequency weighted by the magnitude of the frequency
components.
• Band Power: The power within a specific frequency band (e.g., 0.5Hz to 4Hz). This can
be a good indicator of the energy in that particular frequency range.
• Power Bandwidth: The width of the frequency band used for band power calculation.

2.3 Mean Feature:

The code also calculates the mean of the signal as a simple but often informative feature:

mean_signal = np.mean(signal)

3. Data Preparation:
Python

features = []
labels = []
for index, row in df.iterrows():
signal = row.iloc[1:45].values # Input values
label = row.iloc[-1] # Corrected label access
# ... (feature calculation calls)

features.append(np.concatenate([time_features, frequency_features,
[mean_signal]]))
labels.append(label)

X = np.array(features)
y = np.array(labels)

This loop iterates through each row of the DataFrame, extracts the signal data and label,
calculates the time and frequency features, and combines them into a single feature vector using
np.concatenate. The features are stored in X, and the corresponding labels are stored in y.

4. Classifiers and K-Fold Cross-Validation:

classifiers = { # Dictionary of classifiers

# ... (classifier instantiations)
}
k = 5 # Number of folds
kf = KFold(n_splits=k, shuffle=True, random_state=42) # KFold object

for name, clf in classifiers.items():

accuracies = []
for train_index, test_index in kf.split(X):
# ... (data splitting and scaling)
clf.fit(X_train, y_train)
y_pred = clf.predict(X_test)
accuracy = accuracy_score(y_test, y_pred)
accuracies.append(accuracy)
print(f"{name} - Mean Accuracy: {np.mean(accuracies):.4f} (+/-
{np.std(accuracies):.4f})")

This section trains and evaluates several classifiers using k-fold cross-validation:
• Classifiers: A dictionary stores the different classifiers to be used.
• K-Fold: KFold splits the data into k folds. The code iterates through each fold, using one
fold for testing and the remaining k-1 folds for training. shuffle=True shuffles the data
before splitting, and random_state ensures reproducibility.
• Feature Scaling: StandardScaler scales the features to have zero mean and unit variance.
This is crucial for many machine learning algorithms. It's done inside the cross-validation
loop to prevent data leakage.
• Training and Evaluation: The code trains each classifier on the training data and evaluates
its performance on the test data using accuracy as the metric. The mean and standard
deviation of the accuracy across all folds are printed.

Key Concepts and Theory:

• Feature Engineering: The process of extracting relevant features from raw data is crucial
for machine learning. The time and frequency domain features calculated in this code are
designed to capture different aspects of the signal related to human activity.
• Fast Fourier Transform (FFT): The FFT is an efficient algorithm for computing the
discrete Fourier transform (DFT). The DFT decomposes a signal into its constituent
frequencies, allowing for analysis in the frequency domain.
• K-Fold Cross-Validation: A robust technique for evaluating machine learning models. It
helps to estimate the model's performance on unseen data and reduces the risk of
overfitting.
• Feature Scaling: Many machine learning algorithms perform better when the features are
scaled to a similar range. This prevents features with larger values from dominating the
learning process.
Tasks for students
1. Feature Engineering Exploration:
Task: Investigate the importance of different features.
Exercises:
• Feature Ablation: Systematically remove each feature (or group of features – time,
frequency, mean) and observe the impact on classifier performance. This helps
understand which features are most discriminative.
• Feature Visualization: Plot the different features for each activity class (e.g., box plots,
histograms). This can help visualize which features are most separable and understand
their distributions.
• Feature Selection: Implement feature selection techniques (e.g., using SelectKBest or
SelectFromModel from sklearn.feature_selection) to find the optimal subset of features.
This can improve performance and reduce computational cost.

2. Classifier Comparison and Tuning:

Task: Compare and tune different classifiers.
Exercises:
• More Classifiers: Experiment with other classifiers available in scikit-learn, such as
Support Vector Machines (SVMs), Gradient Boosting Machines (GBM), or Naive Bayes.
• Hyperparameter Tuning: Use techniques like GridSearchCV or RandomizedSearchCV
from sklearn.model_selection to find the optimal hyperparameters for each classifier.
This significantly impacts performance. Focus on understanding what each
hyperparameter controls and how it affects the model.
• Performance Metrics: Explore other relevant performance metrics beyond accuracy, such
as precision, recall, F1-score, and confusion matrices. Understand the trade-offs between
these metrics and when each is most appropriate. Use classification_report from
sklearn.metrics.
• Cross-Validation Strategies: Experiment with different cross-validation strategies, such as
StratifiedKFold (for imbalanced datasets) or LeaveOneOut cross-validation.

3. Signal Processing Deep Dive:

Task: Explore signal processing techniques in more detail.
Exercises:
• Windowing: Experiment with different windowing functions (e.g., Hamming, Hanning,
Blackman) before calculating the FFT. Understand the effects of windowing on the
frequency spectrum.
• FFT Length: Investigate the impact of different FFT lengths on the frequency resolution.
A longer FFT length provides finer frequency resolution but requires more computation.
• Band Power Refinement: Experiment with different frequency bands for band power
calculation. Research which frequency bands are most relevant for distinguishing
between the activities.
• Filtering: Try applying digital filters (e.g., bandpass, lowpass, highpass) to the raw signal
before feature extraction. This can help remove noise or isolate specific frequency
components.
• Signal Segmentation: Explore different methods for segmenting the signal into windows
for feature extraction. Consider overlapping windows or adaptive windowing techniques.

AIGP Note
100% (1)
AIGP Note
157 pages
Machine Learning Guide: Meher Krishna Patel
No ratings yet
Machine Learning Guide: Meher Krishna Patel
121 pages
Class XI AI Qestion Bank
63% (8)
Class XI AI Qestion Bank
6 pages
Lecture03. Classification (Chapter 3)
No ratings yet
Lecture03. Classification (Chapter 3)
46 pages
AICS Topics
No ratings yet
AICS Topics
250 pages
Lecture 3 - Introduction To Deep Learning
No ratings yet
Lecture 3 - Introduction To Deep Learning
27 pages
Machine Learning Algorithms PDF
100% (1)
Machine Learning Algorithms PDF
148 pages
Minor Project
No ratings yet
Minor Project
21 pages
Ai in The Insurance Industry: 26 Real-World Use Cases
No ratings yet
Ai in The Insurance Industry: 26 Real-World Use Cases
16 pages
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
No ratings yet
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
9 pages
ML File Syllabus
No ratings yet
ML File Syllabus
43 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
ML Book Notes
No ratings yet
ML Book Notes
9 pages
Python With Automation
No ratings yet
Python With Automation
18 pages
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004 - Compressed
No ratings yet
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004 - Compressed
6 pages
ML 3
No ratings yet
ML 3
24 pages
Ashwin Report
No ratings yet
Ashwin Report
18 pages
UNITIV BtechIot
No ratings yet
UNITIV BtechIot
43 pages
Course Work AI - Foundation
No ratings yet
Course Work AI - Foundation
12 pages
Machine Learning Engineer Interview Preparation Guide
No ratings yet
Machine Learning Engineer Interview Preparation Guide
14 pages
Mini Project 2024
No ratings yet
Mini Project 2024
48 pages
Machine Learning in Mechanical Design
No ratings yet
Machine Learning in Mechanical Design
18 pages
198-Article Text-354-1-10-20250227
No ratings yet
198-Article Text-354-1-10-20250227
14 pages
Machinelearning
No ratings yet
Machinelearning
26 pages
(Lecture Notes in Management and Industrial Engineering) Fethi Calisir Emr
100% (2)
(Lecture Notes in Management and Industrial Engineering) Fethi Calisir Emr
500 pages
1 IntroductionDL
No ratings yet
1 IntroductionDL
69 pages
Part III
No ratings yet
Part III
15 pages
NSRD
No ratings yet
NSRD
26 pages
Features Selection and Featurs Generation
No ratings yet
Features Selection and Featurs Generation
5 pages
Air Quality Index Prediction Via Multi Task Machine Learning
No ratings yet
Air Quality Index Prediction Via Multi Task Machine Learning
13 pages
Module 4 - Supervised Learning - First ML Model
No ratings yet
Module 4 - Supervised Learning - First ML Model
23 pages
Feature and Feature Extractionlect2
No ratings yet
Feature and Feature Extractionlect2
28 pages
Eeg Features Extraction
No ratings yet
Eeg Features Extraction
16 pages
Final Project Report
No ratings yet
Final Project Report
76 pages
IT in Space A Seminar
No ratings yet
IT in Space A Seminar
10 pages
AV Report Assignment-2-1
No ratings yet
AV Report Assignment-2-1
8 pages
Complete Data Science Questions
No ratings yet
Complete Data Science Questions
5 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
Assignment1 LATEX
No ratings yet
Assignment1 LATEX
11 pages
Phase 3 IBM
No ratings yet
Phase 3 IBM
7 pages
Fyp 4
No ratings yet
Fyp 4
12 pages
AIML Short Term Internship Session 9 Summary-1719044709410
No ratings yet
AIML Short Term Internship Session 9 Summary-1719044709410
14 pages
Tutorial Pres 1
No ratings yet
Tutorial Pres 1
28 pages
Manufacturing Machine Learning Tool Mechanical
No ratings yet
Manufacturing Machine Learning Tool Mechanical
13 pages
Mini Project With Output
No ratings yet
Mini Project With Output
8 pages
Machine Learning Lecture - 4 and Lecture - 5
No ratings yet
Machine Learning Lecture - 4 and Lecture - 5
73 pages
Northbay Summarizes Data Pre-Processing Algorithms
No ratings yet
Northbay Summarizes Data Pre-Processing Algorithms
10 pages
S 11
No ratings yet
S 11
7 pages
Benefits and Risks of Using Iaa For Pharmaceutical
No ratings yet
Benefits and Risks of Using Iaa For Pharmaceutical
46 pages
The Funky Impact of Emerging Technologies On The Accounting Groove V2.3
No ratings yet
The Funky Impact of Emerging Technologies On The Accounting Groove V2.3
30 pages
P06 The Classification Pipeline Ans
No ratings yet
P06 The Classification Pipeline Ans
16 pages
(REPORT) LAB - 2 - Decision - Tree
No ratings yet
(REPORT) LAB - 2 - Decision - Tree
17 pages
Feature Engineering
No ratings yet
Feature Engineering
10 pages
Answer
No ratings yet
Answer
5 pages
Python Essential Methods in Machine Learning
No ratings yet
Python Essential Methods in Machine Learning
6 pages
2023 Paclic-1 83
No ratings yet
2023 Paclic-1 83
10 pages
4.3.2.4 Lab - Internet Meter Anomaly Detection
No ratings yet
4.3.2.4 Lab - Internet Meter Anomaly Detection
8 pages
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
50% (2)
Artificial Intelligence & Neural Networks Unit-5 Basics of NN
16 pages
4COSC008C.Coursework 1 Specification
No ratings yet
4COSC008C.Coursework 1 Specification
7 pages
MTech CO
No ratings yet
MTech CO
21 pages
Human Activities Classifier Using SVM
No ratings yet
Human Activities Classifier Using SVM
19 pages
Python TUM
No ratings yet
Python TUM
3 pages
Supervised Learning With Scikit-Learn: Preprocessing Data
No ratings yet
Supervised Learning With Scikit-Learn: Preprocessing Data
32 pages
Yesno Classification - Info
No ratings yet
Yesno Classification - Info
7 pages
Fresco
100% (2)
Fresco
17 pages
Scikit Learn
No ratings yet
Scikit Learn
17 pages
Sheet 3 Sol 3
No ratings yet
Sheet 3 Sol 3
3 pages
RANDOM FOREST (Binary Classification)
No ratings yet
RANDOM FOREST (Binary Classification)
5 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
2 pages
Mcse615l - Data-Analytics - TH - 1.0 - 71 - Mcse615l - 67 Acp
No ratings yet
Mcse615l - Data-Analytics - TH - 1.0 - 71 - Mcse615l - 67 Acp
2 pages
Roll NO 2020
No ratings yet
Roll NO 2020
8 pages
Slay The Day
No ratings yet
Slay The Day
21 pages
Pytorch (Tabular) - Regression
No ratings yet
Pytorch (Tabular) - Regression
13 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Sahil Chavan Resume 24
No ratings yet
Sahil Chavan Resume 24
1 page
Ashutosh Resume1
No ratings yet
Ashutosh Resume1
1 page
NF Assighment4
No ratings yet
NF Assighment4
5 pages
SOP Sample
0% (1)
SOP Sample
2 pages
Paper Presentation - IDS
No ratings yet
Paper Presentation - IDS
2 pages
2013 COMP5318 Lecture1
No ratings yet
2013 COMP5318 Lecture1
21 pages
Software Bug Prediction Using Machine Learning App
No ratings yet
Software Bug Prediction Using Machine Learning App
7 pages
Image Classification
No ratings yet
Image Classification
18 pages
Mtech Big Data Analytics SRM
No ratings yet
Mtech Big Data Analytics SRM
13 pages
ML Checklist PDF
No ratings yet
ML Checklist PDF
4 pages
Question: What Are The Basic Building Blocks of Learning Agent? Explain Each of Them With A Neat Block Diagram
No ratings yet
Question: What Are The Basic Building Blocks of Learning Agent? Explain Each of Them With A Neat Block Diagram
15 pages
Maxbox - Starter67 Machine Learning
No ratings yet
Maxbox - Starter67 Machine Learning
7 pages
Image Processing
No ratings yet
Image Processing
5 pages
Maxbox Starter60 Machine Learning
No ratings yet
Maxbox Starter60 Machine Learning
8 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet

Human Activity Recognition

Uploaded by

Human Activity Recognition

Uploaded by

Human Activity Recognition using Signal Feature Extraction and

Machine Learning: A Code Explanation

1. Libraries and Data Loading:

This function calculates several time-domain features:

peaks, _ = find_peaks(np.abs(yf[0:N//2]), height=0,

mean_frequency = np.sum(xf[0:N//2] * np.abs(yf[0:N//2])) /

power_bandwidth = xf[band_indices[-1]] - xf[band_indices[0]] if

return peak_amplitude, peak_location, mean_frequency, band_power,

2.3 Mean Feature:

4. Classifiers and K-Fold Cross-Validation:

classifiers = { # Dictionary of classifiers

for name, clf in classifiers.items():

Key Concepts and Theory:

2. Classifier Comparison and Tuning:

3. Signal Processing Deep Dive:

You might also like