0% found this document useful (0 votes)

28 views29 pages

November, 2024

Uploaded by

f20220630

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views29 pages

November, 2024

Uploaded by

f20220630

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

IOT PROJECT REPORT

By Group 9

ABHINAV CHERUVU MANIKANTHA SAI - 2021B1A33128H

SHAHEEN ALI - 2021B4A33044H
ROHIT YALAVARTHY - 2021A8TS2294H
KASHISH AGGARWAL - 2021B4AA2916H
YESHWANTH MURTHY - 2022AAPS2024H

BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE,

PILANI (Rajasthan)

(November, 2024)

1
Individual Contributions:

2
Table Of Contents

Table Of Contents........................................................................................................................ 1
Introduction..................................................................................................................................2
Datasets Overview.......................................................................................................................3

Exploratory Data Analysis (EDA)............................................................................................... 9

Key Points:activity_trimmed.....................................................................................................9
Statistical Summary:................................................................................................................ 9
Activity Categories:.................................................................................................................. 9
Visualizations:........................................................................................................................ 11
EDA Summary for data_for_weka_aw.csv.............................................................................11
1. Data Overview..............................................................................................................11
2. Missing Values............................................................................................................. 12
3. Summary Statistics...................................................................................................... 12
Key Observations.................................................................................................................. 14
Machine Learning Pipeline Implementation............................................................................17
a. Training and Testing Framework..................................................................................17
b. Pipeline Construction and Preprocessing.................................................................... 17
c. Hyperparameter Tuning with Grid Search....................................................................17
d. Evaluation and Visualization........................................................................................ 18
1. Evaluation Metrics..................................................................................................18
2. Confusion Matrix.................................................................................................... 19
Results:...................................................................................................................... 21
Custom LSTM Model................................................................................................................. 22
Predicting Heart Rate Using Apple Watch Data with LSTM Model........................................22
Introduction...................................................................................................................... 22
Preprocessing........................................................................................................................22
Preprocessing Steps........................................................................................................23
Model Design.........................................................................................................................24
Model Architecture........................................................................................................... 24
Training Process.............................................................................................................. 25
Results and Analysis............................................................................................................. 25
Performance Metrics........................................................................................................25
Visualization of Results....................................................................................................26
Comparison with State-of-the-Art............................................................................................ 27

3
Introduction
The rapid evolution of wearable devices, such as the latest iterations of the Apple Watch, has
significantly enhanced health monitoring and fitness tracking capabilities. These devices
generate rich physiological datasets, enabling advanced machine learning and deep learning
techniques to analyze and predict metrics such as heart rate and activity levels. This project,
utilizing data from the Harvard Dataverse, is divided into three phases, each exploring different
aspects of wearable data analysis.

Phases of the Project:

Phase 1: Exploratory Data Analysis (EDA)

This initial phase focused on understanding the structure and distribution of the data, sourced
from wearable devices such as Apple Watch and Fitbit. Key metrics like heart rate, steps,
calories, and activity intensity (measured in METs) were analyzed. Statistical summaries and
visualizations were used to uncover patterns, correlations, and outliers, laying a foundation for
modeling.

Phase 2: Activity Classification

In the second phase, machine learning models were employed to classify physical activity
types, such as lying, walking, and running at varying MET levels. Using data from wearable
devices, models such as Random Forest, Gradient Boosting, and K-Nearest Neighbors (KNN)
were trained and evaluated. This phase identified the best-performing model for activity
classification based on metrics derived from wearable data.

Phase 3: Heart Rate Prediction with LSTM

The final phase focused on developing a deep-learning model to predict heart rate values using
Apple Watch data based on different activities. A Long Short-Term Memory (LSTM) neural
network was designed to capture the temporal dependencies in the dataset. Preprocessing
steps included scaling features like steps and activity intensity and encoding activity types using
one-hot encoding. The model was trained to predict heart rate over an 8-minute interval,
demonstrating the potential of deep learning to analyze wearable data based on different activity
levels a person is going through.

This phased approach showcases the utility of wearable device data and machine learning in
enabling advanced health and fitness tracking solutions.

4
Datasets Overview

The datasets have come from a convenience sample of 46 participants (26 women) to wear two
devices, Apple Watch Series 2 and a Fitbit Charge HR2. Participants completed a 65-minute
protocol with 40-minutes of total treadmill time and 25-minutes of sitting or lying time. Indirect
calorimetry was used to measure energy expenditure. The outcome variable for the study was
the activity class; lying, sitting, walking self-paced, 3 METS, 5 METS, and 7 METS.
Minute-by-minute heart rate, steps, distance, and calories from Apple Watch and Fitbit were
included in four different machine learning models. The analysis dataset includes3656 and
2608 minutes of Apple Watch and Fitbit data, respectively.

We have three datasets at play here, they are:

● aw_fb_data:
○ Shape: 6264 rows × 20 columns
○ Columns: Includes general demographic data (example: age, gender, height,
weight), physical activity metrics (example: steps, heart rate, calories, distance),
entropy-based metrics (example: entropy_heart), and device-specific information.
Columns are labeled more generically (example: steps, hear_rate, device).

● Data_for_weka_aw:
○ Shape: 3656 rows × 18 columns
○ Columns: Similar to aw_fb_data but focuses specifically on Apple Watch data.
Columns are prefixed with Apple watch (example: Applewatch.Steps_LE,
Applewatch.Heart_LE). Includes additional entropy-based metrics and trimmed
activity labels.

● Data_for_weka_fb:
○ Shape: 2608 rows × 18 columns
○ Columns: Structured similarly to data_for_weka_aw but focuses on Fitbit data.
Columns are prefixed with Fitbit(example: Fitbit.Steps_LE, Fitbit.Heart_LE).

So here the aw_fb_data serves as a master dataset combining multiple devices, suitable for
comparisons or general analyses. Data_for_weka_aw & Data_for_weka_fb are more focused

5
on individual device data, which is likely cleaned and formatted for specific analyses or machine
learning tasks.

The main difference between the datasets are as follows:

1. Size:
○ aw_fb_data has the largest number of rows, possibly combining data from
multiple devices.
○ data_for_weka_aw and data_for_weka_fb have fewer rows, likely filtered or
device-specific subsets.

2. Column Names:
○ aw_fb_data uses generic column names applicable to any device.
○ data_for_weka_aw and data_for_weka_fb prefix column names with Apple
Watch or Fitbit, indicating device-specific datasets.

3. Focus:
○ aw_fb_data is a combined or general dataset, encompassing data from multiple
devices.

6
○ data_for_weka_aw and data_for_weka_fb are tailored for individual devices,
focusing on Apple Watch and Fitbit, respectively.
4. Activity Label:
○ In aw_fb_data, the activity label is activity.
○ In the device-specific datasets, the activity label is activity_trimmed.

To further understand the datasets, we can use some high level statistical methods.

7
8
In the combined dataset, aw_fb_data, metrics like steps and heart rate exhibit moderate
variability, with mean step counts of 109.56 and a variance of 49,638.91. The heart rate data is
similarly distributed, with a mean of 86.14 and a variance of 820.73. Notably, the "steps times
distance" metric shows extreme variability, with a mean of 590.04 and a variance exceeding 16
million, suggesting significant outliers or skewed data. Entropy metrics for heart rate and steps
are centered around 6 with low variability, indicating consistent daily patterns across
participants.

The Apple Watch dataset, data_for_weka_aw, shows higher step counts (mean: 180.25)
compared to Fitbit but also greater variability (variance: 72,596.79). Heart rate measurements
have a mean of 91.25 and lower variability, demonstrating stability in readings. Entropy metrics
for steps and heart rate are consistent at approximately 6, mirroring patterns in the combined
dataset. Resting heart rate in the Apple Watch dataset shows less variability (variance: 142.32),
highlighting more consistent readings.

9
In contrast, the Fitbit dataset, data_for_weka_fb, demonstrates significantly lower mean step
counts (10.47) but higher variability in distance traveled (variance: 4,433.80), reflecting a
broader range of physical activity levels. The Fitbit dataset also exhibits a stronger correlation
between steps and heart rate (mean correlation: 0.727) compared to the Apple Watch dataset
(mean correlation: 0.006), suggesting better synchronization between these metrics. Entropy
metrics in the Fitbit dataset are slightly lower than in the Apple Watch dataset, with higher
variability, which may indicate differences in activity patterns or measurement algorithms.

Overall, the Apple Watch dataset tends to provide more consistent readings with lower
variability, making it suitable for analyses requiring stability. In contrast, the Fitbit dataset
captures a wider range of activities, as evidenced by its higher variance in several metrics like
steps, calories, and distance. The combined dataset, aw_fb_data, integrates data from both
devices but introduces variability due to differences in measurement approaches. Each dataset
offers unique strengths, allowing researchers to select the most appropriate data source based
on the specific objectives of their analysis.

10
Exploratory Data Analysis (EDA)

Key Points: activity_trimmed

● Numerical Columns: Most columns are numerical (example: age, height..etc).

● Categorical Columns: Only one column,activity_trimmed, is categorical.
● No Missing Values: All columns have complete data.

Statistical Summary:

● Age: Ranges from 18 to 56 years, with an average of ~28.8.

● Gender: Encoded as 0 (likely female) and 1 (male), with a roughly balanced distribution.
● Physical Metrics:
○ Height: 143–191 cm, mean ~169.47 cm.
○ Weight: 43–115 kg, mean ~68.22 kg.
● Fitbit Metrics: Show large variability, example: FitbitStepsXDistance_LE ranges from 1
to 51520.

Activity Categories:

The dataset contains six activity types:

1. Lying
2. Self Pace walk
3. Running 3 METs
4. Running 5 METs
5. Sitting
6. Running 7 METs

11
12
Visualizations:

1. Age Distribution: The data skews slightly towards younger ages, with a peak around
25–30 years.
2. Height and Weight Distribution:
○ Height clusters around 160–180 cm.
○ Weight peaks between 60–70 kg, with a smaller spread than height.
3. Activity Class Distribution:
○ Some activities (example: "Lying" and "Self Pace walk") dominate the dataset.
○ Activities like "Running 7 METs" occur less frequently, indicating a potential class
imbalance.

13
EDA Summary for data_for_weka_aw.csv

1. Data Overview

● Shape: (3656 rows, 18 columns)

● Columns:
○ 3 integer columns: Unnamed: 0, age, gender.
○ 14 float columns: Metrics related to Apple Watch (example: steps, heart rate,
calories).
○ 1 object column: activity_trimmed.
● Memory Usage: 514.3 KB.

2. Missing Values

● No missing values in any column.

3. Summary Statistics

● Age range: 18–56 years.

● Height: 143–191 cm, average ~169.9 cm.
● Weight: 43–115 kg, average ~70.6 kg.
● Apple Watch Metrics:
○ Steps per day: Mean ~180, max ~1714.
○ Heart rate: Mean ~91.3 bpm, max ~194.3 bpm.
○ Calories: Mean ~5.78, max ~29.24.
● Other metrics like entropy, intensity, and normalized heart rates show varying
distributions.

14
15
The plot above shows the distributions of key numeric variables from the data_for_weka_aw.csv
dataset.

● Observations:
○ Age is fairly evenly distributed, with a peak around 25–30 years.
○ Height and weight have a normal distribution centered around their means (~170
cm and ~70 kg).
○ Apple Watch metrics, like steps and calories, show skewed distributions,
indicating most users fall into lower activity ranges.
○ Heart rate data clusters between 60–100 bpm, with fewer outliers at higher rates.

EDA Summary of aw_fb_data.csv

The dataset contains 6264 entries and 20 columns. Here's a quick summary of its structure:

● Numeric Columns: Includes metrics such as age, height, weight, steps, heart rate,
calories, distance, and others.
● Categorical Columns: device, activity.
● Target/Relevant Insights: Possible correlations between activity type, health metrics,
and device usage.

16
17
Here are the visual insights:

1. Age, Height, and Weight Distributions:

○ Age distribution is centered, likely targeting adults.
○ Height and weight show normal distributions with slight variance, suitable for
health data analysis.
2. Correlation Heatmap:
○ Strong correlations between steps, distance, and calories.
○ hear_rate has a notable relationship with resting_heart and intensity_karvonen.
3. Heart Rate by Activity:
○ Different activities show varying distributions of heart rates.
○ Activities like "Running" likely show higher heart rate ranges, while "Lying" has
lower values.
4. Average Steps by Device:
○ Clear differences in step counts among devices, which could indicate usage
patterns or activity tracking precision.

18
Machine Learning Pipeline Implementation
To ensure a streamlined and reproducible approach for model training, testing, and
hyperparameter tuning, we designed and implemented a custom Python class,
ClassifierPipeLine. This class allowed us to efficiently build, evaluate, and optimize machine
learning models while incorporating necessary preprocessing steps. Below, we outline its key
functionalities and the process followed for our analysis.

a. Training and Testing Framework

The ClassifierPipeLine class provided a structured framework for model training and testing. It
facilitated the seamless integration of the training data, validation through cross-validation, and
evaluation on test data. The class ensured that models were trained using the optimal
hyperparameters and that their performance was tested under consistent conditions, thereby
enabling reliable comparisons between different classifiers.

b. Pipeline Construction and Preprocessing

The ClassifierPipeLine class includes the capability to build machine learning pipelines with
preprocessing steps and classifiers integrated into a unified structure. This ensures that
necessary data transformations, such as standardization or feature engineering, are
consistently applied to both the training and testing phases.

For example, during our analysis, scaling transformations were integrated into the pipeline for
models like K-Nearest Neighbors (KNN) to ensure optimal performance. This modular approach
ensured that preprocessing steps were reusable across different models and configurations.

c. Hyperparameter Tuning with Grid Search

To achieve optimal performance, hyperparameter tuning was conducted using the

create_grid_search method of the ClassifierPipeLine class. This method combines the pipeline
with a grid search algorithm, evaluating multiple combinations of hyperparameters through
cross-validation.

For each classifier, specific hyperparameters were tuned:

● Random Forest: Parameters like the number of estimators, maximum depth, and feature
selection strategies were explored.
● KNN: The number of neighbors and distance metrics were optimized.
● Gradient Boosting: Learning rate, number of estimators, and tree depth were fine-tuned.

19
The best-performing hyperparameters were automatically identified, and the models were
retrained with these settings on the entire training dataset.

d. Evaluation and Visualization

Evaluating the performance of the models is crucial to understanding their effectiveness in

classifying activities.

1. Evaluation Metrics

● Accuracy: Measures the proportion of correctly classified instances out of the total
instances.

𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 + 𝑇𝑟𝑢𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠

Accuracy = 𝑇𝑜𝑡𝑎𝑙 𝐼𝑛𝑠𝑡𝑎𝑛𝑐𝑒𝑠

● Precision: Focuses on the quality of positive predictions. It is defined as:

𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠
Precision= 𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠+𝐹𝑎𝑙𝑠𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠

High precision indicates a low false positive rate.

● Recall (Sensitivity or True Positive Rate): Measures the model’s ability to capture all
relevant positive cases.

𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠
Recall= 𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠+𝐹𝑎𝑙𝑠𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠

High recall means fewer false negatives.

● F1 Score: The harmonic mean of precision and recall. It’s useful when you want a
balance between precision and recall, especially with imbalanced classes.

𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛×𝑅𝑒𝑐𝑎𝑙𝑙
F1 Score=2× 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛+𝑅𝑒𝑐𝑎𝑙𝑙

● The metrics for the four classification models are as follows:

2. Confusion Matrix

● A confusion matrix gives a thorough explanation of the right and wrong classifications:
● True Positives (TP) refer to actions that were correctly anticipated.
● False positives (FP) are actions that were incorrectly anticipated but were not
really executed.
● True Negatives (TN) are correctly anticipated non-activities.
● False Negatives (FN) are missed forecasts for actions that were actually done.
● The confusion matrix identifies which actions are misclassified, giving information for
model development.
● Confusion matrices for the four models are shown below:

21
22
Results:

Model Accuracy Best Worst Key Strengths Weaknesses Feature

(%) Class Class Importance
Insights

Random 82 Running 7 Sitting: - Excellent -Minor -The

Forest METs: Precision = precision and underachievem predominance of
Precision 0.75 consistent ent during heart rate metrics
= 0.91 Recall = performance stationary (such as
Recall = 0.69 across a majority activities (e.g., norm_heart,
0.88 F1 = 0.72 of categories sitting). heart_rate, and
F1 = 0.90 - Effective others) suggests
management of that these
noisy or variables are
unimportant indicative, which
features by is consistent with
utilizing feature the feature
bagging importance chart.

K-Nearest 61 Running 3 Sitting: -Simple algorithm -Ineffective -No clear method

Neighbours METs: Precision = easy to interpret. classification for for determining
Precision 0.50 -Effective for low-intensity feature
= 0.65 Recall = high-intensity activities importance
Recall = 0.35 activities where because of effectively
0.74 F1 = 0.41 feature distances feature overlap. addresses the
F1 = 0.69 are distinct -High subtle
computational interactions
demands for between
larger datasets. features.

Naive 32 Self Pace Sitting: -Quick and - Poor -Not relevant

Bayes Walk: Precision = effective on performance because there is
Precision 0.19 limited datasets across all no clear feature
= 0.26 Recall = - Presumes that categories weighting.
Recall = 0.06 features are resulting from
0.69 F1 = 0.09 independent the rigid
F1 = 0.37 independence
assumption.
- Incapable of
capturing
intricate
relationships
within the data.

23
Gradient 83 Running 7 Sitting: -Surpassed other - More In alignment with
Boosting METs: Precision = models in both resource-intensi Random Forest
Precision 0.71 accuracy and ve than basic findings, heart
= 0.93 Recall = precision across algorithms. rate features
Recall = 0.71 the majority of - Some (norm_heart,
0.89 F1 = 0.71 classes. challenges in heart_rate) are
F1 = 0.90 -Identifies subtle differentiating the most
patterns through low-intensity prominent in the
a process of activities. rankings.
iterative boosting.

Custom LSTM Model

Predicting Heart Rate Using Apple Watch Data with LSTM Model

Introduction

Wearable devices such as the Apple Watch provide a unique opportunity to monitor and predict
physiological parameters like heart rate during various physical activities, which is helpful for
athletes. Accurate heart rate prediction can enable more effective fitness tracking and health
monitoring. In this, we leverage data collected from Apple Watch sensors, which is considered
close to the Gold Standard ECG measurement as mentioned in this paper
(mc.ncbi.nlm.nih.gov/articles/PMC6444219) to build a Long Short-Term Memory (LSTM) neural
network model to predict heart rate over an 8-minute interval while training on the 56 minutes.

This part of the project focuses on modeling heart rate fluctuations in response to activity
intensity using time-series data collected from 46 participants. The goal is to demonstrate how
temporal dependencies in sequential data can be effectively captured using LSTM to achieve
accurate predictions.

Preprocessing

Features Utilized:

1. Applewatch.Steps_LE: Represents the number of steps taken per minute.

2. RestingApplewatchHeartrate_LE: Provides the baseline resting heart rate for each
participant.
3. ApplewatchStepsXDistance_LE: Captures the combined effect of steps and distance
traveled, representing physical exertion.
4. ApplewatchIntensity_LE: Denotes activity intensity levels as measured in METs.
5. Activity Type (in one hot encoded format)

24
Preprocessing Steps

1. Trimming Data:
○ Each participant’s data was trimmed to ensure a consistent number of entries (64
data points) across individuals.
2. Feature Scaling:
○ Both the input features and target heart rate values were normalized to the range
[0, 1] using Min-Max Scaling, ensuring uniformity across different units.
3. Sequence Generation:
○ Data was transformed into sequential samples, where 20 consecutive time steps
were used as input to predict the heart rate at the next time step. This sliding
window approach allowed the model to capture temporal patterns in the data.
4. One-Hot Encoding:
○ Categorical variables, in this case, activity types, were one-hot encoded to
provide distinct representations for the model.
5. Train-Test Split:
○ For each participant, 80% of the data was used for training, and 20% was
reserved for testing.

25
Model Design

Model Architecture

The LSTM model was designed to leverage the temporal nature of the dataset with the following
components:

1. Input Layer:
○ Accepts sequences of 20 time steps, with each step containing scaled feature
values.
2. Bidirectional LSTM Layers:
○ Two stacked layers of Bidirectional LSTM units (128 and 64 units, respectively)
were used to capture temporal dependencies in both forward and backward
directions. Regularization (l2) was applied to prevent overfitting.
3. Dropout Layers:
○ A dropout probability of 0.3 was applied after each LSTM layer to further reduce
overfitting.
4. Dense Output Layer:
○ The final dense layer produces a single scalar value representing the predicted
heart rate.
5. Loss Function:
○ Mean Squared Error (MSE) was used to minimize the difference between actual
and predicted heart rate values.
6. Optimizer:
○ Adam optimizer was used with a learning rate of 0.00005, ensuring efficient
convergence during training.

26
Training Process

1. Early Stopping:
○ The training was stopped early if validation loss did not improve for 25
consecutive epochs, preventing overfitting and unnecessary computations.
2. Training Parameters:
○ Batch Size: 32 samples.
○ Epochs: 150 maximum, with early stopping applied.
3. Sequence Input:
○ Input sequences consisted of 20 time steps, allowing the model to predict the
heart rate for the next minute.

Results and Analysis

Performance Metrics

● Average RMSE: The model achieved an average Root Mean Squared Error (RMSE) of
2.34 bpm across all participants.

● Alignment: Predictions closely matched actual heart rate values, particularly during
low-to-moderate activity intensity phases.

27
Visualization of Results

Figures below show the actual vs. predicted heart rate values for selected participants:

28
Comparison with State of the Art

The study through which the data was collected was done in 2018, since when the hardware in
both the Fitbit and Apple Watch have improved.

The Apple Watch Series 10 and the latest Fitbit models seem to cater to different user needs.
The Apple Watch Series 10 boasts a thinner design with the largest display yet, enhanced sleep
apnea notifications, water depth and temperature sensing, and advanced health metrics through
watchOS 11, including a new Vitals app for monitoring key overnight health data. It integrates
seamlessly with the Apple ecosystem, allowing for extensive app usage, notifications, and
contactless payments via Apple Pay.

In contrast, Fitbit emphasizes affordability and battery life, with some models lasting several
days on a single charge compared to the Apple Watch's 18-hour battery life. Fitbit devices excel
in basic fitness tracking features like heart rate monitoring and sleep analysis but offer fewer
smartwatch functionalities. They are compatible with both iOS and Android devices, making
them versatile for a broader audience. Overall, the choice between the two largely depends on
whether users prioritize comprehensive smartwatch capabilities or extended fitness tracking
features at a lower price point.

But similar to our findings above, the latest apple watch seems to outperform the fitbit.
Which is no surprise considering the cost difference between them.

TSP CMC 48061
No ratings yet
TSP CMC 48061
22 pages
APPLICATIONS OF IOT in SMART Wearables
No ratings yet
APPLICATIONS OF IOT in SMART Wearables
6 pages
Nfpa 70B
100% (4)
Nfpa 70B
32 pages
FULLTEXT01
No ratings yet
FULLTEXT01
68 pages
8 Ijmperdjun20198
No ratings yet
8 Ijmperdjun20198
14 pages
Exploration of LifeSnap Fitbit Data
No ratings yet
Exploration of LifeSnap Fitbit Data
32 pages
Recognizing and Classifying Daily Human Activities: Group-22
No ratings yet
Recognizing and Classifying Daily Human Activities: Group-22
23 pages
Exploration of LifeSnap Fitbit Data
No ratings yet
Exploration of LifeSnap Fitbit Data
32 pages
Lecture06 Mobile Sensing
No ratings yet
Lecture06 Mobile Sensing
51 pages
(AI-Machine Learning) Optimized Sensorless Human Heartrate Estimation For A Dance Workout Application
No ratings yet
(AI-Machine Learning) Optimized Sensorless Human Heartrate Estimation For A Dance Workout Application
5 pages
Case Study3-Fitbit
No ratings yet
Case Study3-Fitbit
11 pages
Activity Recognition Using Wearble Sensors
No ratings yet
Activity Recognition Using Wearble Sensors
17 pages
User Perception and Satisfaction Towardssmart Watch in Coimbatore and Tirupurdistricts
No ratings yet
User Perception and Satisfaction Towardssmart Watch in Coimbatore and Tirupurdistricts
7 pages
Research Paper 1
No ratings yet
Research Paper 1
13 pages
Advancing Healthcare Predictions: Harnessing Machine Learning For Accurate Health Index Prognosis
No ratings yet
Advancing Healthcare Predictions: Harnessing Machine Learning For Accurate Health Index Prognosis
8 pages
Sensors 22 00174
No ratings yet
Sensors 22 00174
19 pages
Sensors 19 03731
No ratings yet
Sensors 19 03731
20 pages
Unit 2
No ratings yet
Unit 2
11 pages
Human Activity Classification
No ratings yet
Human Activity Classification
6 pages
Big Data Analytics
No ratings yet
Big Data Analytics
6 pages
Thesis Report
No ratings yet
Thesis Report
20 pages
IEC2021019112116
No ratings yet
IEC2021019112116
21 pages
Oluwalade2021Human Preprint
No ratings yet
Oluwalade2021Human Preprint
6 pages
Ijisav11n1spl 25
No ratings yet
Ijisav11n1spl 25
7 pages
Wearable Technology Research Paper Project
No ratings yet
Wearable Technology Research Paper Project
36 pages
ERROR MAKERShackathon - PPT (1) - 1
No ratings yet
ERROR MAKERShackathon - PPT (1) - 1
4 pages
Smartwatch dsn20 Fastabs
No ratings yet
Smartwatch dsn20 Fastabs
2 pages
Human Activity Classification Poster
No ratings yet
Human Activity Classification Poster
1 page
Final Research Paper
No ratings yet
Final Research Paper
11 pages
IOT Project
No ratings yet
IOT Project
4 pages
HC 12
No ratings yet
HC 12
3 pages
01 Ajms 14 (12) 29 58664
No ratings yet
01 Ajms 14 (12) 29 58664
3 pages
Asm1 Research
No ratings yet
Asm1 Research
4 pages
Sleep Quality Prediction From Wearables Using Convolution Neural
No ratings yet
Sleep Quality Prediction From Wearables Using Convolution Neural
15 pages
Smart-Wearable Sensors and CNN-BiGRU Model A Powerful Combination For Human Activity Recognition
No ratings yet
Smart-Wearable Sensors and CNN-BiGRU Model A Powerful Combination For Human Activity Recognition
12 pages
Sensors 23 03354
No ratings yet
Sensors 23 03354
16 pages
Sleep Quality Prediction From Wearables Using Convolution Neural Networks and Ensemble Learning
No ratings yet
Sleep Quality Prediction From Wearables Using Convolution Neural Networks and Ensemble Learning
11 pages
Core Java - Munishwar Gulati
No ratings yet
Core Java - Munishwar Gulati
252 pages
Smartwatch-Based Human Activity Recognition Using Hybrid LSTM Network
No ratings yet
Smartwatch-Based Human Activity Recognition Using Hybrid LSTM Network
4 pages
A Review On Wearable Biomedical Devices and Their Role in Predictive Healthcare
No ratings yet
A Review On Wearable Biomedical Devices and Their Role in Predictive Healthcare
3 pages
Engineering Emergence - Joris Dormans
No ratings yet
Engineering Emergence - Joris Dormans
302 pages
Bookstore Management System
100% (1)
Bookstore Management System
40 pages
Areva p343 p344 p345 Xrio Converter Manual Enu Tu2.22 v1.001
No ratings yet
Areva p343 p344 p345 Xrio Converter Manual Enu Tu2.22 v1.001
16 pages
ISPF User's Guide Volume I PDF
No ratings yet
ISPF User's Guide Volume I PDF
260 pages
Python-Unit-6 R16 PDF
No ratings yet
Python-Unit-6 R16 PDF
19 pages
Basler RDP-110
No ratings yet
Basler RDP-110
26 pages
DBDM Lecture Notes
No ratings yet
DBDM Lecture Notes
242 pages
SG Acma
No ratings yet
SG Acma
9 pages
Connecting Python With SQL Database
No ratings yet
Connecting Python With SQL Database
21 pages
Algan/Gan Hemts-An Overview of Device Operation and Applications
No ratings yet
Algan/Gan Hemts-An Overview of Device Operation and Applications
10 pages
Micromachines 10 00745 v2 PDF
No ratings yet
Micromachines 10 00745 v2 PDF
11 pages
Format For GWA
No ratings yet
Format For GWA
6 pages
01 - Disaster - (2) - JupyterLab
No ratings yet
01 - Disaster - (2) - JupyterLab
16 pages
Saa-C01 V14.35
No ratings yet
Saa-C01 V14.35
112 pages
Binomial Worked Examples
No ratings yet
Binomial Worked Examples
2 pages
Journal of Parallel and Distributed Computing: Daming Zhao, Jiantao Zhou
No ratings yet
Journal of Parallel and Distributed Computing: Daming Zhao, Jiantao Zhou
11 pages
Instaliranje Total War
No ratings yet
Instaliranje Total War
2 pages
Shred1.06 Manual
No ratings yet
Shred1.06 Manual
12 pages
IPM Lab Manual - Exp - 1
No ratings yet
IPM Lab Manual - Exp - 1
9 pages
Chapter 2 RRL
No ratings yet
Chapter 2 RRL
9 pages
Aemc Ca811 Ca813
No ratings yet
Aemc Ca811 Ca813
1 page
Dataflair FTPO Free Certification Courses
No ratings yet
Dataflair FTPO Free Certification Courses
14 pages
Nse 3.1
No ratings yet
Nse 3.1
4 pages
Az1084s PDF
No ratings yet
Az1084s PDF
17 pages
Running Head: Mass Customization at Hewlett-Packard 1
No ratings yet
Running Head: Mass Customization at Hewlett-Packard 1
3 pages
CHM121 - Module 2 - Significant Figures
No ratings yet
CHM121 - Module 2 - Significant Figures
26 pages
Bang-Bang and Singular in Biology
No ratings yet
Bang-Bang and Singular in Biology
45 pages
Advanced Ec Section 6
No ratings yet
Advanced Ec Section 6
5 pages
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
From Everand
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
EMC Education Services
No ratings yet
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
From Everand
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
César Pérez López
No ratings yet
Machine Learning for Time Series Forecasting with Python
From Everand
Machine Learning for Time Series Forecasting with Python
Francesca Lazzeri
4/5 (2)
Thinking Statistically
From Everand
Thinking Statistically
Anthony Banfield
5/5 (1)
Improved Performance Research Integration Tool User Guide - Version 4.6
From Everand
Improved Performance Research Integration Tool User Guide - Version 4.6
Beth Plott
No ratings yet
Core Concepts in Statistical Learning
From Everand
Core Concepts in Statistical Learning
Tushar Gulati
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Introduction to Machine Learning and Neural Classification
From Everand
Introduction to Machine Learning and Neural Classification
Trilokesh Khatri
No ratings yet
Data Science with R: Beginner to Expert
From Everand
Data Science with R: Beginner to Expert
Narayana Nemani
No ratings yet
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
"Big Data Science" Basic Concepts and Applications
From Everand
"Big Data Science" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Data Analytics
From Everand
Data Analytics
Jeffery Short
1/5 (1)
A Quick and Easy Guide in Using SPSS for Linear Regression Analysis
From Everand
A Quick and Easy Guide in Using SPSS for Linear Regression Analysis
Jurex Gallo
No ratings yet
Exploring the World of Data Science and Machine Learning
From Everand
Exploring the World of Data Science and Machine Learning
NIBEDITA Sahu
No ratings yet
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
From Everand
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Peter Bradley
No ratings yet
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
From Everand
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
Robert Johnson
No ratings yet
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet

November, 2024

Uploaded by

November, 2024

Uploaded by

IOT PROJECT REPORT

ABHINAV CHERUVU MANIKANTHA SAI - 2021B1A33128H

BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE,

Exploratory Data Analysis (EDA)............................................................................................... 9

Phases of the Project:

Phase 1: Exploratory Data Analysis (EDA)

Phase 2: Activity Classification

Phase 3: Heart Rate Prediction with LSTM

We have three datasets at play here, they are:

The main difference between the datasets are as follows:

Key Points: activity_trimmed

● Numerical Columns: Most columns are numerical (example: age, height..etc).

● Age: Ranges from 18 to 56 years, with an average of ~28.8.

The dataset contains six activity types:

● Shape: (3656 rows, 18 columns)

● No missing values in any column.

● Age range: 18–56 years.

EDA Summary of aw_fb_data.csv

1. Age, Height, and Weight Distributions:

a. Training and Testing Framework

b. Pipeline Construction and Preprocessing

c. Hyperparameter Tuning with Grid Search

To achieve optimal performance, hyperparameter tuning was conducted using the

For each classifier, specific hyperparameters were tuned:

d. Evaluation and Visualization

Evaluating the performance of the models is crucial to understanding their effectiveness in

𝑇𝑟𝑢𝑒 𝑃𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑠 + 𝑇𝑟𝑢𝑒 𝑁𝑒𝑔𝑎𝑡𝑖𝑣𝑒𝑠

● ​Precision: Focuses on the quality of positive predictions. It is defined as:

High precision indicates a low false positive rate.

High recall means fewer false negatives.

● The metrics for the four classification models are as follows:

Model Accuracy Best Worst Key Strengths Weaknesses Feature

Random 82 Running 7 Sitting: - Excellent -Minor -The

K-Nearest 61 Running 3 Sitting: -Simple algorithm -Ineffective -No clear method

Naive 32 Self Pace Sitting: -Quick and - Poor -Not relevant

Custom LSTM Model

1. Applewatch.Steps_LE: Represents the number of steps taken per minute.

Results and Analysis

You might also like

● Precision: Focuses on the quality of positive predictions. It is defined as: