0% found this document useful (0 votes)

16 views12 pages

Chapter 9 BTC PRICE PRED

The document outlines a comprehensive guide for predicting Bitcoin prices using Linear Regression, covering data inspection, preprocessing, visualization, model training, and evaluation. It includes steps for reading data, visualizing relationships, understanding correlations, selecting features, and making predictions. The final sections discuss model performance metrics and real-world applications, providing insights into the practical use of the regression model for financial forecasting.

Uploaded by

Nawaz Wariya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views12 pages

Chapter 9 BTC PRICE PRED

Uploaded by

Nawaz Wariya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

NW INTERNSHIP 10CP

BTC Price Prediction Using Linear

Regression
Step 1: Reading and Inspecting the Data
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import
train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error,
mean_absolute_error

# Reading the CSV file and loading it into a

DataFrame
df = pd.read_csv('BTC-USD.csv')

# Checking basic information about the DataFrame

df.info()

Explanation:
• Imports: Necessary libraries are imported

1|Page
NW INTERNSHIP 10CP

(numpy, pandas, matplotlib, seaborn, sklearn).

• Data Loading: The BTC-USD.csv file is loaded
into a Pandas DataFrame (df).
• Data Inspection: df.info() provides basic
information about the DataFrame, such as
column names, data types, and missing values.

DataFrame Information
The info() method gives a concise summary of the
DataFrame. It provides the following details:
• Data types of each column

• Non-null counts

• Memory usage
df.info()
This helps in understanding the structure of the
data and identifying any potential issues such as
missing values or incorrect data types.

Step 2: Data Preprocessing and Visualization

Converting the 'Date' Column
# Converting the 'Date' column datatype from
object to datetime
df['Date'] = pd.to_datetime(df['Date'])

2|Page
NW INTERNSHIP 10CP

# Checking updated information after conversion

df.info()

Explanation:
• Date Conversion: The 'Date' column is

converted from object type to datetime using

pd.to_datetime().
• Updated Information: df.info() confirms the

conversion, showing the 'Date' column now has

datetime datatype.
Visualizing Data with Scatter Plots
# Visualizing data with scatter plots
plt.figure(figsize=(8, 6))
plt.scatter(df['Date'], df['High'])
plt.ylabel('High')
plt.xlabel('Date')
plt.title("Date vs. High (Scatter Plot)")
plt.show()

Explanation:
• Visualization: A scatter plot (plt.scatter) is

created to visualize the relationship between

'Date' and 'High' prices, helping to understand
the data distribution and trends.
3|Page
NW INTERNSHIP 10CP

Step 3: Exploring Data Relationships and Trends

Scatter Plot of 'Date' vs. 'Low'
# Scatter plot of 'Date' vs. 'Low'
plt.figure(figsize=(8, 6))
plt.scatter(df['Date'], df['Low'])
plt.ylabel('Low')
plt.xlabel('Date')
plt.title("Date vs. Low (Scatter Plot)")
plt.show()
Line Plot of 'Date' with 'High' and 'Low' Prices
# Line plot of 'Date' with 'High' and 'Low' prices
plt.plot(df['Date'], df['High'], label='High')
plt.plot(df['Date'], df['Low'], label='Low')
plt.xlabel('Date')
plt.ylabel('Price')
plt.title('High and Low Prices Over Time')
plt.legend()
plt.show()

Explanation:
• Visualization Continues: Another scatter plot

shows the relationship between 'Date' and 'Low'

prices.
4|Page
NW INTERNSHIP 10CP

• Price Trends: Line plots (plt.plot) are used to

visualize the trends of 'High' and 'Low' prices
over time, providing insights into price volatility
and historical movements.

Step 4: Understanding Data Correlations

Heatmap of Correlations Among Numerical
Columns
# Heatmap of correlations among numerical
columns
numerical_cols = ['Open', 'High', 'Low', 'Close', 'Adj
Close', 'Volume']
corr_matrix = df[numerical_cols].corr()

sns.heatmap(corr_matrix, annot=True,
cmap='coolwarm')
plt.title('Correlation Heatmap')
plt.show()

Explanation:
• Correlation Heatmap: sns.heatmap() creates a

heatmap to visualize correlations (corr())

among numerical columns ('Open', 'High', 'Low',
'Close', 'Adj Close', 'Volume'). This helps in
5|Page
NW INTERNSHIP 10CP

understanding how different variables are

related, which is crucial for feature selection in
modeling.

Detailed Analysis of Correlations

• Strong Positive Correlation: Observing strong

correlations between 'High' and 'Close', 'Open'

and 'Close', etc.
• Weak Correlation: Identifying columns with
weaker correlations which might not be as
useful for prediction.

Step 5: Model Preparation and Feature Selection

Selecting Relevant Features for Modeling
# Selecting relevant features for modeling and
defining target variable
X = df[['Open', 'High', 'Low', 'Volume']]
y = df['Close']

# Splitting the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X,
y, test_size=0.3, random_state=42)

# Checking the first few rows of the training set

6|Page
NW INTERNSHIP 10CP

print(X_train.head())
print(y_train.head())

Explanation:
• Feature Selection: Features (X) such as 'Open',

'High', 'Low', and 'Volume' are selected for

modeling, while 'Close' is chosen as the target
variable (y).
• Data Splitting: train_test_split() splits the data
into training (X_train, y_train) and testing
(X_test, y_test) sets with a test size of 30% and
a fixed random state for reproducibility.
• Data Validation: head() displays the first few
rows of the training set to verify the correct
selection and splitting of data.

Feature Engineering
• Feature Transformation: Discuss potential

feature transformations (e.g., log

transformation) to improve model
performance.
• Handling Missing Values: Describe steps to
handle any missing values if present.

7|Page
NW INTERNSHIP 10CP

Step 6: Model Training and Evaluation

Initializing and Training the Linear Regression
Model
# Initializing and training the Linear Regression
model
model = LinearRegression()
model.fit(X_train, y_train)
Predicting on the Test Set
# Predicting on the test set
y_pred = model.predict(X_test)
Evaluating Model Performance
# Evaluating model performance
r_squared = model.score(X_test, y_test)
print('Coefficient of determination (R^2):',
r_squared)

mse = mean_squared_error(y_test, y_pred)

rmse = np.sqrt(mse)
mae = mean_absolute_error(y_test, y_pred)

print("Mean Squared Error:", mse)

print("Root Mean Squared Error:", rmse)
print("Mean Absolute Error:", mae)

8|Page
NW INTERNSHIP 10CP

Explanation:
• Model Initialization and Training:

LinearRegression() initializes a Linear

Regression model (model) which is then trained
(fit()) on the training data (X_train, y_train).
• Prediction and Evaluation: predict() predicts

'Close' prices on the test set (X_test), and

model performance metrics such as R-squared
(score()), Mean Squared Error
(mean_squared_error()), Root Mean Squared
Error (sqrt()), and Mean Absolute Error
(mean_absolute_error()) are calculated and
printed.
•

Detailed Performance Metrics Analysis

• R-squared Interpretation: Explaining the

coefficient of determination and its

significance.
• Error Metrics: Detailed interpretation of MSE,

RMSE, and MAE, and their implications on model

performance.

Step 7: Interpreting Model Results

Extracting Model Coefficients and Intercept
9|Page
NW INTERNSHIP 10CP

# Extracting model coefficients and intercept

coefficients = model.coef_
intercept = model.intercept_
print("Coefficients (w):", coefficients)
print("Intercept (b):", intercept)

Explanation:
• Model Coefficients: coef_ retrieves the

coefficients of the features (Open, High, Low,

Volume) in the Linear Regression model
(model), while intercept_ retrieves the intercept
(b).
• Understanding Impact: Printing these
coefficients and intercept helps understand
their impact on predicting the 'Close' price
based on the selected features.

Interpretation of Coefficients
• Feature Impact: Detailed discussion on how

each feature impacts the target variable

('Close').
• Significance Testing: Introduction to
significance testing of coefficients (e.g., p-
values).
10 | P a g e
NW INTERNSHIP 10CP

Step 8: Making a Prediction

Example Prediction Using the Model
# Example prediction using the model
input_data = [1565, 3822.384766, 3901.908936,
3797.219238, 4770578575]
predicted_close_price =
model.predict([input_data])
print("Predicted Closing Price:",
predicted_close_price[0])

Explanation:
• Prediction Example: An example input
(input_data) is used to predict the closing price
('Close') using the trained model
(model.predict()), providing a practical
application of the regression model for
forecasting.

Real-World Application
• Use Case Scenarios: Discuss potential real-

world scenarios where this model can be

applied (e.g., trading strategies, market
analysis).
11 | P a g e
NW INTERNSHIP 10CP

• Model Limitations: Highlight limitations of the

model and potential areas for improvement.

Conclusion
By providing detailed explanations and
visualizations, this extended version helps in
understanding the process of predicting Bitcoin
prices using Linear Regression. The document
covers data inspection, preprocessing, visualization,
model training, evaluation, and practical
applications, offering a comprehensive guide for
anyone interested in applying Linear Regression for
financial forecasting.

12 | P a g e

000+ +curriculum+ +Complete+Data+Science+and+Machine+Learning+Using+Python
No ratings yet
000+ +curriculum+ +Complete+Data+Science+and+Machine+Learning+Using+Python
10 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Experiment No.8
No ratings yet
Experiment No.8
5 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Introduction To Data Science - Lin and Li
No ratings yet
Introduction To Data Science - Lin and Li
403 pages
Exp 1
No ratings yet
Exp 1
6 pages
Introduction To Data Science: Hui Lin and Ming Li
No ratings yet
Introduction To Data Science: Hui Lin and Ming Li
403 pages
Lab Experiment 4 - AI
No ratings yet
Lab Experiment 4 - AI
7 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
ML SIG - Day 1
No ratings yet
ML SIG - Day 1
55 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
Salary Prediction
No ratings yet
Salary Prediction
9 pages
Module 2
No ratings yet
Module 2
20 pages
Aakash S Project Report
No ratings yet
Aakash S Project Report
12 pages
Kartik MLP 4-9prg
No ratings yet
Kartik MLP 4-9prg
10 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
ML Recordjp
No ratings yet
ML Recordjp
35 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Netflix Stock Price Prediction
No ratings yet
Netflix Stock Price Prediction
20 pages
Task 8
No ratings yet
Task 8
2 pages
Unit 2
No ratings yet
Unit 2
19 pages
d3 It ML Jan 2023 Part 2
No ratings yet
d3 It ML Jan 2023 Part 2
32 pages
21BEI052 2EI503 ML SpecialAssignmentReport
No ratings yet
21BEI052 2EI503 ML SpecialAssignmentReport
12 pages
Ml Lab File
No ratings yet
Ml Lab File
47 pages
Final Lab Manual
No ratings yet
Final Lab Manual
34 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Lecture 13
No ratings yet
Lecture 13
39 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Infotec Ai 1000 Program-hcia-Ai Lab Guide
No ratings yet
Infotec Ai 1000 Program-hcia-Ai Lab Guide
82 pages
ICT-4202, DIP Lab Manual - 8
No ratings yet
ICT-4202, DIP Lab Manual - 8
20 pages
Unit5 - Linear Regression
No ratings yet
Unit5 - Linear Regression
4 pages
Technology in Education Technology Presentation in Blue Peach Illustrative Style
No ratings yet
Technology in Education Technology Presentation in Blue Peach Illustrative Style
11 pages
EXP-4 DMusingPYTHON
No ratings yet
EXP-4 DMusingPYTHON
7 pages
Econometrics
No ratings yet
Econometrics
28 pages
Lab02
No ratings yet
Lab02
14 pages
Practical (Data Science)
No ratings yet
Practical (Data Science)
13 pages
Business Report PM Suchita Bhovar March 10 2024
No ratings yet
Business Report PM Suchita Bhovar March 10 2024
27 pages
Crash Lecture On Deep Convolutional Neural Networks
No ratings yet
Crash Lecture On Deep Convolutional Neural Networks
27 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Project Idea
No ratings yet
Project Idea
8 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
ML Combined
No ratings yet
ML Combined
254 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
A Short Guide For Feature Engineering and Feature Selection
No ratings yet
A Short Guide For Feature Engineering and Feature Selection
32 pages
MLLAb
No ratings yet
MLLAb
36 pages
Ids PDF
No ratings yet
Ids PDF
397 pages
Linear Regression2
No ratings yet
Linear Regression2
9 pages
DS-unit-4.pptx (1)
No ratings yet
DS-unit-4.pptx (1)
21 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Practitioner's Guide To Data Science
No ratings yet
Practitioner's Guide To Data Science
403 pages
Unit6 Part3 General Procedure
No ratings yet
Unit6 Part3 General Procedure
19 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Android OS Overview
No ratings yet
Android OS Overview
6 pages
Android Development Basics
No ratings yet
Android Development Basics
10 pages
Ajp 14-16
No ratings yet
Ajp 14-16
4 pages
Practical 12 Osy
No ratings yet
Practical 12 Osy
1 page
Practical No 9 Osy
No ratings yet
Practical No 9 Osy
1 page
Output-173 PNG
No ratings yet
Output-173 PNG
1 page
Output-026 PNG
No ratings yet
Output-026 PNG
1 page
Chapter 1
No ratings yet
Chapter 1
2 pages
Output-174 PNG
No ratings yet
Output-174 PNG
1 page
Output-022 PNG
No ratings yet
Output-022 PNG
1 page
Output-175 PNG
No ratings yet
Output-175 PNG
1 page
Output-172 PNG
No ratings yet
Output-172 PNG
1 page
Hacking in Web Applications
No ratings yet
Hacking in Web Applications
8 pages
Output-170 PNG
No ratings yet
Output-170 PNG
1 page
1st Priority V-IMPs With Answer
No ratings yet
1st Priority V-IMPs With Answer
43 pages
Networking Basics MCQ
No ratings yet
Networking Basics MCQ
9 pages
Android Vs iOS OS
No ratings yet
Android Vs iOS OS
2 pages
MCQ For Unit 3 Event Handling
No ratings yet
MCQ For Unit 3 Event Handling
16 pages
Topview Simulator: Software User Guide
No ratings yet
Topview Simulator: Software User Guide
115 pages
Cri 195 Sas 1
No ratings yet
Cri 195 Sas 1
7 pages
Head Hunter Catalog 2007
No ratings yet
Head Hunter Catalog 2007
40 pages
Taking-Off Sheet, BQ Sheet and Concrete Mix Design Form
No ratings yet
Taking-Off Sheet, BQ Sheet and Concrete Mix Design Form
7 pages
(T) EE2028 Topic 9B Interrupt Programming
No ratings yet
(T) EE2028 Topic 9B Interrupt Programming
16 pages
Questions and Answers
No ratings yet
Questions and Answers
18 pages
62.questions STS
No ratings yet
62.questions STS
6 pages
Norval Morrisseau: Life & Work by Carmen Robertson
100% (2)
Norval Morrisseau: Life & Work by Carmen Robertson
95 pages
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
100% (2)
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
8 pages
2021 AP Exam Administration Student Samples - AP English Literature and Composition Free-Response Question 3
No ratings yet
2021 AP Exam Administration Student Samples - AP English Literature and Composition Free-Response Question 3
4 pages
Naskah Soal Sts II Bahasa Inggris Kelas 4
No ratings yet
Naskah Soal Sts II Bahasa Inggris Kelas 4
4 pages
Library Movement in AP D00623
No ratings yet
Library Movement in AP D00623
29 pages
Weather and Climate
No ratings yet
Weather and Climate
13 pages
Kami Export - TEST - Ecology & Biome PDF
No ratings yet
Kami Export - TEST - Ecology & Biome PDF
6 pages
Local Literature
No ratings yet
Local Literature
5 pages
Seperators
No ratings yet
Seperators
6 pages
multiCELL 8619 - Waste Water Equipment and Water Quality Monitoring - EN PDF
No ratings yet
multiCELL 8619 - Waste Water Equipment and Water Quality Monitoring - EN PDF
1 page
Electric Drive CHAPTER-3 Updated-1
No ratings yet
Electric Drive CHAPTER-3 Updated-1
136 pages
Chimneys
No ratings yet
Chimneys
20 pages
Lesson Proper #5 Resonance and Formal Charge
No ratings yet
Lesson Proper #5 Resonance and Formal Charge
35 pages
Summer Report Kunal Sharma
No ratings yet
Summer Report Kunal Sharma
74 pages
Global Canesugar Services Distillery Services Profile
No ratings yet
Global Canesugar Services Distillery Services Profile
14 pages
Design Analysis
No ratings yet
Design Analysis
2 pages
PSC Bridge Failures
100% (1)
PSC Bridge Failures
46 pages
Sheet 2 (Conduction in Solids)
No ratings yet
Sheet 2 (Conduction in Solids)
5 pages
Band Plan HF VHF
No ratings yet
Band Plan HF VHF
4 pages
BÀI TẬP CÂU BỊ ĐỘNG
No ratings yet
BÀI TẬP CÂU BỊ ĐỘNG
5 pages
Book Review Border Fictions
No ratings yet
Book Review Border Fictions
5 pages
Theatre of The Absurd Essay
No ratings yet
Theatre of The Absurd Essay
8 pages
Hagens Berman Antitrust Class-Action Lawsuit Against Apple's App Store Fees
No ratings yet
Hagens Berman Antitrust Class-Action Lawsuit Against Apple's App Store Fees
251 pages

Chapter 9 BTC PRICE PRED

Uploaded by

Chapter 9 BTC PRICE PRED

Uploaded by

NW INTERNSHIP 10CP

BTC Price Prediction Using Linear

# Reading the CSV file and loading it into a

# Checking basic information about the DataFrame

(numpy, pandas, matplotlib, seaborn, sklearn).

Step 2: Data Preprocessing and Visualization

# Checking updated information after conversion

converted from object type to datetime using

conversion, showing the 'Date' column now has

created to visualize the relationship between

Step 3: Exploring Data Relationships and Trends

shows the relationship between 'Date' and 'Low'

• Price Trends: Line plots (plt.plot) are used to

Step 4: Understanding Data Correlations

heatmap to visualize correlations (corr())

understanding how different variables are

Detailed Analysis of Correlations

correlations between 'High' and 'Close', 'Open'

Step 5: Model Preparation and Feature Selection

# Splitting the data into training and testing sets

# Checking the first few rows of the training set

'High', 'Low', and 'Volume' are selected for

feature transformations (e.g., log

Step 6: Model Training and Evaluation

mse = mean_squared_error(y_test, y_pred)

print("Mean Squared Error:", mse)

LinearRegression() initializes a Linear

'Close' prices on the test set (X_test), and

Detailed Performance Metrics Analysis

coefficient of determination and its

RMSE, and MAE, and their implications on model

Step 7: Interpreting Model Results

# Extracting model coefficients and intercept

coefficients of the features (Open, High, Low,

each feature impacts the target variable

Step 8: Making a Prediction

world scenarios where this model can be

• Model Limitations: Highlight limitations of the

You might also like