0% found this document useful (0 votes)

3 views

Decision Tree

The document outlines a Python script that uses a Decision Tree Classifier to predict rainy days based on historical rainfall data. It involves loading and preprocessing data, extracting relevant features, and splitting the dataset into training and test sets. The model is trained, evaluated for accuracy, and used to predict rainfall for a future date.

Uploaded by

popsthenu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Decision Tree

Uploaded by

popsthenu

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Decision tree:

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier

from sklearn.metrics import accuracy_score, classification_report

# Load and preprocess data

data = pd.read_csv('tn-rainfall.csv')

# Convert date column to datetime with explicit format or dayfirst

data['date'] = pd.to_datetime(data['date'], format='%d-%m-%Y')

# Extract features

data['Month'] = data['date'].dt.month

data['Year'] = data['date'].dt.year

# Create target variable based on threshold

threshold = 5

data['Rainy Day'] = (data['value'] >= threshold).astype(int)

# Prepare features (X) and target (y)

X = data[['Month', 'Year']]

y = data['Rainy Day']

# Split data into training and test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train Decision Tree model

model = DecisionTreeClassifier()

model.fit(X_train, y_train)
# Evaluate model

y_pred = model.predict(X_test)

accuracy = accuracy_score(y_test, y_pred)

report = classification_report(y_test, y_pred)

print('Accuracy:', accuracy)

print(report)

# Predict for a future date

future_date = pd.to_datetime('2025-02-03')

future_features = pd.DataFrame({'Month': [future_date.month], 'Year': [future_date.year]})

future_prediction = model.predict(future_features)

print('Predicted class for', future_date.date(), ':', future_prediction[0])

Supervised Learning
100% (1)
Supervised Learning
15 pages
Data analytics assignment solutions
No ratings yet
Data analytics assignment solutions
20 pages
import pandas as pd
No ratings yet
import pandas as pd
1 page
22b2195_E7_group5
No ratings yet
22b2195_E7_group5
4 pages
Random Forest
No ratings yet
Random Forest
2 pages
CATBOOST CODE
No ratings yet
CATBOOST CODE
2 pages
Forecast MQL Script
No ratings yet
Forecast MQL Script
1 page
Gas Price Analyzer
No ratings yet
Gas Price Analyzer
3 pages
Shap
No ratings yet
Shap
2 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
DT_R
No ratings yet
DT_R
2 pages
ML Assignment 1 - Nageswar
No ratings yet
ML Assignment 1 - Nageswar
7 pages
Practical 6A & 6B
No ratings yet
Practical 6A & 6B
4 pages
ITERATORS
No ratings yet
ITERATORS
8 pages
ML Remaining
No ratings yet
ML Remaining
17 pages
Clp
No ratings yet
Clp
1 page
Ass1_SetB2
No ratings yet
Ass1_SetB2
1 page
liner regression chapter N1
No ratings yet
liner regression chapter N1
1 page
Expt 5
No ratings yet
Expt 5
1 page
DWM Exp 8
No ratings yet
DWM Exp 8
2 pages
Logistic Regression
No ratings yet
Logistic Regression
2 pages
AI ML - Cycle 2 Programs (1)
No ratings yet
AI ML - Cycle 2 Programs (1)
15 pages
Task1
No ratings yet
Task1
5 pages
DA Practicle Answers Easyw
No ratings yet
DA Practicle Answers Easyw
30 pages
3 Classification
No ratings yet
3 Classification
16 pages
Project
No ratings yet
Project
10 pages
ML Manual Final
No ratings yet
ML Manual Final
35 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
ML Internal questions
No ratings yet
ML Internal questions
15 pages
7a
No ratings yet
7a
2 pages
If With: February 26, 2024
No ratings yet
If With: February 26, 2024
7 pages
Decision Trees.
No ratings yet
Decision Trees.
1 page
Data Analytics Program
No ratings yet
Data Analytics Program
11 pages
distfit
No ratings yet
distfit
2 pages
NTFX Price Prediction
No ratings yet
NTFX Price Prediction
5 pages
Machine NB + Lda Second Try
No ratings yet
Machine NB + Lda Second Try
5 pages
DMT Cia2
No ratings yet
DMT Cia2
11 pages
clp2
No ratings yet
clp2
1 page
PCA&CNN AD_LAB Code
No ratings yet
PCA&CNN AD_LAB Code
4 pages
21b-200-SE_LW04
No ratings yet
21b-200-SE_LW04
4 pages
LR
No ratings yet
LR
2 pages
Yall
No ratings yet
Yall
2 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
aam p-4 to 6
No ratings yet
aam p-4 to 6
6 pages
ml exp-5,6 (1)[1] (1)
No ratings yet
ml exp-5,6 (1)[1] (1)
6 pages
Aiml Ex 5
No ratings yet
Aiml Ex 5
3 pages
DA_012307
No ratings yet
DA_012307
8 pages
Logistic Regression
No ratings yet
Logistic Regression
2 pages
MI_PR_5
No ratings yet
MI_PR_5
4 pages
save dist figures
No ratings yet
save dist figures
2 pages
tutorial-time-series-forecasting-with-xgboost
No ratings yet
tutorial-time-series-forecasting-with-xgboost
5 pages
allcodesml2
No ratings yet
allcodesml2
10 pages
23BCE7092_ML_Lab_Assignment[1]
No ratings yet
23BCE7092_ML_Lab_Assignment[1]
14 pages
7
No ratings yet
7
2 pages
Import Pandas As PD
No ratings yet
Import Pandas As PD
21 pages
Importing the Necessary Libraries
No ratings yet
Importing the Necessary Libraries
3 pages
ML MANUAL WITH OUTPUTS (2)
No ratings yet
ML MANUAL WITH OUTPUTS (2)
30 pages
New Chat: 1. Predicting Uber Ride Prices
No ratings yet
New Chat: 1. Predicting Uber Ride Prices
16 pages
Lecture Material 3
No ratings yet
Lecture Material 3
7 pages
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
From Everand
Microsoft Certified: Power BI Data Analyst Associate PL 300 Practice Tests
CertSquad Professional Trainers
No ratings yet

Decision Tree

Uploaded by

Decision Tree

Uploaded by

Decision tree:

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier

from sklearn.metrics import accuracy_score, classification_report

# Load and preprocess data

# Convert date column to datetime with explicit format or dayfirst

data['date'] = pd.to_datetime(data['date'], format='%d-%m-%Y')

# Create target variable based on threshold

data['Rainy Day'] = (data['value'] >= threshold).astype(int)

# Prepare features (X) and target (y)

# Split data into training and test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train Decision Tree model

accuracy = accuracy_score(y_test, y_pred)

report = classification_report(y_test, y_pred)

# Predict for a future date

future_features = pd.DataFrame({'Month': [future_date.month], 'Year': [future_date.year]})

print('Predicted class for', future_date.date(), ':', future_prediction[0])

You might also like