Mock - Coding: Numpy NP CSV Sklearn - Linear - Model Pandas PD Matplotlib - Pyplot PLT Sklearn - Metrics

This document summarizes code for analyzing an insurance cost dataset using linear regression. It loads insurance data, cleans it by dropping null values and encoding categorical variables. It then splits the data into training and test sets, fits a linear regression model to predict costs from attributes in the training set, and evaluates the model by calculating the RMSE and R^2 on the test set.

Uploaded by

YTPUB001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views2 pages

Mock - Coding: Numpy NP CSV Sklearn - Linear - Model Pandas PD Matplotlib - Pyplot PLT Sklearn - Metrics

Uploaded by

YTPUB001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

mock_coding

December 2, 2020

[340]: import numpy as np

import csv
from sklearn.linear_model import LinearRegression
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.metrics import mean_squared_error

[341]: dataset = pd.read_csv('insurance-data.csv')

dataset.head()

[341]: age gender bmi childrens smoker region cost

0 19 female 27.900 0 yes southwest 16884.92400
1 18 male NaN 1 no southeast 1725.55230
2 28 male 33.000 3 no southeast 4449.46200
3 33 male 22.705 0 no northwest 21984.47061
4 32 male 28.880 0 no northwest 3866.85520

[342]: dataset.dropna(inplace=True)

[343]: dataset = pd.get_dummies(dataset, columns = [ 'gender', 'smoker', 'region'] )

[344]: dataset.head()

[344]: age bmi childrens cost gender_female gender_male smoker_no \

0 19 27.900 0 16884.92400 1 0 0
2 28 33.000 3 4449.46200 0 1 1
3 33 22.705 0 21984.47061 0 1 1
4 32 28.880 0 3866.85520 0 1 1
5 31 25.740 0 3756.62160 1 0 1

smoker_yes region_northeast region_northwest region_southeast \

0 1 0 0 0
2 0 0 0 1
3 0 0 1 0
4 0 0 1 0
5 0 0 0 1

1
region_southwest
0 1
2 0
3 0
4 0
5 0

[345]: dataset.dtypes

[345]: age int64

bmi float64
childrens int64
cost float64
gender_female uint8
gender_male uint8
smoker_no uint8
smoker_yes uint8
region_northeast uint8
region_northwest uint8
region_southeast uint8
region_southwest uint8
dtype: object

[346]: X = dataset.drop('cost', axis=1).values

y = dataset.loc[:,'cost'].values
print(X[:2])
print(y[:2])

[[19. 27.9 0. 1. 0. 0. 1. 0. 0. 0. 1. ]
[28. 33. 3. 0. 1. 1. 0. 0. 0. 1. 0. ]]
[16884.924 4449.462]

[347]: from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=0)

[348]: from sklearn.linear_model import LinearRegression

regressor = LinearRegression()
regressor.fit(X_train,y_train)

[348]: LinearRegression()

[349]: y_pred = regressor.predict(X_test)

print(f"RMSE: {mean_squared_error(y_test, y_pred, squared=False):.2f}")
print(f"R^2: {regressor.score(X_test, y_test):.2f}")

RMSE: 6239.55
R^2: 0.74

Key For Literacy Test
78% (9)
Key For Literacy Test
2 pages
Wheel Chair
No ratings yet
Wheel Chair
73 pages
SML Lab 1
No ratings yet
SML Lab 1
19 pages
Linear and Multilinear Regression
No ratings yet
Linear and Multilinear Regression
5 pages
Step 1
No ratings yet
Step 1
10 pages
Medical Cost Prediction
No ratings yet
Medical Cost Prediction
27 pages
Gaurav - Data Mining Lab Assignment
No ratings yet
Gaurav - Data Mining Lab Assignment
36 pages
Medical
No ratings yet
Medical
4 pages
4-10 Aiml
No ratings yet
4-10 Aiml
25 pages
RL - EX1.Ipynb - Colab
No ratings yet
RL - EX1.Ipynb - Colab
3 pages
Python Sklearn Linear Regression
No ratings yet
Python Sklearn Linear Regression
45 pages
Predicting Insurance Prices Using Machine Learning
No ratings yet
Predicting Insurance Prices Using Machine Learning
12 pages
Week 8 Lab - Linear Regression
No ratings yet
Week 8 Lab - Linear Regression
4 pages
Linear Regression: Data Exploration
No ratings yet
Linear Regression: Data Exploration
12 pages
Ass 1 Dsbda
No ratings yet
Ass 1 Dsbda
8 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
Outlier Treatment - Jupyter Notebook
No ratings yet
Outlier Treatment - Jupyter Notebook
15 pages
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
No ratings yet
Linear Regression in Scikit-Learn (Sklearn) - An Introduction - Datagy
22 pages
02 B Regression Healthcare
No ratings yet
02 B Regression Healthcare
5 pages
02 B Regression Healthcare
No ratings yet
02 B Regression Healthcare
5 pages
4-R Code and PPT - Predicting Medical Expenses Using Linear Regression - New Without Prerequsit
No ratings yet
4-R Code and PPT - Predicting Medical Expenses Using Linear Regression - New Without Prerequsit
17 pages
Mi PR 5
No ratings yet
Mi PR 5
4 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
Exploratory Data Analysis Main Concepts
No ratings yet
Exploratory Data Analysis Main Concepts
1 page
Group Work Assignment Supervised and Unsupervised Learning
No ratings yet
Group Work Assignment Supervised and Unsupervised Learning
10 pages
ML 7th and 10th Program
No ratings yet
ML 7th and 10th Program
8 pages
DSBDA2
No ratings yet
DSBDA2
6 pages
ML 6 7 8
No ratings yet
ML 6 7 8
10 pages
'Name-Piyush Tiwari''/n' 'Section - C'/N' 'Roll - No-2001610100142'
No ratings yet
'Name-Piyush Tiwari''/n' 'Section - C'/N' 'Roll - No-2001610100142'
28 pages
ExNo 08ml
No ratings yet
ExNo 08ml
4 pages
Heart Disease Diagnosis Using Machine Learning
No ratings yet
Heart Disease Diagnosis Using Machine Learning
26 pages
Pranav Mane Mini Project NSM
No ratings yet
Pranav Mane Mini Project NSM
7 pages
Openlab 1
No ratings yet
Openlab 1
17 pages
ML Lab Exp
No ratings yet
ML Lab Exp
7 pages
Data Science Fundamentals
No ratings yet
Data Science Fundamentals
22 pages
Capstone 1 Problem Statement
No ratings yet
Capstone 1 Problem Statement
18 pages
Diabetes Prediction - Logistic Regression - Jupyter Notebook
No ratings yet
Diabetes Prediction - Logistic Regression - Jupyter Notebook
4 pages
ML Manual Final
No ratings yet
ML Manual Final
35 pages
LAb Test 2
No ratings yet
LAb Test 2
4 pages
Logistic Regression vs. SVMs - Solution
No ratings yet
Logistic Regression vs. SVMs - Solution
7 pages
GUIDE User Manual 26.0 Department of Statistics Wisconsin-Madison
No ratings yet
GUIDE User Manual 26.0 Department of Statistics Wisconsin-Madison
247 pages
Turing Data Analysis
No ratings yet
Turing Data Analysis
30 pages
Rmprobit
No ratings yet
Rmprobit
8 pages
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
0% (1)
Medical Insurance Cost Prediction System: Dharesh Bahety EN18EL301057 Under The Guidance of Mr. Parag Ravekar Sir
18 pages
Marginal Effects
No ratings yet
Marginal Effects
16 pages
DSBDA Practicals
No ratings yet
DSBDA Practicals
16 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
Python Cod1
No ratings yet
Python Cod1
3 pages
Stroke Prediction
No ratings yet
Stroke Prediction
14 pages
Aiml Programs
No ratings yet
Aiml Programs
12 pages
Python 1
No ratings yet
Python 1
3 pages
Batch-2 Ieee DMT
No ratings yet
Batch-2 Ieee DMT
4 pages
Diabetic Prediction Using LogicalRegression
No ratings yet
Diabetic Prediction Using LogicalRegression
9 pages
Predict Health Insurance Cost by Using Machine Learning and DNN Regression Models
No ratings yet
Predict Health Insurance Cost by Using Machine Learning and DNN Regression Models
7 pages
Understanding The Data: Objective
No ratings yet
Understanding The Data: Objective
1 page
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Medical Insurance Cost Prediction
No ratings yet
Medical Insurance Cost Prediction
16 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
Brain Stroke Prediction Using ML - Jupyter Notebook
No ratings yet
Brain Stroke Prediction Using ML - Jupyter Notebook
17 pages
Medicial
No ratings yet
Medicial
13 pages
Abdul Qadir
No ratings yet
Abdul Qadir
17 pages
VLSI Interview Questions - 1
No ratings yet
VLSI Interview Questions - 1
9 pages
MAS III Review Question Prelim
No ratings yet
MAS III Review Question Prelim
17 pages
AC Motors Winding Diagram
81% (31)
AC Motors Winding Diagram
40 pages
Math 6 - Surface ARea
No ratings yet
Math 6 - Surface ARea
42 pages
Machining Solidcam
100% (1)
Machining Solidcam
18 pages
Experimental Failure Analysis of S-Polymer Gears
No ratings yet
Experimental Failure Analysis of S-Polymer Gears
8 pages
Origami Papiroflexia
No ratings yet
Origami Papiroflexia
6 pages
Table: Case - Static 1 - Load Assignments Case Loadtype Loadname Loadsf
No ratings yet
Table: Case - Static 1 - Load Assignments Case Loadtype Loadname Loadsf
23 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Demand Forecasting PDF
No ratings yet
Demand Forecasting PDF
18 pages
Lesson Plan #3 Scalar & Vector Quantity
No ratings yet
Lesson Plan #3 Scalar & Vector Quantity
3 pages
ArchE 1 - Statics of Rigid Bodies
No ratings yet
ArchE 1 - Statics of Rigid Bodies
146 pages
Chapter 5
No ratings yet
Chapter 5
11 pages
Analysis & Design of Algorithms Lab BHav
No ratings yet
Analysis & Design of Algorithms Lab BHav
7 pages
Exercises On Linear Algebra MI1036
No ratings yet
Exercises On Linear Algebra MI1036
12 pages
Question Paper - CAPS S-18 PAPER
No ratings yet
Question Paper - CAPS S-18 PAPER
4 pages
Aerial Robotics Lecture 3A - 2 3-D Quadrotor Control
No ratings yet
Aerial Robotics Lecture 3A - 2 3-D Quadrotor Control
5 pages
Coordinate Systems
100% (1)
Coordinate Systems
40 pages
Colour & Design - Unit 1
No ratings yet
Colour & Design - Unit 1
24 pages
Calculus and Linear Algebra
No ratings yet
Calculus and Linear Algebra
107 pages
Resolution Refutation
No ratings yet
Resolution Refutation
5 pages
10.1515 - Sspjce 2020 0009
No ratings yet
10.1515 - Sspjce 2020 0009
14 pages
CISSP Official Practice Tests Mike Chapple download
No ratings yet
CISSP Official Practice Tests Mike Chapple download
98 pages
Index and Log
No ratings yet
Index and Log
4 pages
Y 9 S 16 Ex
No ratings yet
Y 9 S 16 Ex
10 pages
Euler S Disk
No ratings yet
Euler S Disk
4 pages
WIMO Final Training Course S - NumberTheory
100% (2)
WIMO Final Training Course S - NumberTheory
26 pages
Aqa 73571 QP Nov20
No ratings yet
Aqa 73571 QP Nov20
32 pages