0% found this document useful (0 votes)

17 views38 pages

Lecture 6 Python

The document covers data modeling and analysis techniques, focusing on interpolation, regression, and curve fitting. It explains various methods such as linear and spline interpolation, types of regression including linear and polynomial, and the importance of evaluating model fit using metrics like R-squared and Mean Squared Error. Additionally, it provides examples of implementing these techniques using Python, particularly for chemical engineering applications.

Uploaded by

alayid1438888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views38 pages

Lecture 6 Python

Uploaded by

alayid1438888

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

CHE 212 : Data Modelling and Analysis

Lecture No. 06

1 / 38
Table of Contents

1 Introduction

2 Interpolation
Linear Interpolation
Spline Interpolation
Other Types of Interpolation

3 Regression
Types of Regression
Goodness of Fit
Overfitting and Underfitting

4 Curve Fitting
Introduction
Curve fitting Examples

2 / 38
Introduction

In this chapter, we study data modeling and analysis.

Topics include interpolation, regression, and curve fitting.
These techniques are essential for estimating unknown values from
known data points.

3 / 38
Interpolation

Interpolation estimates unknown values between known data points.

E.g. estimating values from steam tables, pressure data, etc.
Methods include linear, polynomial, spline interpolation, and more.
Extrapolation refers to estimating values outside the known data
range.

4 / 38
Interpolation

Figure: Interpolation vs Extrapolation

5 / 38
Linear Interpolation

Linear interpolation assumes a straight-line relationship between two

points.
Formula:
(y2 − y1 )
y = y1 + (x − x1 )
(x2 − x1 )
Python’s numpy.interp() function performs linear interpolation.

6 / 38
Example 1: Linear Interpolation

Given data points:

x = [0, 2, 4, 6, 8, 10], y = [0, 4, 8, 16, 32, 64]

import numpy as np
x = np.array ([0, 2, 4, 6, 8, 10])
y = np.array ([0, 4, 8, 16, 32, 64])

x_new = 5
y_new = np.interp(x_new , x, y)

print(f"The estimated value at x = {x_new} is {y_new }.")

7 / 38
Example 1 (Continued): Interpolating at Multiple Values
x_new2 = [1, 3, 5, 7]
y_new2 = np.interp(x_new2 , x, y)

for i in range(len(x_new2)):
print(f"The estimated value at x = {x_new2[i]} is {y_new2[i]}.")

Figure: Linear interpolation at multiple values of x.

8 / 38
Spline Interpolation

Spline interpolation uses a series of polynomial functions to estimate

intermediate values between data points.
Cubic splines are commonly used due to their smoothness, offering
continuous first and second derivatives.
Unlike linear interpolation, cubic splines provide smooth curves,
making them ideal for more complex or curved data relationships.
Ensures continuity of first and second derivatives at each data point.
Minimizes overall curvature across the dataset.

9 / 38
Example 2: Spline Interpolation

Given data points for heat capacity Cp of a substance at different

temperatures:

Temperature (°C) : [100, 125, 175, 200, 225, 275, 300, 400, 500]

Heat Capacity (J/mol·K) : [25.15, 25.9, 26.7, 26.9, 27.3, 28.5, 29.1, 32, 37.0]
Estimate heat capacity at intermediate temperatures
T = [150, 250, 350, 450].
import numpy as np
from scipy.interpolate import CubicSpline

temperature = np.array ([100 ,125 ,175 ,200 ,225 ,275 ,300 ,400 ,500])
heat_capacity = np.array ([25.15 , 25.9, 26.7, 26.9, 27.3, 28.5, 29.1, 32, 37.0])

cs = CubicSpline(temperature , heat_capacity)
temp_new = np.array ([150 , 250, 350, 450])
cp_interpolated = cs(temp_new)

for i in range(len(temp_new)):
print(f"The estimated Cp at T = {temp_new[i]} °C is {cp_interpolated [i]:.2f} J/mol·K.")

10 / 38
Example 2: Plot

Figure: Spline interpolation for heat capacity at different temperatures.

11 / 38
Other Types of Interpolation

Polynomial Interpolation: Uses a single polynomial to pass through

all data points, suitable for small datasets but prone to oscillation
(Runge’s phenomenon).
Nearest-Neighbor Interpolation: Assigns the value of the nearest
data point to the interpolated point, simple but leads to
discontinuities.
Barycentric Interpolation: A stable form of polynomial
interpolation, preferred for large datasets due to its numerical stability.

12 / 38
Introduction to Regression

Regression is a statistical technique to identify relationships between

dependent and independent variables.
It models the conditional mean of the response variable given certain
predictors.
Unlike interpolation, regression assumes noise in the data and provides
a model that approximates the relationship between variables.

13 / 38
Regression vs Interpolation

Interpolation: Estimates unknown values between known data

points, assumes no noise.
Regression: Models the entire dataset, assuming noise, and finds
relationships between variables.

14 / 38
Types of Regression

Linear Regression: Models a linear relationship between dependent

and independent variables.
Polynomial Regression: Fits a polynomial of degree n to the data.
Multiple Linear Regression: Models the relationship between one
dependent variable and multiple independent variables.

15 / 38
Linear Regression

A linear model is given by:

y = mx + c

where:
m is the slope (rate of change of y with respect to x).
c is the y-intercept (value of y when x = 0).
The goal is to find m and c that minimize the difference between
predicted and actual values.
Methods: Least squares, numpy.polyfit(), scikit-learn, statsmodels.

16 / 38
Polynomial Regression

Extends linear regression by fitting a polynomial of degree n.

y = an x n + an−1 x n−1 + · · · + a1 x + a0

Useful when data shows curvature or a non-linear relationship.

Methods: numpy.polyfit(), scikit-learn, statsmodels.

17 / 38
Multiple Linear Regression

Models the relationship between multiple independent variables and

one dependent variable.

y = m1 x1 + m2 x2 + · · · + mn xn + b

Useful when several variables affect the outcome.

18 / 38
Types of Regression Models

Figure: Different types of regression models: (left) linear regression, (middle)

polynomial regression, (right) multiple linear regression.

19 / 38
Example 3: Linear Regression

Given data for the relationship between temperature and reaction rate:

Temperature (°C) : [0, 20, 40, 60, 80, 100]

Reaction Rate (mol/s) : [0.5, 2.5, 4.8, 7.0, 9.8, 12.5]

Perform linear regression to model this relationship.
import numpy as np
import matplotlib.pyplot as plt

temperature = np.array ([0, 20, 40, 60, 80, 100])

reaction_rate = np.array ([0.5 , 2.5, 4.8, 7.0, 9.8, 12.5])

# Perform linear regression

coefficients = np.polyfit(temperature , reaction_rate , 1)
slope , intercept = coefficients
print(f"Slope: {slope :.2f}, Intercept: {intercept :.2f}")

20 / 38
Example 3: Plotting Linear Fit
# Generate fitted values using the linear model
reaction_rate_fitted = np.polyval(coefficients , temperature)

# Plot the data and regression line

plt.figure(dpi =150)
plt.plot(temperature , reaction_rate , 'ro', label='Data ')
plt.plot(temperature , reaction_rate_fitted , 'b-',
label=f'Linear Fit: y = {slope :.2f}x + {intercept :.2f}')
plt.xlabel('Temperature (°C)')
plt.ylabel('Reaction Rate (mol/s)')

21 / 38
Example 4: Polynomial Regression
We extend the linear regression example by fitting a 2nd-degree
polynomial.
# Perform quadratic regression ( degree 2)
coefficients = np.polyfit(temperature , reaction_rate , 2)
a, b, c = coefficients
print(f"Quadratic Coefficients: a={a:.2f}, b={b:.2f}, c={c:.2f}")

# Generate fitted values using the quadratic model

reaction_rate_fitted = np.polyval(coefficients , temperature)

# Plot the data and regression curve

22 / 38
Evaluating Goodness of Fit

To determine how well the model fits the data, we use metrics such as:
Residuals Analysis: Measures differences between observed and
predicted values.
R-squared (R²): Proportion of variance in the dependent variable
explained by the model.
Mean Squared Error (MSE): Average squared difference between
observed and predicted values.

23 / 38
Python Code to Calculate R² and MSE

from sklearn.metrics import r2_score , mean_squared_error

# Linear regression ( degree 1)

coef_lin = np.polyfit(temperature , reaction_rate , 1)
rate_lin_fit = np.polyval(coef_lin , temperature)

# Quadratic regression ( degree 2)

coef_quad = np.polyfit(temperature , reaction_rate , 2)
rate_quad_fit = np.polyval(coef_quad , temperature)

# Calculate R- squared and MSE for both models

r2_lin = r2_score(reaction_rate , rate_lin_fit)
r2_quad = r2_score(reaction_rate , rate_quad_fit)
mse_lin = mean_squared_error (reaction_rate , rate_lin_fit)
mse_quad = mean_squared_error (reaction_rate , rate_quad_fit)

print(f"Linear Regression - R²: {r2_lin :.4f}, MSE: {mse_lin :.4f}")

print(f"Quadratic Regression - R²: {r2_quad :.4f}, MSE: {mse_quad :.4f}")

24 / 38
Goodness of Fit Results

**Linear Regression**:
R²: 0.97, MSE: 0.49
**Quadratic Regression**:
R²: 0.99, MSE: 0.09
Quadratic regression has a better fit, with a higher R² and lower MSE.

25 / 38
Overfitting and Underfitting

Underfitting: Occurs when the model is too simple to capture

underlying patterns in the data.
Overfitting: Happens when the model is too complex, fitting noise as
well as the trend.
Use metrics like R-squared and MSE to evaluate the fit.
Start with a lower-degree polynomial and increase only as needed.

26 / 38
Introduction to Curve Fitting

Curve fitting is used to find a curve (often non-linear) that best

describes a dataset.
Involves using complex functions (e.g., exponential, logarithmic) to
model the data.
Curve fitting is used to precisely describe the shape of the data, while
regression focuses on creating predictive models.

27 / 38
Steps for Curve Fitting in Python

1 Import necessary libraries like numpy, matplotlib, and

scipy.optimize.curve_fit.
2 Define the function representing the model (e.g., exponential,
logarithmic).
3 Define independent (x) and dependent (y) variables for the dataset.
4 Use curve_fit() to fit the model to the data.
5 Retrieve optimized parameters from the fitting process.
6 Plot the original data and fitted curve for visualization.
7 Evaluate the fit using metrics like R 2 or MSE.

28 / 38
Example 5: Curve Fitting (Arrhenius Equation)

In a chemical reaction, the reaction rate follows the Arrhenius equation:

E

r (T ) = A · exp −
R ·T

where:
r (T ): reaction rate at temperature T ,
A: pre-exponential factor,
E : activation energy,
R = 8.314 J/(mol·K) is the universal gas constant.
The goal is to fit the Arrhenius equation to the data and determine A and
E.

29 / 38
Example 5: Plotting Arrhenius Fit

import numpy as np
import matplotlib.pyplot as plt
from scipy.optimize import curve_fit

T = np.array ([300 , 350, 400, 450, 500, 550])

rate = np.array ([0.0025 , 0.0048 , 0.0075 , 0.0112 , 0.0164 , 0.0235])
R = 8.314

def arrhenius(T, A, E):

return A * np.exp(-E / (R * T))

popt , pcov = curve_fit(arrhenius , T, rate)

A, E = popt

print(f"Fitted A: {A:.4e} mol/s, Fitted E: {E:.2f} J/mol")

# Generate fitted curve
T_fit = np.linspace(min(T), max(T), 100)
rate_fit = arrhenius(T_fit , *popt)

# Plot data and fitted curve

plt.scatter(T, rate , color='red', label='Experimental Data ')
plt.plot(T_fit , rate_fit , label='Fitted Arrhenius Model ', color='blue ')
plt.xlabel('Temperature (K)')
plt.ylabel('Reaction Rate (mol/s)')
plt.title('Arrhenius Model Fitting ')

30 / 38
Figure: Comparison of experimental data and Arrhenius fit.

31 / 38
Improving Curve Fitting

Initial Guesses: Providing good initial guesses for parameters can

significantly improve fit accuracy and prevent errors, especially in
complex models like exponential or Gaussian.
Initial guesses help the fitting process converge faster and lead to a
better fit.
Example: In Gaussian peak fitting, appropriate initial guesses for
amplitude, center, and width can improve the fit.

32 / 38
Example 6: Gaussian Curve Fitting

In a chemical engineering process, the absorption of a compound is

measured at different wavelengths. The absorption forms a peak, which
can be modeled using a Gaussian function:

(λ − λ0 )2

A(λ) = a · exp −
2 · σ2

Where:
A(λ) is the absorption at wavelength λ,
a is the peak amplitude (maximum absorption),
λ0 is the center of the peak,
σ is the standard deviation (related to the width of the peak).
The goal is to fit the Gaussian model to the experimental data and
determine a, λ0 , and σ.

33 / 38
Experimental Data and Initial Guess

Given data:

Wavelengths (nm) : [400, 420, 440, 460, 480, 500, 520, 540, 560, 580, 600]

Absorption (AU) : [0.12, 0.31, 0.82, 1.50, 1.85, 1.92, 1.67, 1.10, 0.65, 0.30, 0.12]
We will use an initial guess for the Gaussian parameters:
import numpy as np
import matplotlib.pyplot as plt
from scipy.optimize import curve_fit

# Data: Wavelength (nm) and absorption (AU)

wavelengths = np.array ([400 , 420, 440, 460, 480, 500, 520, 540, 560, 580, 600])
absorption = np.array ([0.12 , 0.31, 0.82, 1.50, 1.85, 1.92, 1.67, 1.10, 0.65, 0.30, 0.12])

# Gaussian model function

def gaussian(x, a, x0 , sigma):
return a * np.exp(-(x - x0)**2 / (2 * sigma **2))

# Initial guess for parameters

initial_guess = [2, 500, 30]

34 / 38
Fitting the Gaussian Model

Perform curve fitting using the initial guess:

# Perform curve fitting to find best -fit a, x0 �(0) , sigma
popt , pcov = curve_fit(gaussian , wavelengths , absorption , initial_guess)

# Extract the optimized parameters

a, x0 , sigma = popt

print(f"Fitted peak amplitude a: {a:.2f} AU")

print(f"Fitted center wavelength �0: {x0:.2f} nm")
print(f"Fitted standard deviation �: {sigma :.2f} nm")

Output:

Fitted peak amplitude a: 1.98

Fitted center wavelength λ0 : 495.29
Fitted standard deviation σ: 42.22 nm.

35 / 38
Plotting the Gaussian Fit
# Generate fitted curve using the optimized parameters
wavelengths_fit = np.linspace(min(wavelengths), max(wavelengths), 100)
absorption_fit = gaussian(wavelengths_fit , *popt)

# Plot the original data and the fitted Gaussian curve

plt.scatter(wavelengths , absorption , color='red', label='Experimental Data ')
plt.plot(wavelengths_fit , absorption_fit , label='Fitted Gaussian Model ', color='blue ')

Figure: Comparison of experimental data and Gaussian fit.

36 / 38
Fitting Without Initial Guess

The curve_fit() function can also work without an initial guess,

but providing a good initial guess speeds up the optimization process
and ensures accurate fitting, especially for non-linear models.

37 / 38
Thank You!
Any questions?

ml2020 Pythonlab02
No ratings yet
ml2020 Pythonlab02
3 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
Smec ML Lab Manual R22
No ratings yet
Smec ML Lab Manual R22
21 pages
Mathcad Regression Solution
No ratings yet
Mathcad Regression Solution
14 pages
Coding Final Study Guide Notes
No ratings yet
Coding Final Study Guide Notes
3 pages
MATLAB Examples - Interpolation and Curve Fitting
No ratings yet
MATLAB Examples - Interpolation and Curve Fitting
25 pages
ML MANUAL NEW
No ratings yet
ML MANUAL NEW
38 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
CE206 Curvefitting Interpolation 4
No ratings yet
CE206 Curvefitting Interpolation 4
20 pages
machine learning (1)
No ratings yet
machine learning (1)
30 pages
ML & DA Unit2 - Notes
No ratings yet
ML & DA Unit2 - Notes
57 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
ICT 2102-Curve-Fitting and Interpolation
No ratings yet
ICT 2102-Curve-Fitting and Interpolation
29 pages
Statistics 2 For Chemical Engineering: Department of Mathematics and Computer Science
No ratings yet
Statistics 2 For Chemical Engineering: Department of Mathematics and Computer Science
37 pages
DS w13 Regression
No ratings yet
DS w13 Regression
60 pages
Chap9 CurveFitting Interpolation
No ratings yet
Chap9 CurveFitting Interpolation
7 pages
ML Unit-4
No ratings yet
ML Unit-4
65 pages
Lab Manual (DAV)
No ratings yet
Lab Manual (DAV)
33 pages
BN2102 7-10
No ratings yet
BN2102 7-10
24 pages
PS4
No ratings yet
PS4
8 pages
ML Lab Mala Reddy Clg
No ratings yet
ML Lab Mala Reddy Clg
23 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Curve Fitting and Interpolation
No ratings yet
Curve Fitting and Interpolation
14 pages
Analysis of Data - Curve Fitting and Spectral Analysis
No ratings yet
Analysis of Data - Curve Fitting and Spectral Analysis
23 pages
ML Lab (R22) Manual
No ratings yet
ML Lab (R22) Manual
25 pages
Curve Fitting: There Are Two General Approaches For Curve Fitting
No ratings yet
Curve Fitting: There Are Two General Approaches For Curve Fitting
63 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
Mathcad Regression
No ratings yet
Mathcad Regression
14 pages
MIT6 0002F16 ProblemSet5
No ratings yet
MIT6 0002F16 ProblemSet5
13 pages
An Introduction To Stadistical Learning-129-140-1-8
No ratings yet
An Introduction To Stadistical Learning-129-140-1-8
8 pages
Lab 11,12
No ratings yet
Lab 11,12
7 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
Dr. Siti Mariam Binti Abdul Rahman Faculty of Mechanical Engineering Office: T1-A14-01C E-Mail: Mariam4528@salam - Uitm.edu - My
No ratings yet
Dr. Siti Mariam Binti Abdul Rahman Faculty of Mechanical Engineering Office: T1-A14-01C E-Mail: Mariam4528@salam - Uitm.edu - My
30 pages
R22 ML Lab Manual
No ratings yet
R22 ML Lab Manual
25 pages
Advanced Engineering Mathematics Report
No ratings yet
Advanced Engineering Mathematics Report
14 pages
Mitocw - Watch?V Fqvg-Hh9Duw: Professor
No ratings yet
Mitocw - Watch?V Fqvg-Hh9Duw: Professor
18 pages
CH817 Lecture 02 2025
No ratings yet
CH817 Lecture 02 2025
36 pages
Module 3 Regression and Interpolation.pptx
No ratings yet
Module 3 Regression and Interpolation.pptx
88 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
AI14 - MachineLearning
No ratings yet
AI14 - MachineLearning
49 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
Message
No ratings yet
Message
2 pages
ML Combined
No ratings yet
ML Combined
254 pages
ML Practical File
100% (2)
ML Practical File
43 pages
ML Unit
No ratings yet
ML Unit
23 pages
ENGR 217 Lecture 6
No ratings yet
ENGR 217 Lecture 6
29 pages
AI Lab7
No ratings yet
AI Lab7
13 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
Modeling Basics: Compartment Models Dimensional Analysis Stochastic Modeling
No ratings yet
Modeling Basics: Compartment Models Dimensional Analysis Stochastic Modeling
58 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Pedestrian Stackers: 1 000 - 1 200KG S1.0 E, S1.2 E
No ratings yet
Pedestrian Stackers: 1 000 - 1 200KG S1.0 E, S1.2 E
3 pages
Independent Proposal
No ratings yet
Independent Proposal
26 pages
Global Organic Textile Standard - GOTS
No ratings yet
Global Organic Textile Standard - GOTS
3 pages
Xenon E-180 Service Manual
No ratings yet
Xenon E-180 Service Manual
17 pages
Hussain CV To The Public PDF
No ratings yet
Hussain CV To The Public PDF
2 pages
Chemistry Form Four: Chapter 9: Manufactured Substances in Industry
No ratings yet
Chemistry Form Four: Chapter 9: Manufactured Substances in Industry
18 pages
American Invitational Mathematics Examination
No ratings yet
American Invitational Mathematics Examination
3 pages
AZ2 F+Chassis SM 3a 1 3
No ratings yet
AZ2 F+Chassis SM 3a 1 3
70 pages
MATLAB Solution To Microwave Engineering Pozar 4th Ed. Example 1.5
No ratings yet
MATLAB Solution To Microwave Engineering Pozar 4th Ed. Example 1.5
5 pages
Workshop:: How To Write An Effective Business Plan in Just 3 Hours
100% (2)
Workshop:: How To Write An Effective Business Plan in Just 3 Hours
24 pages
The Geometry of Futon Comfort
No ratings yet
The Geometry of Futon Comfort
5 pages
Module-15-22.pdf - PHY 032 PHYSICS FOR ENGINEERS Module #15 Student Activity Sheet Name - Section - College Sidekick
No ratings yet
Module-15-22.pdf - PHY 032 PHYSICS FOR ENGINEERS Module #15 Student Activity Sheet Name - Section - College Sidekick
1 page
Note - 2024-04-13 - 09-22-07 5 - Copy 4
No ratings yet
Note - 2024-04-13 - 09-22-07 5 - Copy 4
50 pages
CS2D Manual
No ratings yet
CS2D Manual
18 pages
Geopolitics of Water
No ratings yet
Geopolitics of Water
8 pages
Catalog Fortuner GR Sport Compressed
No ratings yet
Catalog Fortuner GR Sport Compressed
8 pages
Paper 3
No ratings yet
Paper 3
6 pages
LTC4365 240805 102331
No ratings yet
LTC4365 240805 102331
20 pages
Boq For Construction of Reinforced Concrete Piles (Permanent Shoring) To Support Vertical Excavations
No ratings yet
Boq For Construction of Reinforced Concrete Piles (Permanent Shoring) To Support Vertical Excavations
4 pages
Thomas Mutoro Wefwafwa - Final Project Report-Signed
No ratings yet
Thomas Mutoro Wefwafwa - Final Project Report-Signed
34 pages
Chapter - 01
No ratings yet
Chapter - 01
72 pages
B Ingg
No ratings yet
B Ingg
5 pages
Proposal For 365 Farms Nig. LTD
No ratings yet
Proposal For 365 Farms Nig. LTD
12 pages
She Loves To Walk On The Mountain
100% (1)
She Loves To Walk On The Mountain
1 page
How The Rib of Adam Is Incorrectly Translated
No ratings yet
How The Rib of Adam Is Incorrectly Translated
5 pages
Percentage Boq: Validate Print Help
No ratings yet
Percentage Boq: Validate Print Help
5 pages
ISBT 72 HourWash Guidelines
No ratings yet
ISBT 72 HourWash Guidelines
22 pages
Cbse 10th Bio Atom Bomb Free
No ratings yet
Cbse 10th Bio Atom Bomb Free
6 pages
Grade: Midterm II (Quantitative Methods I)
No ratings yet
Grade: Midterm II (Quantitative Methods I)
3 pages
IFU - GOODTEC-Y-CONNECTOR SET (OKAY II) - Others - Rev.2
No ratings yet
IFU - GOODTEC-Y-CONNECTOR SET (OKAY II) - Others - Rev.2
2 pages

Lecture 6 Python

Uploaded by

Lecture 6 Python

Uploaded by

CHE 212 : Data Modelling and Analysis

In this chapter, we study data modeling and analysis.

Interpolation estimates unknown values between known data points.

Figure: Interpolation vs Extrapolation

Linear interpolation assumes a straight-line relationship between two

Given data points:

x = [0, 2, 4, 6, 8, 10], y = [0, 4, 8, 16, 32, 64]

print(f"The estimated value at x = {x_new} is {y_new }.")

Figure: Linear interpolation at multiple values of x.

Spline interpolation uses a series of polynomial functions to estimate

Given data points for heat capacity Cp of a substance at different

Figure: Spline interpolation for heat capacity at different temperatures.

Polynomial Interpolation: Uses a single polynomial to pass through

Regression is a statistical technique to identify relationships between

Interpolation: Estimates unknown values between known data

Linear Regression: Models a linear relationship between dependent

A linear model is given by:

Extends linear regression by fitting a polynomial of degree n.

Useful when data shows curvature or a non-linear relationship.

Models the relationship between multiple independent variables and

Useful when several variables affect the outcome.

Figure: Different types of regression models: (left) linear regression, (middle)

Temperature (°C) : [0, 20, 40, 60, 80, 100]

Reaction Rate (mol/s) : [0.5, 2.5, 4.8, 7.0, 9.8, 12.5]

temperature = np.array ([0, 20, 40, 60, 80, 100])

# Perform linear regression

# Plot the data and regression line

# Generate fitted values using the quadratic model

# Plot the data and regression curve

from sklearn.metrics import r2_score , mean_squared_error

# Linear regression ( degree 1)

# Quadratic regression ( degree 2)

# Calculate R- squared and MSE for both models

print(f"Linear Regression - R²: {r2_lin :.4f}, MSE: {mse_lin :.4f}")

Underfitting: Occurs when the model is too simple to capture

Curve fitting is used to find a curve (often non-linear) that best

1 Import necessary libraries like numpy, matplotlib, and

In a chemical reaction, the reaction rate follows the Arrhenius equation:

T = np.array ([300 , 350, 400, 450, 500, 550])

def arrhenius(T, A, E):

popt , pcov = curve_fit(arrhenius , T, rate)

print(f"Fitted A: {A:.4e} mol/s, Fitted E: {E:.2f} J/mol")

# Plot data and fitted curve

Initial Guesses: Providing good initial guesses for parameters can

In a chemical engineering process, the absorption of a compound is

# Data: Wavelength (nm) and absorption (AU)

# Gaussian model function

# Initial guess for parameters

Perform curve fitting using the initial guess:

# Extract the optimized parameters

print(f"Fitted peak amplitude a: {a:.2f} AU")

Fitted peak amplitude a: 1.98

# Plot the original data and the fitted Gaussian curve

Figure: Comparison of experimental data and Gaussian fit.

The curve_fit() function can also work without an initial guess,

You might also like