0% found this document useful (0 votes)

111 views9 pages

Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024

DATA ANALYTICS 1

Uploaded by

robertdowneyrdj708

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views9 pages

Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024

DATA ANALYTICS 1

Uploaded by

robertdowneyrdj708

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

EX.

NO: 1 Exploring the features of NumPy, SciPy, Jupyter, Statsmodels and Pandas

DATE: 07/08/2024

AIM:

To download, install and explore the features of NumPy, SciPy, Jupyter, Statsmodels and
Pandas packages and to read data from text file, excel and the web.

PACKAGES :

Pandas: A powerful Python library used for data manipulation and analysis, providing data
structures like DataFrames and Series, which are excellent for handling structured data with
rows and columns. NumPy: A foundational Python library for numerical computing that
provides support for large, multidimensional arrays and matrices, along with a collection of
mathematical functions to perform operations on these arrays.

SciPy: An open-source Python library built on NumPy, used for scientific and technical
computing. It includes modules for optimization, integration, interpolation, eigenvalue
problems, and more.

Statsmodels: A Python module that provides classes and functions for statistical modeling,
including linear regression, time series analysis, and hypothesis testing, making it useful for
statistical data analysis.

Altair: A declarative statistical visualization library for Python, built on Vega and Vega-Lite,
that allows users to create interactive, concise, and high-level charts with minimal code,
making it ideal for exploratory data analysis.

Matplotlib: A widely-used Python library for creating static, interactive, and animated
visualizations in 2D and 3D. It offers extensive customization options for plots, making it
suitable for creating a wide range of charts like line plots, scatter plots, histograms, and more.

CODE:

import pandas as pd
import numpy as np

1
height = [1.87, 1.87, 1.82, 1.91, 1.90, 1.85]
weight = [81.65, 97.52, 95.25, 92.98, 86.18, 88.45]

np_height = np.array(height)
np_weight = np.array(weight)
print(height)
print(np_height)
print(type(height))
print(type(np_height))

orgarr=np_height.tolist()
print(orgarr)
print(type(orgarr))

OUTPUT:

[1.87, 1.87, 1.82, 1.91, 1.9, 1.85]

[1.87 1.87 1.82 1.91 1.9 1.85]
<class 'list'>
<class 'numpy.ndarray'>
[1.87, 1.87, 1.82, 1.91, 1.9, 1.85]
<class 'list'>

INFERENCE:
The difference is like we can do direct math operations in numpy array

CODE:

arr=np.array([1,2,3,4,5])
print(arr)
x=arr.copy()
arr[0]=42
print(x)
print(arr)

OUTPUT:

[1 2 3 4 5]
[1 2 3 4 5]
[42 2 3 4 5]

2
CODE:

sample = np.array([32,54,23,78,65,12,32,39,56])
filter_arr = sample > 50
new_arr = sample[filter_arr]
print(new_arr)

OUTPUT:

[54 78 65 56]

CODE:

a = np.array([[1,2],[3,4]])
for x in np.nditer(arr):
print(x)

OUTPUT:

42
2
3
4
5

CODE:

a = np.array([[4,5,2],[3,8,8],[12,32,6]])
b = np.array([[3,4,5],[6,5,2],[1,4,3]])
print(a+b)
print()
print(a-b)

OUTPUT:

[[ 7 9 7]
[ 9 13 10]
[13 36 9]]

[[ 1 1 -3]

3
[-3 3 6]
[11 28 3]]

CODE:

arr = np.array([1, 2, 3, 4, 5])

x = arr.view()
arr[0] = 42
print(arr)
print(x)

OUTPUT:

[42 2 3 4 5]
[42 2 3 4 5]

CODE:

arr = np.array([1, 2, 3, 4, 5])

x = arr.copy()
arr[1] = 40
print(arr)
print(x)

OUTPUT:
[ 1 40 3 4 5] [1 2 3 4 5]

INFERENCE:
COPY MAKES A DEEPCOPY WHERE VIEW IS LINKED WITH THE ORIGINAL
PLACE

PANDAS

CODE:

import pandas as pd
a = [1, 7, 2]
myvar = pd.Series(a)
print(myvar)

4
OUTPUT:

0 1
1 7
2 2

CODE:

df=pd.read_csv("W2023.csv")
df.head()

OUTPUT:

CODE:

sns.histplot(df['Birth Rate'], bins=10, kde=True)

plt.title('Distribution of Birth Rate')
plt.xlabel('Birth Rate')
plt.ylabel('Frequency')

OUTPUT:

5
CODE:

plt.figure(figsize=(10, 6))
sns.scatterplot(x='GDP', y='Co2-Emissions', data=df, hue='Country', s=100)
plt.title('GDP vs CO2 Emissions by Country')
plt.xlabel('GDP')
plt.ylabel('CO2 Emissions')
plt.legend(bbox_to_anchor=(1.05, 1), loc='upper left')

OUTPUT:

CODE:

plt.figure(figsize=(10, 6))
sns.scatterplot(x='Birth Rate', y='Fertility Rate', data=df, hue='Country', s=100)
plt.title('Birth Rate vs Fertility Rate by Country')
plt.xlabel('Birth Rate')
plt.ylabel('Fertility Rate')
plt.legend(bbox_to_anchor=(1.05, 1), loc='upper left')
plt.show()

6
OUTPUT:

CODE:

top5_co2 = df.nlargest(5, 'Co2-Emissions')

plt.figure(figsize=(8, 8))
plt.pie(top5_co2['Co2-Emissions'], labels=top5_co2['Country'], autopct='%1.1f%%',
startangle=90, colors=sns.color_palette('Set2'))
plt.title('Top 5 Countries by CO2 Emissions')
plt.show()

OUTPUT:

7
CODE:

import scipy.stats as stats

a = np.arange(10)
stats.describe(a)

OUTPUT:

DescribeResult(nobs=10, minmax=(0, 9), mean=4.5, variance=9.166666666666668,

skewness=0.0, kurtosis=-1.2242424242424244)

CODE:

import statsmodels.api as sm

# Create a sample dataset

np.random.seed(0)
X = np.random.rand(100) * 10
Y = 3 * X + np.random.randn(100) * 2

df = pd.DataFrame({'X': X, 'Y': Y})

print("Sample DataFrame:")
print(df.head())

# Linear regression can be performed using statsmodels package

X = sm.add_constant(df['X'])

model = sm.OLS(df['Y'], X)
results = model.fit()

print("\nRegression Results:")
print(results.summary())

8
OUTPUT:

RESULT:

Packages are explored and basic operations are performed on a sample dataset
successfully.

BNN Bootcamp 5 (Combination of Planets Part-3)
100% (3)
BNN Bootcamp 5 (Combination of Planets Part-3)
63 pages
V7 Adobe Acrobat Pro DC 2018 (11 - 04-11 - 10) (11 - 25)
50% (2)
V7 Adobe Acrobat Pro DC 2018 (11 - 04-11 - 10) (11 - 25)
6 pages
Organizational Change Management
100% (5)
Organizational Change Management
107 pages
Unit 5
No ratings yet
Unit 5
27 pages
Batch2 FDS Printout
No ratings yet
Batch2 FDS Printout
38 pages
Fds Record
No ratings yet
Fds Record
69 pages
CS3361 Data Science Lab Manual
No ratings yet
CS3361 Data Science Lab Manual
43 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Final Fds Manual
No ratings yet
Final Fds Manual
77 pages
Python Abstract
No ratings yet
Python Abstract
7 pages
Cs3361-Data Science Lab Manual
No ratings yet
Cs3361-Data Science Lab Manual
44 pages
FINAL FDS MANUAL Print
No ratings yet
FINAL FDS MANUAL Print
55 pages
BDA File
No ratings yet
BDA File
26 pages
DSF Lab Exp Full
No ratings yet
DSF Lab Exp Full
88 pages
Final Fds Manual Print
No ratings yet
Final Fds Manual Print
55 pages
FDS Lab Manual (Print)
No ratings yet
FDS Lab Manual (Print)
43 pages
Lab Manual Fds
No ratings yet
Lab Manual Fds
44 pages
Final Dev Record
No ratings yet
Final Dev Record
49 pages
DV Lab2 Updated
No ratings yet
DV Lab2 Updated
12 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
42 pages
Unit 4
No ratings yet
Unit 4
105 pages
Fds Lab Record
No ratings yet
Fds Lab Record
84 pages
Data Science Lab Manual Full
No ratings yet
Data Science Lab Manual Full
47 pages
Unit 5
No ratings yet
Unit 5
28 pages
Pythonlibraries
No ratings yet
Pythonlibraries
20 pages
EX - No: 1 Date:: Download Install Explore The Features of Numpy, Scipy, Jupiter, Statsmodels and Pandas Packages
No ratings yet
EX - No: 1 Date:: Download Install Explore The Features of Numpy, Scipy, Jupiter, Statsmodels and Pandas Packages
38 pages
Fdsa Lab Manual Final
No ratings yet
Fdsa Lab Manual Final
70 pages
Fundamentals of Data Science Lab Manual
No ratings yet
Fundamentals of Data Science Lab Manual
34 pages
ML File Updated
No ratings yet
ML File Updated
60 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
72 pages
De&v Lab Manual
No ratings yet
De&v Lab Manual
91 pages
Unit 5 PythonPackages (Matplotlib)
No ratings yet
Unit 5 PythonPackages (Matplotlib)
24 pages
Scipy, Matplotlib, Pandas
No ratings yet
Scipy, Matplotlib, Pandas
16 pages
Dav Lab
No ratings yet
Dav Lab
8 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Graphs Using Matplotlib
No ratings yet
Graphs Using Matplotlib
23 pages
FDS Record Last
No ratings yet
FDS Record Last
61 pages
Data Analysis and Visulaization Experiment
No ratings yet
Data Analysis and Visulaization Experiment
104 pages
NumPy, Pandas, MatplotLib, Seaborn, ScikitLearn (SkLearn)
No ratings yet
NumPy, Pandas, MatplotLib, Seaborn, ScikitLearn (SkLearn)
14 pages
PR Final File
No ratings yet
PR Final File
70 pages
DAV EXP 1 t12 31
No ratings yet
DAV EXP 1 t12 31
39 pages
Fundamentals of Data Science Students
No ratings yet
Fundamentals of Data Science Students
52 pages
Machine Learning Experiment
No ratings yet
Machine Learning Experiment
69 pages
Dev Lab Manual Org
No ratings yet
Dev Lab Manual Org
28 pages
Python Libraries
No ratings yet
Python Libraries
27 pages
FDS Lab Meterial CS3361
No ratings yet
FDS Lab Meterial CS3361
30 pages
Fds Lab Manual
No ratings yet
Fds Lab Manual
24 pages
Data Analysis and Visualisation With Python
No ratings yet
Data Analysis and Visualisation With Python
75 pages
Pandas
No ratings yet
Pandas
25 pages
Q-Step WS 06112019 Data Analysis and Visualisation With Python
No ratings yet
Q-Step WS 06112019 Data Analysis and Visualisation With Python
76 pages
Machine Learning Lab File: Submitted To: Submitted by
No ratings yet
Machine Learning Lab File: Submitted To: Submitted by
9 pages
Fds Merged
No ratings yet
Fds Merged
102 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
62 pages
PP&DS Unit Iii
No ratings yet
PP&DS Unit Iii
26 pages
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
No ratings yet
3rd Semester DDM AI DAA DEV Print Pages For Spiral Record 25-1-24 - Removed
28 pages
Python Libraries Explained
No ratings yet
Python Libraries Explained
10 pages
Numpy Data Analysis and Visualisation With Python
No ratings yet
Numpy Data Analysis and Visualisation With Python
75 pages
3-Numpy Pandas
No ratings yet
3-Numpy Pandas
37 pages
l9 Scientific Python Proc
No ratings yet
l9 Scientific Python Proc
30 pages
3 - Pandas
No ratings yet
3 - Pandas
87 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Age of Empires Rise of Rome
No ratings yet
Age of Empires Rise of Rome
35 pages
Geol 194 Syllabus Revised
No ratings yet
Geol 194 Syllabus Revised
4 pages
FCE Sample Use of English 1, Twins, Edinburugh, Languages
No ratings yet
FCE Sample Use of English 1, Twins, Edinburugh, Languages
6 pages
A First Book Nature UK Part4
100% (1)
A First Book Nature UK Part4
13 pages
Avoid News Part1 TEXT PDF
No ratings yet
Avoid News Part1 TEXT PDF
11 pages
The Gomti Riverfront in Lucknow, India: Revitalization of A Cultural Heritage Landscape
No ratings yet
The Gomti Riverfront in Lucknow, India: Revitalization of A Cultural Heritage Landscape
20 pages
To From
No ratings yet
To From
4 pages
2ND Performance Task in Science
No ratings yet
2ND Performance Task in Science
6 pages
Assessment of Credit Management in Micro Finance Institution
No ratings yet
Assessment of Credit Management in Micro Finance Institution
42 pages
API 653 Notes
No ratings yet
API 653 Notes
3 pages
Tan ChineseLiteratureEssays 2016
No ratings yet
Tan ChineseLiteratureEssays 2016
5 pages
Newborn Care 2
No ratings yet
Newborn Care 2
2 pages
Dhupguri Report
No ratings yet
Dhupguri Report
11 pages
Information Package: Including Terms & Conditions
No ratings yet
Information Package: Including Terms & Conditions
8 pages
U2000 Northbound Performance File Interface Developer Guide (NE-Based)
No ratings yet
U2000 Northbound Performance File Interface Developer Guide (NE-Based)
79 pages
Monetary Statistics M
No ratings yet
Monetary Statistics M
42 pages
DR - AishaCv 20250422 152511 0000
No ratings yet
DR - AishaCv 20250422 152511 0000
4 pages
Benchmark Report - Voice Service Optimization For Common State, TP20160728
No ratings yet
Benchmark Report - Voice Service Optimization For Common State, TP20160728
16 pages
Kruse
No ratings yet
Kruse
25 pages
Teip7419 Mo
No ratings yet
Teip7419 Mo
22 pages
Gallup Test
No ratings yet
Gallup Test
25 pages
Experiment 6 Isolation of Eugenol From Cloves TECHNIQUE: Steam Distillation
No ratings yet
Experiment 6 Isolation of Eugenol From Cloves TECHNIQUE: Steam Distillation
2 pages
Bridges and Roads
No ratings yet
Bridges and Roads
22 pages
Playdor School Bus Schedule - 15 April 2024 To 26 July 2024-1
No ratings yet
Playdor School Bus Schedule - 15 April 2024 To 26 July 2024-1
1 page
Statistical Reasoning For Everyday Life 5th Edition Bennett Test Bank Download
100% (3)
Statistical Reasoning For Everyday Life 5th Edition Bennett Test Bank Download
40 pages
15 Advanced English Phrases For Better Expressing Emotions
No ratings yet
15 Advanced English Phrases For Better Expressing Emotions
4 pages
Formulation of Objective
No ratings yet
Formulation of Objective
16 pages

Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024

Uploaded by

Ex. No: 1 Exploring The Features of Numpy, Scipy, Jupyter, Statsmodels and Pandas Date: 07/08/2024

Uploaded by

EX.

[1.87, 1.87, 1.82, 1.91, 1.9, 1.85]

arr = np.array([1, 2, 3, 4, 5])

arr = np.array([1, 2, 3, 4, 5])

sns.histplot(df['Birth Rate'], bins=10, kde=True)

top5_co2 = df.nlargest(5, 'Co2-Emissions')

import scipy.stats as stats

DescribeResult(nobs=10, minmax=(0, 9), mean=4.5, variance=9.166666666666668,

# Create a sample dataset

df = pd.DataFrame({'X': X, 'Y': Y})

# Linear regression can be performed using statsmodels package

You might also like