0% found this document useful (0 votes)

45 views8 pages

Python Data Analytics Libraries

Uploaded by

yashnikam844

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views8 pages

Python Data Analytics Libraries

Uploaded by

yashnikam844

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Python is a powerful tool for data analytics, thanks to its extensive libraries that support data

manipulation, analysis, visualization, and machine learning. Here’s a detailed look at some of the
most popular Python libraries used in data analytics:

### 1. **Pandas**

- Purpose: Data manipulation and analysis.

- **Key Features**: Provides DataFrame and Series objects, powerful tools for reading and writing
data, handling missing data, and more.

```python

import pandas as pd

# Create a DataFrame

df = pd.DataFrame({

'Name': ['Alice', 'Bob', 'Charlie'],

'Age': [25, 30, 35],

'Salary': [70000, 80000, 90000]

})

# Display the DataFrame

print(df)

# Perform operations

df['Salary'] = df['Salary'] * 1.1

print(df.describe())

```

### 2. **NumPy**

- Purpose: Numerical computing.

- **Key Features**: Support for large, multi-dimensional arrays and matrices, mathematical
functions.

```python

import numpy as np

# Create an array

arr = np.array([1, 2, 3, 4, 5])

# Perform operations

arr = arr * 2

print(arr)

# Statistical operations

mean = np.mean(arr)

std_dev = np.std(arr)

print(f"Mean: {mean}, Standard Deviation: {std_dev}")

```

### 3. **SciPy**

- Purpose: Scientific computing.

- Key Features: Builds on NumPy, providing additional functionality for optimization,

integration, interpolation, eigenvalue problems, and more.

```python

from scipy import stats

# Perform statistical tests

data = np.random.normal(0, 1, 1000)

t_statistic, p_value = stats.ttest_1samp(data, 0)

print(f"T-statistic: {t_statistic}, P-value: {p_value}")

```

### 4. **Matplotlib**

- Purpose: Data visualization.

- **Key Features**: Comprehensive library for creating static, animated, and interactive
visualizations.

```python

import matplotlib.pyplot as plt

# Plot data

plt.plot([1, 2, 3], [4, 5, 6])

plt.xlabel('X-axis')

plt.ylabel('Y-axis')

plt.title('Simple Plot')

plt.show()

```

### 5. **Seaborn**

- Purpose: Statistical data visualization.

- **Key Features**: Based on Matplotlib, provides a high-level interface for drawing attractive and
informative graphics.

```python

import seaborn as sns

# Load dataset

tips = sns.load_dataset("tips")

# Create a bar plot

sns.barplot(x="day", y="total_bill", data=tips)

plt.show()

```

### 6. **Plotly**

- Purpose: Interactive data visualization.

- Key Features: Supports a variety of chart types, interactive plots.

```python

import plotly.express as px

# Create an interactive line plot

fig = px.line(x=[1, 2, 3], y=[4, 5, 6], title='Interactive Line Plot')

fig.show()

```

### 7. **Scikit-learn**

- Purpose: Machine learning.

- **Key Features**: Tools for data mining and data analysis, including classification, regression,
clustering, and dimensionality reduction.

```python

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split

from sklearn.ensemble import RandomForestClassifier

from sklearn.metrics import accuracy_score

# Load dataset

iris = load_iris()
X, y = iris.data, iris.target

# Split data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train a model

model = RandomForestClassifier()

model.fit(X_train, y_train)

# Make predictions

predictions = model.predict(X_test)

print(f"Accuracy: {accuracy_score(y_test, predictions)}")

```

### 8. **Statsmodels**

- Purpose: Statistical modeling and econometrics.

- Key Features: Tools for estimating and testing statistical models.

```python

import statsmodels.api as sm

# Load dataset

data = sm.datasets.get_rdataset("mtcars").data

# Fit a linear regression model

X = sm.add_constant(data[['hp', 'wt']])

y = data['mpg']

model = sm.OLS(y, X).fit()

print(model.summary())

```
### 9. **Dask**

- Purpose: Parallel computing and larger-than-memory computations.

- **Key Features**: Integrates with Pandas and NumPy, allows for scalable data analysis.

```python

import dask.dataframe as dd

# Load a large dataset

df = dd.read_csv('large_dataset.csv')

# Perform operations

result = df.groupby('column_name').mean().compute()

print(result)

```

### 10. TensorFlow and PyTorch

- Purpose: Deep learning and machine learning.

- **Key Features**: TensorFlow provides a comprehensive ecosystem for ML; PyTorch offers
dynamic computation graphs and is favored for research.

**TensorFlow Example:**

```python

import tensorflow as tf

# Define a simple model

model = tf.keras.models.Sequential([

tf.keras.layers.Dense(10, activation='relu'),

tf.keras.layers.Dense(1)
])

# Compile and train the model

model.compile(optimizer='adam', loss='mean_squared_error')

model.fit(X_train, y_train, epochs=10)

```

**PyTorch Example:**

```python

import torch

import torch.nn as nn

import torch.optim as optim

# Define a simple model

class SimpleModel(nn.Module):

def __init__(self):

super(SimpleModel, self).__init__()

self.fc1 = nn.Linear(10, 1)

def forward(self, x):

return self.fc1(x)

model = SimpleModel()

# Define loss and optimizer

criterion = nn.MSELoss()

optimizer = optim.Adam(model.parameters(), lr=0.01)

# Train the model

for epoch in range(10):

optimizer.zero_grad()
outputs = model(torch.tensor(X_train, dtype=torch.float32))

loss = criterion(outputs, torch.tensor(y_train, dtype=torch.float32))

loss.backward()

optimizer.step()

```

These libraries form the core of Python's data analytics ecosystem. Mastering them will enable you
to handle a wide variety of data-related tasks efficiently and effectively.

Essential Python Libraries and Functions For Data Science 1706295212
No ratings yet
Essential Python Libraries and Functions For Data Science 1706295212
12 pages
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
No ratings yet
10 Essential Python Libraries For Data Professionals - by Sigli Mumuni - Medium
6 pages
Exp 1 Dav
No ratings yet
Exp 1 Dav
3 pages
Practical 1
No ratings yet
Practical 1
8 pages
Numpy: Explanation
No ratings yet
Numpy: Explanation
21 pages
Machine Learning Document
No ratings yet
Machine Learning Document
7 pages
Essential Python Libraries For Data Science 1694045951
No ratings yet
Essential Python Libraries For Data Science 1694045951
7 pages
Staple Python Libraries For Data Science
No ratings yet
Staple Python Libraries For Data Science
26 pages
Top 20 Python Libraries For Data Science
No ratings yet
Top 20 Python Libraries For Data Science
15 pages
Dsbda Unit4
No ratings yet
Dsbda Unit4
110 pages
Lab 2 Report
No ratings yet
Lab 2 Report
6 pages
Lecture 4
No ratings yet
Lecture 4
33 pages
Introduction To Popular-1
No ratings yet
Introduction To Popular-1
15 pages
PDF 1675791423
No ratings yet
PDF 1675791423
11 pages
Machine Learning Python Packages
No ratings yet
Machine Learning Python Packages
9 pages
ML Lab File
No ratings yet
ML Lab File
33 pages
00 Dm2 Python Libraries4data Science 2020
No ratings yet
00 Dm2 Python Libraries4data Science 2020
7 pages
Python Libs For Ds
No ratings yet
Python Libs For Ds
5 pages
ML Exp
No ratings yet
ML Exp
9 pages
Python for Data Analysis
No ratings yet
Python for Data Analysis
15 pages
ENROLLMENT NO: 202203103510400: Utu/Cgpit/Ce/Sem-6/Machine Intelligence (Ce5008)
No ratings yet
ENROLLMENT NO: 202203103510400: Utu/Cgpit/Ce/Sem-6/Machine Intelligence (Ce5008)
6 pages
Basic Libraries For Data Science
No ratings yet
Basic Libraries For Data Science
4 pages
Machine Learning Experiment
No ratings yet
Machine Learning Experiment
69 pages
Deep Python for Data Analysis
No ratings yet
Deep Python for Data Analysis
4 pages
Predictive Data Analytics With Python
100% (2)
Predictive Data Analytics With Python
97 pages
Common Python Packages For FinML
No ratings yet
Common Python Packages For FinML
7 pages
Exp1ml
No ratings yet
Exp1ml
6 pages
15 Python Libraries For Data Science
No ratings yet
15 Python Libraries For Data Science
17 pages
Top 18 Python Libraries
100% (1)
Top 18 Python Libraries
11 pages
Chapter1 Notes Python Data Analysis
No ratings yet
Chapter1 Notes Python Data Analysis
2 pages
Practical 1
No ratings yet
Practical 1
2 pages
DAV Exp.1-8 Output
No ratings yet
DAV Exp.1-8 Output
19 pages
40 Most Popular Python Scientific Libraries
No ratings yet
40 Most Popular Python Scientific Libraries
9 pages
Python
No ratings yet
Python
3 pages
Python Library Functions
No ratings yet
Python Library Functions
12 pages
Project Des
No ratings yet
Project Des
52 pages
Python Libraries
No ratings yet
Python Libraries
17 pages
Python Libraries for ML
No ratings yet
Python Libraries for ML
2 pages
Data Preprocessing-AIML Algorithm1
No ratings yet
Data Preprocessing-AIML Algorithm1
47 pages
Introduction To EDA
No ratings yet
Introduction To EDA
16 pages
5 Essential Python Libraries for Every Data Scientist
No ratings yet
5 Essential Python Libraries for Every Data Scientist
10 pages
Data Analysis Library: by Muthu Priya J 19MZ06
No ratings yet
Data Analysis Library: by Muthu Priya J 19MZ06
3 pages
Data Science Tools
No ratings yet
Data Science Tools
2 pages
Sec-D ML Practical File PDF
No ratings yet
Sec-D ML Practical File PDF
19 pages
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
No ratings yet
100 Must-Know PythonMl Interview Questions and Answers 2024 - Devinterview - Io
1 page
Report Format (1) .Docx - 20240508 - 124537 - 0000
No ratings yet
Report Format (1) .Docx - 20240508 - 124537 - 0000
11 pages
Chapter 6 Python Libraries For Machine Learning
No ratings yet
Chapter 6 Python Libraries For Machine Learning
21 pages
Pre ML Practise
No ratings yet
Pre ML Practise
14 pages
Introduction to Python for Data Analysis and Visualization 2
No ratings yet
Introduction to Python for Data Analysis and Visualization 2
24 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
Python For Data Exploration
No ratings yet
Python For Data Exploration
28 pages
Data Science
No ratings yet
Data Science
17 pages
Libraries For Data Science - CBS - PDS
No ratings yet
Libraries For Data Science - CBS - PDS
2 pages
TY FDS Workbook
No ratings yet
TY FDS Workbook
56 pages
Week 3
No ratings yet
Week 3
10 pages
Core Libraries For Machine Learning
No ratings yet
Core Libraries For Machine Learning
5 pages
PYTHON
No ratings yet
PYTHON
11 pages
Comprehensive Overview of Common ML Techniques
No ratings yet
Comprehensive Overview of Common ML Techniques
7 pages
Quick Python Guide
From Everand
Quick Python Guide
Coder1
No ratings yet
Introduction to Python Programming: Do your first steps into programming with python
From Everand
Introduction to Python Programming: Do your first steps into programming with python
Greytower Corp
No ratings yet
Cap1 Financial Accounting Screenshots
No ratings yet
Cap1 Financial Accounting Screenshots
29 pages
Sjg06-017 (11) Grace01 - На Английсском
No ratings yet
Sjg06-017 (11) Grace01 - На Английсском
33 pages
CCD Manual
No ratings yet
CCD Manual
118 pages
Object Monologue
No ratings yet
Object Monologue
8 pages
Test Chart Carta PDF
No ratings yet
Test Chart Carta PDF
1 page
Accenture
100% (1)
Accenture
5 pages
Comparison Essay
No ratings yet
Comparison Essay
10 pages
OEG Service Information Hard Disk Initialization Method For DNC-DT Function
100% (1)
OEG Service Information Hard Disk Initialization Method For DNC-DT Function
2 pages
Contoh CV
No ratings yet
Contoh CV
2 pages
Em270 DS Eng
No ratings yet
Em270 DS Eng
18 pages
Annunciation Spontaneous Annunciation - 16-03-2021 16:58:21.550
No ratings yet
Annunciation Spontaneous Annunciation - 16-03-2021 16:58:21.550
3 pages
Saes T 151
No ratings yet
Saes T 151
13 pages
SD Aqua View Brochure
No ratings yet
SD Aqua View Brochure
4 pages
Chajja
86% (7)
Chajja
14 pages
Lab 04. Relational, Logical and Conditional Operators in C++
No ratings yet
Lab 04. Relational, Logical and Conditional Operators in C++
8 pages
Workshop Manual: Technical Data
100% (3)
Workshop Manual: Technical Data
40 pages
Entropy 22 01150 v2
No ratings yet
Entropy 22 01150 v2
14 pages
WFC 4000
No ratings yet
WFC 4000
64 pages
MScThesis PepijnKessels
No ratings yet
MScThesis PepijnKessels
142 pages
Instruction Manual: VHF Marine Transceiver
No ratings yet
Instruction Manual: VHF Marine Transceiver
80 pages
BCD Code Simulation Practice
No ratings yet
BCD Code Simulation Practice
9 pages
Referensi 3
No ratings yet
Referensi 3
15 pages
Course Code CSE3011 Python Programming Course Type LP Credits 3
No ratings yet
Course Code CSE3011 Python Programming Course Type LP Credits 3
3 pages
1925 Multi Servers X.T.R.E.A.M - 2024-05-02-00-30-12
No ratings yet
1925 Multi Servers X.T.R.E.A.M - 2024-05-02-00-30-12
131 pages
1 Tune-Up and Routine Maintenance 6 Engine Oil and Filter Change (Every 5,000 Miles (8,000 KM) or 5 Months)
No ratings yet
1 Tune-Up and Routine Maintenance 6 Engine Oil and Filter Change (Every 5,000 Miles (8,000 KM) or 5 Months)
6 pages
Introduction To Presentation Software
No ratings yet
Introduction To Presentation Software
34 pages
CSBS Practical
No ratings yet
CSBS Practical
1 page
Power Bric: General Overview
No ratings yet
Power Bric: General Overview
2 pages
Foster Krolnik Creating Assertion Based IP
No ratings yet
Foster Krolnik Creating Assertion Based IP
328 pages
Sinucom Ffs 76
No ratings yet
Sinucom Ffs 76
2 pages

Python Data Analytics Libraries

Uploaded by

Python Data Analytics Libraries

Uploaded by

Python is a powerful tool for data analytics, thanks to its extensive libraries that support data

- **Purpose**: Data manipulation and analysis.

'Name': ['Alice', 'Bob', 'Charlie'],

'Age': [25, 30, 35],

'Salary': [70000, 80000, 90000]

# Display the DataFrame

df['Salary'] = df['Salary'] * 1.1

- **Purpose**: Numerical computing.

arr = np.array([1, 2, 3, 4, 5])

print(f"Mean: {mean}, Standard Deviation: {std_dev}")

- **Purpose**: Scientific computing.

- **Key Features**: Builds on NumPy, providing additional functionality for optimization,

from scipy import stats

# Perform statistical tests

data = np.random.normal(0, 1, 1000)

t_statistic, p_value = stats.ttest_1samp(data, 0)

print(f"T-statistic: {t_statistic}, P-value: {p_value}")

- **Purpose**: Data visualization.

import matplotlib.pyplot as plt

plt.plot([1, 2, 3], [4, 5, 6])

- **Purpose**: Statistical data visualization.

import seaborn as sns

# Create a bar plot

- **Purpose**: Interactive data visualization.

- **Key Features**: Supports a variety of chart types, interactive plots.

# Create an interactive line plot

fig = px.line(x=[1, 2, 3], y=[4, 5, 6], title='Interactive Line Plot')

- **Purpose**: Machine learning.

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split

from sklearn.ensemble import RandomForestClassifier

from sklearn.metrics import accuracy_score

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

print(f"Accuracy: {accuracy_score(y_test, predictions)}")

- **Purpose**: Statistical modeling and econometrics.

- **Key Features**: Tools for estimating and testing statistical models.

# Fit a linear regression model

model = sm.OLS(y, X).fit()

- **Purpose**: Parallel computing and larger-than-memory computations.

# Load a large dataset

### 10. **TensorFlow and PyTorch**

- **Purpose**: Deep learning and machine learning.

# Define a simple model

# Compile and train the model

model.fit(X_train, y_train, epochs=10)

import torch.optim as optim

# Define a simple model

def forward(self, x):

# Define loss and optimizer

optimizer = optim.Adam(model.parameters(), lr=0.01)

# Train the model

for epoch in range(10):

loss = criterion(outputs, torch.tensor(y_train, dtype=torch.float32))

You might also like

- Purpose: Data manipulation and analysis.

- Purpose: Numerical computing.

- Purpose: Scientific computing.

- Key Features: Builds on NumPy, providing additional functionality for optimization,

- Purpose: Data visualization.

- Purpose: Statistical data visualization.

- Purpose: Interactive data visualization.

- Key Features: Supports a variety of chart types, interactive plots.

- Purpose: Machine learning.

- Purpose: Statistical modeling and econometrics.

- Key Features: Tools for estimating and testing statistical models.

- Purpose: Parallel computing and larger-than-memory computations.

### 10. TensorFlow and PyTorch

- Purpose: Deep learning and machine learning.