Chapter 2. Data Analysis and Processing - Full

Uploaded by

schlaggen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views49 pages

Chapter 2. Data Analysis and Processing - Full

Uploaded by

schlaggen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

DATA ANALYSIS AND PROCESSING

DR. PHẠM MINH HOÀN – [email protected]

OBJECTIVES OF CHAPTER 2
• Understanding different types of data sources and how to access and manipulate
them.
• Data analysis is all about extracting meaningful insights from your data.
• Data exploration is about getting familiar with your data and identifying patterns
and trends.
• Data
visualization is about creating visual representations of your data to
communicate insights effectively.
• Most data analysis tasks involve using specialized libraries that provide functions
and tools for working with data.
CONTENTS
2.1. Introduce and work with data sources
2.2. Data Analysis
2.3. Data Exploration
2.4. Data Visualization
2.5. Working with library
2.5.1. Pandas
2.5.2. SciPy and Numpy
2.5.3. Matplotlib
2.5.4. Scikit-learn
INTRODUCE AND WORK WITH DATA SOURCES
• Datasources: A data source is a location or system that stores and
manages data. This data can be anything from numbers and text to
images and audio files.
• Databases: These are structured collections of data that allow for easy access
and manipulation.
• Spreadsheets: Familiar programs like Microsoft Excel that store data in tables
with rows and columns.
• Cloud-based platforms: Services like Google Drive or Dropbox that store data
online and allow access from anywhere.
INTRODUCE AND WORK WITH DATA SOURCES
• Working with data sources:
• Identify the data source: Determine what kind of data you need and where it's
stored.
• Connect to the data source: This will involve using specific tools or software
depending on the data source type.
• Extractthe data: Use tools or write queries to extract the specific data you
need for your project.
INTRODUCE AND WORK WITH DATA SOURCES
• Working with data sources:
• Clean and transform the data: Real-world data often has inconsistencies or
errors. This stage involves cleaning the data and transforming it into a usable
format for analysis.
• Analyze the data: After the data is prepared, can use various techniques to
analyze it and extract insights.
DATA ANALYSIS
• Dataanalysis is the process of extracting meaningful information
from data.
• Goals of data analysis:
• Uncover patterns and trends: Data analysis helps identify relationships
between different pieces of data. This can reveal trends or patterns.
• Make informed decisions: Data-driven insights can inform better choices in
various fields, from business strategy to scientific research.
• Solve problems: Data analysis is a powerful tool for identifying and solving
problems. By examining data, can pinpoint root causes and develop solutions.
DATA ANALYSIS
• Main types of data analysis:
• Descriptive Analysis: This is the foundation for further analysis. It provides a
summary of the data, describing its central tendencies (like average or
median) and variability.
• Diagnostic Analysis: Understand why things are happening. Identify factors
influencing specific outcomes or behaviors. Using data to diagnose the root
cause of a problem.
• PredictiveAnalysis: Uses historical data to forecast future trends or events.
The goal is to make predictions about what might happen based on patterns
observed in the data.
DATA ANALYSIS
• Popular Python Libraries for Data Analysis:
• NumPy: The foundation for numerical computing in Python. It offers efficient
arrays, linear algebra operations, and mathematical functions for data
manipulation.
• Pandas: Builds on top of NumPy and provides high-performance data
structures like DataFrames (think spreadsheet on steroids) for handling tabular
data. It excels in data cleaning, transformation, and analysis.
• SciPy: Offers a collection of algorithms and functions for advanced scientific
computing and data analysis tasks like optimization, integration, and
statistical modeling.
EXPLORATORY DATA ANALYSIS
EDA is a phenomenon under data analysis used for gaining a better
understanding of data aspects like:
• main features of data
• variables and relationships that hold between them
• Identifying which variables are important for our problem
EXPLORATORY DATA ANALYSIS
EDA process:
• Reading dataset
• Analyzing the data
• Checking for the duplicates
• Missing Values Calculation
• Exploratory Data Analysis
• Univariate Analysis
• Bivariate Analysis
• Multivariate Analysis
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 0: Install Libraries
# íntall Libraries
python pip install pandas
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 1: Importing Required Libraries
# importting Libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import warnings as wr
wr.filterwarnings('ignore')
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 2: Reading Dataset
# loading and reading dataset
df = pd.read_csv("winequality-red.csv")
print(df.head())
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 3: Analyzing the Data
# shape of the data
df.shape
#data information
df.info()
# describing the data
df.describe()
#column to list
df.columns.tolist()
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 3: Analyzing the Data
# check for missing values:
df.isnull().sum()
#checking duplicate values
df.nunique()
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 4: Univariate Analysis
# Assuming 'df' is your DataFrame
quality_counts = df['quality'].value_counts()
# Using Matplotlib to create a count plot
plt.figure(figsize=(8, 6))
plt.bar(quality_counts.index, quality_counts, color='darpink')
plt.title('Count Plot of Quality')
plt.xlabel('Quality')
plt.ylabel('Count')
plt.show()
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 4: Univariate Analysis
# Set Seaborn style
sns.set_style("darkgrid")

# Identify numerical columns

numerical_columns = df.select_dtypes(include=["int64", "float64"]).columns

# Plot distribution of each numerical feature

plt.figure(figsize=(14, len(numerical_columns) * 3))
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 4: Univariate Analysis
for idx, feature in enumerate(numerical_columns, 1):
plt.subplot(len(numerical_columns), 2, idx)
sns.histplot(df[feature], kde=True)
plt.title(f"{feature} | Skewness: {round(df[feature].skew(), 2)}")

# Adjust layout and show plots

plt.tight_layout()
plt.show()
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 4: Univariate Analysis
# Assuming 'df' is your DataFrame
plt.figure(figsize=(10, 8))
# Using Seaborn to create a swarm plot
sns.swarmplot(x="quality", y="alcohol", data=df, palette='viridis')
plt.title('Swarm Plot for Quality and Alcohol')
plt.xlabel('Quality')
plt.ylabel('Alcohol')
plt.show()
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 5: Bivariate Analysis
# Set the color palette
sns.set_palette("Pastel1")
# Assuming 'df' is your DataFrame
plt.figure(figsize=(10, 6))
# Using Seaborn to create a pair plot with the specified color palette
sns.pairplot(df)
plt.suptitle('Pair Plot for DataFrame')
plt.show()
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 5: Bivariate Analysis
# Assuming 'df' is your DataFrame
df['quality'] = df['quality'].astype(str) # Convert 'quality' to categorical
plt.figure(figsize=(10, 8))
# Using Seaborn to create a violin plot
sns.violinplot(x="quality", y="alcohol", data=df, palette={
'3': 'lightcoral', '4': 'lightblue', '5': 'lightgreen', '6': 'gold', '7': 'lightskyblue',
'8': 'lightpink'}, alpha=0.7)
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 5: Bivariate Analysis
plt.title('Violin Plot for Quality and Alcohol')
plt.xlabel('Quality')
plt.ylabel('Alcohol')
plt.show()
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 5: Bivariate Analysis
#plotting box plot between alcohol and quality
sns.boxplot(x='quality', y='alcohol', data=df)
EXPLORATORY DATA ANALYSIS
EDA process:
• Step 6: Multivariate Analysis
# Assuming 'df' is your DataFrame
plt.figure(figsize=(15, 10))

# Using Seaborn to create a heatmap

sns.heatmap(df.corr(), annot=True, fmt='.2f', cmap='Pastel2', linewidths=2)

plt.title('Correlation Heatmap')
plt.show()
DATA VISUALIZATION
• Data visualization is a powerful tool to understand and communicate
insights from data.
• Data visualization helps you see patterns, trends, and relationships
within your data that might be difficult to identify just by looking at
raw numbers. It's a great way to:
• Summarize large datasets
• Find correlations between variables
• Communicate complex ideas to others
• Python Libraries for Data Visualization: Matplotlib, Seaborn, …
DATA VISUALIZATION
• Install and load libraries.
• Pandas.
• SciPy and NumPy.
• Scikit-learn.
INSTALL AND LOAD LIBRARIES
• Install libraries in Python
py -m pip install [package_name]
Ex:
py -m pip install numpy
py -m pip install pandas
py -m pip install matplotlit
INSTALL AND LOAD LIBRARIES
• Install libraries in Pycharm
Step 1. File\Settings… (Ctrl+Alt+S)
Step 2. Project: pythonProject…\Python Interpreter
Step 3. Install (+) (Alt+S)
DATA VISUALIZATION WITH PANDAS
• Pandas, while not a dedicated visualization library, offers built-in
plotting functionalities that are great for exploratory data analysis
(EDA).
DATA VISUALIZATION WITH PANDAS
• Scatter Plots: Show relationships between two numerical variables.
• Line Plots: Useful for visualizing trends over time or along a sequence.
• Histograms: Depict the distribution of a single numerical variable.
• Box Plots: Summarize the distribution of a numerical variable, highlighting
outliers.
• Bar Plots: Represent categorical data or compare values between categories.
• Area Plots: Similar to line plots, but emphasize the area between the line and the
axis.
• Pie Charts: Represent proportions of a whole using slices.
DATA VISUALIZATION WITH PANDAS
• Ex:
import pandas as pd
import matplotlib.pyplot as plt
# Sample Dataframe
data = {'col1': [1, 2, 3, 4], 'col2': ['A', 'B', 'A', 'C']}
df = pd.DataFrame(data)
# Create a scatter plot
df.plot(kind='scatter', x='col1', y='col2')
plt.show()
DATA VISUALIZATION WITH NUMPY
• WhileNumPy are fundamental libraries for scientific computing in
Python, data visualization isn't their primary focus.
• However, they can be a powerful foundation for creating custom
visualizations.
DATA VISUALIZATION WITH NUMPY
• Ex:
import numpy as np
import matplotlib.pyplot as plt
x = np.linspace(0, 5, 100) # Create x-axis data (100 points from 0 to 5)
y = x**2 # Create y-axis data (square the x values)
plt.plot(x, y) # Plot the line
plt.xlabel('X-axis')
plt.ylabel('Y-axis')
plt.title('Line Plot (NumPy Data)')
plt.show()
DATA VISUALIZATION WITH SCIPY
• While SciPy is a powerful library for scientific computing in Python,
it's not primarily designed for data visualization.
• SciPy's Role in Data Visualization Workflow:
• Data Preparation: SciPy functions can help you clean, transform, and analyze
your data before visualization. For instance, you can use SciPy for outlier
detection, filtering, and data smoothing.
• Statistical Analysis: SciPy provides functions for statistical calculations like
finding correlations or fitting distributions. These results can be incorporated
into your visualizations to add context and insights.
DATA VISUALIZATION WITH SCIPY
• Ex:
import matplotlib.pyplot as plt
from scipy.signal import savgol_filter
# Generate noisy data
x = range(100)
y = np.sin(x) + np.random.randn(100)
# Smooth the data using Savitzky-Golay filter
y_smooth = savgol_filter(y, 51, 3)
# Plot the original and smoothed data
plt.plot(x, y, label='Original Data')
plt.plot(x, y_smooth, label='Smoothed Data')
plt.legend()
plt.show()
DATA VISUALIZATION WITH MATPLOTLIB
• Line Charts: Ideal for showcasing trends or changes over time (e.g.,
temperature fluctuations, stock prices).
• Bar Charts: Effective for comparing categories or quantities (e.g.,
sales figures across regions, customer satisfaction ratings).
• Scatter Plots: Used to explore relationships between two variables
(e.g., correlation between weight and height, relationship between
study hours and exam scores).
• PieCharts: Useful for representing proportions of a whole (e.g.,
budget allocation, market share distribution).
DATA VISUALIZATION WITH MATPLOTLIB
• Ex: Data visualization with Matplotlib that creates a line chart.
import matplotlib.pyplot as plt
# Sample temperature data for a week
days = ['Mon', 'Tue', 'Wed', 'Thu', 'Fri', 'Sat', 'Sun']
temperatures = [18, 21, 24, 22, 20, 19, 17]
# Create the line chart
plt.plot(days, temperatures)
# Add labels and title
plt.xlabel('Days')
plt.ylabel('Temperature (°C)')
plt.title('Weekly Temperature Variation')
# Display the chart
plt.show()
DATA ANALYSIS AND VISUALIZATION
• Data Loading: Reading data from CSV files.
• Data Cleaning: Checking for missing values and data types.
• Data Analysis: Grouping, sorting, and summarizing data.
• Data Visualization: Creating charts to explore trends.
DATA ANALYSIS AND VISUALIZATION
• Ex:
import pandas as pd
# Read the data from the CSV file
df = pd.read_csv("website_traffic.csv")
print(df.head())
print(df.info())
DATA ANALYSIS AND VISUALIZATION
• Ex:
# Group data by source and sort by visits in descending order
source_visits =
df.groupby('source')['visits'].sum().sort_values(ascending=False)
# Get the source with the most visits
top_source = source_visits.index[0]
# Get the number of visits from that source
top_visits = source_visits.iloc[0]
print(f"Top traffic source: {top_source} with {top_visits} visits")
DATA ANALYSIS AND VISUALIZATION
• Ex:
import matplotlib.pyplot as plt
# Plot a bar graph of source vs visits
source_visits.plot(kind='bar')
plt.xlabel("Traffic Source")
plt.ylabel("Visits")
plt.title("Website Traffic by Source")
plt.show()
SCIKIT-LEARN
• Free and open-source library for machine learning in Python
• User-friendly interface for various machine learning tasks
• Wide range of algorithms for classification, regression, clustering, and
more
SCIKIT-LEARN
• Classification: Categorizes data points (e.g., spam filtering)
• Regression: Predicts continuous values (e.g., stock price prediction)
• Clustering: Groups similar data points (e.g., customer segmentation)
• Dimensionality Reduction: Reduces features while preserving
information (e.g., image compression)
DATA VISUALIZATION WITH SCIKIT-LEARN
• Ex: Visualizing K-Means Clustering with scikit-learn
#Import libraries:
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris
from sklearn.cluster import KMeans
from sklearn.metrics import pairplot
DATA VISUALIZATION WITH SCIKIT-LEARN
• Ex: Visualizing K-Means Clustering with scikit-learn
#Load data:
iris = load_iris()
x = iris.data # Features
y = iris.target # Target labels (species)
DATA VISUALIZATION WITH SCIKIT-LEARN
• Ex: Visualizing K-Means Clustering with scikit-learn
#Perform K-Means clustering:
kmeans = KMeans(n_clusters=3, random_state=0)
kmeans.fit(X)
DATA VISUALIZATION WITH SCIKIT-LEARN
• Ex: Visualizing K-Means Clustering with scikit-learn
#Visualize clusters:
plt.figure(figsize=(8, 6))
pairplot(X, labels=kmeans.labels_, hue=kmeans.labels_)
plt.title("Iris Dataset - KMeans Clustering")
plt.show()
SUMMARY

Aids Lab
No ratings yet
Aids Lab
45 pages
Telecom Customer Churn Project Report
50% (2)
Telecom Customer Churn Project Report
25 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
23 pages
Iso 11462-2-2010
No ratings yet
Iso 11462-2-2010
18 pages
Dev Lab Manual Org
No ratings yet
Dev Lab Manual Org
28 pages
Python Data Analyst Handbook Guide - Byom - Cybertechie
No ratings yet
Python Data Analyst Handbook Guide - Byom - Cybertechie
57 pages
Internship Report - K
No ratings yet
Internship Report - K
30 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
10 pages
2A - Python+Data Analysis For Pyhton2 v2
No ratings yet
2A - Python+Data Analysis For Pyhton2 v2
38 pages
Data Analytics Fundamentals-2
No ratings yet
Data Analytics Fundamentals-2
34 pages
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
100% (1)
Comprehensive Guide Data Exploration Sas Using Python Numpy Scipy Matplotlib Pandas
12 pages
FRA Assignment
100% (1)
FRA Assignment
31 pages
Unit 2
No ratings yet
Unit 2
36 pages
Unit 6
No ratings yet
Unit 6
3 pages
EDA Unit V
No ratings yet
EDA Unit V
28 pages
Krishna Kumar BTP2 Report
No ratings yet
Krishna Kumar BTP2 Report
23 pages
Labdev
No ratings yet
Labdev
57 pages
Chap - 24
No ratings yet
Chap - 24
61 pages
Introduction To EDA
No ratings yet
Introduction To EDA
16 pages
MLS 1 - Python For Data Science
No ratings yet
MLS 1 - Python For Data Science
33 pages
Robust Statistical Methods For Empirical Software Engineering
No ratings yet
Robust Statistical Methods For Empirical Software Engineering
52 pages
Datascience 3
No ratings yet
Datascience 3
40 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
33 pages
Ccs346 Eda Unit 1
No ratings yet
Ccs346 Eda Unit 1
139 pages
1.3.1. Exploratory Data Analysis
No ratings yet
1.3.1. Exploratory Data Analysis
24 pages
Python For Data Analysts - Quick Summary
No ratings yet
Python For Data Analysts - Quick Summary
6 pages
DS Module 1 Notes
No ratings yet
DS Module 1 Notes
25 pages
Data Science Notes - Hamza
No ratings yet
Data Science Notes - Hamza
110 pages
Dav Exps - Merged - Merged
No ratings yet
Dav Exps - Merged - Merged
99 pages
Rushikesh Mane EDA Capstone Project On Hotel Booking Analysis
No ratings yet
Rushikesh Mane EDA Capstone Project On Hotel Booking Analysis
22 pages
Documentation & Report For Flyzy Flight Cancellation Project
No ratings yet
Documentation & Report For Flyzy Flight Cancellation Project
25 pages
PDF Experiments-1 DADV
No ratings yet
PDF Experiments-1 DADV
41 pages
Unit 42 Statistic For Management
No ratings yet
Unit 42 Statistic For Management
29 pages
Project Cardio Good Fitness
No ratings yet
Project Cardio Good Fitness
29 pages
BA64 Group6 Geo-Economy
No ratings yet
BA64 Group6 Geo-Economy
30 pages
AUTOMATED EDA Libraries
No ratings yet
AUTOMATED EDA Libraries
12 pages
5 Data Analytics Projects For Beginners - CourseraG
No ratings yet
5 Data Analytics Projects For Beginners - CourseraG
6 pages
The Landscape of R Packages For Automated Exploratory Data Analysis
No ratings yet
The Landscape of R Packages For Automated Exploratory Data Analysis
19 pages
PythonDASE - 2025 Version1
No ratings yet
PythonDASE - 2025 Version1
44 pages
Lesson 1 WINDOWS AND INTERNET
No ratings yet
Lesson 1 WINDOWS AND INTERNET
18 pages
Stat 100 - Statistics 1
No ratings yet
Stat 100 - Statistics 1
95 pages
Lesson 2 WORD 2010
No ratings yet
Lesson 2 WORD 2010
14 pages
Big Data (Imp-Questions)
No ratings yet
Big Data (Imp-Questions)
17 pages
AirBnB Data Analysis - HLD
No ratings yet
AirBnB Data Analysis - HLD
10 pages
Linear Regression Merged
No ratings yet
Linear Regression Merged
38 pages
2) Theoretical Background: 2.1 EDA (Exploratory Data Analysis)
No ratings yet
2) Theoretical Background: 2.1 EDA (Exploratory Data Analysis)
7 pages
Ai For IT Non Coders
No ratings yet
Ai For IT Non Coders
14 pages
Edap Lab
No ratings yet
Edap Lab
47 pages
Data Analytics
No ratings yet
Data Analytics
34 pages
Dev Answer Key
No ratings yet
Dev Answer Key
21 pages
Generative AI Foundations Certificate Brochure
No ratings yet
Generative AI Foundations Certificate Brochure
10 pages
Anurag's Resume - 6
No ratings yet
Anurag's Resume - 6
1 page
Exploratory Data Analysis: by Neha Mathur
No ratings yet
Exploratory Data Analysis: by Neha Mathur
14 pages
Python For Data Analysis
No ratings yet
Python For Data Analysis
84 pages
Unit I - Part I Notes
100% (7)
Unit I - Part I Notes
33 pages
Python For Data Exploration
No ratings yet
Python For Data Exploration
28 pages
DEV Manual - ESEC
No ratings yet
DEV Manual - ESEC
27 pages
Exploratory Data Analysis: by Neha Mathur
No ratings yet
Exploratory Data Analysis: by Neha Mathur
14 pages
DSP Unit - Ii
No ratings yet
DSP Unit - Ii
14 pages
Data Analysis
No ratings yet
Data Analysis
42 pages
Datascience
No ratings yet
Datascience
26 pages
Machine
No ratings yet
Machine
10 pages
Data Analytics Lab Manual Final1
No ratings yet
Data Analytics Lab Manual Final1
32 pages
Unit 1 - Intro To EDA
No ratings yet
Unit 1 - Intro To EDA
40 pages
Dev Record Final
No ratings yet
Dev Record Final
34 pages
IOT-Domain Analyst
No ratings yet
IOT-Domain Analyst
11 pages
SyamilFakhruddin - DS - Summary - Data Analysis
No ratings yet
SyamilFakhruddin - DS - Summary - Data Analysis
17 pages
Getting Started With Python Data Analysis - Sample Chapter
0% (1)
Getting Started With Python Data Analysis - Sample Chapter
17 pages
IBMK64B - Déjà Vu - Research Proposal 2
No ratings yet
IBMK64B - Déjà Vu - Research Proposal 2
6 pages
Data Science
No ratings yet
Data Science
42 pages
Ngan Thanh Tran Tran Thanh Ngan-11214243 415062 1736623253
No ratings yet
Ngan Thanh Tran Tran Thanh Ngan-11214243 415062 1736623253
2 pages
Data Analyst Course
No ratings yet
Data Analyst Course
8 pages
Data Analysis Resume
No ratings yet
Data Analysis Resume
2 pages
Unit - Iii - Eda
No ratings yet
Unit - Iii - Eda
25 pages
Project 11 Hotel Management
No ratings yet
Project 11 Hotel Management
3 pages
Final Dev Record
No ratings yet
Final Dev Record
49 pages
Dev Lab Manual
No ratings yet
Dev Lab Manual
35 pages
Complete Guide To Exploratory Data Analysis With Python Plotly - by Anar Abiyev - Mar, 2022 - Medium
No ratings yet
Complete Guide To Exploratory Data Analysis With Python Plotly - by Anar Abiyev - Mar, 2022 - Medium
11 pages
Learneverythingai
No ratings yet
Learneverythingai
9 pages
Summer Internship 2024
No ratings yet
Summer Internship 2024
2 pages
Project 03 Sales Management
No ratings yet
Project 03 Sales Management
3 pages
Project 07 Inventory
No ratings yet
Project 07 Inventory
3 pages
Unit 2
No ratings yet
Unit 2
58 pages
Unit 2, 3
No ratings yet
Unit 2, 3
9 pages
Pandas Complete + Visualisation Summary of IBM Visualization
No ratings yet
Pandas Complete + Visualisation Summary of IBM Visualization
21 pages
DEV Lab Manual-1
No ratings yet
DEV Lab Manual-1
27 pages
Ad3364 Data Exploration and Visualization Laboratory Syllabus L T P C
No ratings yet
Ad3364 Data Exploration and Visualization Laboratory Syllabus L T P C
2 pages
Data Mining Vs Data Exploration UNIT-II
No ratings yet
Data Mining Vs Data Exploration UNIT-II
11 pages
SameerBohra. Resume. Data Analyst
No ratings yet
SameerBohra. Resume. Data Analyst
1 page
Summary: Introduction To Data Visualization Tools
No ratings yet
Summary: Introduction To Data Visualization Tools
13 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
29 pages
Exploratory Data Analysis-1
No ratings yet
Exploratory Data Analysis-1
10 pages
Data Analyst Nanodegree Program - Syllabus
No ratings yet
Data Analyst Nanodegree Program - Syllabus
7 pages
Data Exploration Preparation
No ratings yet
Data Exploration Preparation
12 pages
Efficient Data Preparation: With Python
No ratings yet
Efficient Data Preparation: With Python
19 pages
Data Minds - Data Science Curriculum 2023 V2
No ratings yet
Data Minds - Data Science Curriculum 2023 V2
15 pages
Data Science Workflow
No ratings yet
Data Science Workflow
7 pages