0% found this document useful (0 votes)

102 views58 pages

Data Set Exploration in Python - v1 - Students

This document outlines a class on data exploration using Python. It discusses importing data, identifying variables, handling missing data through various techniques like deletion and imputation, and exploring data through univariate and bivariate analysis. The class uses ICU clinical data from MIMIC-III and Python with libraries like Pandas, Numpy, and Matplotlib. [END SUMMARY]

Uploaded by

Shawn Wenren

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views58 pages

Data Set Exploration in Python - v1 - Students

Uploaded by

Shawn Wenren

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 58

Data Exploration with

Python
Andrew Michelson, MD
Pulmonary/Critical Care
Institute for Informatics
Washington University School of Medicine in St. Louis

February 17, 2020

Institute for Informatics (I 2)

Disclosures
No relevant financial disclosures.

Many topics could be their own courses, so this will be a brief overview

The best techniques to analyze and clean your data will depend on the question your
asking and data you have

Institute for Informatics (I 2)

Class Structure

Institute for Informatics (I 2)

Objectives
1. Learn how to import data into Python

2. Discuss variable identification

3. Explore missing data and discuss its management

4. Explore univariate & bivariate analyses

5. Discuss outlier assessment and management

6. Explore data transformation

Institute for Informatics (I 2)

The Data
Source: MIMIC-III Demo Data

Contents:
• Vital Signs: Blood pressure, heart rate, respiratory rate, etc…

• Laboratory Values: White Blood Cell Count, Potassium, etc…

• And more, but we won’t use any of that today

Institute for Informatics (I 2)

The Working Environment
1. Python

2. jupyter-notebook

3. Import libraries
A. Pandas
B. Numpy
C. Seaborn
D. Datetime
E. Matplotlib
F. Scipy.stats

Institute for Informatics (I 2)

Importing Data Into Python
1. Python is a versatile and powerful language that can accept data from
many formats

2. In this class we import CSV documents from the MIMIC-III demo data

3. Use: dfNAME = pd.read_csv(filepath/filename, sep = ’,’)

Institute for Informatics (I 2)

Importing Data Into Python

Jupyer-Notebook
• Open Jupyter-Notebook
• Run Section 2: Import Libraries for DataSet Exploration
• Fill in the blank to import the following files:
• ICUSTAYS.csv
• PATIENTS.csv
• D_ITEMS.csv
• D_LABITEMS.csv

Institute for Informatics (I 2)

Variable Identification
Variable Name: Variable name

Variable type:
• Continuous (ex, age)
• Categorical (ex, sex)

Data Type:
• String
• Category
• Integer
• Float
• ManyString

Independent vs Dependent:

Institute for Informatics (I 2)

Variable Identification
Identify your variables:

>> DataFrame.head( )

Patients dataframe

Note: you can use >> DataFrame.tail( ) to view the tail rows of the data frame

By adding in a number within the parenthesis you can specify how many rows to view

Institute for Informatics (I 2)

Variable Identification
View your data frame

ICU Stays

Institute for Informatics (I 2)

Variable Identification
How do we know how many rows and columns we have in total?

>> DataFrame.shape

How do we know the type of the data type?

>> DataFrame.info()

Institute for Informatics (I 2)

Variable Identification
Remove Extraneous Information that takes up space (visible and memory)

>> DataFrame.drop(items, axis, inplace)

Institute for Informatics (I 2)

Variable identification in Python

Go to section 3.0.1 and fill in the *** to start identifying your

variables

Complete until section 3.2: Merge Patients & ICU Data to

Create a single DataFrame

Institute for Informatics (I 2)

Manipulating Data in Python
Often data is collected from different sources and then
merged together for analysis.

>> DataFrame1.merge(DataFrame2, how = “left/right”,

on=[‘’])

After a merge, double check the shape, to make sure you

merged correctly

Institute for Informatics (I 2)

Variable identification in Python

Go to section 3.2: Merge Patients & ICU Data to create a single

DataFrame

Check the size of the new DataFrame to confirm a successful

merge

Institute for Informatics (I 2)

Missing Data
Very Common in clinical data

Why is data missing?

• Data extraction
• Data collection

Institute for Informatics (I 2)

Missing Data Categorization
1. Missing completely at random:
• The propensity for a data point to be missing is completely
random and not dependent on observed or unobserved data

2. Missing at random:
• Systematic differences between the missing and observed values,
but these can be entirely explained by other observed variables

Institute for Informatics (I 2)

Missing Data Categorization
3. Missing not at random
• There is a relationship between the propensity of a value to be
missing and it’s values

Institute for Informatics (I 2)

Missing Data Treatment

Adapted from: https://medium.com/ibm-data-science-experience/missing-data-conundrum-exploration-and-imputation-techniques-9f40abe0fd87

Institute for Informatics (I 2)

Missing Data: Case Deletion

List Wise Pair Wise

Delete all data Analyze all cases

where any where data is
missing available
value is present

Institute for Informatics (I 2)

Missing Data: Imputation
Goal is to fill missing data with estimated values

Most common methods: mean/median/mode:

• Population-wide
• Cohort-wide

Institute for Informatics (I 2)

Missing Data: Statistical-Model Imputation
Linear Regression
• Limitations:
• Reduces variability
• Overestimates the model fit and correlation coefficient

K-nearest Neighbor Imputation

• Limitations:
• The choice of k critical in getting desired results
• Very slow

Institute for Informatics (I 2)

Missing Data: Statistical-Model Imputation
Multiple Imputation by Chained Equations (MICE)
• Assumes data is missing at random
• Runs multiple regression models
• Each value is modeled conditionally
• Multiple data sets are made (usually at least 10)

Institute for Informatics (I 2)

Assessing Missing data in Python
Look for null entries
>>DataFrame.isnull( ).sum

Look for non-null entries

>>DataFrame.notnull( ).sum

Institute for Informatics (I 2)

Assessing Missing Data

Go to section 3.3: Assess Missing Data in NEW Patients

DataFrame and complete UP TO, but not including Import Vital
Signs

Institute for Informatics (I 2)

Data Mapping
Process of extracting and unifying data for further analysis

Measurements of interest could be mixed with measurements

not of interest

The same value can have different names

• Sometimes the differences in names is important, other
times its not

Occurs in many data sets, including MIMIC-III

Institute for Informatics (I 2)

Data Mapping
Vital Signs:
• Blood Pressure (systolic/diastolic)
• Heart Rate
• Respiratory Rate
• Oxygen saturation (%)
• Temperature

In MIMIC-III vital signs are mixed with other measurements in

the CHARTEVENTS.CSV

Institute for Informatics (I 2)

Data Mapping with Vital Signs
Systolic Blood Pressure Synonyms in THIS dataset:
• Non Invasive Blood Pressure systolic',
• 'Arterial Blood Pressure systolic',
• 'Manual Blood Pressure Systolic Left',
• 'Manual Blood Pressure Systolic Right’,

Institute for Informatics (I 2)

Data Mapping with Vital Signs
Count variable frequency
>> DataFrame.series.value_counts( )

Institute for Informatics (I 2)

Data Mapping with Dictionaries

Dictionaries are data structures

that consist of an unordered
collections of key-value pairs
that can be changed

Dictionary = {
<key>: <value>
}

Institute for Informatics (I 2)

Data Mapping with Vital Signs
To accommodate synonyms, or extract items of interest from a
larger data set, you can use a dictionary

Institute for Informatics (I 2)

Import the remaining data and assess
missingness

Go to section 4.2 Import Vital Signs complete up to section 5:

Univariate & Bivariate Analysis

Institute for Informatics (I 2)

Univariate Analysis
Explore variables individually

Basic descriptive analysis

Central Tendency Measure Dispersion Visualization

Mean Interquartile Range Histogram
Median Standard Deviation/ Box plot
Variance
Mode Skewness
Min Kurtosis
Max

Institute for Informatics (I 2)

Univariate Analysis: Skewness
Measure of the asymmetry of the probability distribution of a variable
• Positive or Right
• Negative or Left

Grading Skewness Severity

• Minimal: -0.5 and 0.5
• Moderate: -1 and -0.5 or 0.5 and 1
• Severe: < -1 or >1
https://en.wikipedia.org/wiki/Skewness

Institute for Informatics (I 2)

Univariate Analysis: Kurtosis
“The kurtosis parameter is a measure of the combined weight of the tails relative to the rest
of the distribution.”

Kurtosis >3: Positive

No Kurtosis/Normal

Kurtosis <3: Negative

https://www.spcforexcel.com/knowledge/basic-statistics/are-skewness-and-kurtosis-useful-statistics#kurtosis
https://bishalbanksonfinance.wordpress.com/tag/probabality-distribution/

Institute for Informatics (I 2)

Bivariate Analysis
A method to determine the relationship between 2 variables

1. Visualization: Scatter plots

2. Regression analysis: Find the equation for the line or curve that best fits the data

3. Correlation coefficients: A measure of association between two data points

Institute for Informatics (I 2)

Outliers
What is an outlier?

• A data point that appears far away and diverges from the overall pattern in a sample
• Can be univariate or bivariate

Institute for Informatics (I 2)

Outliers
How do outliers occur?
• Natural
• Sampling error
• Data entry error
• Data processing error
• Measurement error
• Intentional outlier
• Experimental error

Institute for Informatics (I 2)

Outliers
Why are they important?

• Alters population variance, leading to non-normal data distributions

• Alters performance of downstream analyses
• Biases results

How do you detect outliers?

• Visualization
• Bar charts
• Box plots
• Scatter plots (looking for bivariate outliers)
• There are many, many ways, but we will focus on visualization today!

Institute for Informatics (I 2)

Outliers: Univariate

Institute for Informatics (I 2)

Outliers: Univariate

Institute for Informatics (I 2)

Outliers: Bivariate

Institute for Informatics (I 2)

Outliers
How do you treat outliers? (Subject for an entire course!)

• Delete observations:
• Data entry error
• Data processing error
• Very few (subjective)

• Transform values
• Log conversion
• Binning
• Differential observation weights

• Impute
• Would avoid with natural outliers

• Treat outliers as a separate category

Institute for Informatics (I 2)

Assessing Data in Python: Pivot Tables
DataFrames must be properly structured before they can be plotted

Patient Label Value

John Smith Heart Rate 75
John Smith Respiratory Rate 15

Patient Heart Rate Respiratory Rate

John Smith 75 15

DataFrame.pivot_table(values = 'value', index = [‘columns’], columns='label')

Institute for Informatics (I 2)

Visualize Data Within Python
Declare the graph properties
>> fig, ax = plt.subplots(rows,columns, figsize = (width,height))

Locate a subset of data from within the larger dataframe

>> DataFrame.loc[DataFrame.column == ‘columnname’, ‘return column name']

Use Seaborn to make distribution and boxplots

>> sns.distplot(data, ax=ax[ X ])

>> sns.boxplot(x = data, ax = ax[ X ])

Pivot your dfce

>>DataFrame.pivot_table(values = 'value', index = [‘columns’],
columns='label').reset_index()

Use Seaborn to plot bivariate data

>>sns.pairplot(pivoted table)

Institute for Informatics (I 2)

Visualize Data Within Python
Seaborn can make a heatmap to help you more rapidly identify correlations
>> sns.heatmap(dflabs.corr(), vmax = 1)

Institute for Informatics (I 2)

Univariate & Bivariate Visualization with
Vital Signs

Go to section 5: Univariate & Bivariate Analysis and complete

until section 6: Data Transformation

Institute for Informatics (I 2)

Data Transformation
Skewed data
• Skewed data can violate model assumptions (logistic regression)
• Amplify a class imbalance, degrading model performance towards the tail of the
distribution

Heteroskedasticity
• The relationship between two variables shows increasing scatter (non-constant standard
error) at extremes of measurement of the dependent variable
• Two forms:
• Conditional: Unpredictable volatility
• Unconditional: Predictable volatility

Institute for Informatics (I 2)

Data Transformation: Heteroskedasticity
Conditional

Institute for Informatics (I 2)

Data Transformation: Heteroskedasticity
Unconditional

Institute for Informatics (I 2)

Data Transformation
Way to improve skewness and heteroskedasticity is to normalize your data
• Remove/manage outliers
• Log
• Cube Root
• Binning
• Normalization
• Sigmoid
• Hyperbolic tangent
• Etc…

Again, there are many different ways to do this and the best way will depend on your
planned analyses and the question you are answering

Institute for Informatics (I 2)

Data Transformation
To perform the log function on data, you take a Pandas Series as such:
>> DataFrame.Column = np.log(DataFrame.column)

To raise a value to the cube root

>> DataFrame.Column = DataFrame.column**(1/3)

Institute for Informatics (I 2)

Data Transformation

Go to section 6: Data Transformation and go until the end!

Institute for Informatics (I 2)

Questions?
Thank you!

Institute for Informatics (I 2)

References:
1. Grus, Joel. Data Science from Scratch. O’Reilly Media;2015.
2. Marcellino, P. Comprehensive data exploration with python.
https://www.kaggle.com/pmarcelino/comprehensive-data-exploration-with-python. 2/2018. Accessed:
2/12/2020.
3. Sheridan, E. Un-bottling the data. 12/2/2019.
https://towardsdatascience.com/un-bottling-the-data-2da3187fb186. Accessed: 2/12/2020.
4. Ojeda, T. Data exploration with python, part 3.
https://www.districtdatalabs.com/data-exploration-with-python-3. Accessed: 2/12/20.
5. Sunil, R. A comprehensive guide to data exploration.
https://www.analyticsvidhya.com/blog/2016/01/guide-data-exploration/#two. Accessed: 2/12/2020.
6. Bratkovics, C. Exploratory data analysis tutorial in Python.
https://towardsdatascience.com/exploratory-data-analysis-tutorial-in-python-15602b417445. 6/16/19.
Accessed: 2/12/20.
7. Sunil, R. Ultiamte guide for data exomploration in Python using Numpy, Matplotlib and Pandas.
https://www.analyticsvidhya.com/blog/2015/04/comprehensive-guide-data-exploration-sas-using-python-nump
y-scipy-matplotlib-pandas/
. 4/9/2015. Accessed: 2/12/2020.
8. Akinfaderin, W. Missing data conundrum: exploration and imputation techniques.
https://medium.com/ibm-data-science-experience/missing-data-conundrum-exploration-and-imputation-techni
ques-9f40abe0fd87
. 9/11/2017. Accessed: 2/12/20.
9. Wade, C. Transforming skewed data. https://towardsdatascience.com/transforming-skewed-data-73da4c2d0d16.
8/21/2019. Accessed: 2/20/20.
10. Chow, J. Log transformation base for data linearization does not matter.
https://towardsdatascience.com/log-transformation-base-for-data-linearization-does-not-matter-22eb3c1463d0.
6/27/2019. Accessed: 2/12/20. Institute for Informatics (I 2)
11. Azur MJ, Stuart EA, Franggakis C, Leaf PJ. Multiple imputation by chained equations: what is it and how does it
Thank you!

Institute for Informatics (I 2)

(Monographs On Statistics and Applied Probability 113) Lang Wu-Mixed Effects Models For Complex Data-CRC Press (2010)
No ratings yet
(Monographs On Statistics and Applied Probability 113) Lang Wu-Mixed Effects Models For Complex Data-CRC Press (2010)
440 pages
Barclays Data Engineer Interview Questions
No ratings yet
Barclays Data Engineer Interview Questions
17 pages
Naveen Python - For - Data-Science-Report
100% (1)
Naveen Python - For - Data-Science-Report
24 pages
AWS Machine Learning Specialty
100% (1)
AWS Machine Learning Specialty
67 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
Course Material Tableau
No ratings yet
Course Material Tableau
54 pages
HTML Tables and Forms (PDFDrive)
100% (1)
HTML Tables and Forms (PDFDrive)
68 pages
OS by JJsir
No ratings yet
OS by JJsir
269 pages
Data Mining:: Concepts and Techniques
100% (1)
Data Mining:: Concepts and Techniques
63 pages
Knime Bigdata Energy Timeseries Whitepaper
No ratings yet
Knime Bigdata Energy Timeseries Whitepaper
37 pages
HTML5 Tag Reference PDF
No ratings yet
HTML5 Tag Reference PDF
278 pages
Natural Language Toolkit NLTK PDF
No ratings yet
Natural Language Toolkit NLTK PDF
23 pages
Python Notes
No ratings yet
Python Notes
279 pages
Python CS1002 PDF
No ratings yet
Python CS1002 PDF
210 pages
Mongo Performance Tuning MongoSeattle 2012
100% (1)
Mongo Performance Tuning MongoSeattle 2012
20 pages
PedsQL Scoring
No ratings yet
PedsQL Scoring
158 pages
SMOTE For Imbalanced Classification With Python
No ratings yet
SMOTE For Imbalanced Classification With Python
75 pages
Social Media Tourism - Capstone Project
No ratings yet
Social Media Tourism - Capstone Project
13 pages
Azure ML Tutorial 1
No ratings yet
Azure ML Tutorial 1
32 pages
11.CSS Margins, Padding, Height
No ratings yet
11.CSS Margins, Padding, Height
81 pages
Python For Non-Programmers Final
No ratings yet
Python For Non-Programmers Final
218 pages
Data Science ML Full Stack 2022 GitHub
No ratings yet
Data Science ML Full Stack 2022 GitHub
9 pages
SENG419-python 98745
No ratings yet
SENG419-python 98745
103 pages
KNIME Introduction 2023-07
No ratings yet
KNIME Introduction 2023-07
52 pages
Machine Learning: Linear Models For Classification 1
No ratings yet
Machine Learning: Linear Models For Classification 1
30 pages
M1 - Introducing Google Cloud v5.2 - ILT
No ratings yet
M1 - Introducing Google Cloud v5.2 - ILT
69 pages
Data Quality and Cleaning
No ratings yet
Data Quality and Cleaning
9 pages
Chapter8 Structuring System Data Requirements
No ratings yet
Chapter8 Structuring System Data Requirements
64 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
39 pages
XML Tutorial
No ratings yet
XML Tutorial
33 pages
Technologies Every Web Developer Should Be Able To Explain
No ratings yet
Technologies Every Web Developer Should Be Able To Explain
4 pages
Weka Lab
No ratings yet
Weka Lab
11 pages
Data Mining N Business Intelligence
No ratings yet
Data Mining N Business Intelligence
63 pages
Read & Download (PDF Kindle)
No ratings yet
Read & Download (PDF Kindle)
5 pages
Data Visualization For Industry 4
No ratings yet
Data Visualization For Industry 4
3 pages
2017-Asec-Thomas Darimont-Open Source Identity Management Mit Keycloak-Praesentation
No ratings yet
2017-Asec-Thomas Darimont-Open Source Identity Management Mit Keycloak-Praesentation
39 pages
Data Science Course Content
No ratings yet
Data Science Course Content
8 pages
Data Mining 101
No ratings yet
Data Mining 101
50 pages
SETLabs Briefings Software Validation
No ratings yet
SETLabs Briefings Software Validation
75 pages
Data Mining Dan Bigdata
No ratings yet
Data Mining Dan Bigdata
38 pages
Python For Multivariate Analysis
No ratings yet
Python For Multivariate Analysis
47 pages
DATA Mining
No ratings yet
DATA Mining
55 pages
Socket Programming in Python
No ratings yet
Socket Programming in Python
7 pages
Silabus Sekolah Fullstack
No ratings yet
Silabus Sekolah Fullstack
21 pages
Css
No ratings yet
Css
22 pages
.. ML Lab 07
No ratings yet
.. ML Lab 07
25 pages
Bigquery
No ratings yet
Bigquery
25 pages
AWID For IntrusionCISS2019
No ratings yet
AWID For IntrusionCISS2019
6 pages
Python Django Presentation
No ratings yet
Python Django Presentation
16 pages
Introduction To Object Detection
No ratings yet
Introduction To Object Detection
24 pages
Data Mining - Density Based Clustering
No ratings yet
Data Mining - Density Based Clustering
8 pages
Big Data Tools 2 - Apache Spark With PySpark
No ratings yet
Big Data Tools 2 - Apache Spark With PySpark
33 pages
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
No ratings yet
Image Classification Using Pre-Trained Convolutional Neural Network in COLAB
6 pages
AHDAdv Cust Guide
No ratings yet
AHDAdv Cust Guide
361 pages
Anomaly Detection: Course: Data Mining II
No ratings yet
Anomaly Detection: Course: Data Mining II
12 pages
? Class 10 AI Part B Most Important Questions For Board Exam Barkha
No ratings yet
? Class 10 AI Part B Most Important Questions For Board Exam Barkha
233 pages
GCLUTO - An Interactive Clustering, Visualization, and Analysis System
No ratings yet
GCLUTO - An Interactive Clustering, Visualization, and Analysis System
10 pages
PHP Syllabus
No ratings yet
PHP Syllabus
3 pages
Data Preprocessing Python 1
No ratings yet
Data Preprocessing Python 1
3 pages
Linking Information Systems To The Business Plan
No ratings yet
Linking Information Systems To The Business Plan
10 pages
Django - Overview: MVC Pattern
No ratings yet
Django - Overview: MVC Pattern
3 pages
Inquiries, Investigation and Immersion
No ratings yet
Inquiries, Investigation and Immersion
31 pages
Big Data and Spark Developers
No ratings yet
Big Data and Spark Developers
5 pages
Python Pyramid Program
No ratings yet
Python Pyramid Program
4 pages
Software Testing Notes
No ratings yet
Software Testing Notes
3 pages
Technical Report Writing For Ca2 Examination: Topic: Introduction To Data Science
No ratings yet
Technical Report Writing For Ca2 Examination: Topic: Introduction To Data Science
7 pages
Jurnal Cluster Randomised CT Dan CONSORT (Shelly, Yuliarni, Anita, Dwi Mayang, Dicky, Alamsyah)
No ratings yet
Jurnal Cluster Randomised CT Dan CONSORT (Shelly, Yuliarni, Anita, Dwi Mayang, Dicky, Alamsyah)
11 pages
Joint Engagement Is A Potential Mechanism Leading To Increased Initiations of Joint Attention and Downstream Effects On Language: JASPER Early Intervention For Children With ASD
No ratings yet
Joint Engagement Is A Potential Mechanism Leading To Increased Initiations of Joint Attention and Downstream Effects On Language: JASPER Early Intervention For Children With ASD
8 pages
A Randomised Clinical Pilot Trial To Test The Effectiveness of Parent Training With Video Modelling To Improve Functioni
No ratings yet
A Randomised Clinical Pilot Trial To Test The Effectiveness of Parent Training With Video Modelling To Improve Functioni
16 pages
Geldium EDA Summary Report Template Filled
No ratings yet
Geldium EDA Summary Report Template Filled
2 pages
Data Management Under Spss
No ratings yet
Data Management Under Spss
41 pages
Effects of An Intervention Designed To Enhance Romantic Relationship Excitement: A Randomized-Control Trial
No ratings yet
Effects of An Intervention Designed To Enhance Romantic Relationship Excitement: A Randomized-Control Trial
14 pages
Stata
No ratings yet
Stata
33 pages
CS ELEC 4 Midterm Module
No ratings yet
CS ELEC 4 Midterm Module
59 pages
Anderson 1975
No ratings yet
Anderson 1975
19 pages
Do Educated Leaders Matter - Besley, Montalvo, Reynal-Querol
No ratings yet
Do Educated Leaders Matter - Besley, Montalvo, Reynal-Querol
20 pages
Abhishek Tripathi
No ratings yet
Abhishek Tripathi
13 pages
Analysis and Interpretation of Censored Cost Data Using Real-World Evidence: A Step-By-Step Approach
No ratings yet
Analysis and Interpretation of Censored Cost Data Using Real-World Evidence: A Step-By-Step Approach
31 pages
2 Data Preperation
No ratings yet
2 Data Preperation
21 pages
CIEA Term Project
No ratings yet
CIEA Term Project
19 pages
Integrating Data From Different Sources
No ratings yet
Integrating Data From Different Sources
11 pages
ET - Project Presentation Solution
No ratings yet
ET - Project Presentation Solution
29 pages
Tourist Attractiveness Measuring Residents Perception of Tourists PDF
No ratings yet
Tourist Attractiveness Measuring Residents Perception of Tourists PDF
20 pages
Meyer Et Al (2009)
No ratings yet
Meyer Et Al (2009)
23 pages
9580 ANIL PANDEY Anil DATA Week 4 Assignment 4 833985 1072711078
No ratings yet
9580 ANIL PANDEY Anil DATA Week 4 Assignment 4 833985 1072711078
16 pages
Lab Assessment 2 - Question
No ratings yet
Lab Assessment 2 - Question
2 pages
PCA in R
No ratings yet
PCA in R
11 pages
IAT-II FDS-Answer Key
No ratings yet
IAT-II FDS-Answer Key
11 pages
Additional MCQs Data and Collection
No ratings yet
Additional MCQs Data and Collection
5 pages
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet