Case Study 1

Uploaded by

mahesh Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views4 pages

Case Study 1

Uploaded by

mahesh Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Python certification training

Module 4: Introduction to NumPy,

Pandas & Matplotlib
Case Study

© Brain4ce Education Solutions Pvt. Ltd.

Module 4 – Introduction to NumPy, Pandas & Matplotlib

Case Study

1. Extract data from the given SalaryGender CSV file and store the data from each
column in a separate NumPy array

2. Find:
1. The number of men with a PhD
2. The number of women with a PhD

3. Use SalaryGender CSV file. Store the “Age” and “PhD” columns in one DataFrame
and delete the data of all people who don’t have a PhD

4. Calculate the total number of people who have a PhD degree from SalaryGender
CSV file.

5. How do you Count The Number Of Times Each Value Appears In An Array Of
Integers?
[0, 5, 4, 0, 4, 4, 3, 0, 0, 5, 2, 1, 1, 9]
Answer should be array([4, 2, 1, 1, 3, 2, 0, 0, 0, 1]) which means 0 comes 4 times,
1 comes 2 times, 2 comes 1 time, 3 comes 1 time and so on.

6. Create a numpy array [[0, 1, 2], [ 3, 4, 5], [ 6, 7, 8],[ 9, 10, 11]]) and filter the elements
greater than 5.

7. Create a numpy array having NaN (Not a Number) and print it.
array([ nan, 1., 2., nan, 3., 4., 5.])
Print the same array omitting all elements which are nan

8. Create a 10x10 array with random values and find the minimum and maximum
values.

9. Create a random vector of size 30 and find the mean value.

Module 4 – Introduction to NumPy, Pandas & Matplotlib

10. Create numpy array having elements 0 to 10 And negate all the elements between
3 and 9

11. Create a random array of 3 rows and 3 columns and sort it according to 1 st column,
2nd column or 3rd column.

12. Create a four dimensions array get sum over the last two axis at once.

13. Create a random array and swap two rows of an array.

14. Create a random matrix and Compute a matrix rank.

15. Analyse various school outcomes in Tennessee using pandas. Suppose you are a
public school administrator. Some schools in your state of Tennessee are
performing below average academically. Your superintendent, under pressure
from frustrated parents and voters, approached you with the task of understanding
why these schools are under-performing. To improve school performance, you
need to learn more about these schools and their students, just as a business needs
to understand its own strengths and weaknesses and its customers. Though you is
eager to build an impressive explanatory model, you know the importance of
conducting preliminary research to prevent possible pitfalls or blind spots. Thus,
you engages in a thorough exploratory analysis, which includes: a lit review, data
collection, descriptive and inferential statistics, and data visualization.
Phase 1 - Data Collection
Here is a data of every public school in middle Tennessee. The data also includes
various demographic, school faculty, and income variables. You need to convert the
data into useful information.
• Read the data in pandas data frame
• Describe the data to find more details
Phase 2 - Group data by school ratings
Chooses indicators that describe the student body (for example, reduced_lunch) or
school administration (stu_teach_ratio) hoping they will
explain school_rating. reduced_lunch is a variable measuring the average percentage
of students per school enrolled in a federal program that provides lunches for students
from lower-income households. In short, reduced_lunch is a good proxy for household
income.

Module 4 – Introduction to NumPy, Pandas & Matplotlib

Isolates ‘reduced_lunch’ and groups the data by ‘school_rating’ using pandas groupby
method and then uses describe on the re-shaped data
Phase 3 – Correlation analysis
Find the correlation between ‘reduced_lunch’ and ‘school_rating’. The values in the
correlation matrix table will be between -1 and 1. A value of -1 indicates the strongest
possible negative correlation, meaning as one variable decreases the other increases.
And a value of 1 indicates the opposite.
Phase 4 – Scatter Plot
Find the relationship between school_rating and reduced_lunch, Plot a graph with the
two variables on a scatter plot. Each dot represents a school. The placement of the dot
represents that school's rating (Y-axis) and the percentage of its students on reduced
lunch (x-axis). The downward trend line shows the negative correlation
between school_rating and reduced_lunch (as one increases, the other decreases). The
slope of the trend line indicates how much school_rating decreases
as reduced_lunch increases. A steeper slope would indicate that a small change
in reduced_lunch has a big impact on school_rating while a more horizontal slope
would indicate that the same small change in reduced_lunch has a smaller impact
on school_rating.
Phase 5 – Correlation Matrix
An efficient graph for assessing relationships is the correlation matrix, as seen below;
its color-coded cells make it easier to interpret than the tabular correlation matrix
above. Red cells indicate positive correlation; blue cells indicate negative correlation;
white cells indicate no correlation. The darker the colors, the stronger the correlation
(positive or negative) between those two variables. Draw a graph of correlation matrix
having all important fields of data frame.

Module1 DS
No ratings yet
Module1 DS
61 pages
Apuntes Azure Data Scientist
No ratings yet
Apuntes Azure Data Scientist
397 pages
Stock Card Drug Management
75% (4)
Stock Card Drug Management
4 pages
Student Analysis
No ratings yet
Student Analysis
16 pages
Options With Python
100% (1)
Options With Python
203 pages
Fdsa Lab Manual
No ratings yet
Fdsa Lab Manual
53 pages
Python Notes
No ratings yet
Python Notes
55 pages
Python For Data Science .
100% (4)
Python For Data Science .
112 pages
Fds Merged
No ratings yet
Fds Merged
102 pages
AL Notes
No ratings yet
AL Notes
61 pages
Karel Robot Book
100% (1)
Karel Robot Book
161 pages
CS1010S Lecture 11 - Visualising Data
No ratings yet
CS1010S Lecture 11 - Visualising Data
68 pages
Microsoft Ai Automate
No ratings yet
Microsoft Ai Automate
259 pages
Dser
No ratings yet
Dser
35 pages
Student Performance Analysis and Prediction
No ratings yet
Student Performance Analysis and Prediction
19 pages
Pds Record Document Ds II
No ratings yet
Pds Record Document Ds II
36 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Administare Netwrok and Peripheral Devices Information Sheet
88% (16)
Administare Netwrok and Peripheral Devices Information Sheet
54 pages
Dsa Lab Manual
No ratings yet
Dsa Lab Manual
35 pages
Udacity Enterprise Syllabus Data Analyst nd002
No ratings yet
Udacity Enterprise Syllabus Data Analyst nd002
16 pages
DAV Practicle File
No ratings yet
DAV Practicle File
28 pages
Python Practical Guide 2
No ratings yet
Python Practical Guide 2
18 pages
Data Science
No ratings yet
Data Science
42 pages
Machine Learning Project Report
No ratings yet
Machine Learning Project Report
65 pages
Practical List 2022-23
100% (1)
Practical List 2022-23
4 pages
Explorotary Data Analysis
100% (1)
Explorotary Data Analysis
30 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Data Preprocessing Python Tome III
No ratings yet
Data Preprocessing Python Tome III
12 pages
Notebook 2 - Linear Regression
No ratings yet
Notebook 2 - Linear Regression
11 pages
Data Preprocessing Python Tome I
No ratings yet
Data Preprocessing Python Tome I
10 pages
Data Analyst Nanodegree Program - Syllabus
50% (2)
Data Analyst Nanodegree Program - Syllabus
7 pages
Lab Manual: 18CS3262S Data Modelling and Visualization Techniques
33% (3)
Lab Manual: 18CS3262S Data Modelling and Visualization Techniques
17 pages
00 - Project - Your First Data Science Project - Jupyter Notebook
No ratings yet
00 - Project - Your First Data Science Project - Jupyter Notebook
8 pages
Visualization Worksheet
No ratings yet
Visualization Worksheet
8 pages
DALab Part-B BCU&BU
No ratings yet
DALab Part-B BCU&BU
12 pages
Chatgpt Prompt Engineering
50% (2)
Chatgpt Prompt Engineering
12 pages
Python Lab 9
No ratings yet
Python Lab 9
8 pages
Lab 3 & 4
No ratings yet
Lab 3 & 4
10 pages
Analyzing Student Performance in Exams Using Python
No ratings yet
Analyzing Student Performance in Exams Using Python
11 pages
AI Assignment 1&2 PDF
No ratings yet
AI Assignment 1&2 PDF
12 pages
MachineLearning Algorithm - Hope
No ratings yet
MachineLearning Algorithm - Hope
125 pages
Week2 Lab
No ratings yet
Week2 Lab
8 pages
Singh Project1 Report
No ratings yet
Singh Project1 Report
12 pages
Lab 2 - Basic Statistical Analysis
No ratings yet
Lab 2 - Basic Statistical Analysis
7 pages
Eda 4 5
No ratings yet
Eda 4 5
7 pages
DS Practical 01
No ratings yet
DS Practical 01
9 pages
Numpy and Pandas
No ratings yet
Numpy and Pandas
11 pages
Lab 13
No ratings yet
Lab 13
5 pages
Data Science in Society Cat
No ratings yet
Data Science in Society Cat
5 pages
9 Supervised Learning - II
No ratings yet
9 Supervised Learning - II
55 pages
Data Exploration and Analysis With Python
No ratings yet
Data Exploration and Analysis With Python
9 pages
Chatgpt Prompt Engineering
0% (1)
Chatgpt Prompt Engineering
9 pages
00 - Lesson - Data Science Workflow - Jupyter Notebook
No ratings yet
00 - Lesson - Data Science Workflow - Jupyter Notebook
6 pages
Case Study DSBDA
No ratings yet
Case Study DSBDA
12 pages
Manual Configuracao Honeywell Eclipse Ms 5145
No ratings yet
Manual Configuracao Honeywell Eclipse Ms 5145
117 pages
Feature Selection Engineering
No ratings yet
Feature Selection Engineering
72 pages
IDS Syllabus
No ratings yet
IDS Syllabus
5 pages
Data Analysis Lab - Final - 23-24
No ratings yet
Data Analysis Lab - Final - 23-24
11 pages
Data Analyst: Nanodegree Program Syllabus
No ratings yet
Data Analyst: Nanodegree Program Syllabus
16 pages
TAFJ JBC Remote Debugger
No ratings yet
TAFJ JBC Remote Debugger
10 pages
Data Science Syllabus
No ratings yet
Data Science Syllabus
4 pages
Home Assignment Dataliteracy
No ratings yet
Home Assignment Dataliteracy
4 pages
Dav End Sem
No ratings yet
Dav End Sem
2 pages
Data Analyst Nanodegree Program - Syllabus
No ratings yet
Data Analyst Nanodegree Program - Syllabus
7 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Univarite Hope
No ratings yet
Univarite Hope
103 pages
FDS Iat-2 Part-B
No ratings yet
FDS Iat-2 Part-B
4 pages
11 Association Rules Mining and Recommendation Systems
No ratings yet
11 Association Rules Mining and Recommendation Systems
70 pages
Nd002 Syllabus 2018 June v9
No ratings yet
Nd002 Syllabus 2018 June v9
5 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
03 CP PDF
No ratings yet
03 CP PDF
8 pages
Https WWW - Irctc.co - in Cgi-Bin Bv60
No ratings yet
Https WWW - Irctc.co - in Cgi-Bin Bv60
1 page
2nd Quarter Exam Mil
100% (2)
2nd Quarter Exam Mil
3 pages
Numpy Advanced Functional Analysis Questions
No ratings yet
Numpy Advanced Functional Analysis Questions
1 page
Painless Statistics
From Everand
Painless Statistics
Barron's Educational Series
No ratings yet
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
No ratings yet
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
3 pages
Integrative Programming and Technology 1
No ratings yet
Integrative Programming and Technology 1
4 pages
IoT-Version2 FR
No ratings yet
IoT-Version2 FR
143 pages
Portable Jammer Vip Price List
No ratings yet
Portable Jammer Vip Price List
9 pages
MaheshKumar ApplicationForm
No ratings yet
MaheshKumar ApplicationForm
8 pages
Key Fact Statement
No ratings yet
Key Fact Statement
7 pages
SymphonyAI Overview
No ratings yet
SymphonyAI Overview
8 pages
Streaming Data Sample
No ratings yet
Streaming Data Sample
5 pages
31 R20 M1-Sept-2023
No ratings yet
31 R20 M1-Sept-2023
7 pages
Module 4 String Data Structure
No ratings yet
Module 4 String Data Structure
9 pages
CV 2022081018080367
No ratings yet
CV 2022081018080367
2 pages
Case Study 1
No ratings yet
Case Study 1
2 pages
Case Study 3
No ratings yet
Case Study 3
2 pages
Facerecognition Results Metrics
No ratings yet
Facerecognition Results Metrics
3 pages
Buying Guides Best Tablets For Photo Editing
No ratings yet
Buying Guides Best Tablets For Photo Editing
15 pages
Lab01 - Classical Cryptography
No ratings yet
Lab01 - Classical Cryptography
10 pages
HP DL380 G8: Hardware Module Description
No ratings yet
HP DL380 G8: Hardware Module Description
6 pages
004N - UG EVO 3 IP ENG 15 - 04 - 2021 - Compressed
No ratings yet
004N - UG EVO 3 IP ENG 15 - 04 - 2021 - Compressed
52 pages
50watts Audio Amplifier Using TDA7265
No ratings yet
50watts Audio Amplifier Using TDA7265
26 pages
Prepare a Database of Standardized Datasets Which Can Be Used for Training and Evolution of Models
No ratings yet
Prepare a Database of Standardized Datasets Which Can Be Used for Training and Evolution of Models
6 pages
04 Five Senses Printables
No ratings yet
04 Five Senses Printables
30 pages
Mathematical Physics A Modern Introduction To Its Foundations 2nd Edition 2024 Scribd Download
100% (3)
Mathematical Physics A Modern Introduction To Its Foundations 2nd Edition 2024 Scribd Download
28 pages
Magazine, Servo 11 2008
No ratings yet
Magazine, Servo 11 2008
85 pages
Booklet Primer Grado Insps 2024
No ratings yet
Booklet Primer Grado Insps 2024
42 pages
Image Steganography: Protection of Digital Properties Against Eavesdropping
No ratings yet
Image Steganography: Protection of Digital Properties Against Eavesdropping
8 pages
Digital Learning Resources and Support Features Matrix
No ratings yet
Digital Learning Resources and Support Features Matrix
9 pages
Genetic Algorithms
No ratings yet
Genetic Algorithms
14 pages
Introduction To Optimization: Class Notes On: Mathematical Foundations in Engineering, ECEG 6209
No ratings yet
Introduction To Optimization: Class Notes On: Mathematical Foundations in Engineering, ECEG 6209
34 pages
Database Deign UG - G Assignment 1 Semester 1 2021
No ratings yet
Database Deign UG - G Assignment 1 Semester 1 2021
4 pages
Bread, Milk Bread, Diapers, Beer, Eggs Bread, Diapers, Beer, Cola Bread, Milk, Diapers, Beer Bread, Milk, Diapers, Cola
No ratings yet
Bread, Milk Bread, Diapers, Beer, Eggs Bread, Diapers, Beer, Cola Bread, Milk, Diapers, Beer Bread, Milk, Diapers, Cola
4 pages
Moscad-L: SCADA Remote Terminal Unit
No ratings yet
Moscad-L: SCADA Remote Terminal Unit
2 pages
Kamal Sir Cabin: S.No. Item Reuse in 206 Where and How
No ratings yet
Kamal Sir Cabin: S.No. Item Reuse in 206 Where and How
2 pages
Choose An OTA For The Apple Watch Series 3 (42mm) IPSW Downloads
No ratings yet
Choose An OTA For The Apple Watch Series 3 (42mm) IPSW Downloads
1 page

Case Study 1

Uploaded by

Case Study 1

Uploaded by

Python certification training

Module 4: Introduction to NumPy,

© Brain4ce Education Solutions Pvt. Ltd.

9. Create a random vector of size 30 and find the mean value.

©Brain4ce Education Solutions Pvt. Ltd Page 1

13. Create a random array and swap two rows of an array.

14. Create a random matrix and Compute a matrix rank.

©Brain4ce Education Solutions Pvt. Ltd Page 2

©Brain4ce Education Solutions Pvt. Ltd Page 3

You might also like