0% found this document useful (0 votes)
2 views1 page

A2 Midterm QP

The document outlines the Mid-Term Lab Examination for the Department of Data Science and Computer Applications, scheduled for September 10, 2024. It includes meta data information regarding COVID-19 statistics and provides a series of tasks for exploratory data analysis using a dataset named 'Covid.csv'. The tasks involve data cleaning, analysis, and visualization using Python and pandas.

Uploaded by

batmanflyinsky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views1 page

A2 Midterm QP

The document outlines the Mid-Term Lab Examination for the Department of Data Science and Computer Applications, scheduled for September 10, 2024. It includes meta data information regarding COVID-19 statistics and provides a series of tasks for exploratory data analysis using a dataset named 'Covid.csv'. The tasks involve data cleaning, analysis, and visualization using Python and pandas.

Uploaded by

batmanflyinsky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Department of Data Science and Computer Applications

Mid-Term Lab Examination


Section A Batch 2 Date: 10th September 2024

Meta Data Information


1. Serial Number: Serial Number of the records
2. Country: Name of different countries
3. Total Cases: Total covid cases
4. Total Deaths: Total number of deaths due to Covid
5. Total Recovered: Total number of patients recovered.
6. Active Cases: Active cases
7. Total Test: Total medical test conducted.
8. Population: Total Population of individual countries

Perform the Exploratory Data Analysis on “Covid.csv” by answering the following questions
using python.

For Writing

1. Determine if the dataset contains missing values. Get the total count
of missing values (1)
2. Using appropriate pandas function, determine the total count of records
In the dataframe (1)
3. Obtain the details of different countries that are affected with covid (1)
4. How to identify those countries that may contain noisy values for the
attribute “Active cases” using IQR. (3)
5. Calculate the total number of active cases across all countries. (2)
6. What is the total number of COVID-19 cases reported in India? (2)

For Execution

1. Drop the attribute “Serial Number” from the dataset. (1)


2. How to preprocess the entries of the attribute “Total Deaths” containing
missing values with the median of the attribute? (2)
3. Find the country with the highest number of deaths. (2)
4. Visualize the top 10 countries with the highest “Total cases” where the
“Total Deaths” are greater than 100000. (2)
5. Analyse the relationship between “Total cases”, “Total Deaths” and
“Total Recovered” with an appropriate plotting method. (3)

You might also like