Covid Data Report
Covid Data Report
Maharashtra Mahavidyalaya,
Nilanga
NAAC Re-accredited “B+” Grade College
Affilated to
Swami Ramanand Teerth Marathwada University-Nanded
A Project
Report on
Covid Data Analysis and visualization
Master of Science
By:
Baheti Amruta Ishwarprsad
In
Year: 2024 – 2025
Maharashtra Shikshan Samiti’s
Maharashtra Mahavidyalaya,
Nilanga
CERTIFICATE
This is to certify that the project entitled “Covid Data Analysis
and visualization” has been carried out by Baheti Amruta
Ishwarprasad under my guidance in partial fulfillment of the
degree i.e. Bachelor of Computer Science of SRTMU, Nanded
during the academic year 2023-2024
Besides we would like to thank all staff members who helped us by giving
advice and providing equipment which we needed.
Last but not in least we would like to thank all who helped and motivated us.
COVID-2019 has been recognized as a global threat, and several studies are being
conducted in order to contribute to the fight and prevention of this pandemic. This work
presents a scholarly production dataset focused on COVID-19, providing an overview of
scientific research activities, making it possible to identify countries, total test & death cases .
The dataset is composed of 40,212 records of articles’ metadata collected from Scopus,
PubMed, arXiv and bioRxiv databases from January 2019 to July 2020.
Those data were extracted by using the techniques of Python Web Scraping and
preprocessed with Pandas Data Wrangling. It is visualized using Plotly Express in Python. It is
used to create dozens of bar charts, line graphs, bubble charts, scatter plots. Envisioning
COVID-19 will primarily be using Plotly Express for this project. The analysis and
visualization enable people to understand complex scenarios and make predictions about the
future from the current situation.
This analysis summarizes the modeling, simulation, and analytics work around the
COVID-19 outbreak around the world from the perspective of data science and visual
analytics. It examines the impact of best practices and preventive measures in various sectors
and enables outbreaks to be managed with available health resources.
1
2. Introduction:-
Coronaviruses are a family of viruses that can cause respiratory illness in humans.
They are called “corona” because of crown-like spikes on the surface of the virus. Severe acute
respiratory syndrome (SARS), Middle East respiratory syndrome (MERS) and the common
cold are examples of coronaviruses that cause illness in humans.
The new strain of coronavirus — SARS-CoV-2 — was first reported in Wuhan,
China in December 2019. It has since spread to every country around the world.
This analysis summarizes the modeling, simulation, and analytics work around the
COVID-19 outbreak around the world from the perspective of data science and visual
analytics. It examines the impact of best practices and preventive measures in various sectors
and enables outbreaks to be managed with available health resources.
This project will introduce learners to an array of skills as they strive to create a data
visualization dashboard focusing on COVID-19 data using Python. Data visualization is a
quintessential part of any data science project and offers us valuable insights for understanding
and translating data.
7
2.2) Project Plan:-
Firstly we will ex the analysis of COVID-19 data and visualize it utilizing
Plotly Express in Python. The focus will be on generating a variety of visual
representations, including bar charts, line graphs, bubble charts, and scatter plots. The
visualizations produced in this project will be of high quality.
The primary tool for visualizing COVID-19 data will be Plotly Express.
Through this analysis and visualization, individuals will gain insights into complex
scenarios and be able to make informed predictions regarding future developments
based on the current data.
This analysis encapsulates the modeling, simulation, and analytical efforts
related to the global COVID-19 pandemic from a data science and visual analytics
perspective. It assesses the effectiveness of best practices and preventive measures
across different sectors, facilitating the management of outbreaks with the health
resources available.
Gantt Chart:-
Sr.no Task Name 21-Sept 5-Oct 24-Oct 11-Nov 27-Nov
1 Requirement
Gathering
2 Planning
3 Designing
4 Coding
5 Testing and
Deployment
8
3. Project Requirement:-
3.1) Hardware Requirements:-
9
3.3) IDE :-
10
4. System Design:-
4.1) ER Diagram:-
6
4.2) Data Flow Diagram:-
START
Date Preparation
Data Preprocessing
7
5. Designing:-
Home Page:
8
9
6. Coding-
data for Canada [36] captured by Public Health Agency of Canada (PHAC)
exposure methods, for all 107,916 captured cases from January 25 (when
Our tool conducts visual analytics on the data to discover frequent patterns
information in the form of a pie chart for each frequent 1-itemset (and its
related information) and a sunburst diagram for each frequent k-itemset
(and its related information, for k > 1). As previewed in Example 1, our tool
(i.e., community exposures). See Fig. 9(a), which also shows that, among
NULL).
To avoid distraction from NULL values, our tool provides users with
previewed in Example 2, our tool reveals that 90/94 ≈ 96% of cases with
international travel.
8. Conclusion-
To avoid distraction from NULL values, our tool provides users with
flexibility of visualizing non-NULL values. See Fig. 9(b), which focuses on
the 90%+4% = 94% of cases (i.e., those with stated/known values). As
previewed in Example 2, our tool reveals that 90/94 ≈ 96% of cases with
stated/known transmission methods were transmitted through domestic
acquisition, whereas the remaining 4/94 ≈ 4% were transmitted through
international travel.
9.Bibliography-
9.1) Books:
Published in: 2024 5th International Conference on Innovative Trends in
Information Technology (ICITIIT)
The novel coronavirus, 2019-nCoV, is highly contagious and more infectious than
initially estimate