0% found this document useful (0 votes)

28 views25 pages

DV Chapter 1

The document provides an overview of data visualization, emphasizing its importance in analyzing and interpreting data through graphical representations like charts and graphs. It discusses various data classification methods, data collection types, and transformation techniques, highlighting their roles in enhancing decision-making and understanding trends. Additionally, it covers data mining, processing, analysis, reporting, and cleaning, underscoring the significance of high-quality data for effective insights.

Uploaded by

Prasanna Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views25 pages

DV Chapter 1

Uploaded by

Prasanna Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Data Visualization

Dr. B. SUJATHA
ASSOCIATE PROFESSOR
UNIT 1- INTRODUCTION TO DATA
VISUALIZATION
Data is a collection of raw facts, numbers, text, sound, images, or
any other format
Data classification is the process of organizing data into groups
based on shared characteristics

The main objectives of Classification of Data are as follows:

• Explain similarities and differences of data
• Simplify and condense data’s mass
• Facilitate comparisons
• Study the relationship
• Prepare data for tabular presentation
• Present a mental picture of the data
Basis of Classification of Data
The classification of statistical data is done
after considering the scope, nature, and purpose
of an investigation and is generally done on four
bases; viz., geographical location, chronology,
qualitative characteristics, and quantitative
characteristics.
1. Geographical Classification
The classification of data on the basis of geographical location or
region is known as Geographical or Spatial Classification. For
example, presenting the population of different states of a country is
done on the basis of geographical location or region.
2. Chronological Classification
The classification of data with respect to different time
periods is known as Chronological or Temporal
Classification. For example, the number of students in a
school in different years can be presented on the basis of a
time period.
3. Qualitative Classification
The classification of data on the basis of descriptive or qualitative characteristics like region,
caste, sex, gender, education, etc., is known as Qualitative Classification. A qualitative
classification can not be quantified and can be of two types; viz., Simple
Classification and Manifold Classification.
Simple Classification
When based on only one attribute, the given data is classified into two classes, which is
known as Simple Classification. For example, when the population is divided into literate
and illiterate, it is a simple classification.

Manifold Classification
When based on more than one attribute, the given data is classified into different classes,
and then sub-divided into more sub-classes, which is known as Manifold Classification. For
example, when the population is divided into literate and illiterate, then sub-divided into
male and female, and further sub-divided into married and unmarried, it is a manifold
classification.
4. Quantitative Classification
The classification of data on the basis of the characteristics,
such as age, height, weight, income, etc., that can be
measured in quantity is known as Quantitative
Classification. For example, the weight of students in a
class can be classified as quantitative classification.
Data Collection refers to the systematic process of
gathering, measuring, and analyzing data from various sources to
get a complete and accurate picture of an area of interest.
Primary data refers to information collected directly from first-
hand sources specifically for a particular research purpose. This type of
data is gathered through various methods, including surveys, interviews,
experiments, observations, and focus groups. One of the main
advantages of primary data is that it provides current, relevant, and
specific information tailored to the researcher’s needs, offering a high
level of accuracy and control over data quality.
Secondary data refers to information that has already been
collected, processed, and published by others. This type of data can be
sourced from existing research papers, government reports, books,
statistical databases, and company records. The advantage of secondary
data is that it is readily available and often free or less expensive to
obtain compared to primary data. It saves time and resources since the
data collection phase has already been completed.
What is Data Visualization and Why is It Important in
analyzing the data?

Data visualization is the graphical representation of information and

data. By using visual elements like charts, graphs, and maps, data
visualization tools provide an accessible way to see and understand
trends, outliers, and patterns in data. Data visualization translates
complex data sets into visual formats that are easier for the
human brain to comprehend. This can include a variety of visual
tools such as:
• Charts: Bar charts, line charts, pie charts, etc.
• Graphs: Scatter plots, histograms, etc.
• Maps: Geographic maps, heat maps, etc.
• Dashboards: Interactive platforms that combine
multiple visualizations.
The Role of Data Visualization in Decision Making
Data visualization plays an integral role in the decision-
making process, as it helps stakeholders understand trends,
patterns, relationships, and outliers within data. By presenting data
in an easily digestible format, decision-makers can grasp the
implications of the information, leading to more informed choices
and better outcomes.
Furthermore, effective data visualization can foster
collaboration and facilitate communication between team members
by presenting information in a universally understandable manner.
For example, a sales team might use a data visualization tool to track
their progress toward their monthly targets. By presenting this
information in a clear and concise manner, the team can identify
areas where they need to improve and take action accordingly. This
can lead to increased sales, higher revenue, and better overall
performance.
Examples of data visualizations
1. Traditional visuals: Time-tested data visualization tools like
charts (bar, line, pie), graphs (scatter plots, histograms) and
maps remain incredibly powerful for conveying information
quickly and clearly.
2. Infographics: Combine visuals, text and data to present
complex information in a compelling and easy-to-follow
way.
3. Data dashboards: Interactive dashboards consolidate real-
time key performance indicators (KPIs), providing an at-a-
glance overview of business health.
4. Advanced visual techniques: Techniques like heatmaps,
network diagrams and treemaps are used to visualize
complex relationships or hierarchical data.
Benefits of Effective Data Visualization

Effective data visualization offers several benefits, such as:

• Improved comprehension of complex data

• Increased ability to identify trends and patterns
• Enhanced decision-making and problem-solving
capabilities
• Streamlined communication, collaboration, and sharing
of insights
• Reduced time, effort, and resources required to interpret
data
Data transformation techniques
Data transformation techniques refer to all the
actions that help you transform your raw data into a clean
and ready-to-use dataset. The process of data
transformation, involves converting, cleansing, and
structuring data into a usable format which is used to
analyzed to support decision-making processes.
It includes modifying the format, organization, or
values of data to prepare it for consumption by an
application or for analysis. This crucial process is
undertaken by organizations seeking to leverage their data
to provide timely business insights, ensuring that the
information is accessible, consistent, safe, and eventually
acknowledged by the targeted business users.
Different types of data transformation techniques
Data Smoothing:
Problem solved: Smoothing removes noise and fluctuations from data, making it
easier to analyze and interpret.
Use case scenarios: Smoothing can be useful in scenarios where the data is noisy or
contains fluctuations that obscure the underlying patterns.
How it works: Techniques include moving averages, exponential smoothing, and
kernel smoothing. The goal is to reduce noise and fluctuations in the data, making it
easier to analyze and interpret.
Attribute Construction (Feature Engineering):
Problem solved: Attribute construction creates new features or modifies existing
ones to improve the performance of machine learning models.
Use case scenarios: Feature engineering can be useful in various scenarios, such as
combining or aggregating features to capture higher-level patterns, applying
mathematical transformations (e.g., log, square root) to address skewed
distributions, or extracting new information from existing features (e.g., creating a
day of the week from a timestamp).
How it works: Feature engineering can be accomplished through various methods,
such as mathematical transformations, aggregation, binning, and dimensionality
reduction techniques. The goal is to create new data attributes that are more
representative of the underlying patterns in the data and that help to improve the
performance of the machine learning model.
Generalization:
Problem solved: Generalization reduces the complexity of data by replacing
low-level attributes with high-level concepts.
Use case scenarios: Generalization can be useful in scenarios where the
dataset is too complex to analyze, such as in image or speech recognition.
How it works: Techniques include abstraction, summarization, and clustering.
The goal is to reduce the complexity of the data by identifying patterns and
replacing low-level attributes with high-level concepts that are easier to
understand and analyze.

Data Aggregation:
Problem solved: Aggregation combines data at different levels of granularity,
making it easier to analyze and understand.
Use case scenarios: Aggregation can be useful in scenarios where data needs
to be analyzed at different levels of detail, such as in financial analysis or sales
forecasting.
How it works: Techniques include summarization, averaging, and grouping.
The goal is to combine data at different levels of granularity, creating
summaries or averages that are more representative of the underlying
patterns in the data.
Normalization:
Problem solved: Data normalization scales numerical features to a standard
range, typically [0, 1] or [-1, 1]. This prevents features with larger scales
from dominating the model and causing biased results.
Use case scenarios: Normalization is particularly important when working
with machine learning algorithms that are sensitive to the scale of input
features.
How it works: Techniques include min-max scaling and z-score
standardization, which transform the original feature values to a standard
range or distribution, making them more suitable for analysis and modeling.

Generalization:
Problem solved: Generalization reduces the complexity of data by replacing
low-level attributes with high-level concepts.
Use case scenarios: Generalization can be useful in scenarios where the
dataset is too complex to analyze, such as in image or speech recognition.
How it works: Techniques include abstraction, summarization, and clustering.
The goal is to reduce the complexity of the data by identifying patterns and
replacing low-level attributes with high-level concepts that are easier to
understand and analyze.
Filters and slicers
Slicing is the process of extracting a part of a collection (like a list or
a string) by specifying a range. It's like cutting a piece out of a larger
set. You can decide where to start, where to end, and how to step
through the collection.
Key Points:
Start: Where to begin the slice (inclusive).
End: Where to stop the slice (exclusive).
Step: The spacing between elements to include in the slice (optional).
Filtering is the process of picking out specific items from a
collection based on a condition. You can think of it like a sieve: only
the items that match your criteria pass through.
There are two main ways to filter:
Using filter(): This function selects items based on a condition.
Using List Comprehension: A more flexible way to create a new list by
filtering items that meet certain criteria.
Filters and slicers
Aspect Slicing Filtering

Extract a specific range of Select items based on a

Purpose
items based on position. condition.

Using start, end, and step Testing each item against a

Works By (indices). condition.

A subset of the collection, A subset of the collection,

Result based on index positions. based on conditions.

List comprehension or filter()

Syntax iterable[start:end:step] with a condition.

Less flexible (fixed range by More flexible (custom

Flexibility index). conditions can be applied).

When you need elements that

Use Case When you need a specific satisfy a condition (e.g., even
range of elements. numbers).
Data mining
Data mining is the process of discovering patterns,
trends, and useful information from large datasets using
statistical, mathematical, and computational techniques. It
involves analyzing vast amounts of data to extract valuable
insights that can help organizations make data-driven
decisions.
In simpler terms, data mining is like digging through
a pile of data to find hidden gems of information that can
be used for various purposes, such as improving business
operations, predicting future trends, or understanding
customer behavior.
Data processing
Data processing refers to the collection,
transformation, and manipulation of raw data into
meaningful information. The process involves a series
of steps to convert data into a usable format, and often
involves cleaning, organizing, and structuring the data
before it can be analyzed or used for decision-making.
In short, data processing turns raw data into
useful insights or outcomes. This is an essential activity
in fields like business analytics, data science, research,
and artificial intelligence
Data analysis
Data analysis refers to the process of inspecting,
cleaning, transforming, and modeling data with the
goal of discovering useful information, drawing
conclusions, and supporting decision-making. It
involves applying statistical, mathematical, or
computational techniques to extract insights from
data. Data analysis is used to make sense of raw
data, uncover patterns, and provide actionable
insights that guide business, scientific, or
operational strategies.
Data report
A data report is a structured presentation of data
analysis results, often accompanied by insights,
conclusions, and recommendations. It is typically
used to communicate findings to stakeholders such
as managers, executives, clients, or teams, helping
them make informed decisions. Data reports can
take many forms, including tables, charts, graphs,
and narrative explanations, depending on the
audience and the complexity of the information.
Data cleaning
Data cleaning (also called data cleansing or data
scrubbing) is the process of identifying and rectifying (or
removing) errors, inconsistencies, and inaccuracies in raw
data to improve its quality. Data cleaning is a crucial step
in the data analysis pipeline because clean, accurate, and
consistent data leads to more reliable analysis and better
decision-making.
Raw data collected from various sources may be
incomplete, inconsistent, or erroneous, and cleaning the
data ensures that the data used for analysis is of high
quality.

Principles of Management Solved MCQs (Set-1-24)
100% (7)
Principles of Management Solved MCQs (Set-1-24)
135 pages
C - IBP - 2502 Exam Valid Dumps Questions
No ratings yet
C - IBP - 2502 Exam Valid Dumps Questions
4 pages
105-106 Data Visualization Techniques Tools and Best Practices
No ratings yet
105-106 Data Visualization Techniques Tools and Best Practices
25 pages
Data Visualization
No ratings yet
Data Visualization
103 pages
8499 Ecap794 Advance Data Visualization
No ratings yet
8499 Ecap794 Advance Data Visualization
256 pages
Chapter 1 - 1
No ratings yet
Chapter 1 - 1
44 pages
Data Visualization Seminar Report4.docx 11
No ratings yet
Data Visualization Seminar Report4.docx 11
40 pages
Lecture 4 Unit 1
No ratings yet
Lecture 4 Unit 1
23 pages
Prof. Jashaswi - Mandal - Descriptive Analytics Data Visualization - 12.06.24
No ratings yet
Prof. Jashaswi - Mandal - Descriptive Analytics Data Visualization - 12.06.24
47 pages
Job Alert
No ratings yet
Job Alert
269 pages
Notes - Business Analytics
No ratings yet
Notes - Business Analytics
138 pages
Notes - 5 Unit
No ratings yet
Notes - 5 Unit
55 pages
Data Visualization Presentation
No ratings yet
Data Visualization Presentation
15 pages
Business Analytics Anna University
No ratings yet
Business Analytics Anna University
40 pages
Report On Summer Internship
No ratings yet
Report On Summer Internship
30 pages
Data Science
No ratings yet
Data Science
12 pages
Unit 5
No ratings yet
Unit 5
6 pages
All Unit DV Notes
No ratings yet
All Unit DV Notes
31 pages
DA Unit 1
No ratings yet
DA Unit 1
43 pages
Unit 4
No ratings yet
Unit 4
21 pages
Unit 6
No ratings yet
Unit 6
12 pages
Pedagogy MCQs
No ratings yet
Pedagogy MCQs
85 pages
What Is Data Visualization UNIT-V
No ratings yet
What Is Data Visualization UNIT-V
24 pages
Dvba Digital Notes
No ratings yet
Dvba Digital Notes
70 pages
Notes DV 2025
No ratings yet
Notes DV 2025
10 pages
Data Visualization Techniques Traditional Data To Big Data
No ratings yet
Data Visualization Techniques Traditional Data To Big Data
23 pages
Business Anaytics Unit 1
No ratings yet
Business Anaytics Unit 1
37 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
39 pages
Data Visu Ans
No ratings yet
Data Visu Ans
20 pages
Unit-1 Data Visualization Notes
No ratings yet
Unit-1 Data Visualization Notes
15 pages
Chapter 6
No ratings yet
Chapter 6
13 pages
DATA4
No ratings yet
DATA4
259 pages
Ds 4
No ratings yet
Ds 4
88 pages
Most Important Pedagogy Questions For KVS PGT - TGT - PRT Exam With Answers
No ratings yet
Most Important Pedagogy Questions For KVS PGT - TGT - PRT Exam With Answers
23 pages
BA Unit 1
No ratings yet
BA Unit 1
38 pages
Unit-3-Data Visualization
No ratings yet
Unit-3-Data Visualization
83 pages
Data Analytics Unit V
No ratings yet
Data Analytics Unit V
18 pages
1 Design Principles
No ratings yet
1 Design Principles
26 pages
DA Unit3
No ratings yet
DA Unit3
40 pages
DV Unit-I
No ratings yet
DV Unit-I
25 pages
Webinar StorytellingwithDataSession5-6
No ratings yet
Webinar StorytellingwithDataSession5-6
30 pages
Module 1
No ratings yet
Module 1
28 pages
Data Visualization
No ratings yet
Data Visualization
25 pages
2.1 Introduction To Data Visualization
No ratings yet
2.1 Introduction To Data Visualization
16 pages
Data Visualization CAE-1
No ratings yet
Data Visualization CAE-1
8 pages
Dsbda Ut6
No ratings yet
Dsbda Ut6
11 pages
Data Visualization Techniques: Dr. D. Koteswara Rao
No ratings yet
Data Visualization Techniques: Dr. D. Koteswara Rao
41 pages
Data Visualization
No ratings yet
Data Visualization
33 pages
Report Data
No ratings yet
Report Data
22 pages
5 Ethical Approaches
100% (1)
5 Ethical Approaches
5 pages
Sample Size
No ratings yet
Sample Size
28 pages
DV Co1 All PDF
No ratings yet
DV Co1 All PDF
196 pages
Eds Unit 3
No ratings yet
Eds Unit 3
22 pages
Data Visualization Tools Module
No ratings yet
Data Visualization Tools Module
29 pages
Final Seminar Report
No ratings yet
Final Seminar Report
27 pages
DVP 1
No ratings yet
DVP 1
24 pages
Da End Sem
No ratings yet
Da End Sem
5 pages
Module 5
No ratings yet
Module 5
8 pages
Module 1
No ratings yet
Module 1
33 pages
Data Visualization
No ratings yet
Data Visualization
16 pages
Data Visualization and Hadoop
No ratings yet
Data Visualization and Hadoop
34 pages
Notes
No ratings yet
Notes
10 pages
2023 AA DAS Guidance Notes
No ratings yet
2023 AA DAS Guidance Notes
96 pages
PEGACPCSD24V1 Pegasystems Exam Practice Questions
No ratings yet
PEGACPCSD24V1 Pegasystems Exam Practice Questions
5 pages
Reading and Writing Set 2 Assgn
No ratings yet
Reading and Writing Set 2 Assgn
16 pages
Unit III Business Analytics
No ratings yet
Unit III Business Analytics
8 pages
Top 13 Data Visualization Techniques, Concepts & Methods
No ratings yet
Top 13 Data Visualization Techniques, Concepts & Methods
1 page
Production and Operations Management Solved MCQs (Set-2)
100% (1)
Production and Operations Management Solved MCQs (Set-2)
6 pages
Data Visualization Notes
No ratings yet
Data Visualization Notes
4 pages
840020WP0Box380thiopia000full0study PDF
No ratings yet
840020WP0Box380thiopia000full0study PDF
188 pages
04-1 English Workbook-Compressed Tlm4all
No ratings yet
04-1 English Workbook-Compressed Tlm4all
162 pages
Budget Control Systems
No ratings yet
Budget Control Systems
77 pages
I1547 3465 05 147 PDF
No ratings yet
I1547 3465 05 147 PDF
12 pages
Baseline Survey On Competition and Markets in Ethiopia PDF
No ratings yet
Baseline Survey On Competition and Markets in Ethiopia PDF
112 pages
08-1 Maths SEM-2 Workbook-Compressed Tlm4all
No ratings yet
08-1 Maths SEM-2 Workbook-Compressed Tlm4all
78 pages
06-1 Maths SEM-1 Workbook-Compressed Tlm4all
No ratings yet
06-1 Maths SEM-1 Workbook-Compressed Tlm4all
70 pages
UKG 2019 Final
No ratings yet
UKG 2019 Final
56 pages
Report 3 54 The 2552084041
No ratings yet
Report 3 54 The 2552084041
53 pages
Ethiopian Agriculture: A Dynamic Geographic Perspective: Jordan Chamberlin and Emily Schmidt
No ratings yet
Ethiopian Agriculture: A Dynamic Geographic Perspective: Jordan Chamberlin and Emily Schmidt
29 pages
Decap782 Advance Data Visualization
No ratings yet
Decap782 Advance Data Visualization
368 pages
R24 BBA II-I Syllabus
No ratings yet
R24 BBA II-I Syllabus
33 pages
Business Law MBA-022
No ratings yet
Business Law MBA-022
1 page
DigitalMarketing Notes-4
No ratings yet
DigitalMarketing Notes-4
28 pages
Cricket Team Analysing
No ratings yet
Cricket Team Analysing
44 pages
Data Visualization Using Power BI 1
No ratings yet
Data Visualization Using Power BI 1
2 pages
MGRL Econ Cha 3.1
No ratings yet
MGRL Econ Cha 3.1
32 pages
Deloitte Interview Insights For A Power BI Developer
No ratings yet
Deloitte Interview Insights For A Power BI Developer
26 pages
Improving Business Process Performance Gain Agility Create Value and Achieve Success Joseph Raynus Instant Download
No ratings yet
Improving Business Process Performance Gain Agility Create Value and Achieve Success Joseph Raynus Instant Download
78 pages
MGRL Econ Cha 1-2
No ratings yet
MGRL Econ Cha 1-2
24 pages
Data Analytics
No ratings yet
Data Analytics
14 pages
Suganya C Resume
No ratings yet
Suganya C Resume
2 pages
Bba - I Year-Ii Sem-Minor-Ii Iohrm Set - 1
No ratings yet
Bba - I Year-Ii Sem-Minor-Ii Iohrm Set - 1
2 pages
Fertility Rate and Life Expectancy Dashboard Project Report
No ratings yet
Fertility Rate and Life Expectancy Dashboard Project Report
10 pages
NVIDIA Data Analysis Final Project - Report - Esar Eyad Nassar - Hamza Elareef
No ratings yet
NVIDIA Data Analysis Final Project - Report - Esar Eyad Nassar - Hamza Elareef
15 pages
Nirdpr-Project Officer Research Application Form
No ratings yet
Nirdpr-Project Officer Research Application Form
2 pages
Enhance Your Apps With Amazon QuickSight Embedded Analytics
No ratings yet
Enhance Your Apps With Amazon QuickSight Embedded Analytics
35 pages
Ai Unit II Notes-Che
No ratings yet
Ai Unit II Notes-Che
14 pages
PowerBI Assignment
No ratings yet
PowerBI Assignment
16 pages
Kunal Udawant-Data Analyst
No ratings yet
Kunal Udawant-Data Analyst
2 pages
In4suite Brochure - Contracting
No ratings yet
In4suite Brochure - Contracting
4 pages
Key Roles in Data Analytics Project
No ratings yet
Key Roles in Data Analytics Project
2 pages
Mythily Ramanathan - Data Analyst Resume PDF
No ratings yet
Mythily Ramanathan - Data Analyst Resume PDF
1 page
Yugandar - Data Analyst Resume
No ratings yet
Yugandar - Data Analyst Resume
1 page
Dave Isysa Assignment Final
No ratings yet
Dave Isysa Assignment Final
14 pages
Nischala Bonda Resume
No ratings yet
Nischala Bonda Resume
1 page
Deepanshu Jeetraj Resume-2 - Deepanshu Jeetraj
No ratings yet
Deepanshu Jeetraj Resume-2 - Deepanshu Jeetraj
2 pages
Town Hall Huddle Procedure Table
No ratings yet
Town Hall Huddle Procedure Table
3 pages
MuskanKasere BA
No ratings yet
MuskanKasere BA
1 page
SEDEMAC Mechatronics
No ratings yet
SEDEMAC Mechatronics
1 page
Resume SRM
No ratings yet
Resume SRM
1 page
Resume David Townshend PDF
No ratings yet
Resume David Townshend PDF
2 pages
Bony Kundu-Resume Data Governance
No ratings yet
Bony Kundu-Resume Data Governance
1 page
Data Collection: Six Sigma Thinking, #1
From Everand
Data Collection: Six Sigma Thinking, #1
Sumeet Savant
No ratings yet

DV Chapter 1

Uploaded by

DV Chapter 1

Uploaded by

Data Visualization

The main objectives of Classification of Data are as follows:

Data visualization is the graphical representation of information and

Effective data visualization offers several benefits, such as:

• Improved comprehension of complex data

Extract a specific range of Select items based on a

Using start, end, and step Testing each item against a

A subset of the collection, A subset of the collection,

List comprehension or filter()

Less flexible (fixed range by More flexible (custom

When you need elements that

You might also like