0% found this document useful (0 votes)
30 views38 pages

Cda U2 Visualization

Computational data analytics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views38 pages

Cda U2 Visualization

Computational data analytics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 38

Visualization

Visualization
• Visualization is a pictorial or visual
representation technique.
• Anything which is represented in pictorial or
graphical form, with the help of diagrams,
charts, pictures, flowcharts, etc. is known as
visualization.
• Data represented in the form of graphics can
be analyzed better than the data presented in
words.
Ways of Representing Visual Data
• The data is first analyzed and the result of the analysis
is visualized in different ways. There are 2 ways:
– 1. infographics and
– 2. data visualization.

• 1. Inforgraphics are the visual representation of


information or data rapidly and accurately.
• Using colorful graphics in drawing charts and graphs
it improves the interpretation of a given data.
Ways of Representing Visual Data
• infographics
Ways of Representing Visual Data
• 2. Data Visualization is different approach than
infographics.
• It is the study of representing data or information in a
visual form.
• Using advanced technologies such as multimedia, scope of
data visualization has increased manifold.
• Visual representations in the form of graphs, images,
diagrams, or animations proliferated the media industry
and the internet.
• It is a fact that human mind can comprehend information
more easily if it is presented in the form of visuals.
Visualization is Excellent medium of
analysis why?
• Visual images help to transmit a huge amount of
information to the human brain at a glance.
• Visual images help in establishing relationships and
distinction between different patterns or processes easily.
• Visual interpretations helps in exploring data from
different angles, which help gain insights.
• Visualization helps in identifying problems and
understanding trends and outliers.
• Visualizations point out key or interesting breakthroughs
in a large dataset.
Classification of data
• Data can be classified using 3 criteria irrespective of whether it is
presented as data visualization or infographics:

• 1. Method of creation – it refers to the type of content used


while creating any graphical representation.
• 2. Quantity of data displayed- It refers to the amount data which
is represented. E.g. geographical map, companies financial data
etc.
• 3. Degree of creativity applied – It refers to the extent to which
the data is created graphically or designed in a colourful way or it
is just showing some important data in black and white diagrams.

On the basis of these evaluation, we can understand which is the


correct form of representation for given data type.
Various Content Types:
• Graph – A representation in which X and Y axes are used to depict
the meaning of the information.
• Diagram – A two-dimensional representation of information to
show how something works.
• Timeline – A representation of important events in a sequence
with the help of self-explanatory visual material.
• Template - A layout is a design for presenting information
• Checklist – A list of items for comparison and verification
• Flowchart – A representation of instructions which shows how
something works or a step-by-step procedure to perform a task
• Mind Map- A type of diagram which is used to visualize
information.
Techniques Used for Visual Data
Representation
• 1. Isoline- It is a 2D data representation of a
curved line that moves constantly on the
surface of a graph. The plotting of an isoline is
based on data arrangement rather than data
visualization.
Figure 1.17
Isobars on a
weather map
depicting
pressure pattern
over United
States
Techniques Used for Visual Data
Representation
• 2. Isosurface - It is a 3D representation of an
isoline. Isosurface are designed to represent points
that are bounded by a constant value in a volume
of space, i.e. in a domain that covers 3D space.

Techniques Used for Visual Data
Representation
• 3. Direct Volume Rendering (DVR)-It is a
method used for obtaining a 2D projection for
a 3D dataset. A 3D record is projected in a 2D
form through DVR for a clearer and more
transparent visualization.
Techniques Used for Visual Data
Representation
• 4. Streamline – It is a field line that results from the
velocity vector field description of the data flow.
Techniques Used for Visual Data
Representation
• 5. Map- It is a visual representation of
locations within a specific area. It is depicted
on a planar surface. E.g. Google map
Techniques Used for Visual Data
Representation
• 6. Parallel Coordinate Plot- It is a visualization
technique of representing multidimensional
data.
Techniques Used for Visual Data
Representation
• 7. Venn Diagram- It is used to represent logical
relations between finite collections of sets.
Techniques Used for Visual Data
Representation
• 8. TimeLine – It is used to represent a chronological
display of events. E.g Google calender
Techniques Used for Visual Data
Representation
• 9. Euler Diagram – it is a representation of the
relationships between sets.
Techniques Used for Visual Data
Representation
• 10. Hyperbolic Trees- They represent graphs
that are drawn using the hyperbolic geometry.
Techniques Used for Visual Data
Representation
• 11. Cluster Diagram – It is a cluster, such as a
cluster of astronomic entities.
Techniques Used for Visual Data
Representation
• 12. Ordinogram – It is used to analyze various
sets of multivariate objects.
Types of Data Visualization
• Data can be viewed in many ways such as 1D,
2D, or 3D structures as below
Name Description Tool

1D/Linear e.g., a list of items organized Generally, no tool is used for 1D


in a predefined manner visualization

2D/Planar e.g. examples, choropleth, GeoCommons, Google Fusion Tables,


cartogram, dot distribution Google Maps API, Polymaps, Many Eyes,
map, and proportional Google Charts, and Tableau Public
symbol map

3D/ For example, 3D computer AC3D, AutoQ3D, TrueSpace


Volumetric models, surface rendering,
volume rendering, and
computer simulations
Types of Data Visualization
Name Description Tool
• Data can
Temporal be viewed
e.g. timeline, time series, in many
Gantt ways
Timeflow, such
Timeline asTimeplot,
JS, Excel, 1D,
chart, sanky diagram, alluvial TimeSearcher, Google Charts, Tableau
2D, ordiagram,
3D structures as below
and connected scatter Public, and Google Fusion Tables
plot
Multidiment e.g. pie chart, histogram, tag Many Eyes, Google Charts, Tableau
ional cloud, bubble cloud, bar chart, Public, and Google Fusion Tables
scatter plot, heat map, etc.
Tree/ e.g. dendogram, radial tree, D3, Google Charts, and Network
Hierarchical hyperbolic tree, and wedge Workbench/ Sci2
stack graph.
Network e.g. matrix, node link diagram, Pajek, Gephi, NodeXL, VOSviewer,
hive plot, and tube map UCINET, GUESS, Network
Workbench/Sci2, sigma.js, d3/Protovis,
Many Eyes and Google Fusion Tables
Applications of Data Visualization
• 1. Education- Visualization is applied to teach a topic that requires
simulation or modeling of any object or process.
• E.g. Organ system or structure of an atom is best described with the
help of diagrams or animations.
• 2. Information- Visualization is applied to transform abstract data into
visual forms for easy interpretation and further exploration.
• 3. Production- Various applications are used to create 3D models of
products for better viewing and manipulation. Real estate,
communication, and automobile industry extensively use 3D
advertisements to provide a better look and feel to their products.
• 4. Science- Every field of science including fluid dynamics, astrophysics,
and medicine use visual representation of information. Isosurfaces and
direct volume rendering are typically used to explain scientific concepts.
Applications of Data Visualization
• 5. Systems visualization- Systems visualization is a
relatively new concept that integrates visual
techniques to better describe complex systems.
• 6. Visual communication- Multimedia and
entertainment industry use visuals to communicate
their ideas and information.
• 7. Visual analytics- It refers to the science of analytical
reasoning supported by the interactive visual interface.
The data generated by social media interaction is
interpreted using visual analytics techniques.
Visualizing Big Data
• Big data comprises both structured and unstructured
forms of data collected from various sources.
• Big Data is highly dynamic in function and therefore,
most traditional tools are not able to generate quality
results. The response time of traditional tools is high so
unfit for quality interaction.
• Challenges faced for Big data analytics:
– Most data is in unstructured form
– Data is not analyzed in real time
– The amount of data generated is huge
– There is a lack of efficient tools and techniques
Visualizing Big Data
• Turning Data into Information
• Data reduction and abstraction are generally
followed during data mining to get valuable
information.
• Visual data reduction process involves
automated data analysis to measure density,
outliers, and their differences.
• These measures are then used as quality
metrics to evaluate data-reduction activity.
Visualizing Big Data
• Visual quality metrics can be categorized as:
– Size metrics (e.g. number of data points)
– Visual effectiveness metrics (e.g. data density, collisions)
– Feature preservation metrics (e.g. discovering and preserving data
density difference)
The visual analytics tools should be:
– Simple enough so that even non-technical users can operate it.
– Interactive to connect with different sources of data
– Competent o create appropriate visuals for interpretations
– Able to interpret Big Data and share information
– Other than representing data, a visualization tool must be able to
establish links between different data values, restore the missing
data, and polish data for further analysis.
Tools used in data Visualization
• Excel- used for data analysis.
• It helps to track and visualize data for deriving
better insights.
• It provides various ways to share data and
analytical conclusion within and across
organizations.
Tools used in data Visualization
• Last.Forward- it is open source software
provided by last.fm for analyzing and
visualizing social music network.
Tools used in data Visualization
• Digg.com- it provides some of the best Web-based
visualization tools
• Pics- This tool is used to track the activity of images
on the website.
• Arc- It is used to display the topics and stories in a
spherical form. A sphere is used to display stories
and topic and bunches of stories are aligned at the
outer circumference of sphere. Larger stories have
more diggs. The arc becomes thicker with the
number of times users dig the story.
Tools used in data Visualization
• Google Charts API – it allows user to create
dynamic charts to be embedded in a Web
page. A chart obtained from the data and
formatting parameters supplied in a HyperText
Transfer Protocol (HTTP) request is converted
into a Portable Network Graphics(PNG) image
by Google to simplify the embedding process.
Tools used in data Visualization
• TwittEarth- This tool is capable of showing live tweete from all
over the world on a 3D globe. It is an effort to improve social
media visualization and provide a global image mapping in
tweets.
• Tag Galaxy- Tag Galaxy provides a stunning way of finding a
collection of Flickr images. It is an unusual site which provides
search tool which makes the online combing process a
memorable visual experience. If you want to search a picture, you
have to enter a tag of your choice and it will find the picture. The
central star contains all the images directly relating to the initial
tag and the revolving planets consist of similar or corresponding
tags. Click on a planet and additional sub-categories will appear.
Click on the central star and Flickr images gather and land on a
gigantic 3D sphere.
Tools used in data Visualization
• D3- D3 allows to bind arbitrary data to a
Document Object Model(DOM) and then
applies data-driven transformations to the
document. E’g’ you can use D3 to generate an
HTML table from an array of numbers. Or, use
the same data to create an interactive SVG bar
chart with smooth transitions and
interactions.
Tools used in data Visualization
• Rootzmap Mapping the Internet – it is a tool
to generate a series of maps on the basis of
the datasets provided by the National
Aeronautics and Space Administration (NASA).
Open source Data Visualization Tools
• The Big Data analytics requires the implementation of advanced tools and
technologies. Due to economic and infrastructural limitations, every
organization cannot purchase all the applications required for analyzing
data. Therefore, to fulfill their requirement of advanced tools and
technologies, organizations often turn to open-source libraries. These
libraries can be defined as pools of freely available applications and
analytical tools.
• Open source tools are easy to use , consistent, and reusable.
• Their performance is high quality. They are compliant with the Web as
well as mobile Web security.
• They provide multichannel analytics for modeling as well as customized
business solutions that can be altered with changing business demands.
• Examples of open source tools for Data Visualization-
• VTK, Cave5D, ELKI, Tulip, Gephi, IBM OpenCDX, Tableau Public, and Vis5D.
Analytical Techniques Used in Big Data
Visualization
• Analytical techniques are used to analyze complex
relationships among variables. The following are
some commonly used analytical techniques for Big
Data solutions:
• 1. Regression analysis- It is a statistical tool used
for prediction. Regression analysis is used to predict
continuous dependent variables from independent
variables. We can find the effect of one variable on
other variable. E.g sales increase when prices
decrease.
Analytical Techniques Used in Big Data
Visualization
• Types of Regression analysis are:
• Ordinary least squares regression - It is used when
dependent variable is continuous and there exists some
relationship between the dependent variable and
independent variable.
• Logistic regression- It is used when dependent variable
has only two potential results.
• Hierarchical linear modeling- It is used when data is in
nested form.
• Duration models – It is used to measure length of process.
Analytical Techniques Used in Big Data
Visualization
• 2. Grouping methods- The technique of categorizing
observation into significant or purposeful blocks is
called grouping. The recognition of features to create a
distinction between groups is called discriminant
analysis.
• 3. Multiple equation models- It is used to analyze
causal pathways from independent variables to
dependent variables. Types of multiple equation models
are as follows:
– Path analysis
– Structural equation modeling

You might also like