0% found this document useful (0 votes)
55 views63 pages

BDT UNIT - 4 Text Note

notes of bdt of R18 regulation of JNTUH

Uploaded by

N.C.Yashaswini
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
55 views63 pages

BDT UNIT - 4 Text Note

notes of bdt of R18 regulation of JNTUH

Uploaded by

N.C.Yashaswini
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 63
Chapter 26 te es ee _ — resented i Visualization is pictorial or visual representation technique? Anything: tt a is known as pictorial o graphical form, with the help of diagrams, charts, pictures flowchart the data presented visualization, Data presented in the form of graphics can be analyzed ter tha FA. in words, Be Ag sich © Data viulzaton sa itor rvs erento of data wt ee Pe of data graphs, bar, histograms, tables, pie charts, mind maps, ete Deper ing UF (one-/two- and the aspects from which it salve visuals a ‘vary in terms of their dimensions ( are used /multi-dimensional) or types, such as temporal, hierarchical, network, Ale the market £07 for presenting different types of datasets, Different types of tools are yaad Tisch is visualizing data. One of the most commonly used data visualization He taal as well 25 available for free as Tableau Public, We can use Tableau products for both a fa the use of corporate purposes, creating several types of 1D, 2D, and 3D visualizations. But Wt ia og data visuali n in Big Data? Is it necessary to use it? Let's first track down the real visualization in the context of Big Data analytics. eT es you with the concept of data visualization and the et Big Data analytics. You also learn about different types of data visualizations. ‘Next, you learn a eS various types of tools using which data or information can be in a visual format. The chapter also discusses the basics and the installation process of the fableau Public tool. This chapter fami Ways of Representing Visual Data “ ‘The data is first analyzed and then the result of that analysis is visualized in different ways as discussed above. There are two ways to visualize a data —infographics and data visual Infographics are the visual representation of information or data rapidly and accurately. The use of colorful graphics in drawing charts and graphs helps in improving the interpretation of a given data. Figure 26.1 shows an example of infographics: f HesvenawietK sae Toes Saas OS 716 ae Data Visualization-1 Mr. Robert Smith, the technical head of A data that is stored in their databy ea ‘abases as Comma Separated Valu _ Of various data visualization tools Prat oo ne v2tue (CSV) fos, Mr. Simth loarned about the his team members about data Visualization, its tec! the market. M1 be used to visually present the stored data, He tells hnkues, and various data visualization tools available i, uctional designers focus on abstract and model-based scientific visualizations to make the ore ting and easy to understand. Nowadays, scientific data is alse Presented through digitally constructed images, These images are generally created with the help of “sual analvics combines the power of akorthm nieligence and Compitaionar power of modem computers. Visualization is an excellent medium to analyze, comprehend, and share information. Let's see why 9 Visual images help to transmit a huge amount of information to the human brain at a glance 9 Visual images help in establishing relationships and distinction between different patterns or Processes easily. Visual interpretations help in exploring data from different angles, which help gain insights 9 Visualization helps in identifying problems and understanding trends and outliers, 9 Visualizations point out key or interesting breakthroughs in a large dataset. Data can be classified on the basis of the following three criteria irrespective of whether it is Presented as data visualization or infographics; 9 Method of cteation—It refers to the type of content used while creating any graphical representation. 2 Quantity of data displayed—It refers to the amount of data which is represented. For example, geographical map, companies financial data, ete, & Degree of creativity applied—It refers to the extent to which the data is created! graphically or designed in a colorful way or it is just showing some important data in black and white ‘ oncipe 5 ae ~ On the basis ‘evaluation, we can understand which is the correct form of representation for a oe po Er scutes cnet pat oe F or a Gi oa: in which X and Y axes are used to depict the meaning 0 information ks. ae i 4 . ‘ something worl _ Diagram — A two-dimensional representation of information to show how ie 2 Parallel Coordinate Plot—It is a visualization technique of representing multidimensiona! «. Figure 26.7 shows a parallel coordinate plot: Data Visualization-| i : Figure 2688 shows a Venn diagram for a set of Taatore Ane a Figure 26.8: Venn Diagrams t Q Timeline—Itis used to represent a chronological display of events. - Figure 269 shows an example of timeline fr some eritcl event sets: * Critical Events Timeline Pence. Coser anCane! —meesanderonns ae ‘roigr sary Figure 26.9: Timeline for Some Critical Events It is a representation of the relationships between sets. 721 @ Hyperbolic Trees They represent graphs that are drawn using the hyperbolic geometry Figure 26.11 shows a hyperbolic tree: Figure 26.11: Hyperbolic Tree 9 Cluster Diagram—It represents a cluster, such as a cluster of astronomic entities. Figure 26.12 shows a cluster diagram: Data Visualization-| Figure 26.13 shows an ordinogram: 282 283 284 285 236 gh A nr gredient ena 3 Figure 26.13: Ordinogram Types of Data Visualization ‘You already know that data can be visualized in man i = n ‘ y ways, such as in the forms of 1D, 2D, or 3D structures. Table 26.1 briefly describes the different types of data visualization: SSPE Gmc | Generally, no tool is used for iD visualization GeoCommons, Google Fusion Tables, Google Maps APL Polymaps, Many Eves. Google Charts, and Tableau Public For example, choropleth, cartogram, dot distribution map, and AG3D, AutoQ3D, TrueSpace ‘TimeFlow, Timeline 1S, L Timeplot, TimeSearcher, Google Charts, Tableau Public, and Google Fusion Tables Many Eyes, Google Char Tableau Public, and Google Fusion a Charts, and Network | or Sci2 Table 26.1: Data Visualization Types Pajek, Gephi, NodeX! For example, matrix, node link L, VOSviewer, diagram, hive plot, and tube map UCINET, GUESS, Network Workbench/Sci2, sigmajs d3/Protovis, Many Eyes, and Google Fusion Tables As shown in Table 26.1, the simplest type of data visualization is 1D representation and the mo= complex data visualization is the network representation. The following is a brief description of of these data visualizations: a 1D (Linear) Data Visualization—In the linear data visualization, data is presented in the form of lists, Hence, we cannot term it as visualization. It is rather a data organization technique ‘Therefore, no tool is required to visualize data in a linear manner. 2D (Planar) Data Visualization —This technique presents data in the form of images, diagrams ‘or charts on a plane surface. Cartogram and dot distribution map are examples of 2D visualization. Some tools used to create 2D data visualization patterns are GeoComm Google Fusion Tables, Google Maps API, Polymaps, Tableau Public, etc. 3D (Volumetric) Data Visualization—In this method, data presentation involves exactly three dimensions to show simulations, surface and volume rendering, etc. Generally, it is used in scientific studies. Today, many organizations use 3D computer modeling and volume render in advertisements to provide users a better feel of their products. To create 3D visualizations, we ‘use some visualization tools that involve AC3D, AutoQ3D, TrueSpace, etc. Temporal Data Visualization—Sometimes, visualizations are time dependent. To visualize the dependence of analyses on time, the temporal data visualization is used, which includes Ga chart, time series, sanky diagram, etc. TimeFlow, Timeline JS, Excel, Timeplot, TimeSearcher Google Charts, Tableau Public, Google Fusion Tables, etc. are some tools used to create tempor data visualization. Multidimensional Data Visualization —In this type of data visualization, numerous dimersio» are used to present data, We have pie charts, histograms, bar charts, etc to pif multidimensional data visualization, re sore —— Boe Dies Gooeie Charts Tebtew Pein, te are some fam of Newtldetaye Sometimes, data relationships need to be shown in the visualizations. Applications of Data Visualization Data visualization tools and techniques are used in various applications. Some of the areas in which we apply data visualization are as follows: Education Visualization is applied to teach a topic that requires simulation or modeling of any ‘object or process. Have you ever wondered how difficult it would be to explain any organ or organ system: without any visuals? Organ system or structure of an atom is best described with the help of diagrams or animation: Information— Visualization is applied to transform abstract data into visual forms for easy interpretation and further exploration. Production Various applications are used to create 3D models of products for better viewing and manipulation. Real estate, communication, and automobile industry extensively use 3D advertisements to provide a better look and feel to their products. Science—Every field of science including fluid dynamics, astrophysics, and medicine use visual representation of information. Isosurfaces and direct volume rendering are typically used to ‘explain scientific concepts. Systems visualization—Systems visualization is a relatively new concept that integrates visual techniques to better describe complex systems. Visual communication—Multimedia and entertainment industry use visuals to communicate their ideas and information. Visual analytics—It refers to the science of analytical reasoning supported by the interactive visual interface. The data generated by social media interaction is interpreted using visual analytics techniques. Visualizing Big Data Visual analysis of data is not a new thing. _ Visualization tools and techniques to interpret For years, statisticians and analysts have been using t and present the outcomes of their analyses. ‘Almost every organization today is struggling to tackle the huge amount of data pouring in every day, Data visualization is a great way to reduce the turn-around time consumed in interpreting Big Data. Traditional visualization techniques are not efficient enough to capture or interpret the information that Big Data possesses. For example, such techniques are not able to interpret videos, audios, and complex sentences. Apart from the type of data, the volume and speed with which it is ‘generating pose a great challenge. Most of the traditional analytics techniques are unable to cater to any of these problems. 5 oth structured as well as unstructured forms of data collected from various neity of data sources, data streaming, and real-time data are also difficult to handle tools. Traditional tools are developed by using relational models that work best ig Data is highly dynamic in function and therefore, most traditional tools are aaa ele The response time of taonal tools is quite high, making it shapter 26 Deriving Business Solutions The most common notation used for Big Data is 3Vs—volume, velocity, and variety. But, the exciting feature is the way in which value is filtered from the haystack of data. Big Data gener: through social media sites is a valuable source of information to understand consumer sen and demographics. Almost every company nowadays is working with Big Data and fac: following challenges: Q Most data is in unstructured form @ Data is not analyzed in real time The amount of data generated is huge a Thereisa lack of efficient tools and techniques Considering all these factors, IT companies are focusing more on research and developme robust algorithms, software, and tools to analyze the data that is scattered in the Internet Tools such as Hadoop are providing the state-of-the-art technology to store and process Bi Analytical tools are now able to produce interpretations on smartphones and tablets. It is poss because of the advanced visual analytics that is enabling business owners and researchers to xo data for finding out trends and patterns. Turning Data into Information The most exciting part of any analytical study is to find useful information from a plethora Visualization facilitates identification of patterns in the form of graphs or charts, which in tu to derive useful information. Data reduction and abstraction are generally followed dur mining to get valuable information. Visual data mining also works on the same principle as simple data mining; however, it involv integration of information visualization and human-computer interaction. Visualization of Produces cluttered images that are filtered with the help of clutter-reduction techniques. U sampling and dimension recuiction are two commonly used clutter-reduction techniques. Visual data reduction process involves automated data analysi i P lysis to measure density, outliers _their differences. These measures are then used as quality metrics to evaluate data-redction activ Visual quality metrics can be categorized as: a ‘Size metrics (e.g, number of data points) & Visual effectiveness metrics (e.g, data density, collisions) a Feahure preservation metrics (e.g, discovering and preserving data density differences) In genera, we can conclude that a visual analytics tool should be: Simple enouigh $0 that even non-technical users can operate it a : ea : Ses Qa re = ¥ Apart from representing data, a visualization 4 . - ; data values, restore the missing data, and oligden fae be able ae links between differe" 726 Data Visuatization-1 Tools Used in Data Visualization ‘Some useful visualization tools are listed as follows: 2 Excel—It is a new tool that is used for data analysis. It helps you to track and visualize data for deriving better insights. This tool provides various ways to share data and analytical conclusions within and across organizations. Figure 26.14 shows an example of Excel sheet: Cen a -_ Are=Itis used to display the topics and stories in a spherical form, He stisplay stories and topic, and bunches of stories are aligned at the outer circumference of spher Bigure 26.16 shows Digg Are: sphere is used to Figure 26.16: Digg Arc Visual Larger stories have more diggs, as shown in Figure 26.16. The arc becomes thicker with number of times users dig the story. 2 Google Charts API—This tool allows a user to create dynamic charts to be embedded in a Wet page. A chart obtained from the data and formatting parameters supplied in a HyperText Transfer Protocol (HTTP) request is converted into a Portable Network Graphics (PNG) im by Google to simplify the embedding process. Figure 26.17 shows some charts created by using Google Charts API: Column Chore Area Chart Candlestick Chart Timetine Bubble Chart Donut Chart vow | re 2 = me 3 ' | 1790 1800, i a Figure 26.17; Charts Obtained from Google Charts API See Earth—This tool is cap. is an effort to improve social media visual, Figure 26.18 shows an example Data Visualization-1 able of showing live tweets from all over the worle ation and provide a ofa TwittBarth visual 1 ona 3D globe. It lobal image mapping in tweet Figure 26.18: TwittEarth Visual ‘Source: htp:/eybergyaen.com/2010/01/10.supercook-ways-to-isualizo-itemet nimi Tag Galaxy—Tag Galaxy provides a stunning way of findi ig A collection of Flickr images. an unusual site which provides search tool which makes the online combs, P memorable visual experience, If you want to search a picture, you have to enter » ta 2 of choice and it will find the picture. The central (core) star contains all the images direct! to the initial tag and the revolving planets consist of similar or corresponding tase. Click Planet and additional sub-categories will appear. Click on the central star and f lickr ima; gather and land on a gigantic 3D sphere. Figure 26.19 shows a visual created by Tag Galaxy: relatin Figure 26.19; Tag Galaxy Visual Source: Taggalaxy de 729 Chapter 26 2. -D3—D3 allows you to bind arbitrary data to a Document Object Model ( data-driven transformations to the document, For example, you can use D3 to HTML table from an array of numbers. Or, use the same data to ¢ reate an interactive SVG ba: chart with smooth transitions and interactions. DOM) and then applic generate an Seer ia Tee Zeta ee 7 ee FD Figure 26.20: Some Visuals Obtained from D3 ‘Source: http://d3js.org/ a tool to generate a series of maps on the basis of t a Rootzmap Mapping the Internet—It is datasets provided by the National Aeronautics and Space Administration (NASA). Figure shows an example of the Internet mapping through Rootzmap: Internet Mapping Visual ‘Source: hity/iww.sysctorgrootzmap/e-nap pa Open-Source Data Visualization Tools We already know that Big Data analytics requires the implementation of advanced tools an! technologies. Due to economic and infrastructural apace, cegentzntionigannot purchas all the applications required for analyzing data. Therefore, to fulfil their requirement of adva" tools and technologies, organizations often turn, to open-source libraries, These libraries can defined as pools of freely available applications and analytical tools. Some examples of open-sou’ tools available for data visualization are VTK, CaveSD, ELKI, Tulip, Gephi, IBM OpenDX Public, and Vis5D. ableau 730 . es ae Misnalizationss Nmsistent, and reusable ‘They deliver high-quality performance ML AS COUR WiRh Uh Wats a Well as mobile Web security, In addition, they provide Aalvtive fr MUNtoLing as wll ae Customized business solutions that can be altered WAH CARING business domanits, QpRISOAEN H00b8 ATW aRY fa se, Sig Data Visualization Analytleal teehiviques are asect to analyze complex relationships among variables. The fol lowing are AE MANMVONLY: eRd analytical feo niques for Big Data solutions: A Regroaston analysis tr is 9 Malistical too! used for prediction, Regression analysis is used to Wreatlet Continuous clopenctent Variables from independent variables In this, we try to find the ‘sfivet of ane vatiable an other yariatie For exany ple, sales increase when prices decrease. Apes of TeBTESAION ANalysis are as follows: Y Ortinary least squares Tegression—It is used when dependent variable is continuous and ANE ONE SOME Avlationship between the dependent variable and independent variable =I is used when dependent Variable has only two potential results. Modeling It is used when data is in nested form. to measure length of process, 9% Grouping methods The technique of categorizing observation . Si into Significant or purposeful ‘Dhocks is called Srouping, The recognition of features to create a distinction between groups is ‘Salles discriminant analysis, on models It is used to analyze causal pathways irom independent variables ‘aviables Types of multiple equation models are as fellowes a analysis several tools and techniques used in Big Data visualization hat are most widely used for Big Data visualization. Some ‘Visualization are listed as follows along, with their use: Chapter 26 ben en ne SO * Collaborate with others Q Tableau Reader ‘+ Share visualizations and dashboards on the desktop + Filter, sort, and page through views © Use “Acrobat for Data” * Canbe downloaded free @ Tableau Public * Create and publish interactive visualizations and dashboards * Embed in websites and blogs * Get free download and free hosting service Installation of Tableau Public Tableau Public is an open-source tool that is used for data visualization. You ca: Product to install it on your computer system. We have downloaded Tableau Public our Windows 8.1 operating system. Perform the following steps to download and Public on a Windows desktop: 1. Open the following link in your Web browser: hitp://www.tableau.com/ public/ The home page of Tableau Public will appear, as shown in Figure 26.22: Data in. Brilliance out, ere mn vow obleov' pub Figure 26.23: The Download Tableau Public Window 3. Enter your e-mail ID in the Email Address text box and click the Download Tableau Public button (Figure 26.23). ‘The Thanks for choosing Tableau Public window appears, as shown in Figure 26.24: Figure 26.24; The Thanks for choosing Tableau Public Window 4. Go to the location in your system where you have saved the Tableau Public setup and double click the executable file, The Welcome to Tableau! dialog box appeats, as shown in Figure 26.25: Figure 26.25: The Welcome to Tableau! Dialog Box 733 Chapter 26 2 ts 5. Click the checkbox to accept the License Agreement (Figure 26.25). The Install button will be enabled, as shown in Figure 26.26: +ableau*public ctr tet Figure 26.26: Enabling the Install Button 6. Click the Install button. The Installing Tableau Public dialog box ap depicting the progress of the setup installation, as shown in Figure 26.27: ; Figure 26.27: The Installing Tableau Public 8.3 Dialog Box After the installation is complete, the Tableau Public window appears, as shown in Figure 2 26.28: Now youn eee Ease ae Tableau Public Window to represent the results of data analyses in vi ae # that You have downloaded and installed 734 Data Visualization! 7, Click the Open Data button (Figure 26. 28) to start loading data in Tableau Public. The connectivity window appears, as shown in Figure 26.29; Figure 26.29: The Data Connectivity Window You can load data stored either in a file (Microsoft Excel, Microsoft Access, or Text file) or on a server. In this case, we are loading a sample'set of data, stored in an Excel file, by clicking the Microsoft Excel link (Figure 26.29). You will be prompted to select the file. Once the data is loaded, the next window appears, as shown in Figure 26.30: ray sees hire te S| 735 nt ‘sheet to see the next window, as shown in Figure 26.31: Figure 26.31: Showing the Loaded Data Sheet = 9) Click the Go to Worksheet button (Figure 26.31). = “The window showing the datasheet view of the loaded data appears, as shown in Figure 26. ieee ceaeeicanet ~» 8 Drag and drop your data sheets from the left frame to the right frame or double click w dato . In our case, we hav? a. Ea ‘A scatter plot is created between longitude and latitude, as shown in Figure 26, Figure 26.33: Showing Scatter Plot for Country Name 11. You can also select any other type of graphical view for the sheet from the Show Me selection box that appears at the right-hand corner of the screen. In our case, we have selected the map view, as shown in Figure 26.34: Figure 26.34: Showing Map View for Country Name Thus; you can easily create various types of visuals for your datain Tableau Publi. Summary In this chapter, you the various techniques used for vi ovis iques in various Jeamed about the basics of data visualization, types of data visualization, and isualizing data. Next, you learned about the application of data industries. Towards the end, you learned to install the Tableau 737 Chepter 26 Quick Revise Multiple-Choice Questions Ql. Which of the following machine learning technique uses input to study the effects on Ans. Q Fe? 8g Ans, outputs? a. Supervised learning b. Unsupervised learning c, Semi-supervised learning d, Transduction The correct option is d. Visual analytics is the science of supported by the interactive visual interface. a. Computers b. Analytical reasoning c. Optics d. Technical tools The correct option is b. Which of the following tools provides state-of-the-art technology to store and process Big Data? b. Google Analytics a. Yahoo c. Twitter Map d. Hadoop . The correct option is d. Visualization of data produces images that are filtered with the help of techniques. a. Clutter reduction b. Predictive analytical ©. Logical reasoning d. Open source The correct option is a. Which of echoing Ris eT ee methods is used for categorizing a. Regression analysis b. Grouping methods © Multiple equation models d. Structural equation modeling The corre isb. Subjective Questions Ans, Jury thi gatements to analyze, comprehend, and share information’ vs wena este technique for representing the result of data analysis as hee 2 a c Fa huge amount of information into the human bron Visual images help in Patterns or processes easily, fei and ‘distinction between different Data Visualization-| a Visual interpretations help in exploring data from different angles, which help gain insights. @ Visualization helps in identifying problems and understanding trends and outliers. @ Visualization points out key or interesting breakthroughs in a large dataset. Which technique is used to analyze multivariate sets? ‘Ordinogram is used in the analysis of multivariate sets. Name a few data visualization tools. Some commonly used data visualization tools are: Excel Last-Forward Arc Google Charts API ‘TwittEarth Tag Galaxy D3 Rootzmap Q4. Discuss some applications of data visualization. Ans. Data visualization tools and techniques are used in various applications. Some of the areas in which we apply data visualization are as follows: 2 Education—Visualization is applied to teach a topic that requires simulation or modeling >of any object or process. Have you ever wondered how difficult it would be to explain "any organ or organ system without any visuals? Organ system or structure of an atom is best described with the help of diagrams or animations. Q Information—Visualization is applied to transform abstract data into visual forms for / easy interpretation and further exploration. ae Production—Various applications are used to create 3D models of products for better and manipulation. Real estate, communication, and automobile industry ‘extensively use 3D advertisements to provide a better look and feel to their products. - field of science, including fluid dynamics, astrophysics, and medicine use representation of information. Isosurfaces and direct volume rendering are to explain scientific concepts. af Systems visualizationSystems visualization is a relatively new concept that integrates visual techniques to better describe complex systems, ‘Visual communication—Multimedia and entertainment industry use visuals to communicate their ideas and information, PER - oooooocoao ter 26 @ Visual analytics—It refers to the science of analytical reasoning supported by the interactive visual interface. The data generated by social media interaction is interpreted using visual analytics techniques. %5. List and discuss various types of data visualizations. ins. The different types of data visualizations are as follows: Q 1D (Linear) Data Visualization—In the linear data visualization, data is presented ir form of lists. Hence, we cannot term it as visualization. It is rather a data organization technique. Therefore, no tool is required to visualize data in a linear manner. Q 2D (Planar) Data Visualization—This technique presents data in the form of imag diagrams, or charts on a plane surface. Cartogram and dot distribution map are exam of 2D data visualization. Some tools used to create 2D data visualization patterns ‘GeoCommons, Google Fusion Tables, Google Maps API, Polymaps, Tableau Public, etc Q 3D (Volumetric) Data Visualization—In this method, data presentation involves exac three dimensions to show simulations, surface and volume rendering, etc. General used in scientific studies. Today, many organizations use 3D computer modeli volume rendering in advertisements to provide users a better feel of their products create 3D visualizations, we use some visualization tools that involve AC3D, Auto TrueSpace, etc. Q ‘Temporal Data Visualization—Sometimes, visualizations are time dependent. To visualize the dependence of analyses on time, the temporal data visualization is us which includes Gantt chart, time series, sanky diagram, etc. TimeFlow, Timeline JS, Ex Timeplot, TimeSearcher, Google Charts, Tableau Public, Google Fusion Tables, etc. are some tools used to create temporal data visualization, a Maltidimensional Data Visualization—In this type of data visualization, numerous dimensions are used to present data. We have pie charts, histograms, bar charts, ets. to exemplify multidimensional data visualization: Many Eyes, Google Charts, Tableau da Public, etc. are some tools used to create multidimensional data visualization. @ Tree/Hierarchical Data Visualization—Sometimes, data relationshiy need to be shown in the form of hierarchies, To represent such kind of ene we use tree or hie data visualizations. Examples of tree/hierarchical data visualization inclw:e ah ttee, wedge-stack graph, etc. Some tools to create hierarchical data are d3, Google Charts, and Network Workbench/Sci2. eas 9 Network i, Visualization—It is used to represent data relations that pl tobe hte form of hierarchies, Some ex: i include matrix, me examples of network data visualization UCINET GUESS Nor glnk diagram hive pot Pajek, Gephi, NodeXt, VOSvie=t Fusion Tables,ete. nh Ses sigmais, d3/Protovis, Many Eyes, Goosle arate Chapter 27 In the previous chapter, you learned about data visualization and its importance in f analytics. You also learned about the various types of tools and techniques used to visu, Among thé various tools and techniques used for analyzing data, Tableau is the most commo; a software tool that can generate various kinds of visuals for data representation. Various 'ableatt Products are available in the market, which include Tableau Desktop, Tableau Ser, Tableau Reader, Tableau Public, etc. We have already discussed the basics and the installa: Process of the Tableau Public tool in the previous chapter. Tableau tools are easy to use create complex visualizations from given datasets, Along with this, they involve sharin, filtering features as well. Additionally, you can access multiple file formats and types for presen... different visual patterns, such as graphs, maps, charts, plots, ete. © Invthis chapter, you learn about the Tableau software, especially the installation and usage fableau Desktop and Tableati-Public software tools..'The chapter also outlines some + nical specifications of Tableau and its use in various departments and data environments. You alsc learn «- about the Tableau workspace and how to present data visually in Tableau. =z : Introduction to Tableau Software Tableau products are the most widely used various mmonly known products include aaa eon popula Sleeps one Heat software tools is Tableau Desktop, which provides | through approach used i ri Desktop tool takes: breakthr i to build the Tableas users in spoting meuenge Converts them into optimized database queries, which tar While working with tho Ta trends, and deriving logical conclusions and insights can. instead, be discovered by jst Sona ata analyst need not write any code; all the insights can easily connect te ee ea a towng ibe thoughts that strike the to direct) An Or on the server. peed OF Table ne? Sata from the server or load it in the Tableau data B-and-drop techng a fast as the thoughts of human beings, a! cise tho dak eh can line in a guided for the an also help you create story ® insights obtained from data, Moreover, you can we and Tableau Server tools, for content. tude the following: The important features of Tableau Q Single-click data analytics in visual form — 42 Hot Spots, 2002-2012 n iM ay Laryent fr0e by lneation : oe | elt ‘wn L y) Data Visualization with Tableau (Data visualization) z ws the ‘Tableau can display the structure and organization of a sports team. Figure 27.3 sho organization of a college football t Reese cout see ponmam | tagms | ee Figure 27.3: Displaying Organization of a College Football Team Tableau can be used to compare the performance of various business enterprises in the Figure 27.4 compares the growth of top 100 entrepreneurs: Tale of 100 Entrepreneurs Grom etry by Company 745 Ghagter MI sy Scatter plots in Tableau help to determine relationships between different elements An example j shown in Figure 27.5; Figure 27.5: Determining Relationships by Using Scatter Plot ‘Tableau software tool helps to create dashboards, as shown in Figure 27.6: a ‘After learning about a few of t to discuss Tableau Desktop Workipa (46 Data Visualization with Tableau (Data Visualization-l) Tableau Desktop Workspace Tableat's desktop efvironment is simple and easy to leam almost for anyone. However, those who have already worked on spreadsheets or other analytical tools would find it even simpler. The/Tableau data analytics tool analyzes data with almost the speed of thought. In other words, Tableau isan extremely efficient tool, which can answer your questions about deta analytics quickly. se cman Desktop, you first need to download and install the tool on your computer. Perform the following steps to download and install the Tableau software: 1. Open the following link in a Web browser: Iittp crow tableau.com/products/desktop Po “St anit Spon Pts ot Pe ee scien: ae s» TABLEAU DESKTOP ches naps eee 2 te. > analytics anyone oF Se eS at. a i Figure 27.7: The Tableau Desktop Home Page the trial version of the Tableau Desktop by clicking the TRY IT FREE button are 27.7). The downloading process will start and the Thanks for choosing a free trial of ‘screen appears, as shown in Figure 27.8: 7aT ‘Thanks for choosing a free tral of Tableau Desktop. ‘Mh en hd a any vo Figure 27.8: The Froe Trial Page 3. Go to the directory of your computer where you have stored the Tableau Desktop se’ d double'click the executable file, The Tableau Desktop installer, showing the Tableau version number on top, will appear, a8 shown in Figure 27.9; Bene ‘ Figure 27.9: The Tableau Desktop installer Click the Ihave read and accept the terms of this License Agreement check box (Figure 27.9) to i ee a 748 ta Visualization-l) Data Visualization with Tableau (Da! 5, Click the Install button to start the installation of the Tableau Desktop. The progress of installation appears, as shown in Figure 27.11; ae en 3 en tie) artes te netemaraee ns missin om Figure 27.11: Showing the Progress of Tableau Installation The installation of Tableau gets completed. The Activate Tableau dialog box appears, as shown in Figure 27.12: Be) nee MES ca Ca Figure 27.12: The Activate Tableau Dialog Box ‘The Activate Tableau dialog box contains three options: Start trial now, Start trial later, and activate the product. To activate the product, you require a product key, which needs to be __ Purchased from the Tableau website. However, you are eligible to continue the trial by clicking __ theStart trial now option. ‘ trial now option (Figure 27.12). This will ask for registration, as shown in Figure 27. 749 Chapter 27 de all the required details. 7. Click the Register button to open the registration form and prov Figure 27.14 shows the completion of the registration process: Figure 27.14: The Dialog Box after Completing Registration ick the Continue button (Figure 27-14). The open page of Tableau tool appears, as shows in Figure 27.15: : "Figure 27.15; The Open Pago of Tableau Desktop ‘The Connect to data button inthe Open page takes you tothe data connection workspace, wher data along with the four sample data sources that 2 sources “the Home button to view the Workbooks area. Tableau s*\°° provided by Tableau, by the last opened nine wo ig The Getting Started area at the screen provides learning and promotional materials ‘The Sample Workbos me sample workbooks. For more samples, you co" visit sampleslink, fies Hania PSE Te age per 7 ‘The ixons, shown in Figure 27.18, are discussed as follows: 2 Umidefitedio“Scrofis backward or forward on the screen. You can retrieve any step by clicking the undo/redo bation. @ File Save—Saves your work You need to click this button frequently as Tableau does not have the automated eave functor, D Commect to a New Data Source—Connects you toa data source. 2 New Dashboard or Worksheet—Adds new pages to your worksheet. 2 Duplicate SheetCreates an exact/duplicate copy of a worksheet as well as of the dashboard page that you are working on. 2 Cheer Sheet—Allows you to clear data of a sheet. 2 AutofManaal Update—Generates visual. It is particularly helpful for large datasets where 2 Group—Allows you to group data by selecting more than one headers in a table or values in a 3 Pivot Worksheet—Allows you to create a pivot table on a new worksheet. @ Ascending/Desceading SortSorts selected items in an ascending or descending order. (Label Marks—Turns on or off screen elements. 5 Presentation Modé—Hides/unhides design shelves. It is particularly used during presentations bea use Tablean as a presentation Slide deck to keep the slides of the 2 ar ac Copier ce emer 2 ea oe You can fit the screen either Fit Axis—Fixes the axis of view. ‘You can zoom in/out charts with this button. igure 27-19, are discussed as follows: meh Ope Sr and Save As. Other functions are Print t© and change the default location of the a Website. The Edit Relationships option is i Provi nn umesare aden ‘erosstab, and Duplicate as crossta ) with Tableau (Data Visualization-) Dashboard—Provides the Ac: tions me : Peet pera seerell:ths actions Re ee is the most important option on the Dashboard ESF Yl Tie Actions cn to Tableau worksheets and dashboards are defined within * a, fons menu is present under the Worksheet menu as well. Story- me vides the New Story option that is used f ir Bar aitecetiesk to certin events, shows used for explaining the relationship among facts, a aes ei nts, showing the dependency between decisions and outcomes, link various events, occurring one after another: a Analysis—Provides the Aggre; gate Measures and Stack Mark options. To creat ae fe new measures 07 dimensions, use Create Calculated Field or Edit Calculated Field ao a Map—Provides opti Ma p—I asp ions to change the color scheme and replace the default maps. You can also \port geocoding for custom locations by using the Geocoding menu. a evened spices Be Size and Workbook Theme. It is one of the least used menus of its features by simply right click i F ry ching yy simply right clicking at the point where you want to a Server—Provides options to publish your work on the Tableau server. This menu is available for those who are not using Tableau professionally. 2 Window—Provides the Bookmark menu, which is used to create .thm files that can be shared with different users. a Help—Provides options to access Tableau’s online manual, training videos, and sample workbooks. Tableau Server Tableau Server is a great tool for those who need to upload their files regularly and share their work with others. It can automatically refresh data extracts published in it by the users. Tableau Server can be accessed either by having a named user license or a core license. A named user license is ideal for smaller deployments (> 150 users), whereas a cor license is good for those who need dynamic access. Core license is a cost-effective solution because it drastically reduces administrative tasks. Tableau Server is compliant with security laws and permits (as defined by the server administrator). Its user interface is and organize reports. Data Window Data source in Tableau appears its customers to access tailored reports intuitive and provides functions to find You can open different data connections in a as a data window. data connections ina single data window: single data window. Figure 27.20 shows two different 753 tiaras Data Visualization with Tal ahtzationrH) Tableau Workbook and Data Source Files Depending on their utility and the files as: @ Tableau Workbook—It is the default save type when you save your work on the desktop. The een of e files will be . twb. The files with extension .twby can be shared with people not ae Bh leau Desktop license or those who cannot access the data source. The . tut files are small in size and include information to visualize data “Pébleaw Data Source—If you frequently connect to a specific data source or if you have manipulated the metadata of any data source, saving the file as Tableau data source is of great use. The extension of such a file will be .tds, and it includes server address, password, and metadata. 9 Tableau Bookmark—If you want to share any specific file with others, use Tableau bookmark. You can create a bookmark by following the path— File menu/Window/Bookmark/Create bookmark. The extension will be . thm amount of information they contain, Tableau saves and shares a Tableau Data Extract—It compresses your extracted data and improves performance by incorporating more formulas and functions. The extension of a Tableau Data Extract file is .tde. Tableau Charts Tableau can create different types of univariate, bivariate, and multivariate charts, depending on your requirements. The following are some of the common chart types that Tableau can create: Q Tables—Tables are an excellent choice of presenting data as they preserve all the information, which in turn minimize the chances of misinterpretation. 2 Scatter Plots—Scatter plots are used to describe the relationship between two variables. These plots cannot explain a causal relationship. Trend Lines—Trend lines are used to analyze the relationship between variables as well as predict the future outcome. 2 Bullet Graph—Bullet graph is just like a bar graph and is generally used in qualitative analysis Ithelps in measuring the same variable in different categories. Box Plot—Box plot represents distribution of data and is used in the comparison of multiple sets of data. It can effectively compute: ~ Minimum and maximum value * Median Begs and 75% quarsle pest compact techniques to visualize the part-to-whole 2 Treemap—Treemap is one of the relationships as well as hierarchical models. i ii i Vi df 2 Bubble Charts—Bubble charts help in categorizing and comparing different values and factors in the data with the help of bubbles. 755 ‘thapter 27 ‘An example of a bubble chart is shown in Figure 27.21: + 7 + ‘aren pe 100,000 pepatn Figure 27.21: Bubble Chart 8 Word Cloud—Word cloud is an interesting type of visualization that Tableau has incorporated in its 80 version. Similar to bubble charts, the words in a word cloud are sized according to the frequency at which they appear in the content. "Antexample of a word cloud is shown in Figure 2722: 756 Data Visualization with Tableau (Data. visualization-l) Operations on Data lata involving complex and intricate patterns is 4 tedious task. chniques for Analyzing large amounts of d. Representing the outcomes of the analysis is even more difficult. We need some tes i manner 80 that it is easy to interpret analyzing data and representing outcomes in a systemati terns and trends. Creating visualizations on a given dataset makes it easy fo interpret patterns 1s software pal eee looking at the figures and charts prepared from the dataset. Among the variow! oe for or fectien of visualizations and dashboards, Tableau Public is used by ctatisticians to represent the outcomes of data analysis carried out on large d imilar to oth lytical ; ; ys ge datasets. Similar to other sralytical tools, Tableau also supports different types of mathematical and statistical data operations such as: Sum Average Median Standard deviation Standard deviation of a population Variance Variance of a population Attribute Dimension In Tableau, you can also use conditional statement such as i else Additionally, Tableau permits you to alter the default aggregation of data and change its level Data Analytics in Tableau Public Fequremanis outined by Mr. Richard Taylor, Me Rachel Keith takes @ sample dataset n Excel sheet file named OlympicAthietes_0.x!sx. proved with the Tableau Public setup. The dataset is 2 Cr evrece tho data to create some graphs and dashboards of Tableau Public software that she earlier Prpws 8.1 operating system. The steps performed by Ms. Ket in downloaded and installed on her Wi visuals are discussed as follows. ©. Open the Hone page of the Tableau Public tool Fe cane wo Data option from the Dats meni fo connect to Fee vce dotset 26 shown Boke! oo Gum bebe te asalece po 757 bleoupublic =o Open. Create. Share. eee Figure 27.23: Connecting to Data in Tableau Public A window appears with the Connect frame on its left-hand side (Figure 27.24). You can alsc establish the connection to the data by using the Open Data button (Figure 27 23). 3. Select the file type in which the data is stored from the Connect frame. In our case, we have Selected Microsoft Excel, as shown in Figure 27,24: | The Open dialog box appears (Figure 27.25). i me ining the sample data used by Ms. Keith and double click it, as shown in Figure 27.25: Selecting a Particular File for Analysis in Tableau connection establishment process completes after the file has been ut Public screen appears, as shown in Figure 27.26: selected. The 759 SE ee e _ 5. Glick the Go to Worksheet button (Figure 27.26) to get the data in the Worksheet form, as shy in Figure 27,27; Data Visualization with Tableau (Data Visualization-l) You can add more dimensions thuthe Col he Y dimension, as shown in Figure 27.29: lumns field. In our case, we have added the Year "ies Me sis Figure 27.29: Adding More Dimensions for Analysis. 7 Specify particular analytical measures by dragging individual dimensions from the Measures ~~~ Section and.dropping them in the Rows field. In our case, we have selected Total Medals - as shown in Figure 27.30: _ ms Ea 8. Select a different kind of chart from the Show ‘corner, to view a different pattern, In our case, Me selection box, appearing on the upper righ we have selected the lines chart, as shown in Figure 27.31: Line Chart for Selected Dimensions ‘A line chart cannot be drawn for dimensions involving more than one column. 9. “Rename the sheet by selecting the Rename Sheet option after right clicking the panel given at the o ‘of the window. In our case, we have right clicked Sheet 1, as shown in Figure 27.32: tt Visuatization with Tabtoan (Data Visuslizeionr i) You can create more visuals, includit . ing bar chart, discrete line chart, bubble chart, etc., by clicking different options from the Show Me selection box. 10. Create a bar chart for the bronze medals the bar chart option, as shown in Figure ~ by different countries over the years by clicking, Figure 27.33: Creating a Bar Chart for Selected Dimensions is possible to create dashboards. To create a dashboard for your analysis, vou need » New Dashboard icon, the right-most icon given on the status bar of the Tableau w. A new dashboard appears, showing the sheets of the dataset being used, as the clarity of vision by selecting the respective range | section on the lower left corner, as shown in “Through this, we were able to see problem areas on a map and prioritise which jobs need to be attended first,” says Wilson. "It also meant we could group work in one particular area to make sure it all happens in the same work package in the same year This saw us save time and money as well being much more efficient.” Finally, Fahey applauds the user experience and ease of implementation Tableau has provided. “One thing that stands out for me is Tableau’s not an engineering tool, it’s not a financial tool and it's not even an IT tool.” I's something that is easily understood everyone can implement it in their work. Rather than buying separate tools we're able to use software that works for all employees within the company. Tableau has been more helpful than we could have imagined. ‘Source: https:/houro.tablenu.com/learn/stories/mainpower spreadsheets-speed-and-oi Using Visual Controls in Tableau Public Tableau Public software provides a number of controls for creating visualizations for a given data For example, weare taking a sample dataset for the temperature of some cities. The sample dataset i stored in an excel sheet, named city temp data.x1s, which contains certain information «i _ includes the city name in the City column, average temperature of the city in the month of January in the JanTemp column, and latitude and longitude values in the Lat and Long columns, You can "connect to this data in Tableau in the same way as you did earlier to load the excel sheet data. The __ dataset, in our case, appears as shown in Table 27.1: Table 27.1: Sample Dataset Containing Average City Temperature Data Visualization with Tableau (Data Visualization) 86.9 39.8 418 il | [| tT _We are using the Tableau Publi Ulto show how visual controls are applied on the given dataset When you visual controls in Tableau Public, the window with the eager: as shown in Figure 27. 37: menu bar, toolbar, data connection fram? frame (showing pages, filters, and marks), and ———— Date Vieuatization with Tableau (Data Visualizationrll) Click the Detail option of the Marks section and then select a bar from the bar chart to see the det about the selected bar, as shown in Figure 27.41; ails Figure 27.41: Getting Details You can also present the visuals in other format. For example, Figure 27.42 shows a bubble chart created for the same dimensions as above: 3 th ks os faa bis i | rer ncstananagpaactins Figure 27.42: ‘Showing a Bubble Chart is provided in the advanced tion features designed for the user 1 eee Pata Visualization with Tabteau (Data Visualization-t) * Parallel queries * Panand zoom * Persisted query cache for Tesuming the previous task o: Tableau 9.0 grabs the better Part of source databases toe time for faster and efficient dashboard pers technology for database connections called them, and converts them in fey Smart maps—Tableau 9.0 introduces Rew selection controls that include geocoding data and geographic search capabilities, which make it easier for end users to answer questions that circulate around the notion of “where?” For example, new radial and lasso selection controls Admin views—In Tableau 9.0, a powerful admin view is introduced in which an admin can track on activities of the user like providing Fesources, identifying bugs, and at the same time admin can debug it and report a log for the same issue. n next window opening xecute more powerful queries at a given ‘Srmance on desktop and server. It introduces a new Query Fusion that takes all the queries and simplifies 'Y Queries that increases throughput with less work Let's have the first look of Tableau 9.0. 1 Start Tableau 9.0 on Windows 7 Figure 27.44 shows the Start-up of the Tableau 9 public edition on Windows 7. itableau Tableau Desktop z Version 9.0) 27.44; The Start-up of Tableau (V et, the first window which we get is shown in Figure 27.45: Figure 27.45: Opening of Tableau 9.0 3. Figure 27.46 shows how the data is uploaded on Tableau 9.0 for the data analy Documents library Data Visualization with Tableau (Data V 4, The screen appears after loading the data in Tableau 9,0, as shown in Figure 27.47: 5. Now, ‘le Oma Widow Heo eon Sheet1 (census_ data) Woreat Sheth ee 23484788 sae aaaso7 anon fhe Gor wornert sai ose NE TESTE view ews nit Fees Wiener rere mason asr531 106308 aor ga09ss soz73 504 546657 e017 10965200 sussas 250552 21380 Figure 27.47: Showing how Data is represented in Tableau 9.0 after Loading the data is represented in the bar graph, which is shown in Figure 27.48: Figure 27.48: Showing Data in Bar Graph in Tableau 9.0 apter 27 Figure 27.49 shows the Treemap of census data, which is given state-wise in Fabheau Figure 27.49: Showing a Treemap of the Data Loaded in Tableau 9.0 Figure 27.50 shows the state-wise population in Tableau 9.0 public edition: a Representation in Horizontal Bar of Tableau 9.0 § data, which we have used in the analysis. Tableau 9.0 has its owr rt he ti form, which helps the user to identify the null value in Fie Hae rr __Data Visualization with Tableau (Data Visualization-I) > Figure 27.51 gives the field information of the data: jain ot to Figure 27.51: Dataset used for the Tableau Analysis esa? Sa ji = d you to the basics of Tableau software, especially the installation and usage and Tableau Public software tools. It started with an overview of the Tableau t outlined some technica 2 QS, In Tableau, which of the following charts will you use for comparing multiple sets of data and representing thelr minimum and maximum values? 4. Scatter plot b Trend fines Bullet graph Box plot Ans. The correct option isc. Q4, Which of the following Tableau software is suitable for data sharing? & Tableau Desktop b, Tableau Server Tableau Reader d, Tableau Public Ana. The correct option in b, QS. Which of the following toolbar icon» allows you to clear data of the sheet? 4 Undo/Redo b. File Save c. Duplicate Sheet d, Clear Sheet Ans. The correct option in d. ‘Widely used software tools for data visualization. There are - various types of Tableau products | wattle in the market, Some of the commonly known Tableau Desktop, Tableau Server, ‘Tableau Online, Tableau Reader, and w of Tableau software tools is Tableau Desktop, which create visualizations, The breakthrough approach used to build tal Gf data and converts them into optimized database ch h ‘users in spotting Patterns, identifying trends, and deriving logical and insights, While working with the Tableau Desktop, the data analyst need not ‘code; all the insights can, instead, be discovered by just connecting to the data and p thoughts that strike the mind naturally, You can easily connect to your data, in the Tableau data engine from a disk. The speed of ‘of human beings, and everything can be done with in the form of dashboards, which can be d filtering data, Dashboards can also help you ing the insights obtained from data, Moreover, 4 Online and Tableau Server tools, for Data Visualization with Tableau (Data Visualization-t 2, List some important features of Tableau software. ins, The important features of Tableau software include the following: Q Single-click data analytics in visual form In-depth statistical analysis Management of metadata In-built, top-class data analytic practices In-built data engine Big Data analytics Quick and accurate data discovery Business dashboards creation Various types of data visualization Social media analytics, including Facebook and Twitter Easy and quick integration of R Business intelligence through mobile Q Analysis of time series data 9 Analysis of data from surveys Mio 0 oe ob Dae oP Q3. How would you open a new workspace page in Tableau? Ans. To open a workspace page, drag the desired data source icon and drop it on the Tableau icon. You can open several connections in Tableau by going to the start or data connection page ___and selecting a new connection. Q4 List and discuss the icons present on the Tableau toolbar. Ans. The following icons are present on the Tableau toolbar: Undo/Redo—Scrolls backward or forward on the screen. You can retrieve any step by clicking the undo/redo button. File Save—Saves your work. You need to click this button frequently as Tableau does not haye the automated save function. ‘Connect to a New Data Source—Connects you to a data source. New Dashbéard or Worksheet—Adds new pages to your worksheet, : Skea Duplicate Sheet—Creates an exact/duplicate copy of a worksheet as well as of the dashboard. page that you are working on. Clear Sheet—Allows you to clear data of a sheet, “Aufo/Manual Update—Generates visuals. It is particularly helpful for large datasets ore oe dropping items consume time, naptor 27 a Group—Allows you to group data by selecting more than one headers in a table or values ina legend. Pivot Worksheet—Allows you to create a pivot table on a new worksheet. Ascending/Descending Sort—Sorts selected items in an ascending, or descending order. Label Marks—Turns on or off screen elements. Presentation Mode—Hides/unhides design shelves. It is particularly used during presentations where you want to use Tableau as a presentation slide deck to keep the slides of the presentation. ooosnd @ Reset Cards—Provides a menu to turn on or the screen elements, such as Caption or Summary. @ Fit Menu—Allows different views of the Tableau screen. You can fit the screen either horizontally or vertically. Fit Axis—Fixes the axis of view. You can zoom in/out charts with this button @ Highlight Control—Compares the selected combinations of dimensions. What is a Tableau Server? @. Ans, Tableau Server is a great tool for those who need to upload their files regularly and share their work with others. It can automatically refresh data extracts published in it by the users. Tableau Server can be accessed either by having a named user license or a core license. A named user license is ideal for smaller deployments (> 150 users), whereas a core license is _ ~~» good for those who need dynamic access. Core license is a cost-effective solution because it _ drastically reduces administrative tasks, Tableau Server is compliant with security laws and permits its customers to access tailored reports (as defined by the server administrator). Its user interface is intuitive and provides functions to find and organize reports. eave landien ae

You might also like