ITISA1 Ch06 PowerPoint
ITISA1 Ch06 PowerPoint
Stair/Reynolds, Principles of Information Systems, 14 th Edition. © 2021 Cengage. All Rights Reserved. May not be scanned,
copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Objectives (1 of 3)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Objectives (3 of 3)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Why Learn about Big Data and
Analytics?
• New data coming from all directions
• Nearly a zettabyte per year
• 1 trillion gigabytes or a 1 followed by 21 zeros= 1 000 000 000
000 000 000 000
• Must analyze large amounts of data
• Measure past and current performance
• Predict the future
• Forecasts drive anticipatory actions
• Improve business strategies
• Strengthen business operations
• Enrich decision making
• Organization will become more competitive
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Big Data (1 of 2)
• Big data
• Enormous (terabytes or more)
• Complex
• Traditional processes incapable of dealing with
them
• Key characteristics
• Volume
• Velocity
• Value
• Variety
• Veracity
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Sources of Big Data
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Big Data Uses
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Technologies Used to Manage and
Process Big Data
• Technologies used to manage and process
big data
• Data warehouses
• Extract Transform Load process
• Data marts
• Data lakes
• NoSQL databases
• Hadoop
• In-Memory databases
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Data Warehouses, Data Marts, and
Data Lakes (1 of 5)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Data Warehouses, Data Marts, and
Data Lakes (2 of 5)
Characteristic Description
Large Holds billions of records and petabytes of data
Multiple sources Data comes from many sources both internal and
external thus an extract, transform, load process
is required to ensure quality data
Historical Typically 5 years of data or more
Cross organizational access Data accessed, used, and analyzed by users across
and analysis the organization to support multiple business
processes and decision making
Supports various types of Drill down analysis, development of metrics,
analyses and reporting identification of trends
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
NoSQL Databases (1 of 3)
• NoSQL database
• Differs from a relational database
• Data modeled without two-dimensional tabular
relations
• Uses horizontal scaling
• Does not require a predefined schema
• Does not conform to true ACID properties when
processing transactions
• Structures used by NoSQL databases
• More flexible than relational database tables
• Provide improved access speed and redundancy
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
NoSQL Databases (2 of 3)
• Four categories
• Key-value NoSQL databases
• Two columns (“key” and “value”)
• Document NoSQL databases
• Store, retrieve, and manage document-oriented
information
• Graph NoSQL databases
• Well-suited for analyzing interconnections
• Column NoSQL databases
• Store data in columns
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Hadoop (1 of 3)
• Hadoop
• Open-source software framework
• Includes several software modules
• Stores and processes extremely large data
sets
• Hadoop Distributed File System (HDFS)
• Distributed file system
• Used for data storage
• Divides the data into subset
• Distributes the subsets onto different servers for
processing
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Hadoop (3 of 3)
• MapReduce program
• Consists of two components
• Map procedure performs filtering and sorting
• Reduce method performs a summary operation
• Hadoop limitation
• Can only perform batch processing
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
In-Memory Databases (1 of 2)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analytics and Business Intelligence
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Role of a Data Scientist
• Data scientist
• Combines several skills
• Strong business acumen
• Deep understanding of analytics
• Healthy appreciation of data, tools, and techniques’
limitations
• Delivers real improvements in decision making
• Highly inquisitive person
• Educational requirements: quite rigorous
• Job outlook: extremely bright
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Components Required for Effective BI
and Analytics
• Three key components
• Existence of a solid data management program
• Includes governance
• Creative data scientists
• Strong commitment to data-driven decision
making
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Business Intelligence and Analytics
Tools
Text and
Descriptive Predictive Video
Analysis Analytics Optimization Simulation Analysis
Monte
Regression Linear Carlo Video
analysis Data mining programming simulation analysis
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Descriptive Analysis (1 of 6)
• Descriptive analysis
• Preliminary data processing stage
• Identifies data patterns
• Answers questions
• Who, what, where, when, and to what extent
• Two types
• Visual analytics
• Regression analysis
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Descriptive Analysis (2 of 6)
• Visual analytics
• Presentation of data pictorially or graphically
• Word cloud
• Visual depiction of a set of words
• Words grouped together
▶ Based on frequency of their occurrence
• Conversion funnel
• Graphical representation
• Example: Summary of steps a consumer takes in
making the decision to buy a product and become a
customer
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Descriptive Analysis (3 of 6)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Descriptive Analysis (4 of 6)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Descriptive Analysis (5 of 6)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Descriptive Analysis (6 of 6)
• Regression analysis
• Determines the relationship between a
dependent variable and one or more
independent variables
• Produces a regression equation
• Coefficients represent a relationship
▶ Between each independent variable and the
dependent variable
• Used to make predictions
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Predictive Analytics (1 of 3)
• Predictive analytics
• Techniques to analyze current data
• Identifies future probabilities and trends
• Makes predictions about the future
• Time series analysis
• Uses statistical methods
• Analyzes time series data
• Extracts meaningful statistics and
characteristics
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Predictive Analytics (2 of 3)
• Data mining
• BI analytics tool
• Explores large amounts of data for hidden
patterns
• Predicts future trends and behaviors
• Used in decision making
• Three common data mining techniques
• Association analysis
• Neural computing
• Case-based reasoning
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Predictive Analytics (3 of 3)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Optimization (1 of 2)
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Simulation
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Self-Service Analytics
• Self-service analytics
• Training, techniques, and processes
• Empower end users to work independently
• Access data from approved sources
• Perform their own analyses
• Use an endorsed set of tools
• Advantages
• Gets valuable data into the hands of end users
• Encourages fact-based decision making
• Accelerates decision making
• Provides a solution to the shortage of data scientists
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Summary
Stair/Reynolds, Principles of Information Systems, 14th Edition. © 2021 Cengage. All Rights Reserved. May not be
scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.