Unit Ii-Ba (2) - 1
Unit Ii-Ba (2) - 1
BUSINESS INTELLIGENCE
Data Warehouses and Data Mart - Knowledge Management –Types of Decisions - Decision
functions
Data warehouses and databases both are relative data systems, but both are made to
serve different purposes. A data warehouse is built to store a huge amount of historical data and
empowers fast requests over all the data, typically using Online Analytical Processing (OLAP).
A database is made to store current transactions and allow quick access to specific
transactions for ongoing business processes, commonly known as Online Transaction
Processing (OLTP).
1. Subject Oriented
2. Time-Variant:
The different data present in the data warehouse provides information for a specific period.
3. Integrated
A data warehouse is built by joining data from heterogeneous sources, such as social databases,
level documents, etc.
4. Non- Volatile
Operational System
An operational system is a method used in data warehousing to refer to a system that is used to
process the day-to-day transactions of an organization.
Flat Files
A Flat file system is a system of files in which transactional data is stored, and every file in the
system must have a different name.
Meta Data
A set of data that defines and gives information about other data.
Meta Data summarizes necessary information about data, which can make finding and work with
particular instances of data more accessible. For example, author, data build, and data changed,
and file size are examples of very basic document metadata.
The area of the data warehouse saves all the predefined lightly and highly summarized
(aggregated) data generated by the warehouse manager.
The goals of the summarized information are to speed up query performance. The summarized
record is updated continuously as new information is loaded into the warehouse.
The principal purpose of a data warehouse is to provide information to the business managers for
strategic decision-making. These customers interact with the warehouse using end-client access
tools.
Data Mart
Data marts are derived from subsets of data in a data warehouse, though in the bottom-
up data warehouse design methodology, the data warehouse is created from the union of
organizational data marts.
The fundamental use of a data mart is Business Intelligence (BI) applications. BI is used
to gather, store, access, and analyze record. It can be used by smaller businesses to utilize the
data they have accumulated since it is less expensive than implementing a data warehouse.
The second approach is Independent data marts (IDM) Here, firstly independent data
marts are created, and then a data warehouse is designed using these independent multiple data
marts. In this approach, as all the data marts are designed independently; therefore, the
integration of data marts is required. It is also termed as a bottom-up approach as the data marts
are integrated develop a data warehouse.
Other than these two categories, one more type exists that is called "Hybrid Data Marts."
Hybrid Data Marts
It allows us to combine input from sources other than a data warehouse. This could be
helpful for many situations; especially when Adhoc integrations are needed, such as after a new
group or product is added to the organizations
In Data Warehouse Data comes from In Data Mart data comes from very few
Source
many sources. sources.
The size of the Data Warehouse may The Size of Data Mart is less than 100
Size
range from 100 GB to 1 TB+. GB.
KNOWLEDGE MANAGEMENT
Types of knowledge
The definition of knowledge management also includes three types of knowledge—tacit,
implicit, and explicit knowledge. These types of knowledge are largely distinguished by the
codification of the information.
Tacit knowledge: This type of knowledge is typically acquired through experience, and
it is intuitively understood. As a result, it is challenging to articulate and codify, making it
1. Knowledge Creation: During this step, organizations identify and document any
existing or new knowledge that they want to circulate across the company.
2. Knowledge Storage: During this stage, an information technology system is typically
used to host organizational knowledge for distribution. Information may need to be
formatted in a particular way to meet the requirements of that repository.
3. Knowledge Sharing: In this final stage, processes to share knowledge are communicated
broadly across the organization. The rate in which information spreads will vary
depending on organizational culture. Companies that encourage and reward this behavior
will certainly have a competitive advantage over other ones in their industry.
Knowledge management tools:
There are a number tools that organizations utilize to reap the benefits of knowledge
management. Examples of knowledge management systems can include:
● Content management systems (CMS) are applications which manage web content
where end users can edit and publish content. These are commonly confused with
document management systems, but CMSs can support other media types, such as audio
and video.
● Intranets are private networks that exist solely within an organization, which enable the
sharing of enablement, tools, and processes within internal stakeholders. While they can
be time-consuming and costly to maintain, they provide a number of groupware services,
such as internal directories and search, which facilitate collaboration.
● Wikis can be a popular knowledge management tool given its ease of use. They make it
easy to upload and edit information, but this ease can lead to concerns about
misinformation as workers may update them with incorrect or outdated information.
● Data warehouses aggregate data from different sources into a single, central, consistent
data store to support data analysis, data mining, artificial intelligence (AI), and machine
learning. Data is extracted from these repositories so that companies can derive insights,
empowering employees to make data-driven decisions.
Benefits of knowledge management
Companies experience a number of benefits when they embrace knowledge management
strategies. Some key advantages include:
● Identification of skill gaps: When teams create relevant documentation around implicit
or tacit knowledge or consolidate explicit knowledge, it can highlight gaps in core
competencies across teams. This provides valuable information to management to form
new organizational structures or hire additional resources.
● Make better informed decisions: Knowledge management systems arm individuals and
departments with knowledge. By improving accessibility to current and historical
enterprise knowledge, your teams can upskill and make more information-driven
decisions that support business goals.
● Maintains enterprise knowledge: If your most knowledgeable employees left
tomorrow, what would your business do? Practicing internal knowledge management
enables businesses to create an organizational memory. Knowledge held by your long-
term employees and other experts, then make it accessible to your wider team.
● Operational efficiencies: Knowledge management systems create a go-to place that
enable knowledge workers to find relevant information more quickly. This, in turn,
reduces the amount of time on research, leading to faster decision-making and cost-
savings through operational efficiencies. Increase productivity not only saves time, but
also reduces costs.
● Increased collaboration and communication: Knowledge management systems and
organizational cultures work together to build trust among team members. These
information systems provide more transparency among workers, creating more
understanding and alignment around common goals. Engaged leadership and open
communication create an environment for teams to embrace innovation and feedback.
● Data Security: Knowledge management systems enable organizations to customize
permission control, viewership control and the level of document-security to ensure that
information is shared only in the correct channels or with selected individuals. Give your
employees the autonomy access knowledge safely and with confidence.
TYPES OF DECISIONS
Decision-making is the action or process of thinking through possible options and
selecting one. It is important to recognize that managers are continually making decisions, and
that the quality of their decision-making has an impact—sometimes quite significant—on the
effectiveness of the organization and its stakeholders. Stakeholders are all the individuals or
groups that are affected by an organization (such as customers, employees, shareholders, etc.)
Members of the top management team regularly make decisions that affect the future of the
organization and all its stakeholders, such as deciding whether to pursue a new technology or
product line. A good decision can enable the organization to thrive and survive long-term, while
a poor decision can lead a business into bankruptcy. Managers at lower levels of the organization
generally have a smaller impact on the organization’s survival, but can still have a tremendous
impact on their department and its workers. Poor decision-making by lower-level managers is
unlikely to drive the entire firm out of existence, but it can lead to many adverse outcomes.
Therefore, increasing effectiveness in decision making is critical, and using a model can help.
DECISION TYPES
⮚ structured decision
⮚ unstructured decision
⮚ semi-structured decision
⮚ Strategic decisions
⮚ operational decisions
Structured decision
An organization has a wide variety of decisions to make, ranging from highly structured
decisions to unstructured decisions. A structured decision is one that is made quite often, and
one in which the decision is based directly on the inputs. With structured decisions, once you
know the necessary information you also know the decision that needs to be made. For example,
inventory reorder levels can be structured decisions. Once your inventory of widgets gets below
a specific threshold, there is a need to order more. Processes like this, based on structured
decisions, are good candidates for automation. They can also be referred to as programmed
decisions.
For programmed decisions, managers often develop heuristics, or mental shortcuts, to help reach
a decision. For example, the retail store manager may not know how busy the store will be the
week of a big sale, but might routinely increase staff by 30% every time there is a big sale
(because this has been fairly effective in the past). Heuristics are efficient—they save time for
the decision maker by generating an adequate solution quickly. Heuristics don’t necessarily yield
the optimal solution, but a good solution. Heuristics are often used for programmed decisions ,
because experience in making the decision over and over helps the decision maker know what
to expect and how to react. Programmed decision-making can also be taught fairly easily to
another person.
Unstructured decision
Semi-structured decision
A semi-structured decision is one in which most of the factors needed for making the
decision are known but human experience and other outside factors may still impact the decision.
A good example of a semi-structured decision is the hiring process. Part of the decision is
structured (years of experience, education, etc.) and part of the decision is based on human
experience (for example: social skills, problem solving skills etc.) Take a look at how companies
are trying to program this decision and use algorithms. Semi-structured and unstructured
decisions are more challenging and systems may not be able to assist in the process fully.
However, advances
this.
Decision Type and System Type Pyramid
Decisions can also be classified into three categories based on the level at which they occur.
Finally, operational decisions refer to decisions that employees make each day to make
the organization run. For example, think about the restaurant that routinely offers a free dessert
when a customer complaint is received. The owner of the restaurant made a strategic decision to
have great customer service. The manager of the restaurant implemented the free dessert policy
as a way to handle customer complaints, which is a tactical decision. Finally, the servers at the
restaurant are making individual decisions each day by evaluating whether each customer
complaint received is legitimate and warrants a free dessert. Different information systems are
used at each level of the company structure to support the different decision types.
Members of the top management team regularly make decisions that affect the future of
the organization and all its stakeholders, such as deciding whether to pursue a new technology or
product line. A good decision can enable the organization to thrive and survive long-term, while
a poor decision can lead a business into bankruptcy. Managers at lower levels of the organization
generally have a smaller impact on the organization’s survival, but can still have a tremendous
impact on their department and its workers. Poor decision-making by lower-level managers is
unlikely to drive the entire firm out of existence, but it can lead to many adverse outcomes.
Therefore, increasing effectiveness in decision making is critical, and using a model can help.
Decision makers should use a systematic process for making decisions. The decision-making
process can be broken down into a series of six steps, as follows:
While these steps may seem straightforward, individuals often skip steps or spend too little time
on some steps. In fact, sometimes people will refuse to acknowledge a problem (Step 1) because
they aren’t sure how to address it.
Steps in Decision-Making Process
DECISION SUPPORT SYSTEMS
A decision support system (DSS) helps managers make decisions using interactive
computer models that describe real-world processes. These systems are designed to take inputs
regarding a known (or partially-known) decision making process and provide the information
necessary to make a decision. The DSS uses data from the internal database but looks for
specific data that relate to the problems at hand.
DSS’s can come in many different formats. A nicely designed spreadsheet that allows for
input of specific variables and then calculates required outputs could be considered a DSS. A
manager could create a spreadsheet to show the amount of overtime required if the number of
workers increases or decreases. Another DSS might be one that assists in determining which
products a company should develop. Input into the system could include market research on the
product, competitor information, and product development costs. The system would then analyze
these inputs based on the specific rules and concepts programmed into it. The system would
report its results with recommendations and/or key indicators to be used in making a decision.
Companies can use a predictive analytics program to improve their inventory management
system and use big data to target customer segments for new products and line extensions.
Goal Seek Analysis Goal seeking analysis-finds the inputs necessary to achieve a goal
Optimization Optimization analysis -finds the optimum value for a target variable
Analysis by repeatedly changing other variables
Most EIS’s have a business intelligence (BI) dashboard. A BI dashboard is a tool that
shows all of an organization’s key metrics and performance indicators on one screen. This
makes it easy for management to quickly see how the company is performing and address any
outliers or negative trends while also being able to identify positive trends. It also provides a
consolidated overview with the ability to drill down or filter when needed. During the COVID
pandemic many dashboards were created to track the virus as well as vaccine rates. One
example is the GIS Hub provided by Esri Canada that shares all information about the pandemic.
Expert Systems
An expert system gives managers advice similar to what they would get from a human
consultant. Artificial intelligence enables computers to reason and learn to solve problems in
much the same way humans do, using what-if reasoning. Although they are expensive and
difficult to create, expert systems are finding their way into more companies as more
applications are found. Lower-end expert systems can even run on mobile devices. Top-of-the-
line systems help airlines appropriately deploy aircraft and crews, critical to the carriers’ efficient
operations. The cost of hiring enough people to do these ongoing analytical tasks would be
prohibitively expensive. Expert systems have also been used to help explore for oil, schedule
employee work shifts, and diagnose illnesses. Some expert systems take the place of human
experts, whereas others assist them.
BUSINESS INTELLIGENCE
With the rise of Big Data and a myriad of new tools and techniques at their disposal,
businesses are learning how to use information to their advantage. The term business
intelligence describes the process that organizations take to collect and analyze data in the hopes
of obtaining a competitive advantage.
Business intelligence is about getting the right information, to the right decision makers, at the
right time.
Business intelligence is really an organizational process as much as it is a set of technologies.
Besides using their own data, stored in data warehouses (more on this later), firms often purchase
information from data brokers to get a big-picture understanding of their industries and the
economy. The results of analysis can drive organizational strategies and provide competitive
advantage.
BI is used for
Data brokers collect and sell data based on online activity. Digital marketing
company WebFX estimates that there are more than 4,000 data brokers worldwide. Digital
marketers use this information to help direct their advertising and product development efforts.
The data broker market in Canada is flourishing due to less safeguards on personal
information. [1]
All the terminology around business intelligence can be confusing, as often the terms are
used interchangeably. What is the difference between business intelligence, data science and data
analytics? Much like information technology and information systems, the terms are
differentiated on the components involved. In the data world, the components consist of
engineering, analysis, and communication. Combining the terms in different ways will yield
various ‘areas of focus’.
Data Engineering Refers to the design, creation, and structure of data and datasets.
Data
Is defined as the flow of data from a source(s) to the user.
Communication
Venn Diagram of Data Engineering, Analysis and Communication.
Data Science
Combining all three areas (engineering, communication and analysis) results in the field
of data science. Data Science is the analysis of large data sets to find new knowledge. The field
of data science is constantly changing, and data scientists are on the cutting edge of work in areas
such as artificial intelligence and neural networks. While a data scientist does many different
things, their focus is generally on analyzing large data sets using various programming methods
and software tools to create new knowledge for their organization. For example, data scientists
look at building new models and algorithms to extract data in different ways. Data scientists are
skilled in machine learning.
Data Analytics
Combining analysis and communication results in data analytics. Data analytics looks at the
past data to try and understand what happened and can also make predictions for the future. Data
analytics is focused on finding answers to questions by analyzing trends and creating visual
representations, like dashboards whereas data science is a broad concept that focuses on figuring
out the questions that should be asked of the data. Whereas business intelligence focuses on what
happened in the past up until now. It helps to identify trends in the past, and data analysis can
look to the future. To learn more about the history of business intelligence and its evolution see
this see this infographic.
Data Analytics Types
Analytics is a very broad term, but no matter the type of data, or size of data you are
using, generally all analytics can be categorized in four ways: describe, diagnose, predict, and
prescribe. These four categories range from summarizing historical information to providing
insight for the future by investigating the data and attempting to answer the following questions:
Descriptive
What happened?
Analytics
Diagnostic
Why did it happen?
Analytics
Prescriptive
What to do next?
Analytics
Calculating some summary statistics such as mean, median, mode and standard deviation
is an example of descriptive analytics. Similarly matching, or creating a scatter plot diagram are
examples of diagnostic techniques that can help identify outliers. Predictive techniques involve
using regression analysis and attempts to find similarities between historical data sets to be able
to forecast into the future. Lastly, prescriptive analytics involve using decision support systems,
machine learning and artificial intelligence to analyze the information and recommend a course
of action. Multiple predictions are generated and an optimized solution is recommended. For
example, if you use a GPS on a long road trip, you may have experienced ‘dynamic route
updates’, or the suggestion of new alternate routes that may be quicker due to slowing traffic or
an accident.
OLAP – ANALYTIC FUNCTIONS
The idea behind analysis tools that allow queries and reporting is to present users with a
subset of requested data, selected, sorted, ordered, calculated, and compared, as needed.
Managers use these tools to see and explore what’s happening inside their
organizations. Canned reports provide regular summaries of information in a predetermined
format. They’re often developed by information systems staff and formats can be difficult to
alter. By contrast, ad hoc reporting tools allow users to dive in and create their own reports,
selecting fields, ranges, and other parameters to build their own reports on the fly.
A manager using an OLAP tool can quickly explore and compare data across multiple
factors such as time, geography, product lines, and so on. In fact, OLAP users often talk about
how they can “slice and dice” their data, “drilling down” inside the data to uncover new insights.
And while conventional reports are usually presented as a summarized list of information, OLAP
results look more like a spreadsheet, with the various dimensions of analysis in rows and
columns, with summary values at the intersection.
Data Visualization
Data visualization is the graphical representation of information and data. These graphical
representations (such as charts, graphs, and maps) can quickly summarize data in a way that is
more intuitive and can lead to new insights and understandings. Just as a picture of a landscape
can convey much more than a paragraph of text attempting to describe it, graphical
representation of data can quickly make meaning of large amounts of data. Many times,
visualizing data is the first step towards a deeper analysis and understanding of the data collected
by an organization. Examples of data visualization software include Tableau and Google Data
Studio.
Dashboards are a type of data visualization that provide a heads-up display of critical indicators,
letting managers get a graphical glance at key performance metrics. Some tools may allow data
to be exported into spreadsheets. Even the lowly spreadsheet can be a powerful tool for
modelling “what if” scenarios and creating additional reports. However, if data can be easily
exported, then it can potentially leave the firm dangerously exposed, raising privacy, security,
legal, and competitive concerns.
Data Mining
Data mining is the process of analyzing data to find previously unknown and interesting trends,
patterns, and associations in order to make decisions. Click to enlarge. Data Mining
Process adapted from Matthew Pauley CC-BY-NC-SA 4.0
While reporting tools can help users explore data, modern data sets can be so large that it might
be impossible for humans to spot underlying trends. That’s where data mining can help. Data
mining is the process of analyzing data to find previously unknown and interesting trends,
patterns, and associations in order to make decisions. Generally, data mining is accomplished
through automated means against extremely large data sets, such as a data warehouse. Some of
the key areas where businesses are leveraging data mining include the following:
Marketing and promotion Identifying which customers will respond to which offers at
targeting which price at what time.
Model Considerations
1. Clean and consistent data. Deceive your systems with bad data and your models are worthless.
It is important that the data that is input into the system is accurate and consistent
2. Data should reflect current and future trends. As well historical consistency should be
considered. Computer-driven investment models can be very effective when the market behaves
as it has in the past. But models are blind when faced with extreme and unusual events that never
occurred in the data used to build the model.
Data mining presents a host of other perils, as well. It’s possible to over-engineer a model,
building it with so many variables that the solution arrived at might only work on the subset of
data you’ve used to create it. You might also be looking at a random but meaningless statistical
fluke. One way to test to see if you’re looking at a random occurrence in the numbers is to divide
your data, building your model with one portion of the data, and using another portion to verify
your results. This is the approach Netflix has used to test results achieved by teams in the Netflix
Prize, the firm’s million-dollar contest for improving the predictive accuracy of its movie
recommendation engine. Finally, sometimes a pattern is uncovered but determining the best
choice for a response is less clear. First findings don’t always reveal an optimal course of action.
Skills Required
These modelling considerations highlight the importance of recruiting a data mining and
business analytics team that possesses three critical skills:
Information For understanding how to pull together data, and for selecting
technology analysis tools.