0% found this document useful (0 votes)

2 views28 pages

Business Data Analytics

The document outlines the importance of data science in business decision-making, covering topics such as the nature and types of data, digitization, data processing, and ethical considerations. It emphasizes the transformation of data into actionable information for improved decision-making and highlights the role of data analytics in finance, including risk management and customer service. Additionally, it discusses the significance of professional skepticism and ethical use of data to ensure responsible data practices.

Uploaded by

Mufeeda Iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views28 pages

Business Data Analytics

Uploaded by

Mufeeda Iqbal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

CMA Joel Varghese Koruthu

(CMA,B.Com)
Index
SL NO Module Name Page No
8 Introduction to Data Science for Business Decision-Making
8.1 Meaning, Nature, Properties, Scope of Data 2
8.2 Types of Data in Finance and Costing 2
8.3 Digitization of Data and Information 3
8.4 Transformation of Data to Decision Relevant Information 4
8.5 Communication of Information for Quality Decision-making 5
8.6 Professional Scepticism Regarding Data 5
8.7 Ethical Use of Data and Information 5
9 Data Processing, Organisation, Cleaning and Validation
9.1 Development of Data Processing 7
9.2 Functions of Data Processing 8
9.3 Data Organisation and Distribution 9
9.4 Data Cleaning and Validation 11
10 Data Presentation: Visualisation and Graphical Presentation
10.1 Data Visualisation of Financial and Non-Financial Data 14
10.2 Objective and Function of Data Presentation 14
10.3 Data Presentation Architecture 15
10.4 Dashboard, Graphs, Diagrams, Tables, Report Design 16
10.5 Tools and Techniques of Visualisation and Graphical Presentation 18
11 Data Analysis and Modelling
11.1 Process, Benefits and Types of Data Analysis 19
11.2 Data Mining and Implementation of Data Mining 20
11.3 Analytics and Model Building 21
11.4 Standards for Data Tagging and Reporting (XML, XBRL) 22
11.5 Cloud Computing, BI, AI, RPA and Machine Learning 23
11.6 Model vs. Data-driven Decision-making 26

Weightage

5% 5%

Introduction to Data Science for Business Decision-Making

Data Processing, Organisation, Cleaning and Validation
Data Presentation: Visualisation and Graphical Presentation
Data Analysis and Modelling

1|Page
Introduction to Data Science for Business Decision-Making

8.1 Meaning, Nature, Properties, Scope of Data

There is a saying ‘data is the new oil’

Data is a source of information and information needs to be processed for gathering knowledge. Any
‘data’ on its own does not confer any meaning.

Data needs to be processed for gathering information

When these ‘information’ is used for solving a problem, we say it’s the use of knowledge

Nature of Data

1. Numerical data: Any data expressed as a number is a numerical data.

2. Descriptive data: Information deciphered in the form of qualitative information.
3. Graphic data: Data presented in the form of a picture or graphics.

8.2 Types of Data in Finance and Costing

Kinds of data

• Quantitative financial data: By the term ‘quantitative data’, we mean the data expressed in
numbers. The quantitative data availability in finance is significant.
Eg.The stock price data, financial statements
• Qualitative financial data: However, some data in financial studies may appear in a qualitative
format e.g. text, videos, audio etc. These types of data may be very useful for financial analysis.
Eg.The ‘management discussion and analysis’ presented as part of annual report of a company

Types of data

(i) Nominal Scale: Nominal scale is being used for categorising data. Under this scale,
observations are classified based on certain characteristics. The category labels may
contain numbers but have no numerical value
(ii) Ordinal Scale: Ordinal scale is being used for classifying and put it in order. The numbers
just indicate an order. They do not specify how much better or worse a stock is at a specific
price compared to one with a lower price.
(iii) Interval scale: Interval scale is used for categorising and ranking using an equal interval
scale. Equal intervals separate neighbouring scale values. As a result of scale’s arbitrary
zero point, ratios cannot be calculated
(iv) Ratio scale: The ratio scale possesses all characteristics of the nominal, ordinal, and
interval scales. The acquired data can not only be classified and rated on a ratio scale, but

2|Page
also have equal intervals. A ratio scale has a true zero, meaning that zero has a significant
value

8.3 Digitization of Data and Information

Digitization implies the process of converting the data and information from analogue to digital
format. The data in the original form may be stored in as an object, a document or an image. The
objective of digitization is to create a digital surrogate of the data and information in the form of binary
numbers that facilitate processing using computers.

There are primarily two basic objectives of digitization.

i. To provide a widespread access of data and information to a very large group of users
simultaneously.
ii. Preservation of data for a longer period.

Why we digitize?

• Improves classification and indexing for documents, this helps in retrieval of the records.
• Digitized records may be accessed by more than one person simultaneously.
• It becomes easier to reuse the data, which are difficult to reuse in present format e.g. very
large maps, data recorded in microfilms etc.
• Helps in work processing
• Higher integration with business information systems
• Easier to keep back-up files and retrieval during any unexpected disaster
• Can be accessed from multiple locations through networked systems
• Increased scope for rise in organizational productivity
• Requires less physical storage space

How do we digitize?

6 phases:

Phase 1: Justification of the proposed digitization project

Factors to be identified :

The accrual benefit of the project

Cost aspect of the project
Availability of resources.
The expected value generation.
Risk assessment For the resources that may be facing quick destruction may be required an early
digitization

Phase 2: Assessment

In any institutions, all records are never digitized. The data that requires digitization is to be decided
on the basis of content and context. Some data may be digitized in a consolidated format, and some
in detailed format. The files, tables, documents, expected future use etc are to be accessed and
evaluated for the assessment. The hardware and software requirements for digitization is also
assessed at this stage. The human resource requirement for executing the digitization project is also

3|Page
planned. The risk assessment at this level e.g. possibilities of natural disasters, and/or cyber attacks
etc also need to be completed.

Phase 3: Planning

Successful execution of digitization project needs meticulous planning.. The institution may decide to
complete the digitization in-house or alternatively by an outsourced agency. It may also be done on-
demand or in batches.

Phase 4: Digitization activities

Upon the completion of assessment and planning phase, the digitization activities start

Phase 5: Processes in the care of records

Once the digitization of records is complete, there are few additional requirements arise which may
be linked to administration of records. The permission for accession of data, intellectual control (over
data), classification (if necessary), and upkeeping and maintenance of data are few additional
requirements for data management.

Phase 6: Evaluation

Once the digitization project is updated and implemented, the final phase should be a systematic
determination of the project’s merit, worth and significant using objective criteria. The primary
purpose is to enable reflection and assist identify changes that would improve future digitization
processes.

8.4 Transformation of Data to Decision Relevant Information

To make the data turn into user friendly information, it should go through six core steps:

1. Collection of data: The collection of data may be done with standardized systems in place.
Appropriate software and hardware may be used for this purpose. Appointment of trained
staff also plays an important role in collecting accurate and relevant data.
2. Organising the data: The raw data needs to be organized in an appropriate manner to
generate relevant information. The data may be grouped, arranged in a manner that create
useful information for the target user groups.
3. Data processing: At this step, data needs to be cleaned to remove the unnecessary elements.
If any data point is missing or not available, that also need to be addressed. The options
available for presentation format for the data also need to be decided.
4. Integration of data: Data integration is the process of combining data from various sources
into a single, unified form. This step include creation of data network sources, a master server
and users accessing the data from master server. Data integration eventually enables the
analytics tools to produce effective, actionable business intelligence.
5. Data reporting: Data reporting stage involves translating the data into a consumable format
to make it accessible by the users..
6. Data utilization: At this ultimate step, data is being utilized to back corporate activities and
enhance operational efficiencies and productivity for the growth of business. This makes the
corporate decision making really ‘data driven’.

4|Page
8.5 Communication of Information for Quality Decision-making
By transforming the information into a process for quality decision making, the firm should achieve
the following abilities:

1. Logical understanding of a wide-ranging structured and unstructured data and put on that
information to corporate planning, budgeting and forecasting and decision support
2. Predict outcomes more effectively compared to conventional forecasting techniques based
on historical financial reports
3. Real time spotting of emerging opportunities and also capability gaps.
4. Making strategies for responding to uncertain events like market volatility and ‘black swan’
events through simulation.
5. Diagnose, filter and excerpt value from financial and operational information for making
better business decisions
6. Recognize viable advantages to service customers in a better manner
7. Identifying possible fraud possibilities on the basis of data analytics.
8. Building impressive and useful dashboards to measure and demonstrate success leading to
effective strategies.

8.6 Professional Scepticism Regarding Data

One major concern about the use of data analytics is the likelihood of false positives, i.e. the data may
identify few potential anomalies that could be later identified as reasonable and explained variation
of data. The frequency of false positives increases proportionately with the size and complexity of data

Professional scepticism is an important focus area for practitioners, researchers, regulators and
standard setters. At the same time, professional scepticism may result into additional costs e.g.
strained client relationships, and budget coverages. Under such circumstances, it is important to
identify and understand conditions in which the finance and audit professionals should apply
professional scepticism.

8.7 Ethical Use of Data and Information

The five basic principles of data ethics that a business organization should follow are:

1. Regarding ownership: The first principle is that ownership of any personal information
belongs to the person. It is unlawful and unethical to collect someone’s personal data without
their consent.. It is always advisable to ask for permission beforehand to avoid future legal
and ethical complications.
2. Regarding transparency: The objective with which the company is collecting user’s data
should be known to the user.While collecting the financial data from clients, it should be
clearly mentioned that for which purpose the data should be used.
3. Regarding privacy: As the user may allow to collect, store and analyze the personally
identifiable information (PII), that does not imply it should be made publicly available. For
companies, it is mandatory to publish some financial information to public e.g. through annual
reports. However, there may be many confidential information, which if falls on a wrong hand
may create problems and financial loss. To protect privacy of data, a data security process
should be in place. This may include file encryption and dual authentication password etc.

5|Page
4. Regarding intention: The intension of data analysis should never be making profits out of
others weaknesses or for hurting others. Collecting data which is unnecessary for analysis
should be avoided and it’s unethical.
5. Regarding outcomes: In some cases, even if the intentions are good, the result of data analysis
may inadvertently hurt the clients and data providers. This is called disparate impact, which is
unethical

6|Page
Data Processing, Organisation, Cleaning and Validation

9.1 Development of Data Processing

Data processing (DP) is the process of organising, categorising, and manipulating data in order to
extract information. Information in this context refers to valuable connections and trends that may
be used to address pressing issues.

History of data processing

1. Manual DP: Manual DP involves processing data without much assistance from machines.
Prior to the phase of mechanical DP only small-scale data processing was possible using
manual efforts.
2. Mechanical DP: Mechanical DP processes data using mechanical (not modern computers)
tools and technologies
3. Electronic DP: Data processing is being done electronically using computers and other
cutting-edge electronics.

How data processing and data science is relevant for finance?

1. Risk analytics: Business inevitably involves risk, particularly in the financial industry. It is
crucial to determine the risk factor before making any decisions. Once a danger has been
recognised, it may be prioritised and its recurrence closely watched.
2. Real time analytics: With modern advancements, Businesses can provide the optimal user
experience. Businesses can now respond quickly to consumer interactions. With real-time
analysis, there are no delays in establishing a customer’s worth to an organisation, and credit
ratings and transactions are far more precise.
3. Customer data management: Data science enables effective management of client data.
Using methods such as text analytics, data mining, and natural language processing, data
science is well equipped to deal with massive volumes of unstructured new data.
Consequently, despite the fact that data availability has been enhanced, data science implies
that a company’s analytical capabilities may also be upgraded, leading to a greater
understanding of market patterns and client behaviour.
4. Consumer Analytics: It is as important to ensure that each client receives a customised service
as it is to process their data swiftly and efficiently, without time-intensive individualised
analysis.
5. Customer segmentation: Customers are frequently segmented based on socioeconomic
factors, such as geography, age, and buying patterns. Customers are frequently segmented
based on socioeconomic factors, such as geography, age, and buying patterns.
6. Personalized services: Major organisations strive to provide customised service to their
consumers as a method of enhancing their reputation and increasing customer lifetime value.
This is also true for businesses in the finance sector.
7. Advanced customer service: Data science’s capacity to give superior customer service goes
hand in hand with its ability to provide customised services. As client interactions may be
evaluated in real-time, more effective recommendations can be offered to the customer care
agent managing the customer’s case throughout the conversation.

7|Page
8. Predictive Analytics: Predictive analytics enables organisations in the financial sector to
extrapolate from existing data and anticipate what may occur in the future, including how
patterns may evolve. When prediction is necessary, machine learning is utilised. Using
machine learning techniques, pre-processed data may be input into the system in order for it
to learn how to anticipate future occurrences accurately.
9. Fraud detection: With a rise in financial transactions, the risk for fraud also increases. Tracking
incidents of fraud, such as identity theft and credit card scams, and limiting the resulting harm
is a primary responsibility for financial institutions. As the technologies used to analyse big
data become more sophisticated, so do their capacity to detect fraud early on.
10. Anomaly detection: Financial services have long placed a premium on detecting abnormalities
in a customer’s bank account activities, partly because anomalies are only proved to be
anomalous after the event happens. Although data science can provide real-time insights, it
cannot anticipate singular incidents of credit card fraud or identity theft.
11. Algorithmic trading: Algorithmic trading is one of the key uses of data science in finance.
Algorithmic trading happens when an unsupervised computer utilising the intelligence
supplied by an algorithm trade suggestion on the stock market. As a consequence, it
eliminates the risk of loss caused by indecision and human error.

9.2 Functions of Data Processing

Data processing generally involves the following processes:

1. Validation: Data validation may be defined as ‘An activity aimed at verifying whether the value
of a data item comes from the given (finite or infinite) set of acceptable values.’ Data
validation could be operationally defined as a process which ensures the correspondence of
the final (published) data with a number of quality characteristics. A decision-making process
called data validation leads to the acceptance or rejection of data as acceptable
2. Sorting: Data sorting is any procedure that organises data into a meaningful order to make it
simpler to comprehend, analyse, and visualise. Sorting is a typical strategy for presenting
research data in a manner that facilitates comprehension of the story being told by the data.
Sorting is also frequently used to rank or prioritise records. Using sorting functions is an easy
idea to comprehend.
3. Aggregation: Data aggregation refers to any process in which data is collected and
summarised. When data is aggregated, individual data rows, which are often compiled from
several sources, are replaced with summaries or totals. Groups of observed aggregates are
replaced with statistical summaries based on these observations. A data warehouse often
contains aggregate data since it may offer answers to analytical inquiries and drastically cut
the time required to query massive data sets.
4. Analysis: Data analysis is described as the process of cleaning, converting, and modelling data
to obtain actionable business intelligence. The objective of data analysis is to extract relevant
information from data and make decisions based on this knowledge.
5. Reporting: Data reporting is the act of gathering and structuring raw data and turning it into
a consumable format in order to evaluate the organisation’s continuous performance. The
data reports can provide answers to fundamental inquiries regarding the status of the firm.
This gives an up-to-date record of the company’s financial health or a portion of the finances
6. Classification: Data classification is the process of classifying data according to important
categories so that it may be utilised and safeguarded more effectively. The categorization
process makes data easier to identify and access on a fundamental level. Regarding risk

8|Page
management, compliance, and data security, the classification of data is of special relevance.
Classifying data entails labelling it to make it searchable and trackable. Additionally, it avoids
many duplications of data, which can minimise storage and backup expenses and accelerate
the search procedure. It is standard practise to divide data and systems into three risk
categories.
i. Low risk: If data is accessible to the public and recovery is simple, then this data
collection and the mechanisms around it pose a smaller risk than others.
ii. Moderate risk: Essentially, they are non-public or internal (to a business or its partners)
data. However, it is unlikely to be too mission-critical or sensitive to be considered “high
risk.” The intermediate category may include proprietary operating processes, cost of
products, and certain corporate paperwork.
iii. High risk: Anything even vaguely sensitive or critical to operational security falls under
the category of high risk. Additionally, data that is incredibly difficult to retrieve (if lost).
All secret, sensitive, and essential data falls under the category of high risk.

Data classification process

Steps for effective data classification

1. Understanding the current setup: Taking a comprehensive look at the location of the
organisation’s current data and any applicable legislation is likely the best beginning
point for successfully classifying data. Before one classifies data, one must know what
data he is having.
2. Creation of a data classification policy: Without adequate policy, maintaining
compliance with data protection standards in an organisation is practically difficult.
Priority number one should be the creation of a policy.
3. Prioritize and organize data: Now that a data classification policy is in place, it is time
to categorise the data. Based on the sensitivity and privacy of the data, the optimal
method to be chosen for tagging it.

9.3 Data Organisation and Distribution

Data Organisation

Data organisation is the classification of unstructured data into distinct groups. This raw data
comprises variables’ observations. Data organisation allows us to arrange data in a manner that is easy
to understand and manipulate. It is challenging to deal with or analyse raw data.

Data distribution

Data distribution is a function that identifies and quantifies all potential values for a variable, as well
as their relative frequency (probability of how often they occur). Any population with dispersed data
is categorised as a distribution. It is necessary to establish the population’s distribution type in order
to analyse it using the appropriate statistical procedures.

9|Page
Types of distribution Distributions are basically classified based on the type of data:

1. Discrete distributions: A discrete distribution that results from countable data and has a finite number
of potential values. In addition, discrete distributions may be displayed in tables, and the values of the
random variable can be counted. Example: rolling dice, selecting a specific amount of heads, etc.

Following are the discrete distributions of various types:

i. Binomial distributions: The binomial distribution quantifies the chance of obtaining a specific
number of successes or failures each experiment.Binomial distribution applies to attributes
that are categorised into two mutually exclusive and exhaustive classes, such as number of
successes/failures and number of acceptances/rejections.

Example: When tossing a coin: The likelihood of a coin falling on its head is one-half and the
probability of a coin landing on its tail is one-half.

ii. Poisson distribution: The Poisson distribution is the discrete probability distribution that
quantifies the chance of a certain number of events occurring in a given time period, where
the events occur in a well-defined order.Poisson distribution applies to attributes that can
potentially take on huge values, but in practise take on tiny ones.

Example: Number of flaws, mistakes, accidents, absentees etc.

iii. Hypergeometric distribution: The hypergeometric distribution is a discrete distribution that

assesses the chance of a certain number of successes in (n) trials, without replacement, from
a sufficiently large population (N). Specifically, sampling without replacement.
iv. Geometric distribution: The geometric distribution is a discrete distribution that assesses the
probability of the occurrence of the first success. A possible extension is the negative binomial
distribution.

Example: A marketing representative from an advertising firm chooses hockey players from
several institutions at random till he discovers an Olympic participant.

2. Continuous distributions: A distribution with an unlimited number of (variable) data points that may
be represented on a continuous measuring scale. A continuous random variable is a random variable
with an unlimited and uncountable set of potential values. It is more than a simple count and is often
described using probability density functions (pdf). The probability density function describes the
characteristics of a random variable. Normally clustered frequency distribution is seen. Therefore, the
probability density function views it as the distribution’s “shape.”

Following are the continuous distributions of various types:

i. Normal distribution: Gaussian distribution is another name for normal distribution. It is a bell-
shaped curve with a greater frequency (probability density) around the core point. As values
go away from the centre value on each side, the frequency drops dramatically.
ii. Lognormal distribution: A continuous random variable x follows a lognormal distribution if
the distribution of its natural logarithm, ln(x), is normal.As the sample size rises, the
distribution of the sum of random variables approaches a normal distribution, independent
of the distribution of the individuals.
iii. F distribution: The F distribution is often employed to examine the equality of variances
between two normal populations.The F distribution is an asymmetric distribution with no
maximum value and a minimum value of 0.

10 | P a g e
iv. Chi square distributions: When independent variables with standard normal distribution are
squared and added, the chi square distribution occurs.
v. Exponential distribution: The exponential distribution is a probability distribution and one of
the most often employed continuous distributions. Used frequently to represent products
with a consistent failure rate.
vi. T student distribution: The t distribution or student’s t distribution is a probability distribution
with a bell shape that is symmetrical about its mean.Used frequently for testing hypotheses
and building confidence intervals for means. Substituted for the normal distribution when the
standard deviation cannot be determined.

9.4 Data Cleaning and Validation

Data Cleaning

Data cleaning is the process of correcting or deleting inaccurate, corrupted, improperly formatted,
duplicate, or insufficient data from a dataset. When several data sources are combined, there are
numerous chances for data duplication and mis-labelling. it is essential to build a template for your
data cleaning process so that you can be certain you are always doing the steps correctly.

Steps for data cleaning:

Step 1: Removal of duplicate and irrelevant information

Eliminate unnecessary observations from your dataset, such as duplicate or irrelevant observations.
Most duplicate observations will occur during data collecting. De-duplication is one of the most
important considerations for this procedure. Observations are deemed irrelevant when they do not
pertain to the specific topic you are attempting to study. This may make analysis more effective and
reduce distractions from your core objective, in addition to producing a more manageable and
effective dataset.

Step 2: Fix structural errors:

When measuring or transferring data, you may detect unusual naming standards, typos, or wrong
capitalization. These contradictions may lead to mislabelled classes or groups.

Step 3: Filter unwanted outliers:

Occasionally, you will encounter observations that, at first look, do not appear to fit inside the data
you are evaluating. If you have a valid cause to eliminate an outlier, such as erroneous data input,
doing so will improve the performance of the data you are analysing. Occasionally, though, the arrival
of an outlier will prove a notion you’re working on. Remember that the existence of an outlier does
not imply that it is erroneous. This step is required to validate the number. Consider deleting an outlier
if it appears to be unrelated to the analysis or an error.

Step 4: Handle missing data

Many algorithms do not accept missing values, hence missing data cannot be ignored. There are
several approaches to handle missing data. Although neither is desirable, both should be explored.

As a first alternative, the observations with missing values may be dropped, but doing so may result
in the loss of information. This should be kept in mind before doing so.

11 | P a g e
As a second alternative, the missing numbers may be entered based on other observations. Again,
there is a chance that the data’s integrity may be compromised, as action may be based on
assumptions rather than real observations.

Step 5: Validation and QA

As part of basic validation, one should be able to answer the following questions at the conclusion of
the data cleaning process:

(a) Does the data make sense?

(b) Does the data adhere to the regulations applicable to its field?

(d) Can data patterns assist you in formulating your next theory?

(e) If not, is this due to an issue with data quality?

False assumptions based on inaccurate or “dirty” data can lead to ineffective company strategies and
decisions.

Benefits of quality data

Main characteristics of quality data are:

(i) Validity
(ii) Accuracy
(iii) Completeness
(iv) Consistency

Benefits of data cleaning

(i) Error correction when numerous data sources are involved.

(ii) Fewer mistakes result in happier customers and less irritated workers.
(iii) Capability to map the many functions and planned uses of your data.
(iv) Monitoring mistakes and improving reporting to determine where errors are originating can
make it easier to repair inaccurate or damaged data in future applications.
(v) Using data cleaning technologies will result in more effective corporate procedures and
speedier decision-making.

Data validation

Data validation is a crucial component of any data management process. If the initial data is not valid,
the outcomes will not be accurate either. It is therefore vital to check and validate data before using
it.

Types of data validation

1. Data type check: A data type check verifies that the entered data has the appropriate data
type.

12 | P a g e
2. Code check: A code check verifies that a field’s value is picked from a legitimate set of options
or that it adheres to specific formatting requirements.
3. Range check: A range check determines whether or not input data falls inside a specified
range. Latitude and longitude, for instance, are frequently employed in geographic data.
4. Format check: Numerous data kinds adhere to a set format. Date columns that are kept in a
fixed format. A data validation technique that ensures data are in the correct format
contributes to data and temporal consistency.
5. Consistency check: A consistency check is a form of logical check that verifies that the data
has been input in a consistent manner. Checking whether a package’s delivery date is later
than its shipment date is one example.
6. Uniqueness check: A uniqueness check guarantees that an item is not put into a database
numerous times.

13 | P a g e
Data Presentation: Visualisation and Graphical Presentation

10.1 Data Visualisation of Financial and Non-Financial Data

A picture speaks a thousand words

Finance professionals who are investigating how data visualisation might help their analytics efforts
and communication should keep the following in mind:

• Know the objective: Grasp the objectives. First establishment of the information if it’s
conceptual or data-driven (i.e. does it rely on qualitative or quantitative data) is required.
Specify if the objective is exploratory or declarative. Determining the answers would assist in
determining the tools and formats required.
• Always keep the audience in mind: Who views the data visualisations will determine the
degree of detail required.
• Invest in the best technology: The firm should first implement an ERP that removes data silos
and develops a centralised information repository. Then, look for tools that allows to instantly
display data by dragging and dropping assets, charts, and graphs; offer search options and
guided navigation to assist in answering queries; and enable any member of the financial team
to generate graphics.
• Improve the team’s ability to visualise data: Find ways to incorporate user training on data
visualisation tools, so that the staff is aware of the options that the technology affords.
Additionally, when making new recruits, look out individuals with proficiency in data analytics
and extensive data visualisation experience.

10.2 Objective and Function of Data Presentation

The following objectives of data visualisation:

• Making a better data analysis

• Faster decision making
• Analysing complicated data

According to an article published by Harvard Business Review (HBR), the most common errors made
by analysts that makes a data visualisation unsuccessful are:

1. Understanding the audience:

Before incorporating the data into visualisation, the objective should be fixed, which is to present
large volumes of information in a way that decision-makers can readily ingest. A great visualisation
relies on the designer comprehending the intended audience and executing on three essential
points:

i. Who will read and understand the material and how will they do so? Can it be presumed
that it understands the words and ideas employed, or if there is a need to provide it with
visual cues (e.g., a green arrow indicating that good is ascending)? A specialist audience will
have different expectations than the broader public.

14 | P a g e
ii.
What are the expectations of the audience, and what information is most beneficial to
them?
iii. What is the functional role of the visualisation, and how may users take action based on it?
A visualisation that is exploratory should leave viewers with questions to investigate, but
visualisations that are instructional or confirmatory should not.
2. Setting up a clear framework

The designer must guarantee that all viewers have the same understanding of what the visualisation
represents. To do this, the designer must establish a framework consisting of the semantics and syntax
within which the data information is intended to be understood. The semantics pertain to the meaning
of the words and images employed, whereas the syntax is concerned with the form of the
communication. For instance, when utilising an icon, the element should resemble the object it
symbolises, with size, colour, and placement all conveying significance to the viewer.

Ensure that the data is clean and that the analyst understands its peculiarities before doing anything
else.

3. Telling a story

Storytelling assists the audience in gaining understanding from facts. Information visualisation is a
technique that turns data and knowledge into a form that is perceivable by the human visual system.
The objective is to enable the audience to see, comprehend, and interpret the information. Design
strategies that favour specific interpretations in visuals that “tell a narrative” can have a substantial
impact on the interpretation of the end user.

10.3 Data Presentation Architecture

Data presentation architecture (DPA) is a set of skills that aims to identify, find, modify, format, and
present data in a manner that ideally conveys meaning and provides insight. “data Presentation
Architecture (DPA) is a rarely applied skill set critical for the success and value of Business Intelligence.

Objectives There are following objectives of DPA:

• Utilize data to impart information in the most efficient method feasible

• To utilise data to deliver information as effectively as feasible

Scope of DPA

(i) Defining significant meaning (relevant information) required by each audience member in
every scenario.
(ii) Obtaining the proper data (focus area, historic reach, extensiveness, level of detail, etc.)
(iii) Determining the needed frequency of data refreshes (the currency of the data)
(iv) determining the optimal presentation moment (the frequency of the user needs to view the
data)
(v) Using suitable analysis, categorization, visualisation, and other display styles
(vi) Developing appropriate delivery techniques for each audience member based on their job,
duties, locations, and technological accesses

15 | P a g e
10.4 Dashboard, Graphs, Diagrams, Tables, Report Design
A data visualisation dashboard is an interactive dashboard that enables to manage important metrics
across numerous financial channels, visualise the data points, and generate reports for customers that
summarise the results.

Graph, Diagram and Charts

i. Bar Chart: It may be used to easily compare data across categories, highlight discrepancies,
demonstrate trends and outliers, and illustrate historical highs and lows. Bar graphs are very useful
when the data can be divided into distinct categories
ii. Line chart: The line chart or line graph joins various data points, displaying them as a continuous
progression. Utilize line charts to observe trends in data, often over time.

Bar Chart Line Chart

6 6
5 5
4 4
3 3
2 2
1 1
0 0
Category 1 Category 2 Category 3 Category 4 Category 1 Category 2 Category 3 Category 4

Series 1 Series 2 Series 3 Series 1 Series 2 Series 3

iii. Pie Chart: A pie chart (or circle chart) is a circular graphical representation of statistical data that is
segmented to demonstrate numerical proportion. In a pie chart, the arc length of each slice (and, by
extension, its centre angle and area) is proportionate to the value it depicts
iv. Scatter plots: Scatter plots are a useful tool for examining the connection between many variables,
revealing whether one variable is a good predictor of another or whether they tend to vary
independently. A scatter plot displays several unique data points on a single graph.

Pie Chart Scatter Plots

0
1st Qtr 2nd Qtr 3rd Qtr 4th Qtr 0 1 2 3

v. Bubble chart: Although bubbles are not exactly their own sort of visualisation, utilising them as a
method enhances scatter plots and maps that illustrate the link between three or more variables. By
varying the size and colour of circles, charts display enormous amounts of data in an aesthetically
engaging manner
vi. Histogram: Histograms illustrate the distribution of the data among various groups. Histograms divide
data into discrete categories (sometimes known as “bins”) and provide a bar proportionate to the
number of entries inside each category

16 | P a g e
vii. Map: For displaying any type of location data, including postal codes, state abbreviations, country
names, and custom geocoding, maps are a no-brainer. If the data is related with geographic
information, maps are a simple and effective approach to illustrate the relationship.
viii. Density map: Density maps indicate patterns or relative concentrations that might otherwise be
obscured by overlapping marks on a map, allowing to identify areas with a larger or lesser number of
data points

ix. Gantt Chart: Gantt charts represent a project’s timeline or activity changes across time. A Gantt chart
depicts tasks that must be accomplished before others may begin, as well as the allocation of
resources.

Tables

Tables, often known as “crosstabs” or “matrices,” emphasise individual values above aesthetic
formatting. They are one of the most prevalent methods for showing data and, thus, one of the most
essential methods for analysing data

17 | P a g e
How to use data Visualisation in report design?
➢ Find a story in the data: Data-driven storytelling is a powerful tool. Finding a story that
connects with the reader can help to create an effective report.
➢ Create a narrative: When some individuals hear the term “data storytelling,” they believe that
it consists of a few statistics and that the task is complete. This is a frequent misconception
that is false. Strong data storytelling comprises an engaging narrative that takes the audience
through the facts and aids in their comprehension. Moreover, an explanation of the
significance of these ideas is essential. To compose an excellent story, one must:
(i) Engage the viewer with a catchy title and subheadings.
(ii) Incorporate context into the data.
(iii) Create a consistent and logical flow.
(iv) Highlight significant discoveries and insights from the data.
➢ Choose the most suitable data Visualisation: Data Visualisation is not limited to the creation
of charts and graphs. It involves presenting the facts in the most comprehensible chart
possible. Applying basic design principles and utilising features like as form, size, colour, and
labelling may have a significant impact on how people comprehend the data
➢ Follow the visual language: It is essential to adhere to data visualisation principles in order to
achieve both uniformity and comprehension. A strategic methodology assists in
implementation.
➢ Publicize the report: Some reports are not intended for public consumption. However, since
they include so much essential information, they may contain knowledge that is of interest to
individuals or media outside of the business.

10.5 Tools and Techniques of Visualisation and Graphical Presentation

i. Tableau : Tableau is a data visualisation application for creating interactive graphs, charts, and
maps. It enables one to connect to many data sources and generate visualisations in minutes.
Tableau Desktop is the first product of its kind. It is designed to produce static visualisations
that may be published on one or more web pages, but it is incapable of producing interactive
maps.
ii. Microsoft Power BI: Microsoft Power BI is a data visualisation tool for business intelligence
data. Reporting, self-service analytics, and predictive analytics are supported. In addition, it
provides a platform for end users to generate reports and share insights with others inside their
business. It serves as a centralized repository for all of the business data, which all of the
business users can access. Through such linkages, the prepared reports may be shared inside
the organisation, making it a crucial tool for businesses seeking a consolidated data reporting
system.
iii. Microsoft Excel: Microsoft Excel is a data Visualisation tool with an intuitive interface, so it is
not necessarily difficult to use. Excel provides several options for viewing data. Using these
techniques, one may illustrate the relationship between two or more datasets that is wished to
compare.
iv. QlikView: QlikView is a data discovery platform that enables users to make quicker, more
informed choices by speeding analytics, uncovering new business insights, and enhancing the
precision of outcomes.An easy software development kit that has been utilized by enterprises
worldwide for many years. It may mix diverse data sources with color-coded tables, charts, and
sliders. It has been designed using a “drag and drop” Visualisation interface, allowing users to
input data from a variety of sources without having to write code’

18 | P a g e
Data Analysis and Modelling

11.1 Process, Benefits and Types of Data Analysis

Data analytics is the science of evaluating unprocessed datasets to draw conclusions about the
information they contain. It helps us to identify patterns in the raw data and extract useful information
from them.

following are the steps for data analytics:

Step 1: Criteria for grouping data

Data may be segmented by a variety of parameters, including age, population, income, and sex. The
data values might be either numeric or category.

Step 2: Collecting the data

Data may be gathered from several sources, including internet sources, computers, personnel, and
community sources.

Step 3: Organizing the data

After collecting the data, it must be arranged so that it can be analysed. Statistical data can be
organised on a spreadsheet or other programme capable of handling statistical data.

Step 4: Cleaning the data

The data is initially cleansed to verify that there are no duplicates or errors. The document is then
examined to ensure that it is comprehensive. Before data is sent to a data analyst for analysis, it is
beneficial to rectify or eliminate any errors by cleaning the data.

Step 5: Adopt the right type of data analytics process:

There are four types of data analytics process: (i) Descriptive analytics (ii) Diagnostics analytics (iii)
Predictive analytics (iv) Prescriptive analytics

Benefits of data analytics

(i) Improves decision making process

Companies can use the information gained from data analytics to base their decisions, resulting in
enhanced outcomes. Using data analytics significantly reduces the amount of guesswork involved in
preparing marketing plans, deciding what materials to produce, and more.

(ii) Increase in efficiency of operations

Data analytics assists firms in streamlining their processes, conserving resources, and increasing their
profitability. When firms have a better understanding of their audience’s demands, they spend less
time creating advertising that do not fulfil those needs.

(iii) Improved service to stakeholders

Data analytics gives organisations with a more in-depth understanding of their customers, employees
and other stake holders.

19 | P a g e
11.2 Data Mining and Implementation of Data Mining
Data mining, also known as knowledge discovery in data (KDD), is the extraction of patterns and other
useful information from massive data sets.

Process of data mining

(i) Setting the business objective:

Data scientists and business stakeholders must identify the business challenge, which informs the data
queries and parameters for a specific project. Analysts may also need to conduct further study to
adequately comprehend the company environment.

(ii) Preparation of data: Once the scale of the problem has been established, it is simpler for data
scientists to determine which collection of data will assist the company in answering crucial questions.
Once the pertinent data has been collected, it will be cleansed by eliminating any noise, such as
repetitions, missing numbers, and outliers .

(iii) Model building and pattern mining:

Data scientists may study any intriguing relationship between the data, such as frequent patterns,
clustering algorithms, or correlations, depending on the sort of research. While high frequency
patterns have larger applicability, data variations can often be more fascinating, exposing possible
fraud areas.Depending on the available data, deep learning algorithms may also be utilised to
categorise or cluster a data collection.

(iv) Result evaluation and implementation of knowledge:

After aggregating the data, the findings must be analysed and understood. When completing results,
they must be valid, original, practical, and comprehensible. When this criterion is satisfied, companies
can execute new strategies based on this understanding, therefore attaining their intended goals.

Techniques of data mining

Using various methods and approaches, data mining transforms vast quantities of data into valuable
information. Here are a few of the most prevalent:

(i) Association rules:

An association rule is a rule-based technique for discovering associations between variables inside a
given dataset. These methodologies are commonly employed for market basket analysis, enabling
businesses to better comprehend the linkages between various items. Understanding client
consumption patterns helps organisations to create more effective cross-selling tactics and
recommendation engines.

(ii) Neural Networks:

Primarily utilised for deep learning algorithms, neural networks replicate the interconnection of the
human brain through layers of nodes to process training data. Every node has inputs, weights, a bias
(or threshold), as well as an output. If the output value exceeds a predetermined threshold, the node
“fires” and passes data to the subsequent network layer. Neural networks acquire this mapping
function by supervised learning and gradient descent, changing based on the loss function. When the

20 | P a g e
cost function is zero or close to it, we may have confidence in the model’s ability to produce the correct
answer.

(iii) Decision tree:

Using classification or regression algorithms, this data mining methodology classifies or predicts likely
outcomes based on a collection of decisions. As its name implies, it employs a tree-like representation
to depict the potential results of these actions.

(iv) K-nearest neighbour:

K-nearest neighbour, often known as the KNN algorithm, classifies data points depending on their
closeness to and correlation with other accessible data. This technique assumes that comparable data
points exist in close proximity to one another. Consequently, it attempts to measure the distance
between data points, often by Euclidean distance, and then assigns some on the most common
category or average.

Implementation of data mining in Finance and management

The data mining applications are:

1. Detecting money laundering and other financial crimes

2. Prediction of loan repayment and customer credit policy analysis
3. Target marketing
4. Design and construction of data warehouses

11.3 Analytics and Model Building

Descriptive Analytics: Focuses on what has happened. It organizes historical data to provide insights
into past performance, often presented through charts and graphs. It is the simplest form of analytics
and serves as a foundation for more advanced types of analysis. Focuses on summarizing historical
data to track patterns and trends. The process involves data aggregation, preparation, analysis, and
presentation, often through visual tools like graphs and charts. While it offers insights into what has
happened, it doesn’t provide conclusions or predictions.

5 Steps :

1. Decide the business metrics

2. Identification of data requirement
3. Preparation and collection of data
4. Analysis of data
5. Presentation of data

Diagnostic Analytics: Explores why something happened. This type digs deeper into data to identify
reasons for particular outcomes, using techniques such as data discovery, drill-down, and correlation
analysis. goes beyond descriptive analysis to investigate why certain events occurred. Techniques such
as data discovery and mining help identify correlations and root causes behind specific outcomes. It’s
often used when businesses need to understand the underlying factors driving performance changes.
Diagnostic analytics develops solutions that may be used to discover answers to data-related problems
and to communicate insights within the organisation.Diagnostic analytics enables to derive value from
the data by asking the relevant questions and doing indepth analyses of the responses.

21 | P a g e
Predictive Analytics: Forecasts what might happen in the future. By examining past trends, predictive
analytics uses techniques like data mining, machine learning, and statistical modeling to estimate
potential future outcomes. applies statistical models and machine learning to predict future
outcomes. It uses historical data to forecast what may happen, providing businesses with foresight to
anticipate trends and customer behavior. it may serve as a crucial tool for forecasting probable future
occurrences and informing future corporate strategy. Examples include forecasting inventory needs
or predicting customer churn

Prescriptive Analytics: Advises what actions should be taken. This is the most advanced form, offering
recommendations based on predictive models and data analysis. It often uses machine learning to
evaluate various future scenarios and suggest optimal decisions. advises on the best course of action
by analyzing future scenarios. It uses complex algorithms to recommend decisions that optimize
business performance. Applications range from route optimization in GPS systems to strategic
decision-making in healthcare, manufacturing, and finance.

11.4 Standards for Data Tagging and Reporting (XML, XBRL)

Extensible Markup Language (XML)

XML is a file format and markup language for storing, transferring, and recreating arbitrary data. It
specifies a set of standards for encoding texts in a format that is understandable by both humans and
machines. It is a textual data format with significant support for many human languages via Unicode.
Although XML’s architecture is centred on texts, the language is commonly used to express arbitrary
data structures, such as those employed by web services. Serialization, or storing, sending, and
rebuilding arbitrary data, is the primary function of XML. In order fortwo dissimilar systems to share
data, they must agree on a file format. XML normalises this procedure. XML is comparable to a
universal language for describing information.

As a markup language, XML labels, categorises, and arranges information systematically. The data
structure is represented by XML tags, which also contain information. The information included within
the tags is encoded according to the XML standard

Application of XML

XML is now widely utilised for the exchange of data via the Internet. There have been hundreds of
document formats created using XML syntax, including RSS, Atom, Office Open XML, OpenDocument,
SVG, and XHTML

Extensible Business Reporting Language (XBRL)

XBRL is a data description language that facilitates the interchange of standard, comprehensible
corporate data.It is based on XML and enables the automated interchange and trustworthy extraction
of financial data across all software types and advanced technology, including Internet.XBRL allows
organisations to arrange data using tags. When a piece of data is labelled as “revenue,” for
instance,XBRL enabled applications know that it pertains to revenue.

22 | P a g e
Benefits of XBRL

1. All reports are automatically created from a single source of information, which reduces the
chance of erroneous data entry and hence increases data reliability.
2. Reduces expenses by simplifying and automating the preparation and production of reports
for various clients.
3. Accelerates the decision-making of financial entities such as banks and rating services.
4. Facilitates the publication of analyst and investor reports
5. Access, comparison, and analytic capabilities for information are unparalleled.

11.5 Cloud Computing, BI, AI, RPA and Machine Learning

Cloud computing

Simply described, cloud computing is the delivery of a variety of services through the Internet, or “the
cloud.” It involves storing and accessing data via distant servers as opposed to local hard drives and
private data centers.

Types of cloud computing

1. Private cloud: Private cloud offers a cloud environment that is exclusive to a single corporate
organisation, with physical components housed on-premises or in a vendor’s data center. This
solution gives a high level of control due to the fact that the private cloud is available to just
one enterprise.
2. Public cloud: The public cloud stores and manages access to data and applications through
the internet. It is fully virtualized, enabling an environment in which shared resources may be
utilised as necessary. Because these resources are offered through the web, the public cloud
deployment model enables enterprises to grow with more ease; the option to pay for cloud
services on an as-needed basis is a significant benefit over local servers.
3. Hybrid cloud: Hybrid cloud blends private and public cloud models.The hybrid cloud
architecture enables businesses to store sensitive data on-premises and access it through
apps hosted in the public cloud. In order to comply with privacy rules, an organisation may,
for instance, keep sensitive user data in a private cloud and execute resource-intensive
computations in a public cloud

Business Intelligence:

Business intelligence includes business analytics, data mining, data visualisation, data tools and
infrastructure, and best practises to assist businesses in making choices that are more data-driven.
When you have a complete picture of your organization’s data and utilise it to drive change, remove
inefficiencies, and swiftly adjust to market or supply changes, you have contemporary business
intelligence

BI Methods:

(i) Data mining: Large datasets may be mined for patterns using databases, analytics, and
machine learning (ML).
(ii) Reporting: The dissemination of data analysis to stakeholders in order for them to form
conclusions and make decisions.

23 | P a g e
(iii) Performance metrics and benchmarking: Comparing current performance data to
previous performance data in order to measure performance versus objectives, generally
utilising customised dashboards.
(iv) Descriptive analytics: Utilizing basic data analysis to determine what transpired
(v) Querying: BI extracts responses from data sets in response to data-specific queries.
(vi) Statistical analysis: Taking the results of descriptive analytics and use statistics to further
explore the data, such as how and why this pattern occurred.
(vii) Data Visualization: Data consumption is facilitated by transforming data analysis into
visual representations such as charts, graphs, and histograms.
(viii) Visual Analysis: Exploring data using visual storytelling to share findings in real-time and
maintain the flow of analysis.
(ix) Data Preparation: Multiple data source compilation, dimension and measurement
identification, and data analysis preparation.

Artificial Intelligence (AI) :

“ It is the science and engineering of making intelligent machines, especially intelligent computer
programs. It is related to the similar task of using computers to understand human intelligence, but AI
does not have to confine itself to methods that are biologically observable.”

Human approach:

Systems that think like humans

Systems that act like humans

Ideal approach:

Systems that think rationally

Systems that act rationally

Types of Artificial Intelligence – Weak AI vs. Strong AI

Weak AI, also known as Narrow AI or Artificial Narrow Intelligence (ANI), is AI that has been trained
andhoned to do particular tasks. This form of artificial intelligence is anything but feeble; it allows
sophisticated applications such as Apple’s Siri, Amazon’s Alexa, .

24 | P a g e
Artificial General Intelligence (AGI) and Artificial Super Intelligence (AIS) comprise strong AI (ASI).
Artificial general intelligence (AGI), sometimes known as general artificial intelligence (AI), is a
hypothetical kind of artificial intelligence in which a machine possesses human-level intellect, a self-
aware consciousness, and the ability to solve problems, learn, and plan for the
future.Superintelligence, also known as Artificial Super Intelligence (ASI), would transcend the
intelligence and capabilities of the human brain.

Deep Learning vs. Machine Learning

Deep learning and machine learning differ in how their respective algorithms learn. Deep learning
automates a significant portion of the feature extraction step, reducing the need for manual human
involvement and enabling the usage of bigger data sets

Robotic Process Automation:

With RPA, software users develop software robots or “bots” that are capable of learning, simulating,
and executing rules-based business processes. By studying human digital behaviours, RPA automation
enables users to construct bots. Give your bots instructions, then let them to complete the task.
Robotic Process Automation software bots can communicate with any application or system in the
same manner that humans can, with the exception that RPA bots can function continuously, around-
the-clock, and with 100 percent accuracy and dependability.

Benefits of RPA

(i) Higher productivity

(ii) Higher accuracy
(iii) Saving of cost
(iv) Integration across platforms
(v) Better customer experience
(vi) Harnessing AI
(vii) Scalability

Machine learning

Machine learning (ML) is a branch of study devoted to understanding and developing systems that
“learn,” or ways that use data to improve performance on a set of tasks. Considered a component of
artificial intelligence. In order to generate predictions or conclusions without being explicitly taught
to do so, machine learning algorithms construct a model based on training data and sample data. In
applications such as medicine, email filtering, speech recognition, and computer vision, when it is
difficult or impractical to create traditional algorithms to do the required tasks, machine learning
techniques are utilised.

Approaches towards machine learning

1. Supervised Learning:

• In supervised learning, a model is trained on labeled data, meaning that the training
examples include both inputs and their corresponding outputs. The goal is to learn a
function that can make accurate predictions when given new, unseen inputs.
• Applications: Classification (e.g., email spam detection) and regression (e.g., predicting
house prices).

25 | P a g e
• Key concepts: Feature vectors, optimization of an objective function, and learning from
labeled data.

2. Unsupervised Learning:

• Unsupervised learning deals with data that does not have labeled outputs. The goal is to find
hidden structures, such as groups or patterns, in the data. Clustering and density estimation
are key techniques in this approach.
• Applications: Market segmentation, anomaly detection, and clustering in customer analysis.
• Key techniques: Cluster analysis, dimensionality reduction, and similarity detection.

3. Semi-Supervised Learning:

• Semi-supervised learning uses a small amount of labeled data combined with a large amount
of unlabeled data. This approach is useful when acquiring labeled data is expensive or time-
consuming, but large amounts of unlabeled data are available.
• Applications: Natural language processing, image recognition, and medical diagnoses.
• Key idea: The combination of labeled and unlabeled data improves the model's
performance.

4. Reinforcement Learning:

• In reinforcement learning, an agent learns how to act in an environment to maximize

cumulative rewards. This approach is commonly applied when an explicit training dataset
isn't available, but feedback can be given based on the agent's actions (e.g., winning or
losing a game).
• Applications: Autonomous systems (e.g., self-driving cars, robotic control), game playing
(e.g., chess, Go).
• Key technique: Markov Decision Processes (MDPs) and dynamic programming methods for
learning without needing an accurate model of the environment.

5. Dimensionality Reduction:

• Dimensionality reduction involves simplifying a dataset by reducing the number of features

while preserving as much information as possible. This is especially important when dealing
with high-dimensional data that is difficult to process.
• Applications: Data visualization, pre-processing for other machine learning tasks.
• Techniques: Principal Component Analysis (PCA), feature extraction, and manifold learning.

11.6 Model vs. Data-driven Decision-making

Data-driven strategies focus on improving data quality, governance, and management to enhance AI
performance. This involves ensuring that the data used is reliable, well-organized, and clean, which
is crucial for accurate and meaningful outputs.

Model-driven strategies focus on developing better algorithms and models to boost performance,
regardless of the quality of data. These methods have advanced significantly more than data-driven
approaches.

26 | P a g e
ALL THE VERY BEST FUTURE CMA’S

27 | P a g e

Business Data Analytics
No ratings yet
Business Data Analytics
19 pages
DA Marathon Notes
No ratings yet
DA Marathon Notes
134 pages
FMDA Theory Part 2
No ratings yet
FMDA Theory Part 2
10 pages
Manan1
No ratings yet
Manan1
65 pages
DS_XI_SEC4
No ratings yet
DS_XI_SEC4
49 pages
Unit-1 DM
No ratings yet
Unit-1 DM
16 pages
Chapter 8
No ratings yet
Chapter 8
10 pages
Module 3
No ratings yet
Module 3
137 pages
Data Discovery With Tableau a Case Study Using Dat
No ratings yet
Data Discovery With Tableau a Case Study Using Dat
5 pages
Vivek1
No ratings yet
Vivek1
91 pages
Business Data Analytics Introduction to Data Science for Business Decision
No ratings yet
Business Data Analytics Introduction to Data Science for Business Decision
1 page
INTRODUCTION to wholeness of DATA Anayltics
No ratings yet
INTRODUCTION to wholeness of DATA Anayltics
12 pages
Managing Finance in A Digital World Chapter 8: Data To Create & Preserve Value For Organisations
No ratings yet
Managing Finance in A Digital World Chapter 8: Data To Create & Preserve Value For Organisations
14 pages
Business Anaytics Lecture Notes1.Docx (2) - Converted
No ratings yet
Business Anaytics Lecture Notes1.Docx (2) - Converted
20 pages
Topic Importance of Data Processing
No ratings yet
Topic Importance of Data Processing
9 pages
Ilovepdf Merged Pagenumber
No ratings yet
Ilovepdf Merged Pagenumber
199 pages
Data For Business Analytics Unit 2
No ratings yet
Data For Business Analytics Unit 2
23 pages
Unit I
No ratings yet
Unit I
31 pages
Ccw331 Business Analytics
No ratings yet
Ccw331 Business Analytics
41 pages
Data Analytics-Wps Office
No ratings yet
Data Analytics-Wps Office
21 pages
Business Analytics Summary (Units 1.2 - 1.8)
No ratings yet
Business Analytics Summary (Units 1.2 - 1.8)
8 pages
It App - Finals Notes
No ratings yet
It App - Finals Notes
60 pages
Unit1 Introduction To Data Analytics and Data Analytics Lifecycle Notes
No ratings yet
Unit1 Introduction To Data Analytics and Data Analytics Lifecycle Notes
13 pages
01_DM_BI_Intro
No ratings yet
01_DM_BI_Intro
22 pages
Unit 1
No ratings yet
Unit 1
36 pages
CSD101 Fundamentals of Data Science Session 1 and 2
No ratings yet
CSD101 Fundamentals of Data Science Session 1 and 2
53 pages
Chapter 2 Introduction To Data Science
No ratings yet
Chapter 2 Introduction To Data Science
50 pages
Introduction to Business Analytics - Copy
No ratings yet
Introduction to Business Analytics - Copy
63 pages
BA NOTES ETE
No ratings yet
BA NOTES ETE
16 pages
6 Portfolio Optimization With Linear and FixedTC
No ratings yet
6 Portfolio Optimization With Linear and FixedTC
25 pages
Coursera - Data Analytics - Course 1
No ratings yet
Coursera - Data Analytics - Course 1
8 pages
Module 1
No ratings yet
Module 1
35 pages
ETCh2
No ratings yet
ETCh2
36 pages
UNIT 1 - INTRODUCTION ( DATA ANALYTICS AND BIG DATA )_60515294_2025_05_15_17_42
No ratings yet
UNIT 1 - INTRODUCTION ( DATA ANALYTICS AND BIG DATA )_60515294_2025_05_15_17_42
25 pages
Data Analitics 2
No ratings yet
Data Analitics 2
8 pages
DS
No ratings yet
DS
32 pages
BIA 5000 Introduction To Analytics - Lesson 6
No ratings yet
BIA 5000 Introduction To Analytics - Lesson 6
59 pages
01_Tutorial_ISB_L1-L2_shared
No ratings yet
01_Tutorial_ISB_L1-L2_shared
13 pages
DATA ANALYTICS (1)
No ratings yet
DATA ANALYTICS (1)
7 pages
Data Analytics Lecture Notes
100% (1)
Data Analytics Lecture Notes
10 pages
Chapter 2 Data Science
No ratings yet
Chapter 2 Data Science
37 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
39 pages
Data Analysis Coursea
No ratings yet
Data Analysis Coursea
15 pages
BUSINESS ANALYTICS UNIT I
No ratings yet
BUSINESS ANALYTICS UNIT I
45 pages
_unit2 DATA SCIENCE
No ratings yet
_unit2 DATA SCIENCE
8 pages
Bana1 Midterm Reviewer
No ratings yet
Bana1 Midterm Reviewer
10 pages
Data Analytics 1
No ratings yet
Data Analytics 1
4 pages
Unit 1
No ratings yet
Unit 1
50 pages
Data Preparation and Exploration: DSCI 5240 Data Mining and Machine Learning For Business Russell R. Torres
No ratings yet
Data Preparation and Exploration: DSCI 5240 Data Mining and Machine Learning For Business Russell R. Torres
28 pages
Week 1
No ratings yet
Week 1
50 pages
Describe The Data Processing Chain: Business Understanding
No ratings yet
Describe The Data Processing Chain: Business Understanding
4 pages
Descriptive and Inferential Statistics (2)
No ratings yet
Descriptive and Inferential Statistics (2)
30 pages
HubSpots Guide To Data Analytics
No ratings yet
HubSpots Guide To Data Analytics
50 pages
DATA ANALYTICS 1
No ratings yet
DATA ANALYTICS 1
13 pages
Informatica Solutions and Data Integration: Definitive Reference for Developers and Engineers
From Everand
Informatica Solutions and Data Integration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
(Ebook) Mathematical Applications for the Management, Life, and Social Sciences by Ronald J. Harshbarger, James J. Reynolds ISBN 9781133106234, 1133106234 - The ebook is available for instant download, no waiting required
No ratings yet
(Ebook) Mathematical Applications for the Management, Life, and Social Sciences by Ronald J. Harshbarger, James J. Reynolds ISBN 9781133106234, 1133106234 - The ebook is available for instant download, no waiting required
55 pages
UQ_of_ML_Tutorial_MSSP 2023
No ratings yet
UQ_of_ML_Tutorial_MSSP 2023
69 pages
Elementary Statistics in Social Research, Updated Edition 12th Edition (eBook PDF) - Own the ebook now with all fully detailed chapters
No ratings yet
Elementary Statistics in Social Research, Updated Edition 12th Edition (eBook PDF) - Own the ebook now with all fully detailed chapters
51 pages
PRCV Lab Manual-Final
No ratings yet
PRCV Lab Manual-Final
60 pages
Experimental-Economics-Method-and-Applications
No ratings yet
Experimental-Economics-Method-and-Applications
25 pages
Intro To Data Analytics
No ratings yet
Intro To Data Analytics
42 pages
Week 2 - Data Analytics Life Cycle
No ratings yet
Week 2 - Data Analytics Life Cycle
41 pages
Assignment OF Data Science (AIT 120) : Submitted To: Submitted by
No ratings yet
Assignment OF Data Science (AIT 120) : Submitted To: Submitted by
10 pages
Essentials of Statistics for Business and Economics 8th Edition Anderson Solutions Manual - PDF Version Is Available For Instant Access
100% (5)
Essentials of Statistics for Business and Economics 8th Edition Anderson Solutions Manual - PDF Version Is Available For Instant Access
52 pages
Lecture 02 20190212
No ratings yet
Lecture 02 20190212
49 pages
Augustus 0.4.3.1 Doc
No ratings yet
Augustus 0.4.3.1 Doc
157 pages
Tutorial Sheet 5
No ratings yet
Tutorial Sheet 5
2 pages
Stats
No ratings yet
Stats
25 pages
Continous Probability Distribution
No ratings yet
Continous Probability Distribution
40 pages
ipl
No ratings yet
ipl
19 pages
Density Estimation On Symmetric Spaces
No ratings yet
Density Estimation On Symmetric Spaces
41 pages
6 Some Probability Distributions B1-2
No ratings yet
6 Some Probability Distributions B1-2
36 pages
BIOSTAT Random Variables & Probability Distribution
No ratings yet
BIOSTAT Random Variables & Probability Distribution
37 pages
Normal and Lognormal
No ratings yet
Normal and Lognormal
16 pages
Other Prospectus
No ratings yet
Other Prospectus
102 pages
Uj 39267+SOURCE1+SOURCE1.1
No ratings yet
Uj 39267+SOURCE1+SOURCE1.1
7 pages
SSRN-id1821643 - All Fin
No ratings yet
SSRN-id1821643 - All Fin
46 pages
Data Quality: Empowering Businesses with Analytics and AI
From Everand
Data Quality: Empowering Businesses with Analytics and AI
Prashanth Southekal
No ratings yet
Syllabus For Vgsom Mba - New - 20 - Aprl-09 - Modified 11-May-Programme
No ratings yet
Syllabus For Vgsom Mba - New - 20 - Aprl-09 - Modified 11-May-Programme
92 pages
Standard Deviation - Wikipedia, The Free Encyclopedia
No ratings yet
Standard Deviation - Wikipedia, The Free Encyclopedia
12 pages
Hw1sol PDF
No ratings yet
Hw1sol PDF
9 pages
CDP Systems and Implementation: Definitive Reference for Developers and Engineers
From Everand
CDP Systems and Implementation: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Psychological Statistics II LAB Psychological Statistics
No ratings yet
Psychological Statistics II LAB Psychological Statistics
2 pages
Astronomical Statistics: Tutorial Questions 1: John Peacock
No ratings yet
Astronomical Statistics: Tutorial Questions 1: John Peacock
2 pages
Syllabus B.Tech. I Year Applied Maths 2023-24
No ratings yet
Syllabus B.Tech. I Year Applied Maths 2023-24
4 pages
2nd Unit - 2.2 - Data Analytics
No ratings yet
2nd Unit - 2.2 - Data Analytics
22 pages
IOT Assignment-12 Solution
No ratings yet
IOT Assignment-12 Solution
7 pages
Anexo - 19 - ING - FPW - IFN053 - Junx14 KDCS - FCA
No ratings yet
Anexo - 19 - ING - FPW - IFN053 - Junx14 KDCS - FCA
20 pages
Lecture Notes: Introduction To Data Science and Big Data
No ratings yet
Lecture Notes: Introduction To Data Science and Big Data
5 pages
CPK
No ratings yet
CPK
39 pages