Ba Unit 1
Ba Unit 1
Course Owner :
Prof. Asif Naikawadi
Analytics:
Analytics is the process of discovering, interpreting, and
communicating significant patterns in data.
Business Analytics is the process of transforming data into
insights to improve business decisions.
• Analytics 1.0
• The practice as we recognize it started in the 1950s, with
individual companies collecting their own offline data and
manually analyzing it to look for ways to improve their
operations
• Data analyst’s role was mostly as a technician
• The work was slow, heavily manual, and only informed by
limited data sources.
• Analytics 2.0
• The start of the 2000s saw the era of early big data initiatives,
as computers and connectivity started to become pervasive
among businesses.
• Any business that was working with a digital infrastructure
could start digging into large data sets to find meaningful,
more timely insights.
• analysts became key personnel who could report on progress
and trends, and make recommendations based on the data
they were processing
• Analytics 3.0
• The range of possible data sources grew hugely in the late
2000s and early 2010s, driven by the explosion of connected
devices, widespread data collection initiatives, and
increasingly intelligent analytics technologies
• In this era, organizations started to think critically about how
their data and analytics practice could translate into
meaningful change for employees, customers and business
processes.
• Analytics 4.0
• Analytics 4.0, though still in its early days, has seen
organizations adopt predictive analytics and advanced
decision simulations at scale
• sources into highly sophisticated analytics programs and
deploy largely automated decision-making tools using cloud
and big data technologies.
• Analytics 4.0 is enabling them to make intelligent, insight-
driven recommendations for how businesses can adapt their
market strategies, digital transformation initiatives, and more
Netflix
Content analysis: They analyze data to determine which content to produce and
license, including genre, budget, and target audience.
Customer behavior: They track viewing habits, search and browsing behavior,
and preferences to make recommendations and personalize the user experience.
Pricing: Amazon uses data and analytics to determine optimal pricing for
products and to track competitor pricing.
• Predictive Analytics: Predictive analytics involves using historical data and statistical
techniques to predict future events or outcomes. It utilizes patterns and relationships found
in the data to estimate what might happen in the future.
• Concept of Insights
• Definition:
Insights refer to the valuable understanding or interpretations derived from data analysis.
They are not just the raw data itself but the actionable takeaways that help businesses make
informed decisions.
• What Insights Provide:
• Clarity on trends and patterns in customer behavior, operations, or market conditions.
• Actionable recommendations that can guide strategic decisions.
• Understanding of causal relationships between different business activities.
• Role in Business:
• Insights help in refining business strategies, improving customer experience, and
optimizing operational processes.
• By transforming data into insights, businesses can find opportunities for growth and
areas of improvement.
Data Utilization
1)Customer Purchase Data:.
2)Web Analytics
3)Customer Feedback
Outcome
1)Targeted Marketing
2)Inventory Management
3)Improved User Experience
Data
Definition: Raw facts and figures without context or meaning.
Characteristics: Unprocessed, unorganized, quantitative or qualitative.
Example: Numbers like 23, 45, 67 or survey responses like "yes," "no," "maybe.“
Information
Definition: Data that has been processed or organized to provide meaning.
Characteristics: Contextualized, structured, relevant to a specific situation.
Example: "The average score is 45," or "70% of survey respondents said 'yes.'“
Knowledge
Definition: Information that has been understood and internalized, allowing for informed decision-making or problem-solving.
Characteristics: Insightful, actionable, based on experience and understanding.
Example: A manager decides to increase product inventory based on the knowledge that customer preferences favor certain
products.
Ensuring high data quality involves monitoring these dimensions to support effective analysis,
reporting, and decision-making. High-quality data leads to better insights and outcomes.
➢ The term “Big Data” was first introduced to the computing world by Roger Magoulas from
O’Reilly media in 2005.
➢ Big Data as: "data that’s too big, too fast, or too hard for existing tools to process.”
Volume
business processes, machines, social media platforms, networks, human interactions, and many more.
Facebook (billion messages) (4.5 billion "Like“) (350 million new posts)
Variety
Big Data can be structured, unstructured, and semi-structured
Data will only be collected from databases and sheets in the past, But these days the data will comes in array forms,
that are PDFs, Emails, audios, SM posts, photos, videos, etc.
Structured data: In Structured schema, along with all the required columns. It is in a tabular form. Structured
Data is stored in the relational database management system.
Semi-structured: In Semi-structured, the schema is not appropriately defined, e.g., JSON, XML, CSV, TSV,
and email. OLTP (Online Transaction Processing) systems are built to work with semi-structured data. It is
stored in relations, i.e., tables.
Unstructured Data: All the unstructured files, log files, audio files, and image files are included in the
unstructured data. Some organizations have much data available, but they did not know how to derive the value
of data since the data is raw.
Quasi-structured Data:The data format contains textual data with inconsistent data formats that are formatted
with effort and time with some tools.Example: Web server logs, i.e., the log file is created and maintained by
some server that contains a list of activities.
Veracity
Veracity means how much the data is reliable. It has many ways to filter or translate the data. Veracity is the
process of being able to handle and manage data efficiently. Big Data is also essential in business development.
For example, Facebook posts with hashtags.
Value
Value is an essential characteristic of big data. It is not the data that we process or store. It
is valuable and reliable data that we store, process, and also analyze.
Velocity
real-time. It contains the linking of incoming data sets speeds, rate of change,
and activity bursts. The primary aspect of Big Data is to provide demanding data rapidly.
Big data velocity deals with the speed at the data flows from sources like application
logs, business processes, networks, and social media sites, sensors, mobile
devices, etc.
Ethical considerations : Privacy Concerns, Informed Consent, Data Ownership, Bias and
Discrimination, Transparency and Accountability, Security and Data Breaches
External Sources
1) Market data 2) Third Party data 3) Social data 4) Govt data/Public data
1) Questionnaires 1) Interviews