Unit 1 Topic 1 Intro
Unit 1 Topic 1 Intro
Analytics
Dr. Anil Kumar Dubey
Associate Professor,
Computer Science & Engineering Department,
ABES EC, Ghaziabad
Affiliated to Dr. A.P.J. Abdul Kalam Technical University,
Uttar Pradesh, Lucknow
Basic
Data analytics is the process of storing,
organizing, and analyzing raw data to answer
questions or gain important insights. Data
analytics is integral to business because it allows
leadership to create evidence-based strategy,
understand customers to better target marketing
initiatives, and increase overall productivity.
◦ Structured
◦ Semi-structured
◦ Unstructured
Structured data
Structured data is data whose elements are addressable
for effective analysis.
It has been organized into a formatted repository that is
typically a database.
It concerns all data which can be stored in
database SQL in a table with rows and columns.
They have relational keys and can easily be mapped
into pre-designed fields. Today, those data are most
processed in the development and simplest way to
manage information.
Example: Relational data.
Semi-Structured data
Semi-structured data is information that does not reside
in a relational database but that has some
organizational properties that make it easier to analyze.
It is based on
It is based on Relational It is based on character
Technology XML/RDF(Resource Description
database table and binary data
Framework).
Query Structured query allow Queries over anonymous nodes Only textual queries are
performance complex joining are possible possible
Need of data analytics
Implementing data analytics into the
business model means companies can help
reduce costs by identifying more efficient
ways of doing business.