Introduction to Data Analytics
Introduction to Data Analytics
Analytics
Terminologies
• Data – values or set of values, raw or unorganized.
• Data is extracted from various sources and is cleaned and categorized to analyze
various behavioral patterns.
• The techniques and the tools used vary according to the organization or individual.
Role of Data Analytics
• Gather Hidden Insights – Hidden insights from data are gathered and then
analyzed with respect to business requirements.
• Generate Reports – Reports are generated from the data and are passed on to
the respective teams and individuals to deal with further actions for a high rise in
business.
• Perform Market Analysis – Market Analysis can be performed to understand the
strengths and weaknesses of competitors.
• Improve Business Requirement – Analysis of data enables better understanding
of business requirements and hence the customer experience.
Data Analytics vs. Data Science
• Data analysts typically work with structured data to solve tangible
business problems using tools like SQL, R or Python programming
languages, data visualization software, and statistical analysis.
• Data scientists often deal with the unknown by using more advanced
data techniques to make predictions about the future. They might
automate their own machine learning algorithms or design predictive
modeling processes that can handle both structured and unstructured
data. This role is generally considered a more advanced version of a
data analyst.
*Source: Coursera
Data Analytics vs. Data Science
• Data analytics answers specific questions or address challenges that have already
been identified and are known to the business.
• Data scientist considers what questions the business should or could be asking.
Involves design of new processes for data modeling - predictive models, custom
analyses.
Data Analytics vs. Data Science
Data Analytics Data Science
Collaborating with organizational leaders to Gathering, cleaning, and processing raw data
identify informational needs
Acquiring data from primary and secondary Designing predictive models and machine
sources learning algorithms to mine big data sets
Cleaning and reorganizing data for analysis Developing tools and processes to monitor and
analyze data accuracy
Analyzing data sets to spot trends and patterns Building data visualization tools, dashboards, and
that can be translated into actionable insights reports
Presenting findings in an easy-to-understand way Writing programs to automate data collection
to inform data-driven decisions and processing
Data Analytics vs. Data Science
Data Analytics Data Science
Mathematics Foundational math, statistics Advanced statistics, predictive
analytics
Programming Basic fluency in R, Python, Advanced object-oriented
SQL programming
Software and SAS, Excel, business Hadoop, MySQL, TensorFlow,
tools intelligence software Spark
Other skills Analytical thinking, data Machine learning, data
visualization modeling
Types of Data Analytics
• Descriptive analytics
• Diagnostic analytics
• Predictive analytics
• Prescriptive analytics
Descriptive analytics
• Descriptive analytics helps answer questions about what happened.
• Key Performance Indicators (KPIs,) are used to keep track of successes or failures.
• E.g. ROI (Return on Investment)
• This process requires the collection of relevant data, processing of the data, data
analysis and data visualization.
• They take the findings from descriptive analytics and dig deeper to find the cause.
• These techniques use historical data to identify trends and determine if they are
likely to recur.
• Prescriptive analytics techniques rely on machine learning strategies that can find
patterns in large datasets.
• By analyzing past decisions and events, the likelihood of different outcomes can
be estimated.
Steps in Data Analytics
• The primary steps in the data analytics process are:-
• Data extraction
• Data management/warehousing
• Statistical analysis
• Data presentation/visualization
• The importance and balance of these steps depend on the data being
used and the goal of the analysis.
Data Extraction
• It involves extracting data from
unstructured data sources. E.g.
text, large complex databases,
or raw sensor data.