Data Analytics Exam Notes
Data Analytics Exam Notes
- Definition: Data analytics involves the collection, organization, and analysis of raw data to answer
- Key Components:
* Data analytics is a multidisciplinary field that uses techniques from mathematics, statistics, and
computer science.
* The goal is to analyze data to draw meaningful conclusions, make predictions, and inform
business decisions.
1. Descriptive Analytics:
* Purpose: To understand what has happened in the past by analyzing historical data.
2. Diagnostic Analytics:
* Purpose: To explain why something happened, often using historical data to identify patterns and
correlations.
* Example: A business might use diagnostic analytics to understand why sales dropped in a
particular quarter by finding patterns in customer data.
3. Predictive Analytics:
* Purpose: To predict future trends based on historical data using statistical techniques like
* Techniques: Linear regression, time series forecasting, data mining, predictive modeling.
* Use Case: Businesses use predictive analytics to forecast sales, customer behaviors, or risk
factors.
4. Prescriptive Analytics:
* Example: In healthcare, prescriptive analytics can suggest treatment plans by analyzing patient
* First step is to identify how data will be categorized. Categories can include demographic factors
2. Data Collection:
* Data is gathered from various sources such as computers, sensors, or online platforms.
3. Data Organization:
* Once collected, data is organized into structured formats, such as spreadsheets or databases, to
4. Data Cleaning:
* This involves removing errors, duplicates, and incomplete data to ensure the dataset is accurate
and reliable.
- NoSQL Databases: Useful for handling large volumes of data that don't fit neatly into structured
formats.
- R Programming: Popular in data science for building statistical models and running complex
analyses.
- Data Lakes: Store vast amounts of raw, unstructured, or semi-structured data for future analysis.
- Microsoft Excel: A common tool for data aggregation and dashboard creation.
1. Primary Data:
2. Secondary Data:
6. Classification of Data
2. Semi-Structured Data: Contains tags to separate elements but not fully structured.
3. Unstructured Data: Data that isn't easily searchable (e.g., videos, images).
7. Need for Data Analytics in Business
9. Prescriptive Analytics
- Not only predicts future outcomes but also suggests actions to optimize results.