Class 2 - Lifecycle ML Concepts in Ds
Class 2 - Lifecycle ML Concepts in Ds
• Definition: Data Science Life Cycle defines the process of the how
information is carried out in various phases for professionals
working on a project.
• Example: A retail store wants to predict sales for the next quarter
to optimize inventory management and reduce waste of perishable
products. This involves analyzing historical sales data and external
factors like seasonal trends or economic conditions.
Data Extraction, Transformation, and
Loading (ETL)
• Definition: Extract relevant data from various sources, transform
it into a machine-readable format, and load it into the system for
analysis.
• Example: For the retail store example, gather past sales data,
weather patterns, holidays, and any promotional activities.
Transform this data into a structured format suitable for analysis.
Data Preprocessing
• Definition: Clean, manipulate, and prepare the data for modeling
using techniques like exploratory data analysis (EDA), statistical
analysis, and data wrangling.
• Fraud detection using machine learning models (e.g., supervised and unsupervised techniques).
• Personalized financial advice through customer segmentation and sentiment analysis.
Retail
Healthcare
Construction
Track task completion times, material costs, and resource usage for better decision-making.
Manufacturing
Analyze untapped data from manufacturing processes to optimize efficiency and reduce waste.
Public Services
• Use cases include health research, fraud detection, energy exploration, environmental protection,
and financial market analysis.
Business Intelligence Vs Data Science
Criterion Business Intelligence Data science
Statistics and Visualization are the two skills Statistics, Visualization, and Machine learning are
Skills required for business intelligence. the required skills for data science.
Applications:
• Forecasting sales or trends.
• Customer segmentation.
• Fraud detection.
Key ML Algorithms
• Regression
• Decision Tree
• Clustering
• Classification
• Outlier Analysis
Key Algorithms and Their Applications
Definition: Machine learning is a subset of artificial intelligence that
enables systems to learn and make decisions without explicit programming.
Learns from unlabeled data by finding Learns through trial and error by
Definition Learns from labeled data (input-output pairs).
patterns. interacting with the environment.
Predict an outcome (classification or Discover hidden patterns or groupings in Maximize rewards through continuous
Goal
regression). data. interactions.
Linear Regression, Decision Trees, Neural K-Means Clustering, Principal Component Q-Learning, Deep Q-Networks (DQN),
Example Algorithms
Networks. Analysis (PCA), Autoencoders. Policy Gradient.
Spam detection, medical diagnosis, price Customer segmentation, anomaly Robotics, game playing (AlphaGo), self-
Use Cases
prediction. detection, recommendation systems. driving cars.