Untitled Document
Untitled Document
---
Example: In retail, data science helps optimize inventory by analyzing sales trends and
predicting future demand.
---
Example: Netflix collects billions of data points on user behavior (watch history, pause/skip
habits) to recommend shows by cleaning, processing, and applying algorithms on the data.
---
Python: Used for data manipulation (pandas), analysis (numpy), and ML (scikit-learn).
Apache Spark: Processes big data in distributed systems, enabling faster computation.
---
Example: In healthcare, patient data helps predict disease outbreaks or personalize treatment
plans.
---
Semi-structured: JSON format data from a web API containing user profiles.
---
Example: In banking, data science is used for credit scoring and fraud detection using
transaction history and behavioral data.
---
Example Workflow:
5. Deployment: Implementing the model into the website for real-time prediction.
---
---
Binomial Distribution: Used to calculate the probability of getting heads in 10 coin flips.
Monte Carlo Simulation: Simulating thousands of possible investment returns to assess financial
risk.
---
Supervised Learning (Regression): Predicting house prices based on area, location, etc.
Unsupervised Learning (Clustering): Grouping customers based on shopping habits for targeted
marketing.
---
11. Use Cases in Various Domains
---
Example: Market basket analysis in retail finds that people who buy diapers often buy beer. This
is derived using association rule mining.
---
Association Rule Mining: Finding that customers who buy bread also often buy butter.
Anomaly Detection: Spotting unusual server activity that may indicate a cyberattack.
---
Example: A company uses a dashboard to track sales KPIs and make informed decisions on
inventory and marketing strategy.
---
Example: Twitter processes real-time tweets across millions of users using distributed
frameworks like Hadoop and Spark for trend analysis.
---
---
---
Would you like a visual mind map or organized table for these details?