FML Solution 1
FML Solution 1
Types:
✅ Short Answer:
Human learning is the process of gaining knowledge or skills through experience, study, or
teaching.
💡 Flashcard:
Q: What is human learning?
A: Learning through experience, study, or teaching.
✅ Short Answer:
Supervised learning uses labeled data to train models, while unsupervised learning finds
patterns from unlabeled data.
💡 Flashcard:
Q: Difference between supervised and unsupervised learning?
A: Supervised → Labeled data.
Unsupervised → Unlabeled data.
Activities:
7. Prediction
8. Tuning
✅ Short Answer:
ML activities include collecting data, preprocessing it, selecting a model, training, testing,
predicting, and tuning.
💡 Flashcard:
Q: What are machine learning activities?
A: Collection, Preprocessing, Model selection, Training, Evaluation, Prediction, Tuning.
🧠 Understanding:
● Mean: Average = Total Sum / Count
Solution:
✅ Short Answer:
Mean = 6.12, Median = 6, Mode = 7
💡 Flashcard:
Q: Mean, median, and mode of the given data?
A: Mean = 6.12, Median = 6, Mode = 7
Shall I continue from Q.2 (a) with all three: Understanding, Short Answer, and Flashcard?
4o
You said:
dont ask that kind of que.. the loop is broke
ChatGPT said:
Updated saved memory
Got it, Sachin. Resuming full FML exam prep from Q.2 (a) — complete with:
● ✅ Understanding
● ✅ Short & Easy Answer
● ✅ Flashcard
Steps:
1. Divide dataset into 2 parts: training and testing (e.g., 80-20 split)
✅ Short Answer:
Split data → Train model → Test model → Evaluate results.
💡 Flashcard:
Q: What are steps in the hold-out method?
A: Split, Train, Test, Evaluate.
Structure:
✅ Short Answer:
A table showing TP, FP, FN, TN to evaluate model accuracy.
💡 Flashcard:
Q: What is a confusion matrix?
A: A table with TP, FP, FN, TN to evaluate predictions.
Methods:
✅ Short Answer:
Data preprocessing improves data quality using cleaning, transformation, selection, and
integration.
💡 Flashcard:
Q: What is data preprocessing?
A: Preparing data using cleaning, transformation, and selection methods.
Example:
If student marks are grouped (0–10, 10–20…), the histogram shows how many fall in each
group.
✅ Short Answer:
Histogram shows frequency distribution using bars.
💡 Flashcard:
Q: What is a histogram?
A: A graph showing data frequency using bars.
Gender of a Nominal
person
✅ Short Answer:
● Gender → Nominal
● Rank → Ordinal
● Price → Ratio
● Color → Nominal
💡 Flashcard:
Q: Data types for gender, rank, price, and color?
A: Nominal, Ordinal, Ratio, Nominal
Steps:
✅ Short Answer:
K-fold divides data into k parts and tests the model k times with different sets.
💡 Flashcard:
Q: What is K-fold cross-validation?
A: It splits data into k parts, testing and training k times.
Applications:
✅ Short Answer:
Used in price prediction, sales estimation, and weather forecasting.
💡 Flashcard:
Q: Applications of regression?
A: House price, sales, weather prediction.
✅ Short Answer:
Uses one variable to predict another using a straight line.
💡 Flashcard:
Q: What is single linear regression?
A: Predicts one value using one variable in a straight-line form.
Steps:
✅ Short Answer:
K-NN classifies based on nearest k data points.
💡 Flashcard:
Q: What does K-NN do?
A: Classifies based on the nearest k neighbors.
✅ Q.3 OR (a) Supervised Learning in Healthcare (03
Marks)
🧠 Understanding:
Examples:
✅ Short Answer:
Used for disease prediction, diagnosis, and patient classification.
💡 Flashcard:
Q: 3 supervised learning examples in healthcare?
A: Disease prediction, risk analysis, diagnosis.
✅ Short Answer:
Classification = categories; Regression = values.
💡 Flashcard:
Q: Classification vs Regression?
A: Classification = classes; Regression = numbers.
✅ Short Answer:
Collect → Preprocess → Train → Test → Evaluate
💡 Flashcard:
Q: Steps in classification learning?
A: Collect, preprocess, train, test, evaluate.
✅ Short Answer:
Clustering = no labels, Classification = labeled data.
💡 Flashcard:
Q: Clustering vs Classification?
A: Clustering = no labels; Classification = labeled.
● Easy to understand
Disadvantages:
✅ Short Answer:
● Easy to use,
− Slow on big data.
💡 Flashcard:
Q: One advantage and disadvantage of Apriori?
A: + Simple; − Slow on large data.
✅ Q.4 (c) Applications of Unsupervised Learning (07
Marks)
🧠 Understanding:
Applications:
✅ Short Answer:
Used in grouping, fraud detection, recommendations.
💡 Flashcard:
Q: 3 uses of unsupervised learning?
A: Segmentation, fraud detection, clustering.
● Product recommendation
💡 Flashcard:
Q: Where is Apriori used?
A: Market basket, product suggestions.
✅ Short Answer:
Support = frequency, Confidence = rule accuracy.
💡 Flashcard:
Q: Support vs Confidence?
A: Support = frequency; Confidence = rule success.
✅ Short Answer:
K-means groups data into k clusters using centroids.
💡 Flashcard:
Q: What does K-means do?
A: Groups data into k clusters.
✅ Short Answer:
Predictive = future, Descriptive = past.
💡 Flashcard:
Q: Predictive vs Descriptive model?
A: Predictive = forecasts; Descriptive = explains.
● Classification
● Regression
● Clustering
● Model selection
● Dimensionality reduction
✅ Short Answer:
Used for ML tasks like classification, regression, and clustering.
💡 Flashcard:
Q: What is Scikit-learn used for?
A: ML tasks like classification and regression.
● N-dimensional array
● Broadcasting
● Fast operations
Applications:
● Data analysis
● Scientific computing
● Machine learning backend
✅ Short Answer:
Fast array handling for ML and data analysis.
💡 Flashcard:
Q: NumPy use in ML?
A: Fast arrays and numerical computing.
✅ Short Answer:
Combines models on random data to reduce error.
💡 Flashcard:
Q: What is bagging?
A: Combines models to reduce variance.
● Fast operations
✅ Short Answer:
Supports DataFrame, filtering, missing data handling.
💡 Flashcard:
Q: Key features of Pandas?
A: DataFrame, filtering, fast data handling.
● 2D plotting
● Customizable plots
Applications:
● Data visualization
● Trend analysis
● ML result plotting
✅ Short Answer:
Used for creating graphs and visualizations in ML.
💡 Flashcard:
Q: Use of Matplotlib?
A: Plot graphs for data and ML.