ML Module 1
ML Module 1
I. Data Quality and Quantity: Machine learning models heavily rely on data for
training. Poor quality data or insufficient data can lead to inaccurate or biased models.
Cleaning and preprocessing data is often a time-consuming and critical step.
Ii. Bias and Fairness: Models can inadvertently learn biases present in the training
data, leading to unfair or discriminatory outcomes. Ensuring fairness and mitigating
bias is a significant ethical concern in machine learning.
iii. Overfitting and Underfitting: Overfitting occurs when a model learns the training
data too well, capturing noise and leading to poor generalization on new data.
Underfitting, on the other hand, happens when a model is too simple to capture the
underlying patterns in the data.
iv. Feature Engineering: Selecting and creating relevant features from raw data is
crucial for model performance. Poor feature selection can lead to suboptimal results,
and manual feature engineering can be time-intensive.
vi . Interpretable and Explainable Models: Complex models like deep neural networks
can be highly accurate, but they are often difficult to interpret. Understanding why a
model makes a certain prediction is important for building trust and explaining
decisions to stakeholders.
2 Explain any five applications of Machine Learning?
4. Recommendation Systems:
ML powers recommendation engines in platforms like Netflix, Amazon, and Spotify.
These systems analyze user behavior to suggest relevant content, products, or music,
enhancing user experience and driving engagement.
In the real world, we are surrounded by humans who can learn everything from
their experiences with their learning capability, and we have computers or
machines which work on our instructions. But can a machine also learn from
experiences or past data like a human does? So here comes the role of Machine
Learning.
explicitly programmed.”
4. Explain the steps of developing Machine Learning applications
steps involved in developing Machine Learning applications:
Problem Definition:
● Clearly define the problem you want to solve with Machine
Learning.
● Identify the input data (features) and the desired output (target).
Data Collection:
● Gather relevant and sufficient data for training and testing the
model.
● Ensure data quality and perform data preprocessing (cleaning,
handling missing values, etc.).
Feature Engineering:
● Select and transform appropriate features from the raw data.
● Create new features if necessary to enhance the model's
performance.
Model Selection and Training:
● Choose a suitable Machine Learning algorithm based on the
problem type (classification, regression, etc.).
● Split the data into training and testing sets.
● Train the chosen model using the training data.
Evaluation and Validation:
● Evaluate the model's performance using metrics like accuracy,
precision, recall, or mean squared error, depending on the problem.
● Validate the model's performance on unseen data (testing set) to
assess its generalization ability.
Fine-Tuning and Deployment:
● Fine-tune the model by adjusting hyperparameters to improve its
performance.
● Once satisfied with the model's performance, deploy it to a
production environment for real-world use.