ML Viva Questions
ML Viva Questions
2. Define Outliers.
Outliers are data points that are significantly different from most other data,
which can affect analysis accuracy.
Assignment B-2 : Classify the email using the binary classification method
1. Data Preprocessing
Data preprocessing involves cleaning, transforming, and organizing raw data to
make it suitable for analysis or machine learning models.
2. Binary Classification
Binary classification is a type of classification where there are only two
possible outcomes, such as "yes" or "no," "true" or "false."
3. K-Nearest Neighbours (K-NN)
K-NN is a simple algorithm that classifies a data point based on the majority
class among its closest K neighbors in the dataset.
2. Keras
Keras is a high-level neural network library in Python, built on top of
TensorFlow, that simplifies building and training deep learning models.
3. TensorFlow
TensorFlow is an open-source machine learning framework that
supports building, training, and deploying large-scale machine learning
and deep learning models.
4. Normalization
Normalization is the process of scaling data to a standard range (usually
0 to 1) to ensure each feature contributes equally to model performance.
5. Confusion Matrix
A confusion matrix is a table that shows the performance of a
classification model by comparing actual vs. predicted values for each
class, helping to evaluate accuracy, precision, and recall.
Assignment B-4 : Implement K-Nearest Neighbors algorithm on diabetes.csv+
1. What is data preprocessing, and why is it important?
Data preprocessing prepares raw data by cleaning, transforming, and organizing
it to improve model performance and accuracy.
7. Define accuracy_score.
Accuracy score is the percentage of correct predictions out of all predictions.