Exam - HND
Exam - HND
9. What type of chart is most appropriate for visualizing the relationship between two
continuous variables?
○ a) Bar chart
○ b) Pie chart
○ c) Scatter plot
○ d) Line chart
10. Which cloud platform is widely used for storing large-scale data?
○ a) AWS
○ b) WordPress
○ c) GitHub
○ d) Canva
You are given a dataset containing features like Study Hours, Attendance, and Past
Performance, and a target column Result (1 for pass, 0 for fail).
1. Explain how you would preprocess this dataset before applying a machine learning
model. (5 marks)
2. Select a suitable algorithm for this task and justify your choice. (5 marks)
3. Write down the steps for splitting the data into training and testing sets and training
the model. (5 marks)
4. Propose metrics to evaluate the model's performance and explain their importance.
(5 marks)
5. Discuss how you could improve the model's accuracy. (5 marks)
A company wants to segment its customers based on their purchasing behavior. You have
features like Purchase Amount, Frequency, and Age.