Machine Learning-Assignments PDF
Machine Learning-Assignments PDF
Students may select one assignment from the options provided and submit softcopy
by April 30, 2024.
This is a group assignment, with each group allowed a maximum of five students.
Split the data into training and testing sets (e.g., 80%/20%).
Train the models on the training set and evaluate their performance on the unseen testing
set.
Use metrics like accuracy, precision, recall, F1-score to assess the models' ability to
correctly classify digits.
Plot ROC curves to visualize the trade-off between true positive rate and false positive
rate for each model.
5. Write a report discussing the models' performance, insights obtained, and potential improvements.
Report:
Discuss the performance of both models (accuracy, precision, recall, F1-score, ROC
curves).
Analyze which model performed better and why.
Explore potential reasons for errors (e.g., similar-looking digits like 6 and 9).
Suggest potential improvements for future iterations:
Hyperparameter tuning for each model.
Assignment 2: Natural Language Processing (NLP)
Objective: Develop a model to perform sentiment analysis on text data.
Dataset: IMDb Reviews, Twitter data, or any other text corpus with sentiment labels.
Tasks:
Perform text preprocessing (tokenization, stopword removal, stemming/lemmatization).
Vectorize the text using techniques like TF-IDF or word embeddings.
Implement a machine learning model (e.g., logistic regression, random forest) and a deep learning
model (e.g., LSTM).
Compare the performance of the models on the basis of precision, recall, and F1-score.
Analyze the errors and discuss potential improvements.