0% found this document useful (0 votes)
39 views2 pages

Machine Learning-Assignments PDF

ml assignment questions

Uploaded by

Abel Sisay
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views2 pages

Machine Learning-Assignments PDF

ml assignment questions

Uploaded by

Abel Sisay
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

CSEg4308 Introduction to Machine Learning – Assignment – 2024.

 Students may select one assignment from the options provided and submit softcopy
by April 30, 2024.
 This is a group assignment, with each group allowed a maximum of five students.

Assignment 1: Supervised Learning in ML


Objective: Implement and evaluate a supervised learning model on a real-world dataset.
Dataset: Choose from datasets available on platforms like UCI Machine Learning Repository, Kaggle,
or use any public dataset relevant to social, economic, or environmental issues.
Tasks:
1. Perform exploratory data analysis to understand the dataset.
2. Preprocess the data: handle missing values, normalize/standardize features, and encode categorical
variables.
3. Implement at least two supervised learning algorithms (e.g., logistic regression, support vector
machines, decision trees, etc.).
4. Evaluate model performance using appropriate metrics (accuracy, precision, recall, F1-score,
ROC curve).
Model Evaluation:

 Split the data into training and testing sets (e.g., 80%/20%).
 Train the models on the training set and evaluate their performance on the unseen testing
set.
 Use metrics like accuracy, precision, recall, F1-score to assess the models' ability to
correctly classify digits.
 Plot ROC curves to visualize the trade-off between true positive rate and false positive
rate for each model.
5. Write a report discussing the models' performance, insights obtained, and potential improvements.
Report:

 Discuss the performance of both models (accuracy, precision, recall, F1-score, ROC
curves).
 Analyze which model performed better and why.
 Explore potential reasons for errors (e.g., similar-looking digits like 6 and 9).
 Suggest potential improvements for future iterations:
 Hyperparameter tuning for each model.
Assignment 2: Natural Language Processing (NLP)
Objective: Develop a model to perform sentiment analysis on text data.
Dataset: IMDb Reviews, Twitter data, or any other text corpus with sentiment labels.
Tasks:
 Perform text preprocessing (tokenization, stopword removal, stemming/lemmatization).
 Vectorize the text using techniques like TF-IDF or word embeddings.
 Implement a machine learning model (e.g., logistic regression, random forest) and a deep learning
model (e.g., LSTM).
 Compare the performance of the models on the basis of precision, recall, and F1-score.
 Analyze the errors and discuss potential improvements.

Assignment 3: Unsupervised Learning and Dimensionality Reduction


Objective: Explore unsupervised learning techniques to discover patterns and reduce dimensionality.
Dataset: Any high-dimensional dataset (e.g., gene expression data, customer purchase data).
Tasks:
1. Apply clustering algorithms like K-means or DBSCAN to identify distinct groups within the data.
2. Use dimensionality reduction techniques such as PCA or t-SNE to visualize data clusters.
3. Interpret and report the findings from the clustering and dimensionality reduction.

You might also like