Semester Project Description and Instructions
Semester Project Description and Instructions
Introduction:
This project delves into the application of advanced neural network architectures for classification
tasks in artificial intelligence. You will explore the power of combining (hybrid) or ensembling
different neural network models to achieve superior results compared to single models.
Throughout the project, you will utilize a dataset of your choice (text, images, etc.) and implement
your chosen approach to solve a classification problem.
Objectives:
1. Dataset Selection:
• Choose a publicly available dataset that aligns with your interests. Examples include:
o Image classification (MNIST handwritten digits, CIFAR-10 object classification)
o Text classification (sentiment analysis datasets, spam detection)
o Other domains (sensor data analysis, financial forecasting)
• Ensure that you know all details about the dataset (total instances, features, and other
details)
2. Model Design and Implementation:
• Select two or more neural network architectures (variants of CNNs, RNNs, etc.) that are
NOT covered in your class curriculum. Research and understand the chosen architectures.
• Explore approaches like:
o Feature extraction with one model and classification with another.
o Stacking models where one model's output becomes the input for another.
o Bagging or boosting techniques to create an ensemble of multiple models.
• Implement your model architecture using a deep learning framework like TensorFlow or
PyTorch.
• Perform all preprocessing steps as well.
• Preprocess the chosen dataset for your model's requirements (normalization, data
augmentation, etc.)
• Train the hybrid/ensemble model, splitting the data into training, validation, and testing
sets.
• Evaluate the model's performance using various metrics:
o Accuracy: Overall correctness of classification.
o Precision: Proportion of true positives among predicted positives.
o Recall: Proportion of true positives identified by the model.
o F1-score: Harmonic mean of precision and recall.
o AUC-ROC (Area Under the Receiver Operating Characteristic Curve): Measures
model performance for imbalanced classes.
o Confusion Matrix: Visualizes the model's performance in each class.
o Heatmap (for image classification): Visualizes model predictions for specific data
points.
• Analyze the evaluation results and interpret them in the context of your chosen dataset and
classification task.
4. Reporting:
o Total: 10%
o 5% for Implementation
o 5% for Report and Presentation