ISB_Assignment 2
ISB_Assignment 2
Objective:
This assignment assesses your practical skills in data acquisition, data preprocessing,
and the application of SEM using a programming language (preferably Python). It tests
your ability to replicate published research and critically analyze the results.
Task:
Part 0: Paper Selection:
1. Choose one of the five papers you reviewed in Assignment 1. Clearly state which
paper you have chosen and explain your reasoning. Factors to consider might
include:
a. The feasibility of replicating the analysis with the data you can scrape.
b. The clarity and completeness of the methodology described in the paper.
c. Your personal interest in the research topic.
d. The availability of code or detailed model specifications from the original
authors (this is a bonus, but not required).
Deliverables:
Datafile: The scraped and preprocessed data in CSV format.
Source Code: Your Python script(s) for data scraping, SEM analysis and, ML model
training and evaluation, well-commented and organized.
Scraping Workflow PPT: The single-slide PowerPoint presentation describing your
scraping workflow.
Modeling Pipeline Documentation: A detailed, written description of your modeling
pipeline (as a separate document, e.g., a Markdown or text file).
Results and Discussion Report: A 2-3 page report in LaTeX format (PDF).
Evaluation Criteria:
Data Scraping: Completeness, accuracy, and efficiency of the data scraping process.
Effective handling of potential issues.
Data Preprocessing: Appropriateness and thoroughness of preprocessing steps. Clear
justification for choices made.
SEM Implementation: Correct implementation of the SEM model, including model
specification, estimation, and fit assessment.
ML Model Development: Appropriate choice of ML models, proper training and
evaluation procedures, and clear justification of choices.
SEM and ML Comparison: Thoughtful and insightful comparison of the two
approaches, focusing on relevant aspects like predictive power and feature
importance.
Result Replication: Degree of success in replicating the key findings of the original
study.
Analysis and Interpretation: Thoughtful and insightful comparison of results,
discussion of limitations, and identification of potential areas for future research.
Documentation and Reporting: Clear, concise, and well-organized documentation of
all steps, including code comments, workflow descriptions, and the final report.
Code Quality: Readability, efficiency, and adherence to good coding practices.
Latex Report Quality: Proper use of Latex syntax, well-formatted, structed report with
professional look.