Hackathon Notes
Hackathon Notes
Dataset Format
● Format: CSV
● Dictionary: A README.md file that explains:
○ What each column in the dataset represents.
○ The data type of each column (e.g., integer, string, float).
○ Any known missing values and how they are represented (e.g., NaN, null).
○ The target variable that students need to predict.
● Starter Notebook: Provide a basic Google Colab:
○ Load the dataset using Pandas.
○ Build a very simple baseline model using Scikit-learn.
○ Generate a submission.
Questions
● Provide multiple questions that could be answered and let students decided
○ Choosing the important/impactful problem to solve is often more important than
simply having a good model.
4 (Very Good) The chosen question is clearly and precisely articulated, demonstrating
a thoughtful selection from the given list. Strong and convincing
justification for why this particular question is important and relevant to
the dataset. Shows a deep understanding of how their approach will
directly answer the question and deliver significant potential value.
3 (Good) The technical approach is appropriate for the problem, with generally
sound data preprocessing and feature engineering. Model selection is
reasonable. Code is mostly organized and reproducible, though minor
improvements could be made. Demonstrates a good understanding of
core ML principles and their application.
Score Description
Score Description
2 (Fair) Presentation has some structure but lacks flow or clarity. Some
important aspects of the project are not fully communicated.
Explanation of technical concepts is basic or somewhat confusing. The
team answers some questions but struggles with deeper inquiries.
3 (Good) Presentation is structured and mostly clear. All key elements of the
project are presented, though some details might be rushed. Technical
concepts are explained adequately for the audience. The team answers
most questions reasonably well.
4 (Very Good) Presentation is well-structured, clear, and engaging. All key elements of
the project are communicated effectively and concisely. Technical
concepts are explained clearly and accessibly to both technical and
non-technical audiences. The team handles questions confidently and
articulately, demonstrating strong understanding.