Project EE331 2019S
Project EE331 2019S
Policy
This course will introduce many essential concepts and algorithms in machine learning through
lectures and homeworks. Students will learn that machine learning is data driven, and the task
performance depends on the quality of the dataset in terms of size and label. This project will provide
students the opportunity (1) to think about an interesting machine learning task that differentiates
itself from all other tasks that have been considered before (recommend students to reference Kaggle,
http:
www.kaggle.com), (2) to collect data and appropriately label it to perform the task (the more labeled
data, the more valuable your dataset becomes),(3) to apply benchmark algorithm to set the bar for
performance and finally (4) choose a dataset from a list of selected dataset and compete with others
in terms of performance. Students are encouraged to form a group for this project - the
size of the group should be no larger than 3.
1
Schedule & grading
1. Each group will submit 4-page task proposal in pdf format including a plan for data collection
The project proposal should include the following items.
(a) Clear description of the task with discussion on (1) how it might be interesting/important
and also (2) the uniqueness of the task.
(b) Detailed description of the data which you will be providing e.g. format, size, number of
data, label.
(c) Any prior knowledge required to understand task/data.
(d) Provide measure for performance. You may have multiple measures.
(e) Any algorithm that you are providing for reference.
(f) Introduce group members and their responsibilities.
2. For selection, each group will make a 10-minute presentation of the task and data. Each
member of the selected groups will receive 7% extra points on the overall final
grade. Members of selected groups must participate in task other than their own. Dataset
and task will be evaluated in terms of uniqueness, datasize and data quality. Selection will be
announced next day. Each group will choose a task from the selection and work on task as a
team. Leaderboard will be operated to inform student of best results.
3. Each student will turn in a 6-page final report describing the algorithm and experimental result
of the group. Student may individually design an algorithm separate from the group and report
its performance. The analysis of the experiment is expected to be different from student to
student regardless of the whether the student has submited an individual or group algorithm.
The final report should be emailed to ([email protected], [email protected] until June
15. Each group must submit video presentation of their algorithm using video recording soft-
ware described below.