Hackathon Problem Statement
Hackathon Problem Statement
For each user, predict the top 3 probable product categories that they
may purchase from, in the future.
Timeline
DEADLINE EXTENDED
21 DAYS LEFT
This file contains the detailed purchasing history for every user. It has
order value and the category of the product.
This file contains data for some users about the category of items they
bought in future.
Test Data
This file contains the detailed purchasing history for some users. It has
the order value and the category of the product. You have to predict
the top 3 categories that the users with these user_ids will purchase
from in the future.
Evaluation
Measurements will be based on mean relevance rank
(mrr) and precision. Both the measurements are explained here.
User Reciproca |
Products in the order shown Product bought
id Rank
E-readers, Kitchen Supplies, Phones, Comics,
1 1/3
Technology books Technology Books
2 Phones, Comics, Fruits None 0
3 Groceries, Fruits, Phones None 0
Fruits, Home Decor,
4 Phones, Home Decor, readers 1/2
readers
Home Decor, Home Furnishings,
5 Phones, Books, Fruits 0
Kitchen Supplies
Precision or Accuracy
We first find the Number of products in the prediction in each row that
matches with the number of products of the user_id. We then average
this number across all valid predictions. For the above table, precision
would look like -
User
Products in the order shown Product bought Precision
id
E-readers, Kitchen Supplies, Phones, Comics, Technology
1 1
Technology books Books
2 Phones, Comics, Fruits None NA
3 Groceries, Fruits, Phones None NA
4 Phones, Home Decor, readers Fruits, Home Decor, readers 2
Home Decor, Home Furnishings,
5 Phones, Books, Fruits NA
Kitchen Supplies
Recall in this case, the number of items for which there is a prediction =
2/5 = 0.4
Ready to submit?
Sample Submission
Submissions should be made in the same format as the sample
provided.
Prediction dataset should be a .csv file with 19,981 rows (and one row for
headers) and the columns user_id and pred3 in the same format as the file
below.