Shopee Python-Pandas Test (45 Mins)
Shopee Python-Pandas Test (45 Mins)
In this task, you'll be analysing listings data from our Shopee Platform.
You may use the Pycharm IDE installed, Sublime or other windows native text editors. Please save
your python source code on the desktop. You may use the internet for help.
The dataset is stored in the Test_Pandas.xlsx file. It contains listing information posted on Shopee.
One single listing corresponds to one row in the dataset.
4. Find Top 3 shopid with the highest revenue (Assumption: the product price has not been
changed.)
5. Find number of products that have more than 3 variations (do not include products with 3 or
fewer variations)
6. Use pandas function to identify duplicated listings within each shop (If listing A and B in shop
S have the exactly same product title, product detailed description, and price, both listing A
and B are considered as duplicated listings)
a. Mark those duplicated listings with True otherwise False and store the marking
result in a new column named “is_duplicated”
b. Find duplicate listings that has less than 2 sold count and store the result in a new
excel file named “duplicated_listings.xlsx”
c. Find the preferred shop shopid that have the most number of duplicated listings