0% found this document useful (0 votes)

4 views2 pages

Python - Project3 Problem Statement

The document outlines a project focused on predicting sales figures for counterfeit medicines, which pose significant health risks and are prevalent in developing countries. It includes a formal problem statement, data files for training and testing, and evaluation criteria based on mean absolute error (MAE). The project is divided into two parts: a quiz based on data exploration and the creation of machine learning models to minimize MAE for successful submission.

Uploaded by

Hem Kuniyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views2 pages

Python - Project3 Problem Statement

Uploaded by

Hem Kuniyal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

16/01/2019 Project3

Counterfeit Medicines Sales Prediction

Counterfeit medicines are fake medicines which are either contaminated or contain wrong or no active
ingredient. They could have the right active ingredient but at the wrong dose. Counterfeit drugs are illegal and
are harmful to health. 10% of the world's medicine is counterfeit and the problem is even worse in developing
countries. Up to 30% of medicines in developing countries are counterfeit.

Millions of pills, bottles and sachets of counterfeit and illegal medicines are being traded across the world. The
World Health Organization (WHO) is working with International Criminal Police Organization (Interpol) to dislodge
the criminal networks raking in billions of dollars from this cynical trade.

Despite all these efforts, counterfeit medicine selling rackets don’t seem to stop popping here and there. It has
become a challenge to deploy resources to counter these; without spreading them too thin and eventually
rendering them ineffective. Government has decided that they should focus on illegal operations of high net
worth first instead of trying to control all of them. In order to do that they have collected data which will help them
to predict sales figures given an illegal operation's characteristics.

Data Files

Train Dataset = counterfeit_train.csv

Test Dataset = counterfeit_test.csv

Formal Problem Statement

Variable names are self explanatory.

Your task here is to build a predictive model for predicting sales figures given other information related to
counterfeit medicine selling operations. You need to build your model on the train dataset. Test dataset does not
have a response column; you need to predict those values and submit it in a csv format.

Evaluation Criterion

Part 1:

file:///C:/Users/anjal/Downloads/Project3%20(2).html 1/2
16/01/2019 Project3

You will first attempt Part 1 of this project which is a quiz. You can access it through LMS. This quiz needs to be
answered based on exploration of the dataset given and some generic questions about algorithms discussed in
the course. Consider only the training dataset for data cleaning and exploration to answer the quiz questions.
There will be 10 questions of which you need to get at least 7 correct in order to pass the project.

Part 2:

Here you work on creating the machine learning models and choosing the one which gives the best
performance. You can refer to the Project Process Guides provided in LMS to understand how to approach and
work on a project.

For this project, score will be calculated as:

Score = 1-(MAE/1660)

where MAE is mean absolute error on test file. You need to score more than 0.5 in order to pass the project
submission. Don't read too much into score formulation, it is just to scale MAE. You just need to focus on
minimizing MAE.

Submission:

Submission CSV should resemble the file:

Sample Submission = 'sample_submission.csv'

Column names, value types should be exactly the same. Also number of rows in the submission csv should be
exactly the same as test data. If this is not taken care of, your submission will not be graded.

You can make as many submissions you want if you want. [We might ask you to submit the script which was
used to generate the submission at any time].

In order to clear this project, you are required to clear both, Part 1 as well as Part 2 of this assignment.

Wish you all the best!

file:///C:/Users/anjal/Downloads/Project3%20(2).html 2/2

Kaeser Screw Compressor Belt
No ratings yet
Kaeser Screw Compressor Belt
97 pages
Machine Learning
100% (1)
Machine Learning
33 pages
Milestone
No ratings yet
Milestone
7 pages
CPS - 25kW 208V UL Modbus Map Spec FW V4.0
No ratings yet
CPS - 25kW 208V UL Modbus Map Spec FW V4.0
78 pages
Predictive Modeling (MP) Project Report
100% (1)
Predictive Modeling (MP) Project Report
73 pages
Daylighting 170712192833
100% (1)
Daylighting 170712192833
55 pages
Quantum Dot PDF
No ratings yet
Quantum Dot PDF
22 pages
Ritesh Mangla ML PracticalFile
No ratings yet
Ritesh Mangla ML PracticalFile
55 pages
MLPC Midterm
No ratings yet
MLPC Midterm
18 pages
Python - Project 2 Problem Statement
No ratings yet
Python - Project 2 Problem Statement
3 pages
Final Exam MPML
No ratings yet
Final Exam MPML
5 pages
Submission Type Due Date Total Score Available From Description
No ratings yet
Submission Type Due Date Total Score Available From Description
3 pages
Final Project Guidelines: Dataset Selection & Planning
No ratings yet
Final Project Guidelines: Dataset Selection & Planning
3 pages
Credit Risk Project
No ratings yet
Credit Risk Project
11 pages
Machine Learning Project
No ratings yet
Machine Learning Project
10 pages
Ce473 Project - Fall 2024
No ratings yet
Ce473 Project - Fall 2024
8 pages
DS Assignment
No ratings yet
DS Assignment
7 pages
Final Project
No ratings yet
Final Project
4 pages
Hackathon Best Practices
No ratings yet
Hackathon Best Practices
2 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
Kartik MLP 4-9prg
No ratings yet
Kartik MLP 4-9prg
10 pages
CSE472 Assignment 2
No ratings yet
CSE472 Assignment 2
3 pages
A1991370857 65680 10 2025 Csm355ca1
No ratings yet
A1991370857 65680 10 2025 Csm355ca1
6 pages
Owner'S Manual: Solar Water Heaters
No ratings yet
Owner'S Manual: Solar Water Heaters
56 pages
Capstone 2 Corizo
No ratings yet
Capstone 2 Corizo
2 pages
Assignment
No ratings yet
Assignment
5 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
Machine Learning Project Checklist
No ratings yet
Machine Learning Project Checklist
30 pages
Taking Leaving Message Kelas Xi #1 Meet
No ratings yet
Taking Leaving Message Kelas Xi #1 Meet
17 pages
CT1-MLOPs S1 2
No ratings yet
CT1-MLOPs S1 2
68 pages
Sales Forecasting Techniques Allow Companies To Predict Sales
No ratings yet
Sales Forecasting Techniques Allow Companies To Predict Sales
3 pages
SL - Problem Statement
No ratings yet
SL - Problem Statement
3 pages
ML Project
No ratings yet
ML Project
5 pages
Microsoft - Classifying Cybersecurity Incidents
No ratings yet
Microsoft - Classifying Cybersecurity Incidents
8 pages
Subject - Machine Learning Group - E27-24 Name
No ratings yet
Subject - Machine Learning Group - E27-24 Name
18 pages
How To Create A Python Model
No ratings yet
How To Create A Python Model
29 pages
CSC 603 - Final Project
No ratings yet
CSC 603 - Final Project
3 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
Machine L-Lab-Manual
No ratings yet
Machine L-Lab-Manual
90 pages
Lec 2
No ratings yet
Lec 2
13 pages
Sari Go MM Ulaan U Deep Resume
No ratings yet
Sari Go MM Ulaan U Deep Resume
3 pages
Assignment 3-PDS Python-24S3
No ratings yet
Assignment 3-PDS Python-24S3
5 pages
Machine Learning Assignment-02
No ratings yet
Machine Learning Assignment-02
2 pages
Important Questions
No ratings yet
Important Questions
4 pages
DS Food
No ratings yet
DS Food
23 pages
Project2 - 158755. 4.21
No ratings yet
Project2 - 158755. 4.21
3 pages
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
No ratings yet
AI Project Report: By: Neha Kalra (17csu122) and Prerna Pathak (17csu143)
22 pages
Medhun Final 1
No ratings yet
Medhun Final 1
4 pages
DCPT
No ratings yet
DCPT
36 pages
E-Let Review Mathematics Set 2
No ratings yet
E-Let Review Mathematics Set 2
53 pages
Field Expedient Methods For Explosives Preparation - 5ac3733a1723dd9445078f1b
No ratings yet
Field Expedient Methods For Explosives Preparation - 5ac3733a1723dd9445078f1b
9 pages
PROPRTIONAL PRESSURE REDUCING 3DREP and 3DREPE RE29184 PDF
No ratings yet
PROPRTIONAL PRESSURE REDUCING 3DREP and 3DREPE RE29184 PDF
12 pages
Naca 63415 Wind Tunnel
No ratings yet
Naca 63415 Wind Tunnel
108 pages
Arnaboldi, 2021
No ratings yet
Arnaboldi, 2021
46 pages
TIME
No ratings yet
TIME
10 pages
Chemistry Unit 1 Review Sheet
No ratings yet
Chemistry Unit 1 Review Sheet
2 pages
REx316 Presentation
No ratings yet
REx316 Presentation
60 pages
Week 6
No ratings yet
Week 6
5 pages
Total Internal Reflection and Evanescent Waves: Principle of SPR Detection: Intensity Profile and Shift of The SPR Angle
100% (1)
Total Internal Reflection and Evanescent Waves: Principle of SPR Detection: Intensity Profile and Shift of The SPR Angle
2 pages
An Empirical Evaluation of Generic Convolutional and Recurrent Networks For Sequence Modeling
No ratings yet
An Empirical Evaluation of Generic Convolutional and Recurrent Networks For Sequence Modeling
14 pages
Howe Et Al (2022)
No ratings yet
Howe Et Al (2022)
21 pages
Sample JEE Main Part-Test Physics
No ratings yet
Sample JEE Main Part-Test Physics
5 pages
The Feynman Lectures On Physics Vol. II Ch. 2 - Differential Calculus of Vector Fields
No ratings yet
The Feynman Lectures On Physics Vol. II Ch. 2 - Differential Calculus of Vector Fields
13 pages
06 Maths Ws 09 Data Handling 01
No ratings yet
06 Maths Ws 09 Data Handling 01
3 pages
Midterm Coc
No ratings yet
Midterm Coc
8 pages
G9 07 Rate and Ratio
No ratings yet
G9 07 Rate and Ratio
6 pages
Solutions To Homework Assignment # 5
No ratings yet
Solutions To Homework Assignment # 5
3 pages
BDT Mock Sample 1
No ratings yet
BDT Mock Sample 1
4 pages
Oral Recit Formula
No ratings yet
Oral Recit Formula
1 page
Novo INTV Standard Stick Adapter Tilto
No ratings yet
Novo INTV Standard Stick Adapter Tilto
2 pages
Call-Forward B2bua
No ratings yet
Call-Forward B2bua
4 pages
Igs M RS 401
No ratings yet
Igs M RS 401
46 pages
Practical Design of Experiments: DoE Made Easy
From Everand
Practical Design of Experiments: DoE Made Easy
Colin Hardwick
4.5/5 (7)
Microsoft Excel Statistical and Advanced Functions for Decision Making
From Everand
Microsoft Excel Statistical and Advanced Functions for Decision Making
Palani Murugappan
5/5 (2)
Software Testing: A Guide to Testing Mobile Apps, Websites, and Games
From Everand
Software Testing: A Guide to Testing Mobile Apps, Websites, and Games
Mark Garzone
4.5/5 (3)
How to Use Total Quality Techniques in Your Job?
From Everand
How to Use Total Quality Techniques in Your Job?
Darlene B. Martinez
No ratings yet
Implementing Computer Systems for Small & Medium Businesses
From Everand
Implementing Computer Systems for Small & Medium Businesses
Randy Rolleman
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
PMI-RMP Question Bank
From Everand
PMI-RMP Question Bank
Mohammad Usmani
3.5/5 (3)
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
From Everand
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
Cloudy Heaven Games
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
From Everand
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
Steven Taylor
No ratings yet
Ways to Achieve Quality
From Everand
Ways to Achieve Quality
chakrapani srinivasa
5/5 (1)
Learn Software Testing in 24 Hours
From Everand
Learn Software Testing in 24 Hours
Alex Nordeen
No ratings yet
Artificial Intelligence Diagnosis: Fundamentals and Applications
From Everand
Artificial Intelligence Diagnosis: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data corruption Second Edition
From Everand
Data corruption Second Edition
Gerardus Blokdyk
No ratings yet
Threat Intelligence Platform Complete Self-Assessment Guide
From Everand
Threat Intelligence Platform Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Mobile data terminal A Clear and Concise Reference
From Everand
Mobile data terminal A Clear and Concise Reference
Gerardus Blokdyk
No ratings yet

Python - Project3 Problem Statement

Uploaded by

Python - Project3 Problem Statement

Uploaded by

16/01/2019 Project3

Counterfeit Medicines Sales Prediction

Train Dataset = counterfeit_train.csv

Test Dataset = counterfeit_test.csv

Formal Problem Statement

Variable names are self explanatory.

For this project, score will be calculated as:

Submission CSV should resemble the file:

Sample Submission = 'sample_submission.csv'

Wish you all the best!

You might also like