0% found this document useful (0 votes)
139 views2 pages

Assignment 2, Quiz 2 & Quiz 3 PDF

Assignment 2 involves a group presentation and data science project. Students will form groups of up to 3 members. The presentation must not plagiarize other groups and will be evaluated on content, delivery, and response to questions. The data science project involves importing a wine dataset, preprocessing with feature selection and extraction, and classification with various models to compare performance. Quizzes 2 and 3 cover topics related to probability, modeling techniques, and Python libraries for data science.

Uploaded by

Saif Ali Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
139 views2 pages

Assignment 2, Quiz 2 & Quiz 3 PDF

Assignment 2 involves a group presentation and data science project. Students will form groups of up to 3 members. The presentation must not plagiarize other groups and will be evaluated on content, delivery, and response to questions. The data science project involves importing a wine dataset, preprocessing with feature selection and extraction, and classification with various models to compare performance. Quizzes 2 and 3 cover topics related to probability, modeling techniques, and Python libraries for data science.

Uploaded by

Saif Ali Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Assignment 2

 Assignment 2 is based on group based activities


 Make a group of maximum three students for the assignments (both Part 1 & Part 2).
 Plagiarism is strictly prohibited. If your work (Jupiter notebook (Part2) and PPT
(Part1)) is found to be plagiarised from other groups, strict action will be taken against
respective group of students. They will be awarded zero mark based on evidence.
 Fill your group details in Google file shared by one of the Class Representatives.

Assignment2_Part1: 15 Marks [Presentation will start from 9th November, 2020]

Instructions:

 It will be a group presentation in the allocated lecture slot as per assigned time
slot.
 Each group will be allotted 15 minutes for the presentation.
 Each member in the group must participate in the presentation. Prepare a PPT for
your presentation. PPT must contain Name and Roll Number of the group members
in the first page. After preparing PPT for the presentation, send it to my email –
[email protected]. Evaluation will be done based on quality of the content,
delivery of presentation and response to queries if any. Marks are awarded to
individual based on individual contribution to the group. You will be assessed based
on the individual performance in the group.
 The presentation PPT is to be submitted in Eduserver.

Topics of Presentation:

Group Name Contents of Presentation Remarks


Group1 to Group 5 Part1, Part2, Part3, Part4, Part 5 of See Attached
attached File1 (Topics from 4th Module) File1
Group 6 to Group 9 CHAPTER 2: Reference Book
Introduction to NumPy
Group 10 to Group 16 CHAPTER 3: Reference Book
Data Manipulation with Pandas
Group 17 to Group 23 CHAPTER 4: Reference Book
Visualization with Matplotlib
Group 24 to Group 33 CHAPTER 5 Reference Book
Machine Learning
Group 34-Group 36 Latest Data science technologies and tools
in the Medical science
Group 37-Group 39 Latest Data science technologies and tools
in the Agriculture
Group 40-Group 42 Latest Data science technologies and tools
for Natural Language processing
Group 43-Group 45 Latest Data science technologies and tools
in the Education
Group 46-Group 48 Latest Data science technologies and tools
for societal benefit
Group 48-Group the Emerging Trends in Data Science
end

Assignment 2_Part 2: 15 Marks [To be submitted by 15th November, 2020]

Instructions:

 Group evaluation will be done based on the submitted assignment.


 The files to be uploaded are: (1) python notebook with its original extension (2)
exported PDF of notebook (along with outputs).
 Name of file should be “GroupNo_Name of members”
 The python notebook should contain proper comments and details in the code with
bold title of appropriate sub section.

Implementation:
i. Import red wine data set

ii. Data Pre-processing: (1) Feature selection (Ranking of feature using variance,
Information Gain (IG) attribute evaluation, (2) Feature Forward method (3) Feature
extraction using Principle Component Analysis (PCA).

iii. Perform Classification task with and without feature selection using classifiers
(Bayes theorem, Decision Tree, Support Vector Machine (SVM), Random Forest). Do
comparative study on mentioned classifiers using evaluation metrics (Precision,
Recall, F1 Score) in tabular form.

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

Syllabus for Quiz 2: (10 Marks) [Tentative schedule: 16-11-2020]


Syllabus: Basic concepts of probability for Data science, Feature selection & Extraction,
Linear Regression, k-Nearest Neighbours (k-NN),Decision Trees (ID3& CART) , Basic
knowledge about Python Programming libraries for data science.

Syllabus for Quiz 3: (10 Marks)[Tentative scheduled: 07-12-2020]


Syllabus: Random Forests, Linear Support Vector Machines, Recommendation Systems,
Ideas and Tools for Data Visualization, Singular Value Decomposition.

You might also like